(2025-Current) - Intern @Télecom Paris
- High-quality and expressive music generation on audio informed with structure.
- Dataset processing.
(2023-2025) - Researcher @CPqD
- Improved a hybrid ASR by adapting its LM on synthetic domain-specific text generated by LoRA of LLMs.
- Evaluated fairness of the company’s ASR on multi-accented speech in the Brazilian Portuguese language.
- Developed an accurate and efficient two-stage SSL-based speech emotion recognition system.
- Enriched the company’s call center customer profiler by developing a SOTA speech age and gender classifier.
(2021-2023) - Fellow Master @CPqD
- Implemented neural customer-oriented expressive TTS models for the Brazilian Portuguese language.
- Enabled customers to edit synthesized audios with character-level prosody control on the ONNX FastPitch.
- Conducted perceptual experiments to evaluate speech naturalness, emotion intensity, and speaker similarity.
(2020-2021) - Internship @LPS
- Enhanced lab automation by designing neural speech commands recognition systems.
- Encapsulated the command recognition system in a local private LoRa network for IoT applications.
- Enabled long distance voice control by developing a wearable prototype with an embedded microphone.
Skills
- Deep Learning: PyTorch, Tensorflow, Sci-Kit, HuggingFace, ONNX, Gradio
- Frameworks: Lightning, Amphion, Coqui, SpeechBrain, ESPNET, Kaldi
- Tools: Docker, Git, Cloud (AWS, GCP)
- Programming: Python, C, C++, MATLAB, LaTeX
- Languages: Portuguese, English, French
- Competences: Paper Implementation, Experiment Design, Team Collaboration
Education
(2021-2024) M.Sc. Computer Engineering @UNICAMP
(2019-2020) Excellence Scolarship Exchange Student @Télécom Paris
(2017-2021) B.Sc. Electronic Engineering @UFPB
Recent posts
Seeing Sounds: Spectrogram Art
Currently writing… Will be out soon!