“Stop thinking about (art) works as objects and start thinking about them as triggers for experiences. What makes a work (of art) good for you is not something that s already inside it but something that happens inside you.”- Brian Eno

HMAPS-Conformer

HMAPS-Conformer

Enhancing SOTA Speech Deepfake Detection with GEMAPS Acoustic Features. Entry to the ASVspoof5 challenge.

Repository

TTS objective metrics

TTS Objective Metrics

A compilation of the objective metrics used in several text-to-speech (TTS) papers.

Repository

Timbre Perturbation for ASR

Timbre Perturbation for ASR

Experimented with timbre perturbation as data augmentation for a Wav2Vec2. Entry to Kaggle’s Bengali.AI ASR challenge.

Notebook

aitmospheric

a i t m o s p h e r i c

Generating soundscapes with VQ‑VAEs for compositional use and inspiration. Entry to the 1st Sound of AI Hackathon.

Live Demo

Cross-Speaker Style Transfer

Cross-Speaker Style Transfer

A fork from Coqui-AI (🐸TTS) used to research about expressive TTS.

Repository

BirdCLEF 2024

BirdCLEF 2024

EffNet Ensemble. Entry to the BirdCLEF2024 challenge.

Repository