Tag: audio
4 posts
Paper Review: Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
My review of the paper Audio Flamingo 2 An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Ab...
Paper Review: Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
My review of the paper Voicebox Text-Guided Multilingual Universal Speech Generation at Scale
Paper Review: MMS: Scaling Speech Technology to 1000+ languages
My review of the paper MMS Scaling Speech Technology to 1000+ languages
Paper Review: NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
My review of the paper NaturalSpeech 2 Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers