Tag: tts Paper Review: NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models (11 Mar 2024) Paper Review: Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale (23 Jun 2023) Paper Review: MMS: Scaling Speech Technology to 1000+ languages (25 May 2023) All tags paperreview (191) deeplearning (187) cv (76) nlp (74) llm (65) transformer (35) multimodal (22) blogpost (19) pretraining (16) sota (15) imagesegmentation (14) finetuning (11) attention (11) pytorch (10) objectdetection (10) career (9) rl (8) imagegeneration (8) diffusion (8) video (7) stablediffusion (7) agent (7) life (6) datascience (6) vlm (5) timeseries (5) speech (5) selfsupervised (5) optimization (5) ner (4) mllm (4) languages (4) gan (4) audio (4) yolo (3) visual (3) tts (3) tokenization (3) superresolution (3) styletransfer (3) rnn (3) recommender (3) reasoning (3) kaggle (3) imagecaptioning (3) gnn (3) distillation (3) bert (3) augmentation (3) videogeneration (2) transferlearning (2) simulation (2) relationextraction (2) ranking (2) rag (2) qa (2) mamba (2) machinelearning (2) jobsearch (2) graph (2) gpt (2) generation (2) fewshotlearning (2) dpo (2) competition (2) cnn (2) classification (2) weaksupervision (1) unet (1) textgeneration (1) tensorflow (1) tabular (1) swa (1) summarization (1) speechtranslation (1) speechtospeech (1) speechrecognition (1) sentenceembeddings (1) semisupervised (1) scaling (1) robustness (1) robotics (1) recurrent (1) realtime (1) quantization (1) promptengineering (1) objecttracking (1) nlg (1) nas (1) motivation (1) motiontracking (1) mlp (1) mentoring (1) memoryoptimization (1) languagetranslation (1) jigsaw (1) interview (1) instructlearning (1) inferencespeed (1) imagetextmatching (1) imagerestoration (1) imageinpainting (1) forecasting (1) flowmatching (1) fail (1) evaluation (1) entitylinking (1) endtoend (1) embedding (1) efficiency (1) depthestimation (1) curriculumlearning (1) contrastivelearning (1) coco (1) clip (1) chatbot (1) captioning (1) books (1) autoencoder (1) asr (1) architecture (1) annotation (1) anchorfree (1) alignment (1) advice (1) adversarial (1) activationfunction (1)