Tag: pretraining

Paper Review: DINOv3 (25 Aug 2025)
Paper Review: SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features (24 Feb 2025)
Paper Review: CogVLM: Visual Expert for Pretrained Language Models (09 Nov 2023)
Paper Review: Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture (15 Jun 2023)
Paper Review: The effectiveness of MAE pre-pretraining for billion-scale pretraining (05 Jun 2023)
Paper Review: DarkBERT: A Language Model for the Dark Side of the Internet (18 May 2023)
Paper Review: DINOv2: Learning Robust Visual Features without Supervision (20 Apr 2023)
Paper Review: NÜWA Visual Synthesis Pre-training for Neural visUal World creAtion (25 Nov 2021)
Paper Review: Efficient Visual Pretraining with Contrastive Detection (01 Sep 2021)
Paper Review: CoAtNet Marrying Convolution and Attention for All Data Sizes (10 Jun 2021)
Paper Review: ByT5 Towards a token-free future with pre-trained byte-to-byte models (02 Jun 2021)
Paper Review: Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence (21 May 2021)
Paper Review: Are Pre-trained Convolutions Better than Pre-trained Transformers? (10 May 2021)
Paper Review: LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval (21 Mar 2021)
Paper Review: ReXNet: Diminishing Representational Bottleneck on Convolutional Neural Network (04 Jul 2020)
Paper Review: VirTex: Learning Visual Representations from Textual Annotations (14 Jun 2020)

All tags

paperreview (187) deeplearning (183) cv (74) nlp (73) llm (62) transformer (35) blogpost (18) pretraining (16) sota (15) imagesegmentation (14) attention (11) pytorch (10) objectdetection (9) career (9) imagegeneration (8) video (7) rl (6) life (6) diffusion (6) datascience (6) agent (6) timeseries (5) stablediffusion (5) vlm (4) selfsupervised (4) ner (4) mllm (4) languages (4) gan (4) audio (4) yolo (3) tokenization (3) superresolution (3) styletransfer (3) rnn (3) reasoning (3) kaggle (3) imagecaptioning (3) distillation (3) bert (3) augmentation (3) visual (2) videogeneration (2) tts (2) transferlearning (2) simulation (2) sd (2) reinforcementlearning (2) recommender (2) ranking (2) rag (2) qa (2) mamba (2) machinelearning (2) languagemodel (2) jobsearch (2) graph (2) gpt (2) gnn (2) generation (2) fewshotlearning (2) dpo (2) competition (2) cnn (2) classification (2) weaksupervision (1) unet (1) textgeneration (1) tensorflow (1) tabular (1) swa (1) summarization (1) speechtranslation (1) speechtospeech (1) speechrecognition (1) speechgeneration (1) sentenceembeddings (1) semisupervised (1) selfsupervisedlearning (1) scaling (1) robustness (1) robotics (1) relationextrction (1) relationextraction (1) recurrent (1) recommendation (1) realtime (1) quantization (1) promptengineering (1) objecttracking (1) objectdetecion (1) nlg (1) nas (1) multimodal (1) motivation (1) motiontracking (1) mlp (1) mentoring (1) memoryoptimization (1) languagetranslation (1) jigsaw (1) interview (1) instructlearning (1) inferencespeed (1) imagetextmatching (1) imagerestoration (1) imageinpainting (1) graphneuralnets (1) forecasting (1) fail (1) evaluation (1) entitylinking (1) endtoend (1) embedding (1) efficiency (1) diffusionmodels (1) depthestimation (1) curriculumlreaning (1) contrastivelearning (1) coco (1) clip (1) chatbot (1) captioning (1) books (1) autoencoder (1) asr (1) annotation (1) anchorfree (1) alignment (1) advice (1) adversarial (1) activationfunction (1) CV (1)