Paper Review: DINOv2: Learning Robust Visual Features without Supervision
20 April 2023
How Meta built all-purpose visual features by scaling self-supervised pretraining to a curated 142M-image dataset, producing models that outperform OpenCLIP without any fine-tuning.