Paper Review: Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
15 June 2023
Yann LeCun's I-JEPA learns semantic image representations by predicting masked patch features — no data augmentation needed — training a ViT-Huge on ImageNet in under 72 hours on 16 GPUs.