Tag: finetuning
- Paper Review: Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference (23 Dec 2024)
- Paper Review: Diffusion Model Alignment Using Direct Preference Optimization (27 Nov 2023)
- Paper Review: Zephyr: Direct Distillation of LM Alignment (30 Oct 2023)
- Paper Review: InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining (16 Oct 2023)
- Paper Review: QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models (05 Oct 2023)
- Paper Review: Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback (10 Aug 2023)
- Paper Review: Llama 2: Open Foundation and Fine-Tuned Chat Models (20 Jul 2023)
- Paper Review: Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning (17 Jul 2023)
- Paper Review: Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision (19 Jun 2023)
- Paper Review: QLoRA: Efficient Finetuning of Quantized LLMs (01 Jun 2023)
- Paper Review: Chain of Hindsight Aligns Language Models with Feedback (30 May 2023)