Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
27 January 2025
My review of the paper DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Data science, career and other topics
27 January 2025
My review of the paper DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
13 January 2025
My review of the paper STAR Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
06 January 2025
My review of the paper Training Large Language Models to Reason in a Continuous Latent Space
28 December 2024
12 years of studying foreign languages with Anki
23 December 2024
My review of the paper Smarter, Better, Faster, Longer A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
16 December 2024
My review of the paper Byte Latent Transformer Patches Scale Better Than Tokens