Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
27 January 2025
My review of the paper DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Data science, career and other topics
27 January 2025
My review of the paper DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
13 January 2025
My review of the paper STAR Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
06 January 2025
My review of the paper Training Large Language Models to Reason in a Continuous Latent Space
28 December 2024
12 years of studying foreign languages with Anki
23 December 2024
My review of the paper Smarter, Better, Faster, Longer A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
16 December 2024
My review of the paper Byte Latent Transformer Patches Scale Better Than Tokens
Type at least 2 characters to search...