Andrey Lukyanenko's personal site

Paper Review: Titans: Learning to Memorize at Test Time

03 February 2025

A new architecture that pairs attention with a learnable long-term memory module, scaling to 2M+ tokens and outperforming Transformers on language modeling, reasoning, genomics, and time series.

paperreview deeplearning llm nlp

Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

27 January 2025

How pure reinforcement learning (without supervised fine-tuning) can teach LLMs to reason, producing open-source models that rival OpenAI-o1 on math and coding benchmarks.

paperreview deeplearning llm rl

Paper Review: STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

13 January 2025

My review of the paper STAR Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

paperreview deeplearning cv video

Paper Review: Training Large Language Models to Reason in a Continuous Latent Space

06 January 2025

Coconut lets LLMs reason in latent space instead of generating text tokens, enabling breadth-first exploration of reasoning paths and better performance on tasks requiring backtracking.

paperreview deeplearning nlp llm

12 years of studying foreign languages with Anki

28 December 2024

12 years of studying foreign languages with Anki

blogpost life languages

Paper Review: Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

23 December 2024

BERT rebuilt with modern tricks — 2 trillion training tokens, 8192 context length, Flash Attention, and rotary embeddings — delivering state-of-the-art classification and retrieval with the best speed/memory efficiency among encoders.

paperreview deeplearning nlp transformer

Blogposts

Browse by category:

Paper Review: Titans: Learning to Memorize at Test Time

Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper Review: STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper Review: Training Large Language Models to Reason in a Continuous Latent Space

12 years of studying foreign languages with Anki

Paper Review: Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference