Blogposts

Data science, career and other topics

Browse by category:

All Posts Paper Reviews Blog Posts

Paper Review: Goku: Flow Based Video Generative Foundation Models

17 February 2025

My review of the paper Goku Flow Based Video Generative Foundation Models

paperreview deeplearning transformer imagegeneration

Paper Review: Titans: Learning to Memorize at Test Time

03 February 2025

A new architecture that pairs attention with a learnable long-term memory module, scaling to 2M+ tokens and outperforming Transformers on language modeling, reasoning, genomics, and time series.

paperreview deeplearning llm nlp

Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

27 January 2025

How pure reinforcement learning (without supervised fine-tuning) can teach LLMs to reason, producing open-source models that rival OpenAI-o1 on math and coding benchmarks.

paperreview deeplearning llm rl

Paper Review: STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

13 January 2025

My review of the paper STAR Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

paperreview deeplearning cv video

Paper Review: Training Large Language Models to Reason in a Continuous Latent Space

06 January 2025

Coconut lets LLMs reason in latent space instead of generating text tokens, enabling breadth-first exploration of reasoning paths and better performance on tasks requiring backtracking.

paperreview deeplearning nlp llm

12 years of studying foreign languages with Anki

28 December 2024

12 years of studying foreign languages with Anki

blogpost life languages

Newer Posts Older Posts