Tag: llm – Andrey Lukyanenko

Jun 15, 2026

MiniMax Sparse Attention: Per-Group Block Selection for Cheap Million-Token Inference

MiniMax Sparse Attention is a practical sparse-attention design for million-token LLMs - it uses a lightweight learne...

paperreview deeplearning llm attention

Jun 10, 2026

Testing MiniMax M3 on real tasks: repo refactor, screenshot debugging, and Spotify recommendations

A hands-on look at MiniMax M3 through Claude Code — what its new MiniMax Sparse Attention (MSA) is and how it differs...

blogpost ai llm claude

Jun 09, 2026

Book Review: 50 ML Projects to Understand LLMs

A review of Mike X Cohen 50 ML Projects To Understand LLMs, a hands-on book that uses code, statistics, and controlle...

blogpost books llm interpretability

May 18, 2026

Testing MiniMax M2.7 via API on three real ML and coding workflows

An evaluation of MiniMax M2.7 used through Claude Code on three workflows I run regularly — writing code for a Kaggle...

blogpost ai llm claude

Apr 24, 2026

DeepSeek-V4 Review: Why Million-Token Context Needs Efficient Attention, Not Just Larger Windows

DeepSeek V4 pairs a hybrid sparse-attention stack with on-policy distillation across domain specialists to bring 1M-t...

paperreview deeplearning llm moe

Apr 20, 2026

FIPO: Teaching LLMs Which Thoughts Actually Matter

FIPO - an RL algorithm that fixes one of the core limitations of RL for LLM reasoning - credit assignment. Instead of...

paperreview deeplearning llm rl

Apr 09, 2026

Book Review: Unlocking Data with Generative AI and RAG, Second Edition

A review of Keith Bourne second edition of Unlocking Data with Generative AI and RAG, covering the running example th...

blogpost books llm rag

Apr 06, 2026

Book Review: A Practical Guide to Reinforcement Learning from Human Feedback

A review of Sandip Kulkarni book on RLHF, covering its strengths as a structured learning resource, its reliance on b...

blogpost books rl rlhf

Mar 16, 2026

Collaborative Reinforcement Learning: Why HACRL Trains Models in Teams Instead of Isolation

HACRL proposes a new paradigm for reinforcement learning - instead of training models in isolation, multiple agents c...

paperreview deeplearning rl llm

Feb 23, 2026

Beyond Positional Bias: How DroPE Unlocks Zero-Shot Long Context in LLMs

A review of DroPE, a simple but counterintuitive method that extends LLM context length by dropping positional embedd...

paperreview deeplearning llm attention

Feb 16, 2026

Kimi k2.5 Review: Native Multimodality and Agent Swarms at 1 Trillion Parameters

A deep-dive review of Kimi K2.5, a next-generation open multimodal model that combines native vision-language trainin...

paperreview deeplearning llm vlm

Jan 26, 2026

Paper Review: mHC: Manifold-Constrained Hyper-Connections

My review of the paper mHC Manifold-Constrained Hyper-Connections

paperreview deeplearning architecture llm

Nov 24, 2025

Paper Review: SAM 3: Segment Anything with Concepts

Meta's unified model for detecting, segmenting, and tracking objects using text or image prompts — trained on 4M conc...

paperreview deeplearning imagesegmentation llm

Nov 17, 2025

Paper Review: HunyuanImage 3.0 Technical Report

My review of the paper HunyuanImage 3.0 Technical Report

paperreview deeplearning llm imagegeneration

Nov 03, 2025

Paper Review: Chronos-2: From Univariate to Universal Forecasting

Chronos-2 extends zero-shot time series forecasting to multivariate and covariate settings with a new group attention...

paperreview deeplearning llm timeseries

Oct 27, 2025

Paper Review: The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

A biologically inspired LLM built as a graph of spiking neurons with Hebbian learning — it matches GPT-2 scaling whil...

paperreview deeplearning nlp llm

Sep 15, 2025

Paper Review: Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

My review of the paper Sharing is Caring Efficient LM Post-Training with Collective RL Experience Sharing

paperreview deeplearning nlp llm

Aug 04, 2025

Paper Review: Group Sequence Policy Optimization

My review of the paper Group Sequence Policy Optimization

paperreview deeplearning llm rl

Jul 28, 2025

Paper Review: Subliminal Learning: Language models transmit behavioral traits via hidden signals in data

My review of the paper Subliminal Learning Language models transmit behavioral traits via hidden signals in data

paperreview deeplearning llm distillation

Jun 30, 2025

Paper Review: ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

My review of the paper ProRL Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

paperreview deeplearning llm rl

Jun 09, 2025

Paper Review: Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Only ~20% of tokens actually matter when training LLMs to reason with RL. Updating the low-entropy majority actively ...

paperreview deeplearning llm rl

Jun 02, 2025

Paper Review: SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

My review of the paper SWE-rebench An Automated Pipeline for Task Collection and Decontaminated Evaluation of Softwar...

paperreview deeplearning llm evaluation

May 26, 2025

Paper Review: Visual Planning: Lets Think Only with Images

My review of the paper Visual Planning Let's Think Only with Images

paperreview deeplearning llm rl

May 15, 2025

Paper Review: AlphaEvolve: A coding agent for scientific and algorithmic discovery

DeepMind's autonomous coding agent that evolves algorithms through LLM-driven iteration — it discovered the first imp...

paperreview deeplearning agent nlp

Apr 28, 2025

Paper Review: AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents

My review of the paper AgentA/B Automated and Scalable Web A/BTesting with Interactive LLM Agents

paperreview deeplearning agent nlp

Apr 21, 2025

Paper Review: M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

My review of the paper M1 Towards Scalable Test-Time Compute with Mamba Reasoning Models

paperreview deeplearning rnn distillation

Mar 10, 2025

Paper Review: Large Language Diffusion Models

LLaDA replaces autoregressive token generation with diffusion-based masked prediction, rivaling LLaMA3 8B while natur...

paperreview deeplearning nlp transformer

Feb 03, 2025

Paper Review: Titans: Learning to Memorize at Test Time

A new architecture that pairs attention with a learnable long-term memory module, scaling to 2M+ tokens and outperfor...

paperreview deeplearning llm nlp

Jan 27, 2025

Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

How pure reinforcement learning (without supervised fine-tuning) can teach LLMs to reason, producing open-source mode...

paperreview deeplearning llm rl

Jan 06, 2025

Paper Review: Training Large Language Models to Reason in a Continuous Latent Space

Coconut lets LLMs reason in latent space instead of generating text tokens, enabling breadth-first exploration of rea...

paperreview deeplearning nlp llm

Dec 16, 2024

Paper Review: Byte Latent Transformer: Patches Scale Better Than Tokens

My review of the paper Byte Latent Transformer Patches Scale Better Than Tokens

paperreview deeplearning nlp llm

Dec 09, 2024

Paper Review: Reverse Thinking Makes LLMs Stronger Reasoners

My review of the paper Reverse Thinking Makes LLMs Stronger Reasoners

paperreview deeplearning nlp llm

Nov 25, 2024

Paper Review: Project Sid: Many-agent simulations toward AI civilization

What happens when you put 1k AI agents in Minecraft and let them self-organize? They form governments, transmit cultu...

paperreview deeplearning nlp llm

Nov 11, 2024

Paper Review: Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

My review of the paper Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

paperreview deeplearning nlp llm

Oct 29, 2024

Paper Review: Unbounded: A Generative Infinite Game of Character Life Simulation

My review of the paper Unbounded A Generative Infinite Game of Character Life Simulation

paperreview deeplearning nlp llm

Sep 23, 2024

Paper Review: Training Language Models to Self-Correct via Reinforcement Learning

My review of the paper Training Language Models to Self-Correct via Reinforcement Learning

paperreview deeplearning rl llm

Sep 04, 2024

Paper Review: Agentic Retrieval-Augmented Generation for Time Series Analysis

My review of the paper Agentic Retrieval-Augmented Generation for Time Series Analysis

paperreview deeplearning llm timeseries

Aug 19, 2024

Paper Review: Winning Amazon KDD Cup24

My review of the paper Winning Amazon KDD Cup24

paperreview deeplearning llm qa

Aug 12, 2024

Paper Review: Wolf: Captioning Everything with a World Summarization Framework

My review of the paper Wolf Captioning Everything with a World Summarization Framework

paperreview deeplearning llm vlm

Jul 22, 2024

Paper Review: RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

My review of the paper RankRAG Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

paperreview deeplearning llm rag

Jul 15, 2024

Paper Review: Unveiling Encoder-Free Vision-Language Models

My review of the paper Unveiling Encoder-Free Vision-Language Models

paperreview deeplearning llm vlm

Jul 01, 2024

Paper Review: Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

My review of the paper Husky A Unified, Open-Source Language Agent for Multi-Step Reasoning

paperreview deeplearning llm agent

May 06, 2024

Paper Review: FlowMind: Automatic Workflow Generation with LLMs

My review of the paper FlowMind Automatic Workflow Generation with LLMs

paperreview deeplearning llm agent

Apr 15, 2024

Paper Review: Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

My review of the paper Ferret-v2 An Improved Baseline for Referring and Grounding with Large Language Models

paperreview deeplearning llm cv

Mar 25, 2024

Paper Review: Chronos: Learning the Language of Time Series

Amazon's framework that tokenizes time series data for pretrained language models, enabling zero-shot forecasting tha...

paperreview deeplearning llm timeseries

Feb 12, 2024

Paper Review: Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

My review of the paper Lag-Llama Towards Foundation Models for Probabilistic Time Series Forecasting

paperreview deeplearning llm timeseries

Jan 15, 2024

Paper Review: Ferret: Refer and Ground Anything Anywhere at Any Granularity

My review of the paper Ferret Refer and Ground Anything Anywhere at Any Granularity

paperreview deeplearning llm cv

Jan 08, 2024

Paper Review: DocLLM: A layout-aware generative language model for multimodal document understanding

My review of the paper DocLLM A layout-aware generative language model for multimodal document understanding

paperreview deeplearning llm attention

Dec 18, 2023

Paper Review: Pixel Aligned Language Models

My review of the paper Pixel Aligned Language Models

paperreview deeplearning llm cv

Nov 23, 2023

Paper Review: Orca 2: Teaching Small Language Models How to Reason

My review of the paper Orca 2 Teaching Small Language Models How to Reason

paperreview deeplearning nlp llm

Nov 20, 2023

Paper Review: Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models

My review of the paper Chain-of-Note Enhancing Robustness in Retrieval-Augmented Language Models

paperreview deeplearning nlp llm

Nov 13, 2023

Paper Review: Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM

My review of the paper Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM

paperreview deeplearning llm qa

Nov 06, 2023

Paper Review: Collaborative Large Language Model for Recommender Systems

My review of the paper Collaborative Large Language Model for Recommender Systems

paperreview deeplearning llm recommender

Oct 30, 2023

Paper Review: Zephyr: Direct Distillation of LM Alignment

My review of the paper Zephyr Direct Distillation of LM Alignment

paperreview deeplearning nlp llm

Oct 23, 2023

Paper Review: Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

My review of the paper Self-RAG Learning to Retrieve, Generate, and Critique through Self-Reflection

paperreview deeplearning llm nlp

Oct 19, 2023

Paper Review: PaLI-3 Vision Language Models: Smaller, Faster, Stronger

My review of the paper PaLI-3 Vision Language Models Smaller, Faster, Stronger

paperreview deeplearning llm vlm

Oct 16, 2023

Paper Review: InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining

My review of the paper InstructRetro Instruction Tuning post Retrieval-Augmented Pretraining

paperreview deeplearning llm nlp

Oct 12, 2023

Paper Review: Mistral 7B

My review of the paper Mistral 7B

paperreview deeplearning llm nlp

Oct 09, 2023

Paper Review: Think before you speak: Training Language Models With Pause Tokens

My review of the paper Think before you speak Training Language Models With Pause Tokens

paperreview deeplearning llm nlp

Oct 05, 2023

Paper Review: QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

My review of the paper QA-LoRA Quantization-Aware Low-Rank Adaptation of Large Language Models

paperreview deeplearning llm nlp

Sep 28, 2023

Paper Review: DreamLLM: Synergistic Multimodal Comprehension and Creation

My review of the paper DreamLLM Synergistic Multimodal Comprehension and Creation

paperreview deeplearning llm cv

Sep 21, 2023

Paper Review: Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

My review of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

paperreview deeplearning llm promptengineering

Sep 04, 2023

Paper Review: RecMind: Large Language Model Powered Agent For Recommendation

My review of the paper RecMind Large Language Model Powered Agent For Recommendation

paperreview deeplearning llm

Aug 28, 2023

Paper Review: Giraffe: Adventures in Expanding Context Lengths in LLMs

My review of the paper Giraffe Adventures in Expanding Context Lengths in LLMs

paperreview deeplearning nlp llm

Aug 24, 2023

Paper Review: OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

My review of the paper OBELISC An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

paperreview deeplearning nlp llm

Aug 10, 2023

Paper Review: Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

A systematic survey of what's broken in RLHF — from reward hacking to evaluation gaps — and what techniques can fix, ...

paperreview deeplearning nlp llm

Aug 10, 2023

Paper Review: UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition

My review of the paper UniversalNER Targeted Distillation from Large Language Models for Open Named Entity Recognition

paperreview deeplearning nlp llm

Aug 07, 2023

Paper Review: Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

My review of the paper Skeleton-of-Thought Large Language Models Can Do Parallel Decoding

paperreview deeplearning nlp llm

Jul 24, 2023

Paper Review: Retentive Network: A Successor to Transformer for Large Language Models

My review of the paper Retentive Network A Successor to Transformer for Large Language Models

paperreview deeplearning nlp transformer

Jul 03, 2023

Paper Review: Multilingual End to End Entity Linking

My review of the paper Multilingual End to End Entity Linking

paperreview deeplearning nlp llm

Jun 19, 2023

Paper Review: Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

My review of the paper Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

paperreview deeplearning nlp llm

May 30, 2023

Paper Review: Chain of Hindsight Aligns Language Models with Feedback

My review of the paper Chain of Hindsight Aligns Language Models with Feedback

paperreview deeplearning nlp llm

May 18, 2023

Paper Review: DarkBERT: A Language Model for the Dark Side of the Internet

My review of the paper DarkBERT A Language Model for the Dark Side of the Internet

paperreview deeplearning nlp pretraining

May 08, 2023

Paper Review: Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

My review of the paper Distilling Step-by-Step Outperforming Larger Language Models with Less Training Data and Small...

paperreview deeplearning nlp distillation

Mar 20, 2023

Paper Review: Hyena Hierarchy: Towards Larger Convolutional Language Models

My review of the paper Hyena Hierarchy Towards Larger Convolutional Language Models

paperreview deeplearning nlp cv

Mar 13, 2023

Paper Review: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

My review of the paper Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models

paperreview deeplearning nlp transformer