Andrey Lukyanenko's personal site

Collaborative Reinforcement Learning: Why HACRL Trains Models in Teams Instead of Isolation

16 March 2026

HACRL proposes a new paradigm for reinforcement learning - instead of training models in isolation, multiple agents collaborate by sharing successful trajectories during training. This simple idea enables more efficient exploration and improves performance across heterogeneous models.

paperreview deeplearning rl llm

Beyond Positional Bias: How DroPE Unlocks Zero-Shot Long Context in LLMs

23 February 2026

A review of DroPE, a simple but counterintuitive method that extends LLM context length by dropping positional embeddings at inference and achieves strong zero-shot long-context generalization without retraining.

paperreview deeplearning llm attention

Kimi k2.5 Review: Native Multimodality and Agent Swarms at 1 Trillion Parameters

16 February 2026

A deep-dive review of Kimi K2.5, a next-generation open multimodal model that combines native vision-language training with parallel agent orchestration. This post explains why Agent Swarm and joint multimodal optimization matter and how K2.5 meaningfully differs from today's top closed and open models.

paperreview deeplearning llm vlm

Paper Review: PaperBanana: Automating Academic Illustration for AI Scientists

09 February 2026

My review of the paper PaperBanana Automating Academic Illustration for AI Scientists

paperreview deeplearning agent vlm

Paper Review: mHC: Manifold-Constrained Hyper-Connections

26 January 2026

My review of the paper mHC Manifold-Constrained Hyper-Connections

paperreview deeplearning architecture llm

Top-10 ML papers I read in 2025

24 December 2025

Top-10 ML and AI papers I read in 2025

paperreview deeplearning blogpost

Blogposts

Browse by category: