Tag: llm

68 posts

Mar 16, 2026
Collaborative Reinforcement Learning: Why HACRL Trains Models in Teams Instead of Isolation
HACRL proposes a new paradigm for reinforcement learning - instead of training models in isolation, multiple agents c...
paperreview deeplearning rl llm
Feb 23, 2026
Beyond Positional Bias: How DroPE Unlocks Zero-Shot Long Context in LLMs
A review of DroPE, a simple but counterintuitive method that extends LLM context length by dropping positional embedd...
paperreview deeplearning llm attention
Feb 16, 2026
Kimi k2.5 Review: Native Multimodality and Agent Swarms at 1 Trillion Parameters
A deep-dive review of Kimi K2.5, a next-generation open multimodal model that combines native vision-language trainin...
paperreview deeplearning llm vlm
Jan 26, 2026
Paper Review: mHC: Manifold-Constrained Hyper-Connections
My review of the paper mHC Manifold-Constrained Hyper-Connections
paperreview deeplearning architecture llm
Nov 24, 2025
Paper Review: SAM 3: Segment Anything with Concepts
Meta's unified model for detecting, segmenting, and tracking objects using text or image prompts — trained on 4M conc...
paperreview deeplearning imagesegmentation llm
Nov 17, 2025
Paper Review: HunyuanImage 3.0 Technical Report
My review of the paper HunyuanImage 3.0 Technical Report
paperreview deeplearning llm imagegeneration
Nov 03, 2025
Paper Review: Chronos-2: From Univariate to Universal Forecasting
Chronos-2 extends zero-shot time series forecasting to multivariate and covariate settings with a new group attention...
paperreview deeplearning llm timeseries
Oct 27, 2025
Paper Review: The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
A biologically inspired LLM built as a graph of spiking neurons with Hebbian learning — it matches GPT-2 scaling whil...
paperreview deeplearning nlp llm
Sep 15, 2025
Paper Review: Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
My review of the paper Sharing is Caring Efficient LM Post-Training with Collective RL Experience Sharing
paperreview deeplearning nlp llm
Aug 04, 2025
Paper Review: Group Sequence Policy Optimization
My review of the paper Group Sequence Policy Optimization
paperreview deeplearning llm rl
Jul 28, 2025
Paper Review: Subliminal Learning: Language models transmit behavioral traits via hidden signals in data
My review of the paper Subliminal Learning Language models transmit behavioral traits via hidden signals in data
paperreview deeplearning llm distillation
Jun 30, 2025
Paper Review: ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
My review of the paper ProRL Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
paperreview deeplearning llm rl
Jun 09, 2025
Paper Review: Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Only ~20% of tokens actually matter when training LLMs to reason with RL. Updating the low-entropy majority actively ...
paperreview deeplearning llm rl
Jun 02, 2025
Paper Review: SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
My review of the paper SWE-rebench An Automated Pipeline for Task Collection and Decontaminated Evaluation of Softwar...
paperreview deeplearning llm evaluation
May 26, 2025
Paper Review: Visual Planning: Lets Think Only with Images
My review of the paper Visual Planning Let's Think Only with Images
paperreview deeplearning llm rl
May 15, 2025
Paper Review: AlphaEvolve: A coding agent for scientific and algorithmic discovery
DeepMind's autonomous coding agent that evolves algorithms through LLM-driven iteration — it discovered the first imp...
paperreview deeplearning agent nlp
Apr 28, 2025
Paper Review: AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents
My review of the paper AgentA/B Automated and Scalable Web A/BTesting with Interactive LLM Agents
paperreview deeplearning agent nlp
Apr 21, 2025
Paper Review: M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
My review of the paper M1 Towards Scalable Test-Time Compute with Mamba Reasoning Models
paperreview deeplearning rnn distillation
Mar 10, 2025
Paper Review: Large Language Diffusion Models
LLaDA replaces autoregressive token generation with diffusion-based masked prediction, rivaling LLaMA3 8B while natur...
paperreview deeplearning nlp transformer
Feb 03, 2025
Paper Review: Titans: Learning to Memorize at Test Time
A new architecture that pairs attention with a learnable long-term memory module, scaling to 2M+ tokens and outperfor...
paperreview deeplearning llm nlp
Jan 27, 2025
Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
How pure reinforcement learning (without supervised fine-tuning) can teach LLMs to reason, producing open-source mode...
paperreview deeplearning llm rl
Jan 06, 2025
Paper Review: Training Large Language Models to Reason in a Continuous Latent Space
Coconut lets LLMs reason in latent space instead of generating text tokens, enabling breadth-first exploration of rea...
paperreview deeplearning nlp llm
Dec 16, 2024
Paper Review: Byte Latent Transformer: Patches Scale Better Than Tokens
My review of the paper Byte Latent Transformer Patches Scale Better Than Tokens
paperreview deeplearning nlp llm
Dec 09, 2024
Paper Review: Reverse Thinking Makes LLMs Stronger Reasoners
My review of the paper Reverse Thinking Makes LLMs Stronger Reasoners
paperreview deeplearning nlp llm
Nov 25, 2024
Paper Review: Project Sid: Many-agent simulations toward AI civilization
What happens when you put 1k AI agents in Minecraft and let them self-organize? They form governments, transmit cultu...
paperreview deeplearning nlp llm
Nov 11, 2024
Paper Review: Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
My review of the paper Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
paperreview deeplearning nlp llm
Oct 29, 2024
Paper Review: Unbounded: A Generative Infinite Game of Character Life Simulation
My review of the paper Unbounded A Generative Infinite Game of Character Life Simulation
paperreview deeplearning nlp llm
Sep 23, 2024
Paper Review: Training Language Models to Self-Correct via Reinforcement Learning
My review of the paper Training Language Models to Self-Correct via Reinforcement Learning
paperreview deeplearning rl llm
Sep 04, 2024
Paper Review: Agentic Retrieval-Augmented Generation for Time Series Analysis
My review of the paper Agentic Retrieval-Augmented Generation for Time Series Analysis
paperreview deeplearning llm timeseries
Aug 19, 2024
Paper Review: Winning Amazon KDD Cup24
My review of the paper Winning Amazon KDD Cup24
paperreview deeplearning llm qa
Aug 12, 2024
Paper Review: Wolf: Captioning Everything with a World Summarization Framework
My review of the paper Wolf Captioning Everything with a World Summarization Framework
paperreview deeplearning llm vlm
Jul 22, 2024
Paper Review: RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
My review of the paper RankRAG Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
paperreview deeplearning llm rag
Jul 15, 2024
Paper Review: Unveiling Encoder-Free Vision-Language Models
My review of the paper Unveiling Encoder-Free Vision-Language Models
paperreview deeplearning llm vlm
Jul 01, 2024
Paper Review: Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
My review of the paper Husky A Unified, Open-Source Language Agent for Multi-Step Reasoning
paperreview deeplearning llm agent
May 06, 2024
Paper Review: FlowMind: Automatic Workflow Generation with LLMs
My review of the paper FlowMind Automatic Workflow Generation with LLMs
paperreview deeplearning llm agent
Apr 15, 2024
Paper Review: Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
My review of the paper Ferret-v2 An Improved Baseline for Referring and Grounding with Large Language Models
paperreview deeplearning llm cv
Mar 25, 2024
Paper Review: Chronos: Learning the Language of Time Series
Amazon's framework that tokenizes time series data for pretrained language models, enabling zero-shot forecasting tha...
paperreview deeplearning llm timeseries
Feb 12, 2024
Paper Review: Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
My review of the paper Lag-Llama Towards Foundation Models for Probabilistic Time Series Forecasting
paperreview deeplearning llm timeseries
Jan 15, 2024
Paper Review: Ferret: Refer and Ground Anything Anywhere at Any Granularity
My review of the paper Ferret Refer and Ground Anything Anywhere at Any Granularity
paperreview deeplearning llm cv
Jan 08, 2024
Paper Review: DocLLM: A layout-aware generative language model for multimodal document understanding
My review of the paper DocLLM A layout-aware generative language model for multimodal document understanding
paperreview deeplearning llm attention
Dec 18, 2023
Paper Review: Pixel Aligned Language Models
My review of the paper Pixel Aligned Language Models
paperreview deeplearning llm cv
Nov 23, 2023
Paper Review: Orca 2: Teaching Small Language Models How to Reason
My review of the paper Orca 2 Teaching Small Language Models How to Reason
paperreview deeplearning nlp llm
Nov 20, 2023
Paper Review: Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models
My review of the paper Chain-of-Note Enhancing Robustness in Retrieval-Augmented Language Models
paperreview deeplearning nlp llm
Nov 13, 2023
Paper Review: Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
My review of the paper Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
paperreview deeplearning llm qa
Nov 06, 2023
Paper Review: Collaborative Large Language Model for Recommender Systems
My review of the paper Collaborative Large Language Model for Recommender Systems
paperreview deeplearning llm recommender
Oct 30, 2023
Paper Review: Zephyr: Direct Distillation of LM Alignment
My review of the paper Zephyr Direct Distillation of LM Alignment
paperreview deeplearning nlp llm
Oct 23, 2023
Paper Review: Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
My review of the paper Self-RAG Learning to Retrieve, Generate, and Critique through Self-Reflection
paperreview deeplearning llm nlp
Oct 19, 2023
Paper Review: PaLI-3 Vision Language Models: Smaller, Faster, Stronger
My review of the paper PaLI-3 Vision Language Models Smaller, Faster, Stronger
paperreview deeplearning llm vlm
Oct 16, 2023
Paper Review: InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
My review of the paper InstructRetro Instruction Tuning post Retrieval-Augmented Pretraining
paperreview deeplearning llm nlp
Oct 12, 2023
Paper Review: Mistral 7B
My review of the paper Mistral 7B
paperreview deeplearning llm nlp
Oct 09, 2023
Paper Review: Think before you speak: Training Language Models With Pause Tokens
My review of the paper Think before you speak Training Language Models With Pause Tokens
paperreview deeplearning llm nlp
Oct 05, 2023
Paper Review: QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
My review of the paper QA-LoRA Quantization-Aware Low-Rank Adaptation of Large Language Models
paperreview deeplearning llm nlp
Sep 28, 2023
Paper Review: DreamLLM: Synergistic Multimodal Comprehension and Creation
My review of the paper DreamLLM Synergistic Multimodal Comprehension and Creation
paperreview deeplearning llm cv
Sep 21, 2023
Paper Review: Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
My review of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
paperreview deeplearning llm promptengineering
Sep 04, 2023
Paper Review: RecMind: Large Language Model Powered Agent For Recommendation
My review of the paper RecMind Large Language Model Powered Agent For Recommendation
paperreview deeplearning llm
Aug 28, 2023
Paper Review: Giraffe: Adventures in Expanding Context Lengths in LLMs
My review of the paper Giraffe Adventures in Expanding Context Lengths in LLMs
paperreview deeplearning nlp llm
Aug 24, 2023
Paper Review: OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
My review of the paper OBELISC An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
paperreview deeplearning nlp llm
Aug 10, 2023
Paper Review: Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
A systematic survey of what's broken in RLHF — from reward hacking to evaluation gaps — and what techniques can fix, ...
paperreview deeplearning nlp llm
Aug 10, 2023
Paper Review: UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition
My review of the paper UniversalNER Targeted Distillation from Large Language Models for Open Named Entity Recognition
paperreview deeplearning nlp llm
Aug 07, 2023
Paper Review: Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding
My review of the paper Skeleton-of-Thought Large Language Models Can Do Parallel Decoding
paperreview deeplearning nlp llm
Jul 24, 2023
Paper Review: Retentive Network: A Successor to Transformer for Large Language Models
My review of the paper Retentive Network A Successor to Transformer for Large Language Models
paperreview deeplearning nlp transformer
Jul 03, 2023
Paper Review: Multilingual End to End Entity Linking
My review of the paper Multilingual End to End Entity Linking
paperreview deeplearning nlp llm
Jun 19, 2023
Paper Review: Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
My review of the paper Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
paperreview deeplearning nlp llm
May 30, 2023
Paper Review: Chain of Hindsight Aligns Language Models with Feedback
My review of the paper Chain of Hindsight Aligns Language Models with Feedback
paperreview deeplearning nlp llm
May 18, 2023
Paper Review: DarkBERT: A Language Model for the Dark Side of the Internet
My review of the paper DarkBERT A Language Model for the Dark Side of the Internet
paperreview deeplearning nlp pretraining
May 08, 2023
Paper Review: Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
My review of the paper Distilling Step-by-Step Outperforming Larger Language Models with Less Training Data and Small...
paperreview deeplearning nlp distillation
Mar 20, 2023
Paper Review: Hyena Hierarchy: Towards Larger Convolutional Language Models
My review of the paper Hyena Hierarchy Towards Larger Convolutional Language Models
paperreview deeplearning nlp cv
Mar 13, 2023
Paper Review: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
My review of the paper Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models
paperreview deeplearning nlp transformer

← All tags