Tag: agent

8 posts

Mar 16, 2026
Collaborative Reinforcement Learning: Why HACRL Trains Models in Teams Instead of Isolation
HACRL proposes a new paradigm for reinforcement learning - instead of training models in isolation, multiple agents c...
paperreview deeplearning rl llm
Feb 09, 2026
Paper Review: PaperBanana: Automating Academic Illustration for AI Scientists
My review of the paper PaperBanana Automating Academic Illustration for AI Scientists
paperreview deeplearning agent vlm
May 15, 2025
Paper Review: AlphaEvolve: A coding agent for scientific and algorithmic discovery
DeepMind's autonomous coding agent that evolves algorithms through LLM-driven iteration — it discovered the first imp...
paperreview deeplearning agent nlp
Apr 28, 2025
Paper Review: AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents
My review of the paper AgentA/B Automated and Scalable Web A/BTesting with Interactive LLM Agents
paperreview deeplearning agent nlp
Nov 25, 2024
Paper Review: Project Sid: Many-agent simulations toward AI civilization
What happens when you put 1k AI agents in Minecraft and let them self-organize? They form governments, transmit cultu...
paperreview deeplearning nlp llm
Nov 11, 2024
Paper Review: Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
My review of the paper Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
paperreview deeplearning nlp llm
Jul 01, 2024
Paper Review: Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
My review of the paper Husky A Unified, Open-Source Language Agent for Multi-Step Reasoning
paperreview deeplearning llm agent
May 06, 2024
Paper Review: FlowMind: Automatic Workflow Generation with LLMs
My review of the paper FlowMind Automatic Workflow Generation with LLMs
paperreview deeplearning llm agent

← All tags