Tag: agent
8 posts
Collaborative Reinforcement Learning: Why HACRL Trains Models in Teams Instead of Isolation
HACRL proposes a new paradigm for reinforcement learning - instead of training models in isolation, multiple agents c...
Paper Review: PaperBanana: Automating Academic Illustration for AI Scientists
My review of the paper PaperBanana Automating Academic Illustration for AI Scientists
Paper Review: AlphaEvolve: A coding agent for scientific and algorithmic discovery
DeepMind's autonomous coding agent that evolves algorithms through LLM-driven iteration — it discovered the first imp...
Paper Review: AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents
My review of the paper AgentA/B Automated and Scalable Web A/BTesting with Interactive LLM Agents
Paper Review: Project Sid: Many-agent simulations toward AI civilization
What happens when you put 1k AI agents in Minecraft and let them self-organize? They form governments, transmit cultu...
Paper Review: Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
My review of the paper Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper Review: Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
My review of the paper Husky A Unified, Open-Source Language Agent for Multi-Step Reasoning
Paper Review: FlowMind: Automatic Workflow Generation with LLMs
My review of the paper FlowMind Automatic Workflow Generation with LLMs