Paper Review: Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
09 June 2025
My review of the paper Beyond the 80/20 Rule High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning