Tag: efficiency
3 posts
Gamma-World: Simplex Agent Encoding and Hub Attention for Multi-Agent World Models
A review of Gamma-World, NVIDIA's generative multi-agent world model that produces shared, action-controllable video ...
Beyond Positional Bias: How DroPE Unlocks Zero-Shot Long Context in LLMs
A review of DroPE, a simple but counterintuitive method that extends LLM context length by dropping positional embedd...
Paper Review: Linformer: Self-Attention with Linear Complexity
My review of the paper Linformer Self-Attention with Linear Complexity.