Tag: visual – Andrey Lukyanenko

Feb 16, 2026

Kimi k2.5 Review: Native Multimodality and Agent Swarms at 1 Trillion Parameters

A deep-dive review of Kimi K2.5, a next-generation open multimodal model that combines native vision-language trainin...

paperreview deeplearning llm vlm

Feb 09, 2026

Paper Review: PaperBanana: Automating Academic Illustration for AI Scientists

My review of the paper PaperBanana Automating Academic Illustration for AI Scientists

paperreview deeplearning agent vlm

Mar 13, 2023

Paper Review: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

My review of the paper Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models

paperreview deeplearning nlp transformer

Jun 14, 2020

Paper Review: VirTex: Learning Visual Representations from Textual Annotations

My review of the paper VirTex Learning Visual Representations from Textual Annotations.

paperreview imagecaptioning cv visual