Paper Review: DocLLM: A layout-aware generative language model for multimodal document understanding
08 January 2024
My review of the paper DocLLM A layout-aware generative language model for multimodal document understanding
Data science, career and other topics
08 January 2024
My review of the paper DocLLM A layout-aware generative language model for multimodal document understanding
25 December 2023
My review of the paper StreamDiffusionStreamDiffusion A Pipeline-Level Solution for Real-Time Interactive Generation
18 December 2023
My review of the paper Pixel Aligned Language Models
12 December 2023
My review of the paper EfficientSAM Leveraged Masked Image Pretraining for Efficient Segment Anything
07 December 2023
My review of the paper Translatotron 3 Speech to Speech Translation with Monolingual Data
04 December 2023
My review of the paper Adversarial Diffusion Distillation