Paper Review: Long-Short Transformer Efficient Transformers for Language and Vision
12 July 2021
My review of the paper Long-Short Transformer Efficient Transformers for Language and Vision
Data science, career and other topics
12 July 2021
My review of the paper Long-Short Transformer Efficient Transformers for Language and Vision
18 June 2021
My review of the paper Semi-Autoregressive Transformer for Image Captioning
10 June 2021
My review of the paper CoAtNet Marrying Convolution and Attention for All Data Sizes
02 June 2021
My review of the paper ByT5 Towards a token-free future with pre-trained byte-to-byte models
21 May 2021
My review of the paper Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence
10 May 2021
My review of the paper Are Pre-trained Convolutions Better than Pre-trained Transformers?