Awesome-Visual-Captioning
Awesome-Visual-Captioning
Table of Contents
- ACL-2021
- CVPR-2021
- AAAI-2021
- ACMMM-2020
- NeurIPS-2020
- ECCV-2020
- CVPR-2020
- ACL-2020
- AAAI-2020
- ACL-2019
- NeurIPS-2019
- ICCV-2019
- CVPR-2019
- AAAI-2019
Paper Roadmap
ACL-2021
Video Captioning
- Hierarchical Context-aware Network for Dense Video Event Captioning
- Video Paragraph Captioning as a Text Summarization Task
- O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning paper
CVPR-2021
Video Captioning
- Open-Book Video Captioning With Retrieve-Copy-Generate Network. [paper]
- Towards Diverse Paragraph Captioning for Untrimmed Videos. [paper]
AAAI-2021
Video Captioning
- Non-Autoregressive Coarse-to-Fine Video Captioning. [paper]
- Semantic Grouping Network for Video Captioning. [paper] [code]
- Augmented Partial Mutual Learning with Frame Masking for Video Captioning. [paper]
ACMMM-2020
NeurIPS-2020
- Prophet Attention: Predicting Attention with Future Attention for Improved Image Captioning. [paper]
- RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning. [paper]
- Diverse Image Captioning with Context-Object Split Latent Spaces. [paper]
ECCV-2020
Video Captioning
- Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos.
Spotlight[paper] [code] - Character Grounding and Re-Identification in Story of Videos and Text Descriptions.
Spotlight[paper] [code] - Identity-Aware Multi-Sentence Video Description. [paper]
CVPR-2020
Video Captioning
- Object Relational Graph With Teacher-Recommended Learning for Video Captioning [paper]
Ziqi Zhang, Yaya Shi, Chunfeng Yuan, Bing Li, Peijin Wang, Weiming Hu, Zheng-Jun Zha - Spatio-Temporal Graph for Video Captioning With Knowledge Distillation [paper] [code]
Boxiao Pan, Haoye Cai, De-An Huang, Kuan-Hui Lee, Adrien Gaidon, Ehsan Adeli, Juan Carlos Niebles - Better Captioning With Sequence-Level Exploration [paper]
Jia Chen, Qin Jin - Syntax-Aware Action Targeting for Video Captioning [code]
Qi Zheng, Chaoyue Wang, Dacheng Tao
ACL-2020
Video Captioning
- MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning [paper] [code]
Jie Lei, Liwei Wang, Yelong Shen, Dong Yu, Tamara Berg and Mohit Bansal
AAAI-2020
Video Captioning
- An Efficient Framework for Dense Video Captioning
Maitreya Suin (Indian Institute of Technology Madras)*; Rajagopalan Ambasamudram (Indian Institute of Technology Madras)
Awesome-Visual-Captioning的更多相关文章
- [CVPR2017] Visual Translation Embedding Network for Visual Relation Detection 论文笔记
http://www.ee.columbia.edu/ln/dvmm/publications/17/zhang2017visual.pdf Visual Translation Embedding ...
- Image Captioning 经典论文合辑
Image Caption: Automatically describing the content of an image domain:CV+NLP Category:(by myself, y ...
- CVPR 2017 Paper list
CVPR2017 paper list Machine Learning 1 Spotlight 1-1A Exclusivity-Consistency Regularized Multi-View ...
- 论文:Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering-阅读总结
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering-阅读总结 笔记不能简单的抄写文中 ...
- ( 转) Awesome Image Captioning
Awesome Image Captioning 2018-12-03 19:19:56 From: https://github.com/zhjohnchan/awesome-image-capti ...
- Paper Read: Convolutional Image Captioning
Convolutional Image Captioning 2018-11-04 20:42:07 Paper: http://openaccess.thecvf.com/content_cvpr_ ...
- Paper Reading - Deep Captioning with Multimodal Recurrent Neural Networks ( m-RNN ) ( ICLR 2015 ) ★
Link of the Paper: https://arxiv.org/pdf/1412.6632.pdf Main Points: The authors propose a multimodal ...
- Paper Reading - CNN+CNN: Convolutional Decoders for Image Captioning
Link of the Paper: https://arxiv.org/abs/1805.09019 Innovations: The authors propose a CNN + CNN fra ...
- Paper Reading - Learning to Evaluate Image Captioning ( CVPR 2018 ) ★
Link of the Paper: https://arxiv.org/abs/1806.06422 Innovations: The authors propose a novel learnin ...
- Paper Reading - Convolutional Image Captioning ( CVPR 2018 )
Link of the Paper: https://arxiv.org/abs/1711.09151 Motivation: LSTM units are complex and inherentl ...
随机推荐
- 【Java】【常用类】SimpleDateFormat 简单日期格式化类
Date类的API不易于国际化,大部分基本摈弃了 java.text.SimpleDateFormate 不和语言环境有关的方式来格式化和解析日期的具体类 支持 文本转格式,格式转文本 public ...
- 【Mybatis】Bonus02 补充
关于主键生成问题 Mybatis的主键生成是基于JDBC的使用主键[getGeneratedKeys()]方法 也就是说,必须要JDBC驱动的支持才行 @Test public void junitT ...
- 【IDEA】回退操作记录
参考自: https://www.cnblogs.com/zeussbook/p/9207970.html 找不到代码错误,又有很多已经写好的东西,不好全部删除 只要能记得确切的操作时间就行了 可以翻 ...
- 美国小伙: "American Guy: Only communism can save America!"
视频地址: https://www.youtube.com/watch?v=Y_WQnXFh8ss 2024大选在即,又是拜登对阵特朗普的旧日重现.在角逐谁的对手反对者更多的畸形内耗中,有一个名为 M ...
- 【转载】 pytorch reproducibility —— pytorch代码的可复现性
原文地址: https://www.jianshu.com/p/96767683beb6 作者:kelseyh来源:简书 ======================================= ...
- 内网穿透之实践记录,使用花生壳进行内外穿透,场景:在家远程ssh连接到公司电脑或学校服务器
今天在网上闲逛的时候看到这样一个内网穿透的软件,ngrok, https://gitee.com/kxwinxp/ngrok 记得10多年前自己在读大学的时候曾经好一段时间在研究内网穿透技术,最后发现 ...
- SpringBoot Session共享,配置不生效问题排查 → 你竟然在代码里下毒!
开心一刻 快 8 点了,街边卖油条的还没来,我只能给他打电话 大哥在电话中说到:劳资卖了这么多年油条,从来都是自由自在,自从特么认识了你,居然让我有了上班的感觉! Session 共享 SpringB ...
- python 音频处理(2)——提取PPG特征之whisper库的使用(2.1)
提取PPG特征之--whisper库的使用(2.1) 1 安装对应的包 方法一(自用): 直接pip即可: pip install openai-whisper 成功后如下图所示 方法二: 当时用了他 ...
- 运用Npcap库实现SYN半开放扫描
Npcap 是一款高性能的网络捕获和数据包分析库,作为 Nmap 项目的一部分,Npcap 可用于捕获.发送和分析网络数据包.本章将介绍如何使用 Npcap 库来实现半开放扫描功能.TCP SYN 半 ...
- MFC的CBitmapButton的使用指南
注意:此按钮使用前应该将按钮的属性:Owner Draw->True 注意:此按钮使用前应该将按钮的属性:Owner Draw->True 注意:此按钮使用前应该将按钮的属性:Owner ...