Awesome-Visual-Captioning
Awesome-Visual-Captioning
Table of Contents
- ACL-2021
- CVPR-2021
- AAAI-2021
- ACMMM-2020
- NeurIPS-2020
- ECCV-2020
- CVPR-2020
- ACL-2020
- AAAI-2020
- ACL-2019
- NeurIPS-2019
- ICCV-2019
- CVPR-2019
- AAAI-2019
Paper Roadmap
ACL-2021
Video Captioning
- Hierarchical Context-aware Network for Dense Video Event Captioning
- Video Paragraph Captioning as a Text Summarization Task
- O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning paper
CVPR-2021
Video Captioning
- Open-Book Video Captioning With Retrieve-Copy-Generate Network. [paper]
- Towards Diverse Paragraph Captioning for Untrimmed Videos. [paper]
AAAI-2021
Video Captioning
- Non-Autoregressive Coarse-to-Fine Video Captioning. [paper]
- Semantic Grouping Network for Video Captioning. [paper] [code]
- Augmented Partial Mutual Learning with Frame Masking for Video Captioning. [paper]
ACMMM-2020
NeurIPS-2020
- Prophet Attention: Predicting Attention with Future Attention for Improved Image Captioning. [paper]
- RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning. [paper]
- Diverse Image Captioning with Context-Object Split Latent Spaces. [paper]
ECCV-2020
Video Captioning
- Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos.
Spotlight[paper] [code] - Character Grounding and Re-Identification in Story of Videos and Text Descriptions.
Spotlight[paper] [code] - Identity-Aware Multi-Sentence Video Description. [paper]
CVPR-2020
Video Captioning
- Object Relational Graph With Teacher-Recommended Learning for Video Captioning [paper]
Ziqi Zhang, Yaya Shi, Chunfeng Yuan, Bing Li, Peijin Wang, Weiming Hu, Zheng-Jun Zha - Spatio-Temporal Graph for Video Captioning With Knowledge Distillation [paper] [code]
Boxiao Pan, Haoye Cai, De-An Huang, Kuan-Hui Lee, Adrien Gaidon, Ehsan Adeli, Juan Carlos Niebles - Better Captioning With Sequence-Level Exploration [paper]
Jia Chen, Qin Jin - Syntax-Aware Action Targeting for Video Captioning [code]
Qi Zheng, Chaoyue Wang, Dacheng Tao
ACL-2020
Video Captioning
- MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning [paper] [code]
Jie Lei, Liwei Wang, Yelong Shen, Dong Yu, Tamara Berg and Mohit Bansal
AAAI-2020
Video Captioning
- An Efficient Framework for Dense Video Captioning
Maitreya Suin (Indian Institute of Technology Madras)*; Rajagopalan Ambasamudram (Indian Institute of Technology Madras)
Awesome-Visual-Captioning的更多相关文章
- [CVPR2017] Visual Translation Embedding Network for Visual Relation Detection 论文笔记
http://www.ee.columbia.edu/ln/dvmm/publications/17/zhang2017visual.pdf Visual Translation Embedding ...
- Image Captioning 经典论文合辑
Image Caption: Automatically describing the content of an image domain:CV+NLP Category:(by myself, y ...
- CVPR 2017 Paper list
CVPR2017 paper list Machine Learning 1 Spotlight 1-1A Exclusivity-Consistency Regularized Multi-View ...
- 论文:Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering-阅读总结
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering-阅读总结 笔记不能简单的抄写文中 ...
- ( 转) Awesome Image Captioning
Awesome Image Captioning 2018-12-03 19:19:56 From: https://github.com/zhjohnchan/awesome-image-capti ...
- Paper Read: Convolutional Image Captioning
Convolutional Image Captioning 2018-11-04 20:42:07 Paper: http://openaccess.thecvf.com/content_cvpr_ ...
- Paper Reading - Deep Captioning with Multimodal Recurrent Neural Networks ( m-RNN ) ( ICLR 2015 ) ★
Link of the Paper: https://arxiv.org/pdf/1412.6632.pdf Main Points: The authors propose a multimodal ...
- Paper Reading - CNN+CNN: Convolutional Decoders for Image Captioning
Link of the Paper: https://arxiv.org/abs/1805.09019 Innovations: The authors propose a CNN + CNN fra ...
- Paper Reading - Learning to Evaluate Image Captioning ( CVPR 2018 ) ★
Link of the Paper: https://arxiv.org/abs/1806.06422 Innovations: The authors propose a novel learnin ...
- Paper Reading - Convolutional Image Captioning ( CVPR 2018 )
Link of the Paper: https://arxiv.org/abs/1711.09151 Motivation: LSTM units are complex and inherentl ...
随机推荐
- 【Spring】05 注解开发
环境搭建 配置ApplicationContext.xml容器文件[半注解实现] <?xml version="1.0" encoding="UTF-8" ...
- 【Web】实现页面自动刷新的功能
技术发现自: https://www.bilibili.com/video/BV14v411b7JS?p=8 摘要自CSDN帖子: https://blog.csdn.net/senbar/artic ...
- 【SpringCloud】 Re02 Nacos
运行Nacos注册中心 win版Nacos在bin目录下打开cmd 执行此命令以运行单机模式的Nacos startup.cmd -m standalone 控制台输出: Microsoft Wind ...
- 【Shiro】06 自定义Realm授权实现
创建一个激活的用户类: public class ActiverUser { private User user; private List<String> roleList; priva ...
- 【Vue】11 VueRouter Part1 概述 & 入门
什么是路由? 即通过互联网把信息从源地址传输到目的地址的活动 路由决定数据包从来源到目的地的路径 转送将输入端的数据转移到合适的输出端 后端路由: 早起网站开发全部由服务器渲染,例如 Java的JSP ...
- 再探 游戏 《 2048 》 —— AI方法—— 缘起、缘灭(1) —— Firefox浏览器下自动运行游戏篇
四年前曾经写过一过博客: 对 游戏 < 2048 > 的一些思考 虽然过去几年了,但是这个游戏一直没有搞懂该怎么使用AI算法来进行求解,这里再次对这个问题进行一些探索. ========= ...
- CPU端多进程/多线程调用CUDA是否可以加速???
相关: NVIDIA显卡cuda的多进程服务--MPS(Multi-Process Service) tensorflow1.x--如何在C++多线程中调用同一个session会话 tensorflo ...
- js map方法处理返回数据,获取指定数据简写方法
map方法处理返回数据,获取指定数据简写方法 前言 后端返回数据为数组列表时,通常比较全面,包含了很多不需要的数据,可以通过 map 方法处理返回数据,筛选出想要的数据 例如 // 返回数据 res ...
- BST二叉查找树的接口设计
/*************************************************************************************************** ...
- Sql语句的两表联合查询
string sql = "select mID,mName,mSex,mAge,(select fzName from TxlFenZu where ID=mFenZu) as mFenZ ...