CVPR 2020论文收藏(转知乎:https://zhuanlan.zhihu.com/p/112337176)
CVPR 2020 共收录 1470篇文章,根据当前的公布情况,人工智能学社整理了以下约100篇,分享给读者。
代码开源情况:详见每篇注释,当前共15篇开源。(持续更新中,可关注了解)。
算法主要领域:图像与视频处理,图像分类&检测&分割、视觉目标跟踪、视频内容分析、人体姿态估计、模型加速、网络架构搜索(NAS)、生成对抗(GAN)、光学字符识别(OCR)、人脸识别、三维重建等方向。 目录如下:
# 总目录
# 图像处理
1. Deep Image Harmonization via Domain Verification
论文:Deep Image Harmonization via Domain Verification
代码:bcmi/Image_Harmonization_Datasets
2. Learning to Shade Hand-drawn Sketches
论文:Learning to Shade Hand-drawn Sketches
3. Generalized ODIN: Detecting Out-of-distribution Image without Learning from Out-of-distribution Data
论文:Generalized ODIN: Detecting Out-of-distribution Image without Learning from Out-of-distribution Data
4. Single Image Reflection Removal through Cascaded Refinement
论文:https://arxiv.org/abs/1911.06634
5. RoutedFusion: Learning Real-time Depth Map Fusion
论文:https://arxiv.org/pdf/2001.04388.pdf
# 图像分类
1. Towards Robust Image Classification Using Sequential Attention Models
论文:Towards Robust Image Classification Using Sequential Attention Models
2. Self-training with Noisy Student improves ImageNet classification
论文:Self-training with Noisy Student improves ImageNet classification
3. Image Matching across Wide Baselines: From Paper to Practice
论文:Image Matching across Wide Baselines: From Paper to Practice
4. Improved Few-Shot Visual Classification
论文:https://arxiv.org/pdf/1912.03432.pdf
5. A General and Adaptive Robust Loss Function
论文:A General and Adaptive Robust Loss Function
6. Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks
论文:Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks
# 目标检测和分割

1. Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector
论文:Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector
2. Bridng the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection
论文:https://arxiv.org/abs/1912.02424
3. Semi-Supervised Semantic Image Segmentation with Self-correcting Networks
论文:Semi-Supervised Semantic Image Segmentation with Self-correcting Networks
4. Deep Snake for Real-Time Instance Segmentation
论文:Deep Snake for Real-Time Instance Segmentation
5. SketchGCN: Semantic Sketch Segmentation with Graph Convolutional Networks
论文:SketchGCN: Semantic Sketch Segmentation with Graph Convolutional Networks
6. xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
论文:xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
7. CenterMask : Real-Time Anchor-Free Instance Segmentation
论文:CenterMask : Real-Time Anchor-Free Instance Segmentation
8. PolarMask: Single Shot Instance Segmentation with Polar Representation
论文:PolarMask: Single Shot Instance Segmentation with Polar Representation
9. BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
论文:BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
# 视觉目标跟踪

1. ROAM: Recurrently Optimizing Tracking Model
论文:ROAM: Recurrently Optimizing Tracking Model
# 视频内容分析(理解)

1. Hierarchical Conditional Relation Networks for Video Question Answering
论文:Hierarchical Conditional Relation Networks for Video Question Answering
2. Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
论文:Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
代码:bbrattoli/ZeroShotVideoClassification
3. Action Modifiers:Learning from Adverbs in Instructional Video
论文:Action Modifiers: Learning from Adverbs in Instructional Videos
4. Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
论文:Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
5. Blurry Video Frame Interpolation
论文:Blurry Video Frame Interpolation
6. Object Relational Graph with Teacher-Recommended Learning for Video Captioning
论文:Object Relational Graph with Teacher-Recommended Learning for Video Captioning
7. Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
论文:Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
8. Learning Representations by Predicting Bags of Visual Words
论文:Learning Representations by Predicting Bags of Visual Words
9. Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
论文:Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
# 人体关键点检测和姿态估计

1. Distribution-Aware Coordinate Representation for Human Pose Estimation
论文:Distribution-Aware Coordinate Representation for Human Pose Estimation
2. VIBE: Video Inference for Human Body Pose and Shape Estimation
论文:VIBE: Video Inference for Human Body Pose and Shape Estimation
3. The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
论文:The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
4. Optimal least-squares solution to the hand-eye calibration problem
论文:Optimal least-squares solution to the hand-eye calibration problem
5. Distribution Aware Coordinate Representation for Human Pose Estimation
论文:Distribution-Aware Coordinate Representation for Human Pose Estimation
6. D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
论文:D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
7. Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
论文:Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
8. PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation
论文:https://arxiv.org/abs/1911.04231
9. 4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras
论文:4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras
# 模型轻量化和加速
1. GPU-Accelerated Mobile Multi-view Style Transfer
论文:GPU-Accelerated Mobile Multi-view Style Transfer
# 神经网络架构设计和搜索NAS

1. GhostNet: More Features from Cheap Operations
论文:GhostNet: More Features from Cheap Operations
2. CARS: Contunuous Evolution for Efficient Neural Architecture Search
论文:https://arxiv.org/pdf/1909.04977.pdf
3. Visual Commonsense R-CNN
论文:https://arxiv.org/abs/2002.12204
4. Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral
5. AdderNet: Do We Really Need Multiplications in Deep Learning?
论文:https://arxiv.org/pdf/1912.13200
6. Filter Grafting for Deep Neural Networks
论文:https://arxiv.org/pdf/2001.05868.pdf
# 生成对抗GAN

1. Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models
论文:Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models
2. MSG-GAN: Multi-Scale Gradient GAN for Stable Image Synthesis
论文:MSG-GAN: Multi-Scale Gradient GAN for Stable Image Synthesis
3. Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory
论文:Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory
# 三维点云&3D重建

1. PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
论文:PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
2. PF-Net: Point Fractal Network for 3D Point Cloud Completion
论文:PF-Net: Point Fractal Network for 3D Point Cloud Completion
3. Learning multiview 3D point cloud registration
论文:Learning multiview 3D point cloud registration
4. Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image
5. In Perfect Shape: Certifiably Optimal 3D Shape Reconstruction from 2D Landmarks
论文:https://arxiv.org/pdf/1911.11924.pdf
6. RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
论文:RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
7. C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
论文:C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
8. Representations, Metrics and Statistics For Shape Analysis of Elastic Graphs
论文:Representations, Metrics and Statistics For Shape Analysis of Elastic Graphs
9. Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion
论文:Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion
# 光学字符识别OCR
1. ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
论文:ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
代码:https://github.com/Yuliang-Liu/bezier_curve_text_spotting,https://github.com/aim-uofa/adet
# 迁移学习

1. Meta-Transfer Learning for Zero-Shot Super-Resolution
论文:Meta-Transfer Learning for Zero-Shot Super-Resolution
2. Transferring Dense Pose to Proximal Animal Classes
论文:Transferring Dense Pose to Proximal Animal Classes
# 弱监督 & 无监督学习
1. Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
论文:Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
2. Disentangling Physical Dynamics from Unknown Factors for Unsupervised Video Prediction
论文:Disentangling Physical Dynamics from Unknown Factors for Unsupervised Video Prediction
3. Rethinking the Route Towards Weakly Supervised Object Localization
论文:Rethinking the Route Towards Weakly Supervised Object Localization
4. NestedVAE: Isolating Common Factors via Weak Supervision
论文:NestedVAE: Isolating Common Factors via Weak Supervision
# 人脸识别
1. Towards Universal Representation Learning for Deep Face Recognition
论文:Towards Universal Representation Learning for Deep Face Recognition
2. Suppressing Uncertainties for Large-Scale Facial Expression Recognition
论文:Suppressing Uncertainties for Large-Scale Facial Expression Recognition
代码:kaiwang960112/Self-Cure-Network
3. Face X-ray for More General Face Forgery Detection
论文:https://arxiv.org/pdf/1912.13458.pdf
# 图神经网络GNN
1. Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction
2. Bundle Adjustment on a Graph Processor
论文:Bundle Adjustment on a Graph Processor
# 视觉 & 语言 混合任务研究
1. Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
论文:Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
2. 12-in-1: Multi-Task Vision and Language Representation Learning
论文:12-in-1: Multi-Task Vision and Language Representation Learning
3. Hierarchical Conditional Relation Networks for Video Question Answering
论文:Hierarchical Conditional Relation Networks for Video Question Answering
# 其他问题研究
1. What it Thinks is Important is Important: Robustness Transfers through Input Gradients
论文:https://arxiv.org/abs/1912.05699
2. Holistically-Attracted Wireframe Parsing
论文:Holistically-Attracted Wireframe Parsing
3. Attntive Context Normalization for Robust Permutation-Equivariant Learning
论文:Attentive Context Normalization for Robust Permutation-Equivariant Learning
5. ClusterFit: Improving Generalization of Visual Representations
论文:ClusterFit: Improving Generalization of Visual Representations
6. Learning in the Frequency Domain
论文:Learning in the Frequency Domain
7. A Characteristic Function Approach to Deep Implicit Generative Modeling
论文:A Characteristic Function Approach to Deep Implicit Generative Modeling
8. Auto-Encoding Twin-Bottleneck Hashing
论文:Auto-Encoding Twin-Bottleneck Hashing
# 论文打包下载地址
链接:https://pan.baidu.com/s/1lo3smbFWiBSNnut9JssYaQ
提取码:可在公众号内发送消息: cvpr2020
CVPR 2020论文收藏(转知乎:https://zhuanlan.zhihu.com/p/112337176)的更多相关文章
- 知乎社区核心业务 Golang 化实践 - 知乎 https://zhuanlan.zhihu.com/p/48039838
知乎社区核心业务 Golang 化实践 - 知乎 https://zhuanlan.zhihu.com/p/48039838
- HTML5之WebSocket && https://zhuanlan.zhihu.com/p/23467317
在认识websocket之前,我们必须了解的是websocket有什么用? 他能解决我们遇到的什么问题? 如果没用,那么我们就么有使用它的必要的. websocket就是建立起全双工协议的,提高了效率 ...
- 转:以下是目前已经建立的sub一览 来自:https://zhuanlan.zhihu.com/p/91935757
转:以下是目前已经建立的sub一览 来自:https://zhuanlan.zhihu.com/p/91935757 作者: Lorgar 理工科 科学(和英文r/science一样,只接受论文讨论 ...
- [转]局域网共享一键修复 18.5.8 https://zhuanlan.zhihu.com/p/24178142
@echo offcolor 2fmode con cols=50 lines=30title OKShare [制作:wnsdt]ver | findstr "6.">nu ...
- 知乎千万级高性能长连接网关 https://zhuanlan.zhihu.com/p/66807833
知乎千万级高性能长连接网关揭秘 9 天前 · 来自专栏 知乎技术专栏 实时的响应总是让人兴奋的,就如你在微信里看到对方正在输入,如你在王者峡谷里一呼百应,如你们在直播弹幕里不约而同的 666,它们的背 ...
- 来源于知乎专栏:https://zhuanlan.zhihu.com/p/29619457
1. 校验数字的表达式 1 数字:^[0-9]*$ 2 n位的数字:^\d{n}$ 3 至少n位的数字:^\d{n,}$ 4 m-n位的数字:^\d{m,n}$ 5 零和非零开头的数字:^(0|[1- ...
- HTTPS 基本流程 转载 https://zhuanlan.zhihu.com/p/27395037
协议 1.HTTP 协议(HyperText Transfer Protocol,超文本传输协议):是客户端浏览器或其他程序与Web服务器之间的应用层通信协议 . 2.HTTPS 协议(HyperTe ...
- https://zhuanlan.zhihu.com/p/32553477
科普:QUIC协议原理分析
- CVPR 2020 全部论文 分类汇总和打包下载
CVPR 2020 共收录 1470篇文章,根据当前的公布情况,人工智能学社整理了以下约100篇,分享给读者. 代码开源情况:详见每篇注释,当前共15篇开源.(持续更新中,可关注了解). 算法主要领域 ...
随机推荐
- C++ 随笔练习 求和
#define _CRT_SECURE_NO_WARNINGS #include <stdio.h> #include <stdlib.h> int main() { int ...
- ajax5
处理跨域方法 (代理) 一个域名地址的组成: /script/jQuery.js 协议 子域名 主域名 端口号 请求资源地址 当协议,子域名,主域名,端口号中任意一个不相同时,都算作不同 ...
- D2T1服务器需求——毒?瘤题(并不是
这题我第一眼居然差点错了\(OTZ\) 然后写了线段树,还写挂了-- 写好了\(query\)操作,发现似乎不需要区间查询,然后又删掉-- 看着这熟悉的操作,似乎在哪里见过-- 然后我莫名其妙把一个\ ...
- Asp.Net Core 中IdentityServer4 授权原理及刷新Token的应用
一.前言 上面分享了IdentityServer4 两篇系列文章,核心主题主要是密码授权模式及自定义授权模式,但是仅仅是分享了这两种模式的使用,这篇文章进一步来分享IdentityServer4的授权 ...
- AVR单片机教程——走向高层
本文隶属于AVR单片机教程系列. 在系列教程的最后一篇中,我将向你推荐3个可以深造的方向:RTOS.C++.事件驱动.掌握这些技术可以帮助你更快.更好地开发更大的项目. 本文涉及到许多概念性的内容 ...
- Elasticsearch系列---多字段搜索
概要 本篇介绍一下multi_match的best_fields.most_fields和cross_fields三种语法的场景和简单示例. 最佳字段 bool查询采取"more-match ...
- linux环境下安装可操作图库语言Gremlin的图框架HugeGraph
原创/朱季谦 图数据库是一项比较前沿而逐渐热门的技术,是NoSql数据库的一种,它应用图形理论存储实体之间的关系信息,最主要的组成有两种,结点集和连接结点的边.常见的图数据库有Neo4j,Januas ...
- [Microsoft Teams]使用连接器接收Azure Devops的通知
1. 什么是连接器 连接器(connector)是Teams中频道的一个接受消息的功能,官方的解释如下: 连接器允许用户订阅来自 web 服务的接收通知和消息. 它们公开服务的 HTTPS 终结点,通 ...
- Flutter配置环境报错“PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target”
背景:最近看了很多Flutter漂亮的项目,想要尝试一下.所有环境都搭建好之后,按照文档一步一步配置(抄袭),但始终报如下图错误. PKIX path building failed: sun.sec ...
- VS2019 C++动态链接库的创建使用(3) - 如何导出类
如何在动态链接库里导出一个类? ①在库头文件里增加一个类声明,class DLL1_API Point是将类内所有成员都导出,如果只导出某个成员函数,则只需在对应的成员函数前加DLL1_API即可: ...