2018AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

dgi 2024-09-29 07:26:10 原文

论文标题：AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

来源/作者机构情况：

谷歌，http://www.cs.toronto.edu/~dross/

UC Berkeley

解决问题/主要思想贡献：

贡献了一个新的动作分类的数据集

成果/优点：

分类更加多，单人，多人，人和物体的动作三大类。还有时间和空间上更加精确的标定

人类动作识别数据集AVA（atomic visual actions，原子视觉动作），提供扩展视频序列中每个人的多个动作标签，精确标注多人动作，我们将动作标签限制在固定的3s时间内。
[电影」和「电视」类别，选择来自不同国家的专业演员。我们对每个视频抽取 15 分钟进行分析，并统一将 15 分钟视频分割成 300 个非重叠的 3 秒片段。采样遵循保持动作序列的时间顺序这一策略。

数据集地址：https://research.google.com/ava/ 需要科学链接

缺点：

反思改进/灵感：

#############################################################

论文主要内容与关键点：

论文主要部分：

1. Introduction

数据集的基本参数：连续三秒长，80种不同的动作类型

2. Related work 动作类数据集

静态动作数据集，以及这些数据记的缺点：失去了时间的特征

3. Data collection：

4. Characteristics of the AVA dataset

5. Experiments

6. Conclusion

目前的研究方法，在AVA数据集都还没有取得SOFA的结果，说明视频动作分类还需要研究出更好的算法出来。

代码实现：

https://github.com/tensorflow/models/tree/master/research/object_detection

2018AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions的更多相关文章

[WinForm] - "更新 DataSet 应用程序集对象失败，Visual Studio 自动重启" 之解决
背景在 WinForm 解决方案中,更新 DataSet 应用程序集对象失败,Visual Studio 自动重启. 试一试 1. 更新 .xsd 时打开对应的 .Designer.cs.2. 如果 ...
论文列表 for Action recognition
要读的论文: https://www.cnblogs.com/hizhaolei/p/10565405.html 骨架动作识别论文汇总 https://blog.csdn.net/bianxuewei ...
【AI科技大本营】
从AutoML.机器学习新算法.底层计算.对抗性攻击.模型应用与底层理解,到开源数据集.Tensorflow和TPU,Google Brain 负责人Jeff Dean发长文来总结他们2017年所做的 ...
Research Guide for Video Frame Interpolation with Deep Learning
Research Guide for Video Frame Interpolation with Deep Learning This blog is from: https://heartbeat ...
6 Tools To Jump Start Your Video Content Marketing
http://www.forbes.com/sites/drewhendricks/2014/10/16/6-tools-to-jump-start-your-video-content-market ...
cvpr2015papers
@http://www-cs-faculty.stanford.edu/people/karpathy/cvpr2015papers/ CVPR 2015 papers (in nicer forma ...
ECCV 2014 Results (16 Jun, 2014) 结果已出
Accepted Papers Title Primary Subject Area ID 3D computer vision 93 UPnP: An optimal O(n) soluti ...
大规模视觉识别挑战赛ILSVRC2015各团队结果和方法 Large Scale Visual Recognition Challenge 2015
Large Scale Visual Recognition Challenge 2015 (ILSVRC2015) Legend: Yellow background = winner in thi ...
### Paper about Event Detection
Paper about Event Detection. #@author: gr #@date: 2014-03-15 #@email: forgerui@gmail.com 看一些相关的论文. 1 ...

随机推荐

What are the differences between struct and class in C++?
Question: This question was already asked in the context of C#/.Net. Now I'd like to learn the diffe ...
两个inline-block消除间距和对齐（vertical-align）
一.神奇的两个inline-block 很初级的问题,无聊决定写一个故事. 故事的主人公很简单,两个inline-block元素.代码如下,为了看起来简单明了,写得很简陋.效果图如右.发现有两个问题. ...
PHP7.27: object
http://www.devshed.com/c/a/PHP/PHP-Services-Layers-Data-Mappers/ https://stackoverflow.com/questions ...
原生JS强大DOM选择器querySelector与querySelectorAll
在传统的 JavaScript 开发中,查找 DOM 往往是开发人员遇到的第一个头疼的问题,原生的 JavaScript 所提供的 DOM 选择方法并不多,仅仅局限于通过 tag, name, id ...
洛谷P2572 [SCOI2010]序列操作(ODT)
题解题意题目链接 Sol ODT板子题..... // luogu-judger-enable-o2 #include<bits/stdc++.h> #define LL long l ...
vue.js 键盘enter事件的使用
在监听键盘事件时,我们经常需要检查常见的键值.Vue 允许为 v-on 在监听键盘事件时添加按键修饰符: <!-- 只有在 `keyCode` 是 13 时调用 `vm.submit()` -- ...
关于input的焦点事件
关于input的焦点事件 $(".scanf_integral").focus(function(){//获取焦点//获取焦点后触发的事件 }) $(".scanf_in ...
SD从零开始64-特异的业务交易(Special Business Transactions)
紧迫订单Rush Orders 紧迫订单和现金销售是用在从工厂销售流程可能用于当客户需要求即刻从货场获得他们的货物时的销售凭据种类: 在即刻交货的销售凭据种类中,即刻交货符号和交货种类DF是设置的:当 ...
Installing Language Tool in TexStudio
This is a recent and more detailed solution for Windows users. Make sure the last version of TeXstud ...
java代码代替xml实现图片
1.使用StateListDrawable替换selector public static StateListDrawable getSelector(Drawable normalDrawable, ...