视觉中的深度学习方法CVPR 2012 Tutorial Deep Learning Methods for Vision
| Deep Learning Methods for Vision
CVPR 2012 Tutorial 9:00am-5:30pm, Sunday June 17th, Ballroom D (Full day) Rob Fergus (NYU), |
| Overview
Hand-designed features such as SIFT and HOG underpin many successful object recognition approaches. However, these only capture low-level edge information and it has proven difficult to design features that effectively capture mid-level cues (e.g. edge intersections) or high-level representation (e.g. object parts). However, recent developments in machine learning, known as "Deep Learning", have shown how hierarchies of features can be learned in an unsupervised manner directly from data. This tutorial will describe these feature learning approaches, as applied to images and video. The tutorial will start by motivating the need to learn features, rather than hand-craft them. It will then introduce several basic architectures, explaining how they learn features, and showing how they can be "stacked" into hierarchies that can extract multiple layers of representation. Throughout, links will be drawn between these methods and existing approaches to recognition, particularly those involving hierarchical representations. The final part of the lecture will examine the current performances obtained by feature learning approaches on a range of standard vision benchmarks, highlighting their strengths and weaknesses. |
Schedule
| 9:00am | Introduction PPT | (Fergus) | [1h] |
| 10:00am | Coffee Break | [30m] | |
| 10:30am | Sparse Coding PPT | (Yu) | [1h] |
| 11:30am | Neural Networks PDF Code | (Ranzato) | [1h] |
| 12:30pm | Lunch | [1h] | |
| 1:30pm | Restricted Boltzmann Machines PDF | (Lee) | [1h] |
| 2:30pm | Deep Boltzmann Machines PDF | (Salakhutdinov) | [30m] |
| 3:00pm | Coffee Break | [30m] | |
| 3:30pm | Transfer Learning PDF | (Salakhutdinov) | [30m] |
| 4:00pm | Motion and Video PDF | (Taylor) | [1h] |
| 5:00pm | Summary / Q & A | [30m] |
Speaker Biographies
![]() |
Rob Fergus
Rob Fergus is an Assistant Professor of Computer Science at the Courant Institute of Mathematical Sciences, New York University. He received a Masters in Electrical Engineering with Prof. Pietro Perona at Caltech, before completing a PhD with Prof. Andrew Zisserman at the University of Oxford in 2005. Before coming to NYU, he spent two years as a post-doc in the Computer Science and Artificial Intelligence Lab (CSAIL) at MIT, working with Prof. William Freeman. He has received several awards including a CVPR best paper prize (2003), a Sloan Fellowship (2011) and an NSF Career award (2012). |
|
![]() |
Honglak Lee
Honglak Lee is currently an Assistant Professor of Computer Science at the University of Michigan, Ann Arbor. He recevied his PhD from Stanford Unviersity, advised by Andrew Ng. His research interests lie in machine learning and its application to a range of perception problems in the fields of artificial intelligence, such as computer vision, robotics, audio recognition, and text processing. |
|
| Marc'Aurelio Ranzato
Marc'Aurelio Ranzato is currently a Research Scientist at Google. Before joining Google in the fall 2011, he was a post-doctoral fellow in Machine Learning, University of Toronto, working with Geoffrey Hinton. He did his Ph.D. in Computer Science at New York University in Yann LeCun's group. His interestes include Machine Learning, Computer Vision and, more generally, Artificial Intelligence. He has worked on unsupervised learning algorithms, in particular, hierarchical models and deep networks. |
||
![]() |
Ruslan Salakhutdinov
Ruslan Salakhutdinov received his PhD in machine learning from the University of Toronto in 2009. After spending two post-doctoral years at the Massachusetts Institute of Technology Artificial Intelligence Lab, he joined the University of Toronto as an Assistant Professor in the Departments of Statistics and Computer Science. Dr. Salakhutdinov's primary interests lie in statistical machine learning, Bayesian statistics, probabilistic graphical models, and large-scale optimization. He is the recipient of the NSERC Postdoctoral Fellowship, Canada Graduate Scholarship, and a Scholar of the Canadian Institute for Advanced Research. |
|
![]() |
Graham Taylor
Graham Taylor recently joined University of Guelph as an Assistant Professor of Engineering. He was previously a postdoc at NYU, working with Chris Bregler, Rob Fergus, and Yann LeCun. He completed his PhD at the University of Toronto in 2009, co-advised by Geoffrey Hinton and Sam Roweis. His interests are in statistical machine learning and biologically-inspired computer vision, with an emphasis on unsupervised learning and time series analysis. Much of his work studies human movement. |
|
![]() |
Kai Yu
Kai Yu recently jointed Baidu as Director of Multimedia Department, in charge of search technologies and products involving video, speech and music. Previously, he was head of the Media Analytics Department of NEC Labs in Silicon Valley, California, leading the development of intelligent systems for machine learning, image recognition, multimedia search, video surveillance, recommendation, data mining, and human-computer interface. He obtained PhD in Computer Science at University of Munich, Germany. |
Acknowledgments
This work was partially supported by the National Science Foundation Career Award #1149633. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
from: http://cs.nyu.edu/~fergus/tutorials/deep_learning_cvpr12/
视觉中的深度学习方法CVPR 2012 Tutorial Deep Learning Methods for Vision的更多相关文章
- 论文阅读:Face Recognition: From Traditional to Deep Learning Methods 《人脸识别综述:从传统方法到深度学习》
论文阅读:Face Recognition: From Traditional to Deep Learning Methods <人脸识别综述:从传统方法到深度学习> 一.引 ...
- 深度学习编译与优化Deep Learning Compiler and Optimizer
深度学习编译与优化Deep Learning Compiler and Optimizer
- 深度学习国外课程资料(Deep Learning for Self-Driving Cars)+(Deep Reinforcement Learning and Control )
MIT(Deep Learning for Self-Driving Cars) CMU(Deep Reinforcement Learning and Control ) 参考网址: 1 Deep ...
- 用500行Julia代码开始深度学习之旅 Beginning deep learning with 500 lines of Julia
Click here for a newer version (Knet7) of this tutorial. The code used in this version (KUnet) has b ...
- 浅谈深度学习中的激活函数 - The Activation Function in Deep Learning
原文地址:http://www.cnblogs.com/rgvb178/p/6055213.html版权声明:本文为博主原创文章,未经博主允许不得转载. 激活函数的作用 首先,激活函数不是真的要去激活 ...
- 最佳实践:深度学习用于自然语言处理(Deep Learning for NLP Best Practices) - 阅读笔记
https://www.wxnmh.com/thread-1528249.htm https://www.wxnmh.com/thread-1528251.htm https://www.wxnmh. ...
- 15 cvpr An Improved Deep Learning Architecture for Person Re-Identification
http://www.umiacs.umd.edu/~ejaz/ * 也是同时学习feature和metric * 输入一对图片,输出是否是同一个人 * 包含了一个新的层: include a lay ...
- Deep learning for visual understanding: A review 视觉理解中的深度学习:回顾 之一
Deep learning for visual understanding: A review 视觉理解中的深度学习:回顾 ABSTRACT: Deep learning algorithms ar ...
- Deep Learning(深度学习)学习笔记整理
申明:本文非笔者原创,原文转载自:http://www.sigvc.org/bbs/thread-2187-1-3.html 4.2.初级(浅层)特征表示 既然像素级的特征表示方法没有作用,那怎样的表 ...
随机推荐
- 【URAL】1960. Palindromes and Super Abilities
http://acm.timus.ru/problem.aspx?space=1&num=1960 题意:给一个串s,要求输出所有的s[0]~s[i],i<|s|的回文串数目.(|s|& ...
- Java中UIManager的几种外观的详细讲解
Java'中的几种Look and Feel 1.Metal风格 (默认) String lookAndFeel = "javax.swing.plaf.metal.MetalLookAnd ...
- SQLSERVER远程备份、恢复(转)
SQLSERVER服务实例名称:192.168.0.2需要备份的数据库名称: a备份机器名称(Client端):192.168.0.3备份机用户:zf 密码:123备份机域名:domain备份机提供备 ...
- 在DataGridView控件中加入ComboBox下拉列表框的实现
在DataGridView控件中加入ComboBox下拉列表框的实现 转自:http://www.cnblogs.com/luqingfei/archive/2007/03/28/691372.htm ...
- 李洪强iOS经典面试题133--UNIX常用命令
可能碰到的iOS笔试面试题(3)--UNIX常用命令 做开发说用不到命令行,那肯定是不可能的.所以记住几个常用的命令还是很有用. cd 改变工作目录 pwd 输出当前工作目录的绝对路径在UNIX中要执 ...
- 关于IAP的破解
介绍 大概在今年7月份,有俄罗斯黑客破解了苹果的应用内付费(In-App Purchases),设备在不越狱的情况下就可以免费获得来自苹果官网App Store里应用的收费道具.受影响的产品众多,包括 ...
- Lamp下安全配置随笔
Apache方面: 1.apache有两个指令可以输出服务器的细节,即ServerSignature和ServerTokens. 当这两个指令一起使用时,会输出apache的版本号,php的版本号,i ...
- "我爱记单词"测试报告兼功能展示
"我爱记单词"测试报告兼功能展示 前言: 我们大部分的测试都是一边开发一边完成的,这里给出软件开发基本完成后在使用时的一些测试例子. 一.背景介绍 我们的数据库中一共有10个表: ...
- springmvc4+hibernate4+spring4注解一对多级联保存
package com.h3c.zgc.user.entity; import java.util.HashSet; import java.util.Set; import javax.persis ...
- SPFA导读及介绍(转载)
适用范围:给定的图存在负权边,这时类似Dijkstra等算法便没有了用武之地,而Bellman-Ford算法的复杂度又过高,SPFA算法便派上用场了. 我们约定有向加权图G不存在负权回路,即最短路径一 ...




