课程大纲:http://vision.stanford.edu/teaching/cs131_fall1718/syllabus.html 课程定位: 课程交叉: what is (computer) vision?: 1. a scientific field that extracts information out of digital images. 2. building algorithms that can be understand the contnent of image a…
大纲 what is color? The result of interaction between physical light in the environment and our visual system. A psychological property of our visual experiences when we look at objects and lights, not a physical property of those objects or lights. Hu…
Background reading: Forsyth and Ponce, Computer Vision Chapter 7 Image sampling and quantization Types of images: binary, gray scale, color Resolution: DPI: dots per inch, spatial pixel density Image histograms: histogram of an image provides the fre…
深度学习 So far this week Edge detection RANSAC SIFT K-Means Linear classifier Mean-shift PCA/Eigenfaces Image features Current Research Learning hierarchical representations from data End-to-end learning: raw inputs to predictions can use a small set of…
向量和矩阵 什么是矩阵/向量? Vectors and matrix are just collections of ordered numbers that represent something: movements in space, scaling factors, pixel brightness, etc. We'll define some common uses and standard operations on them. 向量:列向量/行向量 用处: Vectos can…
实在是太喜欢Richard Szeliski的这本书了.每一章节(after chapter3)都详述了该研究方向比較新的成果.还有很多很多的reference,假设你感兴趣.全然能够看那些參考论文 Point operators(点算子) 点运算是最简单的一类图像处理运算.如简单的对照度变换,亮度变换 Pixel transform(像素变换) g(x) = af (x) +b    a和b有时被觉得用来控制对照度和亮度,在我的opencv栏目有个样例是简单的对照度和亮度变换,用的就是这个公式…
In the 1960s, the legendary Stanford artificial intelligence pioneer, John McCarthy, famously gave a graduate student the job of “solving” computer vision as a summer project. It has occupied an entire community of academic researchers for the past 4…
Participate in Reproducible Research General Image Processing OpenCV (C/C++ code, BSD lic) Image manipulation, matrix manipulation, transforms Torch3Vision (C/C++ code, BSD lic) Basic image processing, matrix manipulation and feature extraction algor…
中文译著已经出版,详情请参考:http://blog.csdn.net/ZhangRelay/article/category/6506865 Learning ROS for Robotics Programming Second Edition学习笔记(五) indigo computer vision FireWire IEEE1394 cameras无USB cameras--sudo apt-get install ros-indigo-usb-cam--roslaunch chapt…
本文把自己理解的图像存储格式总结一下. 计算机中的数据,都是二进制的,所以图片也不例外. 这是opencv文档的描述,具体在代码里面,使用矩阵来进行存储. 类似下图是(BGR格式): 图片的最小单位是像素,这里是BGR(通常我们说的blud.green.red的表示法)表示每个像素对应的值(这里BGR的混合,可以得到我们可见光的所有值). 如果是单通道(例如:灰度化之后的图像,这里就只有一列) 参考可见光光谱: 因为物体都是原子组成,原子都在运动,运动会产生光波,不同的物体生成的光波不一样,人类…