Berkeley 大学最近推出的针对自动驾驶的街景数据集，号称比 Cityscapes 数据量更大，可泛化性更好。

语义实例分割（Semantic Instance Segmentation）

数据集一共有 40 种物体类别

与 Cityscapes 的对比

街景数据来自 US 的城市

模型更熟悉美国的街景。

图片标签

时间：daytime, nighttime, dawn/dusk;

场景：Residential，High-way, City street, Parking lot, Gas station, Tunnel;

天气：Clear, Partly cloudy, Over-case, Rainy, Snowy, Foggy;

Label Maps

语义分割使用标签映射（Label Maps），不是训练索引（Training Indices）。

更高的可泛化性

使用 Dilate Residual Network （Hyper parameter 相同）测试两个数据集时发现下表的关系：

Train	Test	Accuracy
deepDriver	deepDriver	High
deepDriver	Cityscapes	Low
Cityscapes	deepDriver	Low
Cityscapes	Cityscapes	High

在同样的数据集下训练结果都很好，但交叉使用不同测试集时精度下降显著。使用 deepDriver 训练的模型在 Cityscapes 测试集上的表现虽然较差，但有部分训练结果比在特定场景训练的结果要好。这意味着该数据集涵盖场景更多，训练出的模型的可泛化性会比较好。

以上参考：https://arxiv.org/abs/1805.04687

数据集详情

文件结构：

bdd100k

|   seg

|    |  images

|    |    |  train

|    |    |  val

|    |    |  test

|    |  color_labels

|    |    |  train

|    |    |  val

|    |  labels

|    |    |  train

|    |    |  val

检查数据集完整性的 python3 脚本

import os

import sys 

if  len(sys.argv) !=  2:

	print ('Usage: python checkdata.py <train|val>')

	exit(-1)

dataset_category = sys.argv[1]

if dataset_category not  in {'train', 'val'}:

	print (f'Invalid argument "{dataset_category}"')

	exit(-2)

data_size = 7000 if dataset_category == 'train' else 1000

dir_root =  '.'

dir_color = os.path.join(dir_root, 'color_labels', dataset_category)

dir_imgs = os.path.join(dir_root, 'images', dataset_category)

dir_label = os.path.join(dir_root, 'labels', dataset_category)

color_names = os.listdir(dir_color)

img_names = os.listdir(dir_imgs)

label_names = os.listdir(dir_label)

assert len(color_names) ==  len(img_names) ==  len(label_names) == data_size

for i in range(len(color_names)):

	prefix_color = color_names[i].split('_')[0]

	prefix_img = img_names[i].split('.')[0]

	prefix_label = label_names[i].split('_')[0]

	assert prefix_color == prefix_img == prefix_label, f'{prefix_color}, {prefix_img}, {prefix_label}'

print ('All Good!')

包含分割多边形信息的 Json 文件目前还没有公开，因此只能做segmentation，不能做 detection + segmentation。但是单纯的 detection 数据文件已经是提供好的，可以使用查看工具查看标注矩形框和三种图片标签（时间、场景、天气）

官方代码目前的坑

https://github.com/ucbdrive/bdd-data/issues/17

https://github.com/ucbdrive/bdd-data/issues/5

https://github.com/ucbdrive/bdd-data/issues/15

其中，#15 issue 目前还未解决。

Written with StackEdit.

初涉 Deep Drive Dataset的更多相关文章

fashion datasets图像检索实践project
Using Siamese Networks and Pre-Trained Convolutional Neural Networks (CNNs) for Fashion Similarity M ...
【深度学习Deep Learning】资料大全
最近在学深度学习相关的东西,在网上搜集到了一些不错的资料,现在汇总一下: Free Online Books by Yoshua Bengio, Ian Goodfellow and Aaron C ...
Joint Deep Learning for Pedestrian Detection笔记
1.结构图 Introduction Feature extraction, deformation handling, occlusion handling, and classification ...
Machine and Deep Learning with Python
Machine and Deep Learning with Python Education Tutorials and courses Supervised learning superstiti ...
Classifying plankton with deep neural networks
Classifying plankton with deep neural networks The National Data Science Bowl, a data science compet ...
Growing Pains for Deep Learning
Growing Pains for Deep Learning Advances in theory and computer hardware have allowed neural network ...
通过Visualizing Representations来理解Deep Learning、Neural network、以及输入样本自身的高维空间结构
catalogue . 引言 . Neural Networks Transform Space - 神经网络内部的空间结构 . Understand the data itself by visua ...
Coursera Deep Learning 2 Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization - week1, Assignment(Regularization)
声明:所有内容来自coursera,作为个人学习笔记记录在这里. Regularization Welcome to the second assignment of this week. Deep ...
What are some good books/papers for learning deep learning?
What's the most effective way to get started with deep learning? 29 Answers Yoshua Bengio, ...

随机推荐

SVN 操作报错 “Previous operation has not finished; run 'cleanup' if it was interrupted“
今天在通过 SVN 合并代码的时候报了如下的错误 ”Previous operation has not finished; run 'cleanup' if it was interrupted“ ...
Django开发BUG汇总
使用版本知悉 limengjiedeMacBook-Pro:~ limengjie$ python --version Python :: Anaconda, Inc. limengjiedeMacB ...
Android 发版的小工具
Android加固包签名我们知道自己的apk在上传市场的时候, 为了更好的包含我们的代码需要加固服务, 加固后的apk是不能直接安装的, 需要我们手动签名. 关于Android签名的知识就不在赘述了 ...
dva框架使用详解及Demo教程
dva框架的使用详解及Demo教程在前段时间,我们也学习讲解过Redux框架的基本使用,但是有很多同学在交流群里给我的反馈信息说,redux框架理解上有难度,看了之后还是一脸懵逼不知道如何下手,很多 ...
ubuntu下的python网页解析库的安装——lxml, Beautiful Soup, pyquery, tesserocr
lxml 的安装(xpath) pip3 install lxml 可能会缺少以下依赖: sudo apt-get install -y python3-dev build-e ssential li ...
jz2440_lcd
VDEN 使能信号 HSYNC 水平方向的同步信号 VSYNC 垂直方向的同步信号 LED-/LED+ 背光信号 VCLK 时钟信号 VD0~VD23 数字 ...
python--模块之os操作文件模块
作用:OS又名为:操作系统.所以就是操作系统相关的功能.可以处理文件和目录这些我们日常手动需要做的操作,比如:显示当前目录下所有文件.删除某个文件.获取文件大小...os模块是与操作系统交互的一个接口 ...
rails 启动测试环境出现 "Rack::Cors" => Rack::Cors，解决方法
找到项目中"Rack::Cors"改为 Rack::Cors
arping命令用法
arping命令使用说明 BusyBox v1.17.3 (2011-07-20 17:01:30 CST) multi-call binary. Usage: arping [-fqbDUA] [- ...
008---vim编辑器
vim 编辑器三个模式三个模式之间切换图命令模式进入编辑模式 A:行末 a:向后 i:向前 I:行首 o:向上 O:向下命令模式复制 yy:复制光标所在行 4yy:向下复制四行剪切(删除 ...

初涉 Deep Drive Dataset