getting started with building a ROS simulation platform for Deep Reinforcement Learning
Apparently, this ongoing work is to make a preparation for futural research on Deep Reinforcement Learning. The goal of this work is to build a simulation platform that can insert the Deep Reinforcement Learning algorithms as a robot motion planning or navigation module.
I spent all day to position what I should do in this part of work. With an ambiguous goal and a big picture of the whole project, I am almost lost in the information I encountered in knowing about the related fields and knowledge.
In the end of the day, I hope what is now shaped in my mind is enough close to what my boss need. Below are what I believe to explain my work in the future.
Overview of the project
The project finally hopes to determine the robot's motion trajectory in real time by the method of Deep Reinforcement Learning. Given hundreds of different indoor environments the robot is trained to have the ability to move to a specified target without explicitly programmed navigation and obstacle avoidance algorithms.The following paper
The following paper are some related work that I believe could be useful.
Target-driven visual navigation in indoor scenes using deep reinforcement learning
Active Object Localization with Deep Reinforcement Learning
Deep Neural Network for Real-Time Autonomous Indoor Navigation
Autonomous Navigation Planning with ROS
And a git book is quite complete in AI:
https://www.gitbook.com/book/leonardoaraujosantos/artificial-inteligence/details
ROS simulation
Gathering or building virtual indoor worlds
...
Replacing some modules in Navigation stack for Deep Reinforcement Learning algorithms
Navigation stack is a commonly used navigation module in ROS platform and SLAM tasks, of which the architecture is shown below. I believe some modules shown in this diagram can be replaced by DRL algorithm which I still need to dig deeper to determine.

A tutorial might walk through me all the way from building a customised robot to navigating this robot in Gazebo is offered here, which I would follow in the next few days and understand more on the relationship among those modules.
[Tutorial] Getting Starting with Autonomous Robots in ROS via Simulations
...
getting started with building a ROS simulation platform for Deep Reinforcement Learning的更多相关文章
- 中文译文:Minerva-一种可扩展的高效的深度学习训练平台(Minerva - A Scalable and Highly Efficient Training Platform for Deep Learning)
Minerva:一个可扩展的高效的深度学习训练平台 zoerywzhou@gmail.com http://www.cnblogs.com/swje/ 作者:Zhouwan 2015-12-1 声明 ...
- Awesome Reinforcement Learning
Awesome Reinforcement Learning A curated list of resources dedicated to reinforcement learning. We h ...
- AI佳作解读系列(三)——深度学习中的合成数据研究
Below are some investigation resources for synthetic datasets: 1. Synthetic datasets vs. real images ...
- Evolutionary approaches towards AI: past, present, and future
Evolutionary approaches towards AI: past, present, and future 2019-10-06 07:28:13 This blog is from: ...
- 斯坦福CS课程列表
http://exploredegrees.stanford.edu/coursedescriptions/cs/ CS 101. Introduction to Computing Principl ...
- 【机器学习Machine Learning】资料大全
昨天总结了深度学习的资料,今天把机器学习的资料也总结一下(友情提示:有些网站需要"科学上网"^_^) 推荐几本好书: 1.Pattern Recognition and Machi ...
- 【深度学习Deep Learning】资料大全
最近在学深度学习相关的东西,在网上搜集到了一些不错的资料,现在汇总一下: Free Online Books by Yoshua Bengio, Ian Goodfellow and Aaron C ...
- (转) Awesome Deep Learning
Awesome Deep Learning Table of Contents Free Online Books Courses Videos and Lectures Papers Tutori ...
- Top Deep Learning Projects in github
Top Deep Learning Projects A list of popular github projects related to deep learning (ranked by sta ...
随机推荐
- Jquery中$(document).ready()与传统JavaScript中的window.onload方法的区别(2016/8/3)
Jquery中$(document).ready()的作用类似于传统JavaScript中的window.onload方法,不过与window.onload方法还是有区别的. 1.执行时间 ...
- linux下xargs命令用法详解 【转】
转自:http://blog.chinaunix.net/uid-128922-id-289992.html xargs在linux中是个很有用的命令,它经常和其他命令组合起来使用,非常的灵活. xa ...
- 使用percona xtradb cluster的IST方式添加新节点
使用percona xtradb cluster的IST(Incremental State Transfer)特性添加新节点,防止新节点加入时使用SST(State SnapShop Transfe ...
- Maven invalid task...
执行maven构建项目报错: Invalid task '‐DgroupId=*': you must specify a valid lifecycle phase, or a goal in th ...
- ACM题目————字串数
Description 一个A和两个B一共可以组成三种字符串:"ABB","BAB","BBA". 给定若干字母和它们相应的个数,计算一共可 ...
- css2---必须学的经典---定位问题
定位 position body<html<文档 定位是相对文档的 不是相对 body 或 html 1.position:relative 相对定位 a.不能使内嵌元素支持宽高 b. ...
- 鼠标放上去,div高度随文字增加,并显示剩余的文字。
/*这里是鼠标放上去显示全名 */ .kb2wText{display:block; height:20px; width:150px; line-height:20px; color:#0 ...
- 多校赛3- Painter 分类: 比赛 2015-07-29 19:58 3人阅读 评论(0) 收藏
D - Painter Time Limit:1000MS Memory Limit:65536KB 64bit IO Format:%I64d & %I64u Submit Status P ...
- opencv通过dll调用matlab函数,图片作为参数
[blog 项目实战派]opencv通过dll调用matlab函数,图片作为参数 前文介绍了如何“csharp通过dll调用opencv函数,图片作为参数”.而在实 ...
- JAVA基础知识之JVM-——反射和泛型
泛型和Class类 在反射中使用泛型Class<T>可以避免强制类型转换,下面是一个简单例子,如果不使用泛型的话,需要显示转换, package aop; import java.util ...