getting started with building a ROS simulation platform for Deep Reinforcement Learning

xpharry 2024-10-17 03:37:02 原文

Apparently, this ongoing work is to make a preparation for futural research on Deep Reinforcement Learning. The goal of this work is to build a simulation platform that can insert the Deep Reinforcement Learning algorithms as a robot motion planning or navigation module.

I spent all day to position what I should do in this part of work. With an ambiguous goal and a big picture of the whole project, I am almost lost in the information I encountered in knowing about the related fields and knowledge.

In the end of the day, I hope what is now shaped in my mind is enough close to what my boss need. Below are what I believe to explain my work in the future.

Overview of the project

The project finally hopes to determine the robot's motion trajectory in real time by the method of Deep Reinforcement Learning. Given hundreds of different indoor environments the robot is trained to have the ability to move to a specified target without explicitly programmed navigation and obstacle avoidance algorithms.The following paper

The following paper are some related work that I believe could be useful.

　　Target-driven visual navigation in indoor scenes using deep reinforcement learning

Active Object Localization with Deep Reinforcement Learning

Deep Neural Network for Real-Time Autonomous Indoor Navigation

Autonomous Navigation Planning with ROS

And a git book is quite complete in AI:

https://www.gitbook.com/book/leonardoaraujosantos/artificial-inteligence/details

ROS simulation

Gathering or building virtual indoor worlds

...

Replacing some modules in Navigation stack for Deep Reinforcement Learning algorithms

Navigation stack is a commonly used navigation module in ROS platform and SLAM tasks, of which the architecture is shown below. I believe some modules shown in this diagram can be replaced by DRL algorithm which I still need to dig deeper to determine.

A tutorial might walk through me all the way from building a customised robot to navigating this robot in Gazebo is offered here, which I would follow in the next few days and understand more on the relationship among those modules.

[Tutorial] Getting Starting with Autonomous Robots in ROS via Simulations

...

getting started with building a ROS simulation platform for Deep Reinforcement Learning的更多相关文章

中文译文：Minerva-一种可扩展的高效的深度学习训练平台（Minerva - A Scalable and Highly Efficient Training Platform for Deep Learning）
Minerva:一个可扩展的高效的深度学习训练平台 zoerywzhou@gmail.com http://www.cnblogs.com/swje/ 作者:Zhouwan 2015-12-1 声明 ...
Awesome Reinforcement Learning
Awesome Reinforcement Learning A curated list of resources dedicated to reinforcement learning. We h ...
AI佳作解读系列(三)——深度学习中的合成数据研究
Below are some investigation resources for synthetic datasets: 1. Synthetic datasets vs. real images ...
Evolutionary approaches towards AI: past, present, and future
Evolutionary approaches towards AI: past, present, and future 2019-10-06 07:28:13 This blog is from: ...
斯坦福CS课程列表
http://exploredegrees.stanford.edu/coursedescriptions/cs/ CS 101. Introduction to Computing Principl ...
【机器学习Machine Learning】资料大全
昨天总结了深度学习的资料,今天把机器学习的资料也总结一下(友情提示:有些网站需要"科学上网"^_^) 推荐几本好书: 1.Pattern Recognition and Machi ...
【深度学习Deep Learning】资料大全
最近在学深度学习相关的东西,在网上搜集到了一些不错的资料,现在汇总一下: Free Online Books by Yoshua Bengio, Ian Goodfellow and Aaron C ...
(转) Awesome Deep Learning
Awesome Deep Learning Table of Contents Free Online Books Courses Videos and Lectures Papers Tutori ...
Top Deep Learning Projects in github
Top Deep Learning Projects A list of popular github projects related to deep learning (ranked by sta ...

随机推荐

s3c2440 lcd 显示图片裸机程序
因为前面的裸机程序非常的简单,就不写博了. 程序的流程: 1,初始化C SP 2,关看门狗 3,初始化SDRAM 4,读出 NAND FLASH 中的包含图片的程式放到SDRAM里面 5,跳转到SD ...
scala的继承
package com.test.scala.test /** * 模拟java的继承,扩展类 */ abstract class ExtendClass(val des:String) { def ...
hadoop概述测试题和基础模版代码
hadoop概述测试题和基础模版代码 1.Hadoop的创始人是DougCutting?() A.正确 B.错误答对了!正确答案:A解析:参考课程里的文档,这个就不解释了2.下列有关Hadoop的说法 ...
HTTP 请求未经客户端身份验证方案“Anonymous”授权。从服务器收到的身份验证标头为“Negotiate,NTLM”
转自:http://www.cnblogs.com/geqinggao/p/3270499.html 近来项目需要Web Service验证授权,一般有两种解决方案: 1.通过通过SOAP Heade ...
iOS中视频直播功能－流媒体的使用（详解）韩俊强的CSDN博客
上一篇博客:(流媒体实现视频播放和下载功能):http://blog.csdn.net/qq_31810357/article/details/50574914 最近视频直播功能比较火,处于需求,研究 ...
C#:写日志
一.直接写入文件 /// <summary> /// 将message的内容写入日志文件 /// </summary> /// <param name="msg ...
七牛开发文档php
http://developer.qiniu.com/docs/v6/sdk/php-sdk.html#overview http://developer.qiniu.com/docs/v6/sdk/ ...
ACM题目————列变位法解密
这是在百度之星看到的. Problem Description 列变位法是古典密码算法中变位加密的一种方法,具体过程如下将明文字符分割成个数固定的分组(如5个一组,5即为密钥),按一组一行的次序整齐 ...
EffectiveJava笔记（第一部分）
考虑用静态构造方法代替构造器的好处: 1.静态构造方法有名字 BigInteger.probablePrime(int, int, Random)比 new BigInteger(int, i ...
使用oracle写if判断
DECLARE L_X INT; BEGIN SELECT COUNT(*) INTO L_X FROM SYSTEMROLEFUNCTION WHERE ROLEID = '1'; IF L_X & ...