【CUDA开发】 Check failed: error == cudaSuccess (8 vs. 0) invalid device function

最近在复现R-CNN一系列的实验时，配置代码环境真是花费了不少时间。由于对MATLAB不熟悉，实验采用的都是github上rbg大神的Python版本。在配置Faster
R-CNN时，编译没有问题，一运行 ./tools/demo.py --net zf 就会出现如下错误：

<span style="font-size:14px;">Loaded network ./data/faster_rcnn_models/ZF_faster_rcnn_final.caffemodel

F1008 roi_pooling_layer.cu:91] Check failed: error == cudaSuccess (8 vs. 0) invalid device function

*** Check failure stack trace: *** </span>

但是采用CPU mode运行时可以成功。

最后在https://github.com/rbgirshick/py-faster-rcnn/issues/2
找到了我想要的答案，有兴趣的可以慢慢阅读。

不想看的话，就直接按照我下面的方式修改。

一般情况下都是因为显卡的计算能力不同而导致的，修改 py-faster-rcnn/lib/setup.py 的第135行，将arch改为与你显卡相匹配的数值，（比如我的GTX 760，计算能力是3.0，就将sm_35改成了sm_30）然后删除utils/bbox.c，nms/cpu_nms.c ，nms/gpu_nms.cpp 重新编译即可

我看到有些人说还有其他的问题，那么可以在最开始的makefile.config文件中就开始修改，不过我没有试过,具体步骤如下

<span style="font-size:14px;">As below, there is my solution (thress steps):
1 if you're using the GPU instance on AWS, then please change the architecture setting into:
# CUDA architecture setting: going with all of them.
# For CUDA < 6.0, comment the *_50 lines for compatibility.
CUDA_ARCH := -gencode arch=compute_30,code=sm_30 \
-gencode arch=compute_50,code=sm_50 \
-gencode arch=compute_50,code=compute_50
Because the GPU in AWS does not support compute_35
2 I changed sm_35 into sm_30 in lib/setup.py file
3 cd lib, remove these files: utils/bbox.c nms/cpu_nms.c nms/gpu_nms.cpp, if they exist.
And then make && cd ../caffe/ && make clean && make -j8 && make pycaffe -j8 </span>

【CUDA开发】 Check failed: error == cudaSuccess (8 vs. 0) invalid device function的更多相关文章

caffe运行错误： im2col.cu:61] Check failed: error == cudaSuccess (8 vs. 0) invalid device function
错误: im2col.cu:61] Check failed: error == cudaSuccess (8 vs. 0) invalid device function 原因:由于Makefil ...
配置SSD-caffe测试时出现“Check failed: error == cudaSuccess (10 vs. 0) invalid device ordinal”解决方案
这是由于GPU数量不匹配造成的,如果训练自己的数据,那么我们只需要将solver.prototxt文件中的device_id项改为自己的GPU块数,一块就是0,两块就是1,以此类推. 但是SSD配置时 ...
caffe 训练时，出现错误：Check failed: error == cudaSuccess (4 vs. 0) unspecified launch failure
I0415 15:03:37.603461 27311 solver.cpp:42] Solver scaffolding done.I0415 15:03:37.603549 27311 solve ...
Caffe 分类问题 Check failed: error == cudaSuccess (2 vs. 0) out of memory
如果图片过大,需要适当缩小batch_size的值,否则使用GPU时可能超出其缓存大小而报错
check failed status == cudnn_status_success (4 vs. 0) cudnn_status_internal_error
Check failed: error == cudaSuccess (30 vs. 0) unknown error 这个有可能是显存不足造成的,或者网络参数不对造成的 check failed ...
目标检测faster rcnn error == cudaSuccess (2 vs. 0) out of memory
想尝试更深更强的网络,或者自己写了一个费显存的层,发现1080 ti的11G显存不够用了,老师报显存不够怎么办? Check failed: error == cudaSuccess (2 vs. ...
Check failed: status == CUBLAS_STATUS_SUCCESS (11 vs. 0) CUBLAS_STATUS_MAPPING_ERROR
I0930 21:23:15.115576 30918 solver.cpp:281] Learning Rate Policy: multistepF0930 21:23:17.263314 310 ...
CUDA报错： Cannot create Cublas handle. Cublas won't be available. 以及：Check failed: status == CUBLAS_STATUS_SUCCESS (1 vs. 0) CUBLAS_STATUS_NOT_INITIALIZED
Error描述: aita@aita-Alienware-Area-51-R5:~/AITA2/daisida/ssd-github/caffe$ make runtest -j8 .build_re ...
windows7下解决caffe check failed registry.count(type) == 1(0 vs. 1) unknown layer type问题
在Windows7下调用vs2013生成的Caffe静态库时经常会提示Check failed: registry.count(type) == 1 (0 vs. 1) Unknown layer t ...

随机推荐

DT6.0关于SQL注入漏洞修复问题
阿里云安全平台提示:Destoon SQL注入,关于: Destoon的/mobile/guestbook.php中$do->add($post);这行代码对参数$post未进行正确转义,导致黑 ...
py3+requests+json+xlwt，爬取拉勾招聘信息
在拉勾搜索职位时,通过谷歌F12抓取请求信息发现请求是一个post请求,参数为: 返回的是json数据有了上面的基础,我们就可以构造请求了然后对获取到的响应反序列化,这样就获取到了json格式的 ...
nginx1.15.10配置使用非https访问返回403
nginx版本号:nginx version: nginx/1.15.10 server { listen 443 default ssl; server_name app.test.com; if ...
java项目部署
本文章只为帮助大家学习项目的发布,为基础篇,在此给大家示范在window环境下的项目部署及运维. 以下版本为讲解示例,可自行改至匹配版本. 服务器:window service2008 R2 Stan ...
SpringBoot第三节(thymeleaf的配置与SpringBoot注解大全)
Springboot默认是不支持JSP的,默认使用thymeleaf模板引擎.所以这里介绍一下Springboot使用Thymeleaf的实例以及遇到的问题. 1.配置与使用 1.1:在applica ...
计蒜之道百度AI小课堂-上升子序列
计蒜之道百度AI小课堂-上升子序列题目描述给一个长度为 $n$ 的数组 $a$ .试将其划分为两个严格上升子序列,并使其长度差最小. 输入格式输入包含多组数据. 数据的第一行为一个正整 ...
C++之Lambda研究
目录目录 1 1. 前言 1 2. 示例1 1 3. 示例2 2 4. 示例3 3 5. 示例4 3 6. 示例5 6 7. 匿名类规则 6 8. 参考资料 7 1. 前言本文代码测试环境为“GC ...
【后缀数组】【LuoguP4051】 [JSOI2007]字符加密
题目链接题目描述喜欢钻研问题的JS 同学,最近又迷上了对加密方法的思考.一天,他突然想出了一种他认为是终极的加密办法:把需要加密的信息排成一圈,显然,它们有很多种不同的读法. 例如'JSOI07' ...
干货 | 10分钟搞懂branch and bound（分支定界）算法的代码实现附带java代码
Outline 前言 Example-1 Example-2 运行说明 00 前言前面一篇文章我们讲了branch and bound算法的相关概念.可能大家对精确算法实现的印象大概只有一个,调用求 ...
jmeter待解决55大问题
客户交付一个性能测试项目,阐述实施流程. 解释5个常用的性能指标的名称与具体含义. 写出5个jmeter中常用函数,并对其中2个举例说明用法. 简述jmeter的工作原理? 什么是集合点?设置集合点有 ...

【CUDA开发】 Check failed: error == cudaSuccess (8 vs. 0) invalid device function

【CUDA开发】 Check failed: error == cudaSuccess (8 vs. 0) invalid device function的更多相关文章

随机推荐

热门专题