psatck

  ‌pstack命令‌是一个在Linux系统中用于查看进程堆栈信息的工具。

  写了一个服务端死锁程序,如下:

  

#include <iostream>
#include <thread>
#include <mutex>
#include <chrono>
#include <sys/socket.h>
#include <netinet/in.h>
#include <unistd.h>
#include <cstring> // 定义两个互斥锁
std::mutex mutex1;
std::mutex mutex2; // 处理客户端请求的函数
void handle_client(int client_socket, int client_id) {
std::cout << "Client " << client_id << ": Connected" << std::endl; // 模拟客户端请求处理
if (client_id == 1) {
// 客户端 1:先获取 mutex1,再获取 mutex2
std::cout << "Client " << client_id << ": Trying to lock mutex1..." << std::endl;
std::lock_guard<std::mutex> lock1(mutex1);
std::this_thread::sleep_for(std::chrono::seconds(5)); // 增加锁的持有时间
std::cout << "Client " << client_id << ": Locked mutex1, now trying to lock mutex2..." << std::endl; // 尝试获取 mutex2
std::lock_guard<std::mutex> lock2(mutex2); // 死锁发生点
std::this_thread::sleep_for(std::chrono::seconds(5)); // 模拟更多工作
std::cout << "Client " << client_id << ": Locked both mutex1 and mutex2" << std::endl;
} else if (client_id == 2) {
// 客户端 2:先获取 mutex2,再获取 mutex1
std::cout << "Client " << client_id << ": Trying to lock mutex2..." << std::endl;
std::lock_guard<std::mutex> lock2(mutex2);
std::this_thread::sleep_for(std::chrono::seconds(5)); // 增加锁的持有时间
std::cout << "Client " << client_id << ": Locked mutex2, now trying to lock mutex1..." << std::endl; // 尝试获取 mutex1
std::lock_guard<std::mutex> lock1(mutex1); // 死锁发生点
std::this_thread::sleep_for(std::chrono::seconds(5)); // 模拟更多工作
std::cout << "Client " << client_id << ": Locked both mutex1 and mutex2" << std::endl;
} // 关闭客户端连接
close(client_socket);
std::cout << "Client " << client_id << ": Disconnected" << std::endl;
} // TCP 服务器主函数
void start_server(int port) {
int server_fd, new_socket;
struct sockaddr_in address;
int opt = 1;
int addrlen = sizeof(address); // 创建 socket 文件描述符
if ((server_fd = socket(AF_INET, SOCK_STREAM, 0)) == 0) {
perror("socket failed");
exit(EXIT_FAILURE);
} // 设置 SO_REUSEADDR 选项
if (setsockopt(server_fd, SOL_SOCKET, SO_REUSEADDR | SO_REUSEPORT, &opt, sizeof(opt))) {
perror("setsockopt failed");
exit(EXIT_FAILURE);
} // 绑定 socket 到指定端口
address.sin_family = AF_INET;
address.sin_addr.s_addr = INADDR_ANY;
address.sin_port = htons(port); if (bind(server_fd, (struct sockaddr *)&address, sizeof(address)) < 0) {
perror("bind failed");
exit(EXIT_FAILURE);
} // 监听连接
if (listen(server_fd, 3) < 0) {
perror("listen failed");
exit(EXIT_FAILURE);
} std::cout << "Server started on port " << port << ". Waiting for connections..." << std::endl; int client_id = 1; // 用于区分不同客户端 while (true) {
// 接受新的客户端连接
if ((new_socket = accept(server_fd, (struct sockaddr *)&address, (socklen_t*)&addrlen)) < 0) {
perror("accept failed");
continue;
} // 为每个客户端创建一个新线程
std::thread client_thread(handle_client, new_socket, client_id++);
client_thread.detach(); // 分离线程,允许其独立运行
}
} int main() {
int port = 8080;
start_server(port);
return 0;
}

tcp_deadlock_server.cpp

  编译:g++ -std=c++11 -pthread -o tcp_deadlock_server tcp_deadlock_server.cpp -g  

用telnet(telnet 127.1 8080)连两次就会死锁,服务端输出如下:  

Server started on port 8080. Waiting for connections...
Client 1: Connected
Client 1: Trying to lock mutex1...
Client 2: Connected
Client 2: Trying to lock mutex2...
Client 1: Locked mutex1, now trying to lock mutex2...
Client 2: Locked mutex2, now trying to lock mutex1...

  pstack调试死锁

  ps查看进程ID,然后pstack + 进程ID : pstack  915 > pstack_out,将输出重定向到文件,好看一些:

  

Thread 3 (LWP 919):
#0 0x0000fffcc23821dc in ?? () from /lib64/libpthread.so.0
#1 0x0000fffcc237b060 in pthread_mutex_lock () from /lib64/libpthread.so.0
#2 0x00000000004012c4 in __gthread_mutex_lock (__mutex=0x420240 <mutex1>) at /usr/include/c++/7.3.0/aarch64-linux-gnu/bits/gthr-default.h:748
#3 0x0000000000401a88 in std::mutex::lock (this=0x420240 <mutex1>) at /usr/include/c++/7.3.0/bits/std_mutex.h:103
#4 0x0000000000401b34 in std::lock_guard<std::mutex>::lock_guard (this=0xfffcc19ce810, __m=...) at /usr/include/c++/7.3.0/bits/std_mutex.h:162
#5 0x00000000004015ac in handle_client (client_socket=5, client_id=2) at tcp_deadlock_server.cpp:38
#6 0x0000000000402308 in std::__invoke_impl<void, void (*)(int, int), int, int> (__f=@0x248c23e0: 0x401310 <handle_client(int, int)>, __args#0=@0x248c23dc: 5, __args#1=@0x248c23d8: 2) at /usr/include/c++/7.3.0/bits/invoke.h:60
#7 0x0000000000401e18 in std::__invoke<void (*)(int, int), int, int> (__fn=@0x248c23e0: 0x401310 <handle_client(int, int)>, __args#0=@0x248c23dc: 5, __args#1=@0x248c23d8: 2) at /usr/include/c++/7.3.0/bits/invoke.h:95
#8 0x00000000004029cc in std::thread::_Invoker<std::tuple<void (*)(int, int), int, int> >::_M_invoke<0ul, 1ul, 2ul> (this=0x248c23d8) at /usr/include/c++/7.3.0/thread:234
#9 0x0000000000402970 in std::thread::_Invoker<std::tuple<void (*)(int, int), int, int> >::operator() (this=0x248c23d8) at /usr/include/c++/7.3.0/thread:243
#10 0x0000000000402950 in std::thread::_State_impl<std::thread::_Invoker<std::tuple<void (*)(int, int), int, int> > >::_M_run (this=0x248c23d0) at /usr/include/c++/7.3.0/thread:186
#11 0x0000fffcc257e134 in ?? () from /lib64/libstdc++.so.6
#12 0x0000fffcc23788cc in ?? () from /lib64/libpthread.so.0
#13 0x0000fffcc22ba1ec in ?? () from /lib64/libc.so.6
Thread 2 (LWP 917):
#0 0x0000fffcc23821dc in ?? () from /lib64/libpthread.so.0
#1 0x0000fffcc237b060 in pthread_mutex_lock () from /lib64/libpthread.so.0
#2 0x00000000004012c4 in __gthread_mutex_lock (__mutex=0x420270 <mutex2>) at /usr/include/c++/7.3.0/aarch64-linux-gnu/bits/gthr-default.h:748
#3 0x0000000000401a88 in std::mutex::lock (this=0x420270 <mutex2>) at /usr/include/c++/7.3.0/bits/std_mutex.h:103
#4 0x0000000000401b34 in std::lock_guard<std::mutex>::lock_guard (this=0xfffcc21de820, __m=...) at /usr/include/c++/7.3.0/bits/std_mutex.h:162
#5 0x0000000000401450 in handle_client (client_socket=4, client_id=1) at tcp_deadlock_server.cpp:27
#6 0x0000000000402308 in std::__invoke_impl<void, void (*)(int, int), int, int> (__f=@0x248c2290: 0x401310 <handle_client(int, int)>, __args#0=@0x248c228c: 4, __args#1=@0x248c2288: 1) at /usr/include/c++/7.3.0/bits/invoke.h:60
#7 0x0000000000401e18 in std::__invoke<void (*)(int, int), int, int> (__fn=@0x248c2290: 0x401310 <handle_client(int, int)>, __args#0=@0x248c228c: 4, __args#1=@0x248c2288: 1) at /usr/include/c++/7.3.0/bits/invoke.h:95
#8 0x00000000004029cc in std::thread::_Invoker<std::tuple<void (*)(int, int), int, int> >::_M_invoke<0ul, 1ul, 2ul> (this=0x248c2288) at /usr/include/c++/7.3.0/thread:234
#9 0x0000000000402970 in std::thread::_Invoker<std::tuple<void (*)(int, int), int, int> >::operator() (this=0x248c2288) at /usr/include/c++/7.3.0/thread:243
#10 0x0000000000402950 in std::thread::_State_impl<std::thread::_Invoker<std::tuple<void (*)(int, int), int, int> > >::_M_run (this=0x248c2280) at /usr/include/c++/7.3.0/thread:186
#11 0x0000fffcc257e134 in ?? () from /lib64/libstdc++.so.6
#12 0x0000fffcc23788cc in ?? () from /lib64/libpthread.so.0
#13 0x0000fffcc22ba1ec in ?? () from /lib64/libc.so.6
Thread 1 (LWP 915):
#0 0x0000fffcc23827c4 in accept () from /lib64/libpthread.so.0
#1 0x0000000000401868 in start_server (port=8080) at tcp_deadlock_server.cpp:89
#2 0x00000000004018f8 in main () at tcp_deadlock_server.cpp:102

pstack 输出

  能看到一共三个线程,Thread 3 (LWP 919)卡在pthread_mutex_lock:#5  0x00000000004015ac in handle_client (client_socket=5, client_id=2) at tcp_deadlock_server.cpp:38,

  Thread 2 (LWP 917)卡在pthread_mutex_lock:#5  0x0000000000401450 in handle_client (client_socket=4, client_id=1) at tcp_deadlock_server.cpp:27

  就发现了死锁在的位置  

  gdb

  不用c++11又写了一个程序,如下:

  

#include <iostream>
#include <cstring>
#include <unistd.h>
#include <sys/socket.h>
#include <netinet/in.h>
#include <arpa/inet.h>
#include <pthread.h>
#include <errno.h>
#include <cstdlib>
#include <fcntl.h>
#include <stdio.h> // 定义两个互斥锁
pthread_mutex_t mutex1 = PTHREAD_MUTEX_INITIALIZER;
pthread_mutex_t mutex2 = PTHREAD_MUTEX_INITIALIZER; // 处理客户端请求的函数
void* handle_client(void* arg) {
int client_socket = *(static_cast<int*>(arg));
free(arg); // 释放传递给线程的动态分配的内存 std::cout << "Client connected with socket: " << client_socket << std::endl; // 模拟客户端请求处理
if (client_socket == 4) {
// 客户端 1:先获取 mutex1,再获取 mutex2
std::cout << "Client " << client_socket << ": Trying to lock mutex1..." << std::endl;
pthread_mutex_lock(&mutex1);
usleep(5000000); // 休眠 5 秒
std::cout << "Client " << client_socket << ": Locked mutex1, now trying to lock mutex2..." << std::endl; // 尝试获取 mutex2
pthread_mutex_lock(&mutex2); // 死锁发生点
usleep(5000000); // 休眠 5 秒
std::cout << "Client " << client_socket << ": Locked both mutex1 and mutex2" << std::endl; // 释放互斥锁
pthread_mutex_unlock(&mutex2);
pthread_mutex_unlock(&mutex1);
} else if (client_socket == 5) {
// 客户端 2:先获取 mutex2,再获取 mutex1
std::cout << "Client " << client_socket << ": Trying to lock mutex2..." << std::endl;
pthread_mutex_lock(&mutex2);
usleep(5000000); // 休眠 5 秒
std::cout << "Client " << client_socket << ": Locked mutex2, now trying to lock mutex1..." << std::endl; // 尝试获取 mutex1
pthread_mutex_lock(&mutex1); // 死锁发生点
usleep(5000000); // 休眠 5 秒
std::cout << "Client " << client_socket << ": Locked both mutex1 and mutex2" << std::endl; // 释放互斥锁
pthread_mutex_unlock(&mutex1);
pthread_mutex_unlock(&mutex2);
} // 关闭客户端连接
close(client_socket);
std::cout << "Client disconnected with socket: " << client_socket << std::endl; pthread_exit(NULL);
} // TCP 服务器主函数
void start_server(int port) {
int server_fd, new_socket;
struct sockaddr_in address;
int opt = 1;
int addrlen = sizeof(address); // 创建 socket 文件描述符
if ((server_fd = socket(AF_INET, SOCK_STREAM, 0)) == 0) {
perror("socket failed");
exit(EXIT_FAILURE);
} // 设置 SO_REUSEADDR 选项
if (setsockopt(server_fd, SOL_SOCKET, SO_REUSEADDR, &opt, sizeof(opt))) {
perror("setsockopt failed");
exit(EXIT_FAILURE);
} // 绑定 socket 到指定端口
address.sin_family = AF_INET;
address.sin_addr.s_addr = INADDR_ANY;
address.sin_port = htons(port); if (bind(server_fd, (struct sockaddr *)&address, sizeof(address)) < 0) {
perror("bind failed");
exit(EXIT_FAILURE);
} // 监听连接
if (listen(server_fd, 3) < 0) {
perror("listen failed");
exit(EXIT_FAILURE);
} std::cout << "Server started on port " << port << ". Waiting for connections..." << std::endl; int client_id = 1; // 用于区分不同客户端 while (true) {
// 接受新的客户端连接
if ((new_socket = accept(server_fd, (struct sockaddr *)&address, (socklen_t*)&addrlen)) < 0) {
perror("accept failed");
continue;
} // 为每个客户端创建一个新线程
pthread_t thread;
int* client_socket_ptr = new int(new_socket); // 动态分配存储套接字描述符的内存
if (pthread_create(&thread, NULL, handle_client, static_cast<void*>(client_socket_ptr)) != 0) {
perror("pthread_create failed");
delete client_socket_ptr; // 如果线程创建失败,释放内存
close(new_socket);
continue;
} // 分离线程,允许其独立运行
pthread_detach(thread); // 为了测试死锁,只接受前两个客户端连接
if (client_id >= 3) {
close(new_socket); // 关闭多余的连接
continue;
}
client_id++;
}
} int main() {
int port = 8080;
start_server(port);
return 0;
}

tcp_deadlock_server_c++0x.cpp

  编译运行telnet测试跟上面一样

  gdb调试死锁

  ps查看进程ID,然后gdb跟进程:gdb -p 11560

  查看所有线程:info threads,进入线程:thread 2 ,然后bt查看线程堆栈,切换另一个线程如上,就能看到两个线程都卡在了lock,具体调试步骤如下:

  

(gdb) info threads
3 Thread 0x7fb6c0115700 (LWP 11562) 0x0000003b5200dff4 in __lll_lock_wait () from /lib64/libpthread.so.0
2 Thread 0x7fb6bf714700 (LWP 11564) 0x0000003b5200dff4 in __lll_lock_wait () from /lib64/libpthread.so.0
* 1 Thread 0x7fb6c0117720 (LWP 11560) 0x0000003b5200e7ed in accept () from /lib64/libpthread.so.0
(gdb) thread 2
[Switching to thread 2 (Thread 0x7fb6bf714700 (LWP 11564))]#0 0x0000003b5200dff4 in __lll_lock_wait () from /lib64/libpthread.so.0
(gdb) bt
#0 0x0000003b5200dff4 in __lll_lock_wait () from /lib64/libpthread.so.0
#1 0x0000003b52009328 in _L_lock_854 () from /lib64/libpthread.so.0
#2 0x0000003b520091f7 in pthread_mutex_lock () from /lib64/libpthread.so.0
#3 0x0000000000400f9f in handle_client (arg=0x13a8010) at tcp_deadlock_server.cpp:48
#4 0x0000003b520077f1 in start_thread () from /lib64/libpthread.so.0
#5 0x0000003b51ce570d in clone () from /lib64/libc.so.6
(gdb) thread 3
[Switching to thread 3 (Thread 0x7fb6c0115700 (LWP 11562))]#0 0x0000003b5200dff4 in __lll_lock_wait () from /lib64/libpthread.so.0
(gdb) bt
#0 0x0000003b5200dff4 in __lll_lock_wait () from /lib64/libpthread.so.0
#1 0x0000003b52009328 in _L_lock_854 () from /lib64/libpthread.so.0
#2 0x0000003b520091f7 in pthread_mutex_lock () from /lib64/libpthread.so.0
#3 0x0000000000400eb2 in handle_client (arg=0x13a8010) at tcp_deadlock_server.cpp:33
#4 0x0000003b520077f1 in start_thread () from /lib64/libpthread.so.0
#5 0x0000003b51ce570d in clone () from /lib64/libc.so.6

gdb调试死锁

  总结

  pstack和gdb都使用 ptrace() 系统调用来附着到目标进程。ptrace() 允许 GDB 暂停目标进程的执行,读取和修改其内存及寄存器,并捕获系统调用。

c++死锁调试 ,gdb pstack的更多相关文章

  1. linux-c 调试 gdb

    GDB(GNU Debugger) gcc -g –o testarg testarg.c //可执行文件中带上调试信息,用于后续的gdb调试 gdb testarg l; list //显示源程序 ...

  2. 调试多线程 & 查死锁的bug & gcore命令 & gdb对多线程的调试 & gcore & pstack & 调试常用命令

    gdb thread apply all bt 如果你发现有那么几个栈停在 pthread_wait 或者类似调用上,大致就可以得出结论:就是它们几个儿女情长,耽误了整个进程. 注意gdb的版本要高于 ...

  3. GDB死锁调试

    1.测试代码 代码中开启两个线程,加锁后轮流输出数据,其中一个线程误将pthread_mutex_unlock(),写成pthread_mutex_lock()代码如下: int g_tickets ...

  4. Nginx代码调试——gdb工具

    参考网上的资料,写了一个configprint模块,其功能为打印输出location配置内容,并计数访问次数. 代码链接如下:https://github.com/PaulWeiHan/nginx_m ...

  5. Linux下C语言的调试 - gdb

    调试是每个程序员都会面临的问题. 如何提高程序员的调试效率, 更好更快地定位程序中的问题从而加快程序开发的进度, 是大家共同面对的问题. 可能Windows用户顺口就会说出:用VC呗 :-) , 它提 ...

  6. 一、C语言调试—— gdb 的使用

    1.1 gdb 调试工具常用命令 list:展开调试的源代码,缩写 l: break:设置断点,缩写为 b: info break:查看断点信息,缩写为 i b delete:删除断点 print:打 ...

  7. Qt Creator + MinGW 在windows 下的调试GDB停止工作解决

    Qt的安装配置请参考本博客本分类下的其他相关文章,本文主要整理在调试时候遇到的问题和解决方法供遇到同样问题的同学参考.由于我之前也没有任何Qt的开发基础,做的也是硬件方面设计,所以这方面基础还是比较薄 ...

  8. Linux程序调试GDB——数据查看

    查看栈信息 当程序被停住了,首先要确认的就是程序是在哪儿被断住的.这个一般是通过查看调用栈信息来看的.在gdb中,查看调用栈的命令是backtrace,可以简写为bt. (gdb) bt    #0 ...

  9. Android emulator中C代码的调试——gdb/gdbservers时遇到的坑

    版权声明:本文为博主原创文章,未经博主允许不得转载. 先写个helloworld吧,在Android源码树中创建文件夹external/helloworld,加入文件: // helloworld.c ...

  10. C 调试 gdb常用命令

    gdb常用命令: [root@redhat home]#gdb 调试文件:启动gdb (gdb) l :(字母l)从第一行开始列出源码 (gdb) break n :在第n行处设置断点 (gdb) b ...

随机推荐

  1. 【赵渝强老师】Docker Swarm实现服务的滚动更新

    一.什么是Docker Swarm? Docker Swarm是Docker官方提供的一款集群管理工具,其主要作用是把若干台Docker主机抽象为一个整体,并且通过一个入口统一管理这些Docker主机 ...

  2. 【赵渝强老师】使用MongoDB的Web控制台

    MongoDB可以通过web界面监控数据库,默认情况下该选项是关闭的,需要在启动的时候开启.启用web 控制台,需要在启动mongodb的时候,加上:--httpinterface 启动MongoDB ...

  3. 如何更改Wordpress语言为中文

    在使用WordPress的时候,一般安装默认语言是英文,可以在后台设置里面直接修改站点语言为简体中文,当后台没有语言选项框的这一栏,如下图所示,该怎么办呢? 这个时候我们可以找到文件wp-config ...

  4. 17 模块subprocess、re

    1. subprocess模块 1.1 概念 subprocess模块启动一个新进程,并连接到它们的输入/输出/错误管道,从而获取返回值 简单理解:可以远程连接电脑(socket模块) 1.2 Pop ...

  5. Android Qcom USB Driver学习(一)

    该系列文章总目录链接与各部分简介: Android Qcom USB Driver学习(零) USB接口类型 Android终端上常用的USB接口:TypeC(现在的主流),MicroB(以前的设备) ...

  6. iOS Masonry使用小结

    一.Masonry简介 Masonry是一个轻量级的布局框架,它拥有自己的描述语法(采用更优雅的链式语法封装)来自动布局,具有很好可读性且同时支持iOS和Max OS X等. 二.Masonry的基本 ...

  7. C# Webapi 简单的依赖注入-构造函数

    控制器部分: using Microsoft.AspNetCore.Mvc; using WebApplication1.IServices; using WebApplication1.Utilit ...

  8. base64 是什么,有什么作用?

    base64 是图片编码的一种形式,可以替代图片的url进行网络访问和请求等操作: 使用图片的url形式操作图片,每次都要请求一次网络,因为每次请求都是一个http:都是一个网络开销,都是对服务器的负 ...

  9. 0602-nn.Module

    0602-nn.Module 目录 一.nn.Module 1.1 构建一层网络--全连接层 1.2 构建多层网络--多层感知机 pytorch完整教程目录:https://www.cnblogs.c ...

  10. docker打包镜像,上传镜像仓库,使用rancher发布

    步骤一.首先将项目打包放在指定目录下 项目jar包名称为  micro-app.jar 步骤二.将jar包名称改为指定名称,执行命令    docker build -t micro-gateway: ...