Linux内核同步原语之原子操作
避免对同一数据的并发访问(通常由中断、对称多处理器、内核抢占等引起)称为同步。
——题记
内核源码:Linux-2.6.38.8.tar.bz2
目标平台:ARM体系结构
原子操作确保对同一数据的“读取-修改-写入”操作在它的执行期间不会被打断,要么全部执行完成,要么根本不会执行。例如在ARM上对全局变量的++运算至少要经历以下三步:
- ldr r3, [r3, #0]
- add r2, r3, #1
- str r2, [r3, #0]

ldr r3, [r3, #0]
add r2, r3, #1
str r2, [r3, #0]
这就给并发访问制造了可能,所以在编写内核代码时需要给有可能被并发访问的变量加上原子操作。
Linux内核提供了两组原子操作接口,一组用于整数,一组用于位操作。
1、原子整数操作
针对整数的原子操作只能对atomic_t类型的数据进行处理。atomic_t定义如下:
- /* linux-2.6.38.8/include/linux/types.h */
- typedef struct {
- int counter;
- } atomic_t;

/* linux-2.6.38.8/include/linux/types.h */
typedef struct {
int counter;
} atomic_t;
下面将详细分析所有原子整数操作函数在ARMv6之前各种CPU上的实现(定义在linux-2.6.38.8/arch/arm/include/asm/atomic.h文件中)。
- #define ATOMIC_INIT(i) { (i) }

#define ATOMIC_INIT(i) { (i) }
ATOMIC_INIT用于把atomic_t变量初始化为i,如下例把a初始化为2:
- atomic_t a = ATOMIC_INIT(2);

atomic_t a = ATOMIC_INIT(2);
- #define atomic_read(v) (*(volatile int *)&(v)->counter)
- #define atomic_set(v,i) (((v)->counter) = (i))

#define atomic_read(v) (*(volatile int *)&(v)->counter)
#define atomic_set(v,i) (((v)->counter) = (i))
atomic_read(v)用于读取atomic_t变量*v的值,而atomic_set(v,i)用于把atomic_t变量*v设置为i。
- static inline int atomic_add_return(int i, atomic_t *v)
- {
- unsigned long flags;
- int val;
- raw_local_irq_save(flags);
- val = v->counter;
- v->counter = val += i;
- raw_local_irq_restore(flags);
- return val;
- }
- #define atomic_add(i, v) (void) atomic_add_return(i, v)

static inline int atomic_add_return(int i, atomic_t *v)
{
unsigned long flags;
int val; raw_local_irq_save(flags);
val = v->counter;
v->counter = val += i;
raw_local_irq_restore(flags); return val;
}
#define atomic_add(i, v) (void) atomic_add_return(i, v)
atomic_add(i, v)用于把atomic_t变量*v加i。这里的原子性实现只是简单地通过raw_local_irq_save函数来禁止中断。
- /* linux-2.6.38.8/include/linux/irqflags.h */
- #define raw_local_irq_save(flags) \
- do { \
- typecheck(unsigned long, flags); \
- flags = arch_local_irq_save(); \
- } while (0)
- /* linux-2.6.38.8/arch/arm/include/asm/irqflags.h */
- static inline unsigned long arch_local_irq_save(void)
- {
- unsigned long flags, temp;
- asm volatile(
- " mrs %0, cpsr @ arch_local_irq_save\n"
- " orr %1, %0, #128\n"
- " msr cpsr_c, %1"
- : "=r" (flags), "=r" (temp)
- :
- : "memory", "cc");
- return flags;
- }

/* linux-2.6.38.8/include/linux/irqflags.h */
#define raw_local_irq_save(flags) \
do { \
typecheck(unsigned long, flags); \
flags = arch_local_irq_save(); \
} while (0)
/* linux-2.6.38.8/arch/arm/include/asm/irqflags.h */
static inline unsigned long arch_local_irq_save(void)
{
unsigned long flags, temp; asm volatile(
" mrs %0, cpsr @ arch_local_irq_save\n"
" orr %1, %0, #128\n"
" msr cpsr_c, %1"
: "=r" (flags), "=r" (temp)
:
: "memory", "cc");
return flags;
}
typecheck函数用来确保参数flags的数据类型为unsigned long,arch_local_irq_save函数通过内嵌汇编设置当前程序状态寄存器的I位为1,从而禁止IRQ。待加操作完成后,再通过raw_local_irq_restore函数使能中断。
- static inline int atomic_sub_return(int i, atomic_t *v)
- {
- unsigned long flags;
- int val;
- raw_local_irq_save(flags);
- val = v->counter;
- v->counter = val -= i;
- raw_local_irq_restore(flags);
- return val;
- }
- #define atomic_sub(i, v) (void) atomic_sub_return(i, v)

static inline int atomic_sub_return(int i, atomic_t *v)
{
unsigned long flags;
int val; raw_local_irq_save(flags);
val = v->counter;
v->counter = val -= i;
raw_local_irq_restore(flags); return val;
}
#define atomic_sub(i, v) (void) atomic_sub_return(i, v)
atomic_sub(i, v)用于把atomic_t变量*v减i。
- #define atomic_inc(v) atomic_add(1, v)
- #define atomic_dec(v) atomic_sub(1, v)
- #define atomic_inc_and_test(v) (atomic_add_return(1, v) == 0)
- #define atomic_dec_and_test(v) (atomic_sub_return(1, v) == 0)
- #define atomic_inc_return(v) (atomic_add_return(1, v))
- #define atomic_dec_return(v) (atomic_sub_return(1, v))
- #define atomic_sub_and_test(i, v) (atomic_sub_return(i, v) == 0)
- #define atomic_add_negative(i,v) (atomic_add_return(i, v) < 0)

#define atomic_inc(v) atomic_add(1, v)
#define atomic_dec(v) atomic_sub(1, v) #define atomic_inc_and_test(v) (atomic_add_return(1, v) == 0)
#define atomic_dec_and_test(v) (atomic_sub_return(1, v) == 0)
#define atomic_inc_return(v) (atomic_add_return(1, v))
#define atomic_dec_return(v) (atomic_sub_return(1, v))
#define atomic_sub_and_test(i, v) (atomic_sub_return(i, v) == 0) #define atomic_add_negative(i,v) (atomic_add_return(i, v) < 0)
这些函数只是上面所讲函数的封装。
- static inline int atomic_cmpxchg(atomic_t *v, int old, int new)
- {
- int ret;
- unsigned long flags;
- raw_local_irq_save(flags);
- ret = v->counter;
- if (likely(ret == old))
- v->counter = new;
- raw_local_irq_restore(flags);
- return ret;
- }
- static inline int atomic_add_unless(atomic_t *v, int a, int u)
- {
- int c, old;
- c = atomic_read(v);
- while (c != u && (old = atomic_cmpxchg((v), c, c + a)) != c)
- c = old;
- return c != u;
- }
- #define atomic_inc_not_zero(v) atomic_add_unless((v), 1, 0)

static inline int atomic_cmpxchg(atomic_t *v, int old, int new)
{
int ret;
unsigned long flags; raw_local_irq_save(flags);
ret = v->counter;
if (likely(ret == old))
v->counter = new;
raw_local_irq_restore(flags); return ret;
}
static inline int atomic_add_unless(atomic_t *v, int a, int u)
{
int c, old; c = atomic_read(v);
while (c != u && (old = atomic_cmpxchg((v), c, c + a)) != c)
c = old;
return c != u;
}
#define atomic_inc_not_zero(v) atomic_add_unless((v), 1, 0)
atomic_inc_not_zero(v)用于将atomic_t变量*v加1,并测试加1后的*v是否不为零,如果不为零则返回真。
注意,这里所讲函数的参数v都是atomic_t类型数据的地址。
2、原子位操作
在CONFIG_SMP和__ARMEB__都未定义的情况下,原子位操作相关函数的定义如下:
- /* linux-2.6.38.8/arch/arm/include/asm/bitops.h */
- #define ATOMIC_BITOP_LE(name,nr,p) \
- (__builtin_constant_p(nr) ? \
- ____atomic_##name(nr, p) : \
- _##name##_le(nr,p))
- #define set_bit(nr,p) ATOMIC_BITOP_LE(set_bit,nr,p)
- #define clear_bit(nr,p) ATOMIC_BITOP_LE(clear_bit,nr,p)
- #define change_bit(nr,p) ATOMIC_BITOP_LE(change_bit,nr,p)
- #define test_and_set_bit(nr,p) ATOMIC_BITOP_LE(test_and_set_bit,nr,p)
- #define test_and_clear_bit(nr,p) ATOMIC_BITOP_LE(test_and_clear_bit,nr,p)
- #define test_and_change_bit(nr,p) ATOMIC_BITOP_LE(test_and_change_bit,nr,p)

/* linux-2.6.38.8/arch/arm/include/asm/bitops.h */
#define ATOMIC_BITOP_LE(name,nr,p) \
(__builtin_constant_p(nr) ? \
____atomic_##name(nr, p) : \
_##name##_le(nr,p)) #define set_bit(nr,p) ATOMIC_BITOP_LE(set_bit,nr,p)
#define clear_bit(nr,p) ATOMIC_BITOP_LE(clear_bit,nr,p)
#define change_bit(nr,p) ATOMIC_BITOP_LE(change_bit,nr,p)
#define test_and_set_bit(nr,p) ATOMIC_BITOP_LE(test_and_set_bit,nr,p)
#define test_and_clear_bit(nr,p) ATOMIC_BITOP_LE(test_and_clear_bit,nr,p)
#define test_and_change_bit(nr,p) ATOMIC_BITOP_LE(test_and_change_bit,nr,p)
相对应的非原子位操作函数定义在linux-2.6.38.8/include/asm-generic/bitops/non-atomic.h文件中,它们的名字前都多带有两个下划线,如set_bit的非原子操作函数为__set_bit。
原子位操作函数根据参数nr是否为常量分为两种实现方式:
(1)、nr为常量时
- /* linux-2.6.38.8/arch/arm/include/asm/bitops.h */
- static inline void ____atomic_set_bit(unsigned int bit, volatile unsigned long *p)
- {
- unsigned long flags;
- unsigned long mask = 1UL << (bit & 31);
- p += bit >> 5;
- raw_local_irq_save(flags);
- *p |= mask;
- raw_local_irq_restore(flags);
- }
- static inline void ____atomic_clear_bit(unsigned int bit, volatile unsigned long *p)
- {
- unsigned long flags;
- unsigned long mask = 1UL << (bit & 31);
- p += bit >> 5;
- raw_local_irq_save(flags);
- *p &= ~mask;
- raw_local_irq_restore(flags);
- }
- static inline void ____atomic_change_bit(unsigned int bit, volatile unsigned long *p)
- {
- unsigned long flags;
- unsigned long mask = 1UL << (bit & 31);
- p += bit >> 5;
- raw_local_irq_save(flags);
- *p ^= mask;
- raw_local_irq_restore(flags);
- }

/* linux-2.6.38.8/arch/arm/include/asm/bitops.h */
static inline void ____atomic_set_bit(unsigned int bit, volatile unsigned long *p)
{
unsigned long flags;
unsigned long mask = 1UL << (bit & 31); p += bit >> 5; raw_local_irq_save(flags);
*p |= mask;
raw_local_irq_restore(flags);
}
static inline void ____atomic_clear_bit(unsigned int bit, volatile unsigned long *p)
{
unsigned long flags;
unsigned long mask = 1UL << (bit & 31); p += bit >> 5; raw_local_irq_save(flags);
*p &= ~mask;
raw_local_irq_restore(flags);
}
static inline void ____atomic_change_bit(unsigned int bit, volatile unsigned long *p)
{
unsigned long flags;
unsigned long mask = 1UL << (bit & 31); p += bit >> 5; raw_local_irq_save(flags);
*p ^= mask;
raw_local_irq_restore(flags);
}
在32位机上,参数bit的理想取值为0到31,例如取值为1,就表示对*p的bit[1]进行操作。
当bit取值大于31时,函数操作的就不是你想要操作的那个变量*p了(通过p += bit>> 5;语句实现),所以在实际的应用中应该要确保bit的取值在合理的范围内。
*p |= mask;语句实现bit位的置位。
*p &= ~mask;语句实现bit位的清零。
*p ^= mask;语句实现bit位的翻转,即*p的bit位为0时被置位,为1时则被清零。
- static inline int
- ____atomic_test_and_set_bit(unsigned int bit, volatile unsigned long *p)
- {
- unsigned long flags;
- unsigned int res;
- unsigned long mask = 1UL << (bit & 31);
- p += bit >> 5;
- raw_local_irq_save(flags);
- res = *p;
- *p = res | mask;
- raw_local_irq_restore(flags);
- return (res & mask) != 0;
- }
- static inline int
- ____atomic_test_and_clear_bit(unsigned int bit, volatile unsigned long *p)
- {
- unsigned long flags;
- unsigned int res;
- unsigned long mask = 1UL << (bit & 31);
- p += bit >> 5;
- raw_local_irq_save(flags);
- res = *p;
- *p = res & ~mask;
- raw_local_irq_restore(flags);
- return (res & mask) != 0;
- }
- static inline int
- ____atomic_test_and_change_bit(unsigned int bit, volatile unsigned long *p)
- {
- unsigned long flags;
- unsigned int res;
- unsigned long mask = 1UL << (bit & 31);
- p += bit >> 5;
- raw_local_irq_save(flags);
- res = *p;
- *p = res ^ mask;
- raw_local_irq_restore(flags);
- return (res & mask) != 0;
- }

static inline int
____atomic_test_and_set_bit(unsigned int bit, volatile unsigned long *p)
{
unsigned long flags;
unsigned int res;
unsigned long mask = 1UL << (bit & 31); p += bit >> 5; raw_local_irq_save(flags);
res = *p;
*p = res | mask;
raw_local_irq_restore(flags); return (res & mask) != 0;
}
static inline int
____atomic_test_and_clear_bit(unsigned int bit, volatile unsigned long *p)
{
unsigned long flags;
unsigned int res;
unsigned long mask = 1UL << (bit & 31); p += bit >> 5; raw_local_irq_save(flags);
res = *p;
*p = res & ~mask;
raw_local_irq_restore(flags); return (res & mask) != 0;
}
static inline int
____atomic_test_and_change_bit(unsigned int bit, volatile unsigned long *p)
{
unsigned long flags;
unsigned int res;
unsigned long mask = 1UL << (bit & 31); p += bit >> 5; raw_local_irq_save(flags);
res = *p;
*p = res ^ mask;
raw_local_irq_restore(flags); return (res & mask) != 0;
}
这三个函数增加了return (res & mask) != 0;语句,用来判断*p的bit位原值是否为1,如果为1则函数返回1,否则返回0。
(2)、nr不为常量时
当nr不为常量时,原子位操作函数的定义如下:
- /* linux-2.6.38.8/arch/arm/include/asm/bitops.h */
- extern void _set_bit_le(int nr, volatile unsigned long * p);
- extern void _clear_bit_le(int nr, volatile unsigned long * p);
- extern void _change_bit_le(int nr, volatile unsigned long * p);
- extern int _test_and_set_bit_le(int nr, volatile unsigned long * p);
- extern int _test_and_clear_bit_le(int nr, volatile unsigned long * p);
- extern int _test_and_change_bit_le(int nr, volatile unsigned long * p);

/* linux-2.6.38.8/arch/arm/include/asm/bitops.h */
extern void _set_bit_le(int nr, volatile unsigned long * p);
extern void _clear_bit_le(int nr, volatile unsigned long * p);
extern void _change_bit_le(int nr, volatile unsigned long * p);
extern int _test_and_set_bit_le(int nr, volatile unsigned long * p);
extern int _test_and_clear_bit_le(int nr, volatile unsigned long * p);
extern int _test_and_change_bit_le(int nr, volatile unsigned long * p);
它们都是通过汇编语言来实现的,定义如下:
- /* linux-2.6.38.8/arch/arm/lib/setbit.S */
- ENTRY(_set_bit_le)
- bitop orr
- ENDPROC(_set_bit_le)
- /* linux-2.6.38.8/arch/arm/lib/clearbit.S */
- ENTRY(_clear_bit_le)
- bitop bic
- ENDPROC(_clear_bit_le)
- /* linux-2.6.38.8/arch/arm/lib/changebit.S */
- ENTRY(_change_bit_le)
- bitop eor
- ENDPROC(_change_bit_le)
- /* linux-2.6.38.8/arch/arm/lib/testsetbit.S */
- ENTRY(_test_and_set_bit_le)
- testop orreq, streqb
- ENDPROC(_test_and_set_bit_le)
- /* linux-2.6.38.8/arch/arm/lib/testclearbit.S */
- ENTRY(_test_and_clear_bit_le)
- testop bicne, strneb
- ENDPROC(_test_and_clear_bit_le)
- /* linux-2.6.38.8/arch/arm/lib/testchangebit.S */
- ENTRY(_test_and_change_bit_le)
- testop eor, strb
- ENDPROC(_test_and_change_bit_le)

/* linux-2.6.38.8/arch/arm/lib/setbit.S */
ENTRY(_set_bit_le)
bitop orr
ENDPROC(_set_bit_le) /* linux-2.6.38.8/arch/arm/lib/clearbit.S */
ENTRY(_clear_bit_le)
bitop bic
ENDPROC(_clear_bit_le) /* linux-2.6.38.8/arch/arm/lib/changebit.S */
ENTRY(_change_bit_le)
bitop eor
ENDPROC(_change_bit_le) /* linux-2.6.38.8/arch/arm/lib/testsetbit.S */
ENTRY(_test_and_set_bit_le)
testop orreq, streqb
ENDPROC(_test_and_set_bit_le) /* linux-2.6.38.8/arch/arm/lib/testclearbit.S */
ENTRY(_test_and_clear_bit_le)
testop bicne, strneb
ENDPROC(_test_and_clear_bit_le) /* linux-2.6.38.8/arch/arm/lib/testchangebit.S */
ENTRY(_test_and_change_bit_le)
testop eor, strb
ENDPROC(_test_and_change_bit_le)
使用ENTRY和ENDPROC两个宏来定义一个名为name的函数,如下所示:
- /* linux-2.6.38.8/include/linux/linkage.h */
- #define ENTRY(name) \
- .globl name; \
- ALIGN; \
- name:
- #define ALIGN __ALIGN
- #define END(name) \
- .size name, .-name
- /* linux-2.6.38.8/arch/arm/include/asm/linkage.h */
- #define __ALIGN .align 0
- #define ENDPROC(name) \
- .type name, %function; \
- END(name)

/* linux-2.6.38.8/include/linux/linkage.h */
#define ENTRY(name) \
.globl name; \
ALIGN; \
name: #define ALIGN __ALIGN #define END(name) \
.size name, .-name /* linux-2.6.38.8/arch/arm/include/asm/linkage.h */
#define __ALIGN .align 0 #define ENDPROC(name) \
.type name, %function; \
END(name)
而汇编代码实现的宏bitop和testop被相应函数所调用,并传递给它们相应的参数,代码如下所示:
- /* linux-2.6.38.8/arch/arm/lib/bitops.h */
- .macro bitop, instr
- and r2, r0, #7
- mov r3, #1
- mov r3, r3, lsl r2
- save_and_disable_irqs ip
- ldrb r2, [r1, r0, lsr #3]
- \instr r2, r2, r3
- strb r2, [r1, r0, lsr #3]
- restore_irqs ip
- mov pc, lr
- .endm
- .macro testop, instr, store
- add r1, r1, r0, lsr #3
- and r3, r0, #7
- mov r0, #1
- save_and_disable_irqs ip
- ldrb r2, [r1]
- tst r2, r0, lsl r3
- \instr r2, r2, r0, lsl r3
- \store r2, [r1]
- moveq r0, #0
- restore_irqs ip
- mov pc, lr
- .endm
Linux内核同步原语之原子操作的更多相关文章
- Linux内核同步原语之原子操作【转】
转自:http://blog.csdn.net/npy_lp/article/details/7262388 避免对同一数据的并发访问(通常由中断.对称多处理器.内核抢占等引起)称为同步. ——题记 ...
- Linux内核同步机制之原子操作
1.前言 原子操作指的是该操作不会在执行完毕之前被任何其它任务或事件打断,它是最小的执行单位,不会有比它更小的执行单位,原子实际上使用了物理学中物质微粒的概念,在Linux内核中,原子操作需要硬件的支 ...
- Linux内核同步机制--转发自蜗窝科技
Linux内核同步机制之(一):原子操作 http://www.wowotech.net/linux_kenrel/atomic.html 一.源由 我们的程序逻辑经常遇到这样的操作序列: 1.读一个 ...
- Linux内核同步
Linux内核剖析 之 内核同步 主要内容 1.内核请求何时以交错(interleave)的方式执行以及交错程度如何. 2.内核所实现的基本同步机制. 3.通常情况下如何使用内核提供的同步机制. 内核 ...
- Linux内核同步 - spin_lock
一.前言 在linux kernel的实现中,经常会遇到这样的场景:共享数据被中断上下文和进程上下文访问,该如何保护呢?如果只有进程上下文的访问,那么可以考虑使用semaphore或者mutex的锁机 ...
- Linux内核同步机制之(四):spin lock【转】
转自:http://www.wowotech.net/kernel_synchronization/spinlock.html 一.前言 在linux kernel的实现中,经常会遇到这样的场景:共享 ...
- [内核同步]浅析Linux内核同步机制
转自:http://blog.csdn.net/fzubbsc/article/details/37736683?utm_source=tuicool&utm_medium=referral ...
- Linux内核同步机制
http://blog.csdn.net/bullbat/article/details/7376424 Linux内核同步控制方法有很多,信号量.锁.原子量.RCU等等,不同的实现方法应用于不同的环 ...
- 浅析Linux内核同步机制
非常早之前就接触过同步这个概念了,可是一直都非常模糊.没有深入地学习了解过,最近有时间了,就花时间研习了一下<linux内核标准教程>和<深入linux设备驱动程序内核机制>这 ...
随机推荐
- Jmeter如何连接数据库Mysql
1. 下载jdbc 的驱动 mysql-connector-java-5.1.41-bin.jar,并将该驱动放到路径: D:\apache-jmeter-3.0\lib\ 2. 打开Jmeter 工 ...
- Python网络编程socket
网络编程之socket 看到本篇文章的题目是不是很疑惑,what is this?,不要着急,但是记住一说网络编程,你就想socket,socket是实现网络编程的工具,那么什么是socket,什么是 ...
- 洛谷 P3391 【模板】文艺平衡树(Splay)
题目背景 这是一道经典的Splay模板题——文艺平衡树. 题目描述 您需要写一种数据结构(可参考题目标题),来维护一个有序数列,其中需要提供以下操作:翻转一个区间,例如原有序序列是5 4 3 2 1, ...
- 用Racket语言写了一个万花筒的程序
用Racket语言写了一个万花筒的程序 来源:https://blog.csdn.net/chinazhangyong/article/details/79362394 https://github. ...
- 【BZOJ3105】新Nim游戏(线性基)
[BZOJ3105]新Nim游戏(线性基) 题面 BZOJ Description 传统的Nim游戏是这样的:有一些火柴堆,每堆都有若干根火柴(不同堆的火柴数量可以不同).两个游戏者轮流操作,每次可以 ...
- 测试开发面试的Linux面试题总结之一:vim使用方法
现在做测试没有说不用到linux,大部分公司都会涉及到,作为测试经常使用linux最常见手段就是查看日志,帮助开发定位问题,这是目前最常见的测试当中使用linux方法,今天就讲一讲vim文本编辑器的使 ...
- redis的自带VM(虚拟内存)
Redis支持采用VM技术,以达到当数据超过设置的可使用的物理内存的时候能够正常运行.当数据超过物理内存的时候,把一部分数据写入磁盘中的一块空间来代替物理内存. vm-enabled no ...
- DotNet,PHP,Java的数据库连接代码大全(带演示代码)
C#数据库连接字符串 Web.config文件 <connectionStrings> <!--SQLServer数据库连接--> <add name="con ...
- Codeforces 582C. Superior Periodic Subarrays(数学+计数)
首先可以把 i mod n=j mod n的看成是同一类,i mod s=j mod s的也看成是同一类,也就是i mod gcd(s,n)的是同一类,很好理解,但是不会数学证明...大概可以想成数轴 ...
- mysql数据库----下载安装、操作
一.mysql概述 1.什么是数据库 ? 答:数据的仓库,如:在ATM的示例中我们创建了一个 db 目录,称其为数据库 2.什么是 MySQL.Oracle.SQLite.Access.MS SQL ...