概念:

摘录自：http://preshing.com/20120913/acquire-and-release-semantics/

Acquire semantics is a property which can only apply to operations which read from shared memory, whether they are read-modify-write operations or plain loads. The operation is then considered a read-acquire. Acquire semantics prevent memory reordering of the read-acquire with any read or write operation which follows it in program order.

Release semantics is a property which can only apply to operations which write to shared memory, whether they are read-modify-write operations or plain stores. The operation is then considered a write-release. Release semantics prevent memory reordering of the write-release with any read or write operation which precedes it in program order.

Acquire and Release Fences

First things first: Acquire and release fences are considered low-level lock-free operations. If you stick with higher-level, sequentially consistent atomic types, such as volatile variables in Java 5+, or default atomics in C++11, you don’t need acquire and release fences. The tradeoff is that sequentially consistent types are slightly less scalable or performant for some algorithms.

On the other hand, if you’ve developed for multicore devices in the days before C++11, you might feel an affinity for acquire and release fences. Perhaps, like me, you remember struggling with the placement of some lwsync intrinsics while synchronizing threads on Xbox 360. What’s cool is that once you understand acquire and release fences, you actually see what we were trying to accomplish using those platform-specific fences all along.

Acquire and release fences, as you might imagine, are standalone memory fences, which means that they aren’t coupled with any particular memory operation. So, how do they work?

An acquire fence prevents the memory reordering of any read which precedes it in program order with any read or write which follows it in program order.

A release fence prevents the memory reordering of any read or write which precedes it in program order with any write which follows it in program order.

In other words, in terms of the barrier types explained here, an acquire fence serves as both a #LoadLoad + #LoadStore barrier, while a release fence functions as both a #LoadStore + #StoreStore barrier. That’s all they purport to do.

LoadLoad确保前后两个Load操作不乱序，StoreStore确保前后两个Store操作不乱序。 PowerPC上通过 lwsync 轻量级sync

StoreLoad 是最昂贵的。类似于磁盘的sync操作，确保将高速缓存中数据完全写入主内存；并确保其它CPU cache更新。PowerPC上通过 sync

编程接口：

C++11用法:

#include <atomic>
std::atomic_thread_fence(std::memory_order_acquire);

std::atomic_thread_fence(std::memory_order_release);

C11 用法:

#include <stdatomic.h>
atomic_thread_fence(memory_order_acquire);

atomic_thread_fence(memory_order_release);

以 C11 为例详细解释头文件 <stdatomic.h> 中定义的 memory_order 枚举的每个值的意思

enum memory_order {

    memory_order_relaxed,  /* 仅仅确保读写操作的原子性。无内存序，所以仅适用 atomic 变量 */

    memory_order_consume,  /* 数据依赖序，DEC Alpha only */

    memory_order_acquire,

    memory_order_release,

    memory_order_acq_rel,

    memory_order_seq_cst

};

关于 C11 compare and exchange 各自版本的操作区别：

weak 和 strong

循环中用 weak 有更好的性能。非循环操作必须用 strong 版本。因为 weak 有时候会在所比较的值相等时候也失败返回。

implicit 和 explicit

implicit 版本会默认使用强内存模型 memory_order_seq_cst 。

explicit 版本会有2个额外参数 succ 和 fail，succ 指定 compare 比较成功后的内存 barrier；fail 指定 compare 失败后的内存 barrier 。

C 11 对各自的英文解释，比较绕口：

Value	Explanation
memory_order_relaxed	Relaxed ordering: there are no constraints on reordering of memory accesses around the atomic variable.	确保操作原子性
memory_order_consume	Consume operation: no reads in the current thread dependent on the value currently loaded can be reordered before this load. This ensures that writes to dependent variables in other threads that release the same atomic variable are visible in the current thread. On most platforms, this affects compiler optimization only.	简言之 Data dependency barriers，比 Acquire 更弱。一般CPU都会自动保证数据依赖序（Alpha 除外）
memory_order_acquire	Acquire operation: no reads in the current thread can be reordered before this load. This ensures that all writes in other threads that release the same atomic variable are visible in the current thread.	其它线程Release之前的所有内存可见
memory_order_release	Release operation: no writes in the current thread can be reordered after this store. This ensures that all writes in the current thread are visible in other threads that acquire the same atomic variable.	此Release操作之前的所有内存，其它线程Acquire后可见；此Release操作之前的部分内存，其它线程Consume后可见；
memory_order_acq_rel	Acquire-release operation: no reads in the current thread can be reordered before this load as well as no writes in the current thread can be reordered after this store. The operation is read-modify-write operation. It is ensured that all writes in another threads that release the same atomic variable are visible before the modification and the modification is visible in other threads that acquire the same atomic variable.	Acquire和Release操作的合体。自动对读做Aquire操作；对写做Release操作
memory_order_seq_cst	Sequential ordering. The operation has the same semantics as acquire-release operation, and additionally has sequentially-consistent operation ordering.	a full memory fence 比Acquire-release更进一步：之前所有写，其它线程立即可见（其它线程简单的读就能读到，不需要acquire）频繁使用可能会成为性能瓶颈

重点：解释下什么情况下需要 memory_order_consume （data dependency barrier）

1）

A=

<data dependency barrier>

B=*A

2）

A=

<data dependency barrier>

C=B[A]

问题：已经有封装好的 atomic 变量了，那 atomic_thread_fence 还有用场吗？

有用场。如下面例子，开始只有 relaxed 保证原子性，仅仅当读到变量满足条件时，才用 acquire 确保 do_work（）发生在读到 mailbox[i] 之后

样例来自 http://en.cppreference.com/w/cpp/atomic/atomic_thread_fence

const int num_mailboxes = ;

std::atomic<int> mailbox[num_mailboxes];

// The writer threads update non-atomic shared data and then update mailbox[i] as follows

 std::atomic_store_explicit(&mailbox[i], std::memory_order_release);

// Reader thread needs to check all mailbox[i], but only needs to sync with one

 for (int i = ; i < num_mailboxes; ++i) {

    if (std::atomic_load_explicit(&mailbox[i],  std::memory_order_relaxed) == my_id) {

        std::atomic_thread_fence(std::memory_order_acquire); // synchronize with just one writer

        do_work(i); // guaranteed to observe everything done in the writer thread before

                    // the atomic_store_explicit()

    }

 }

C11 memory_order的更多相关文章

c89、c99、c11区别
c89 c99 注: GCC支持C99, 通过 --std=c99 命令行参数开启,如: 代码:gcc --std=c99 test.c ------------------------------- ...
gcc/g++ 如何支持c11 / c++11标准编译
如果用命令 g++ -g -Wall main.cpp 编译以下代码 : /* file : main.cpp */ #include <stdio.h> int main() { in ...
【转】gcc/g++ 如何支持c11 / c++11标准编译
如果用命令 g++ -g -Wall main.cpp 编译以下代码 : 1 2 3 4 5 6 7 8 9 10 11 12 /* file : main.cpp */ #include ...
C89, C99, C11: All the specifics that I know
before anything.. sizeof is an operand! sizeof is an operand! sizeof is an operand! 重要なことは三回にしませんね! ...
[C/C++语言标准] ISO C99/ ISO C11/ ISO C++11/ ISO C++14 Downloads
语言法典,C/C++社区人手一份,技术讨(hu)论(peng)必备 ISO IEC C99 https://files.cnblogs.com/files/racaljk/ISO_C99.pdf IS ...
是我out了,c11标准出炉鸟
gcc -std=c11 -Wall -O3 -g0 -s -o x.c x 或者 clang -std=c11 -Wall -O3 -g0 -s -o x.c x 来吧! 我是有多无聊啊测试代码: ...
[转载]哪个版本的gcc才支持c11
转自:https://blog.csdn.net/haluoluo211/article/details/71141093 哪个版本的gcc才支持c11 2017年05月03日 19:25:43 Fi ...
通过atomic_flag简单自旋锁实现简单说明标准库中锁使用的memory_order
在使用标准库中的加锁机制时,例如我们使用std::mutex,写了如下的代码(下面的代码使用condition_variable可能更合适) std::mutex g_mtx; int g_resNu ...
STL-容器库101--array【C11】
1. 原型 C11提供 template < class T, size_t N > class array; T: 元素类型,以 array::value_type 作为别名使用:N: ...

随机推荐

Linux实现SSH无密码登录（对目录权限的设置非常详细，可以参考一下）
假设服务器IP地址为192.168.1.1,机器名:cluster.hpc.org 客户端IP地址为172.16.16.1,机器名:p470-2.wangrx.sioc.ac.cn 客户端用户yzha ...
SQL Server - 聚集索引 <第六篇>
聚集索引的叶子页存储的就是表的数据.因此,表行物理上按照聚集索引列排序,因为表数据只能有一种物理顺序,所以一个表只能有一个聚集索引. 当我们创建主键约束时,如果不存在聚集索引并且该索引没有被明确指定为 ...
国际化标签 <fmt:bundle>&<fmt:message>的使用
国际化标签 <fmt:bundle>&<fmt:message>的使用 Message.properties文件: name=www.gis520.com #info= ...
关于set和map的用法
1.set 定义:每个元素最多只出现一次,并且默认的是从小到大排序. set 遍历: 题目http://www.cnblogs.com/ZP-Better/p/4700218.html for(set ...
用Robocod游戏来学习JAVA
Robocode(用游戏来学习Java技术还是用Java来玩游戏?)用你的JAVA编程技术来玩游戏吧!不会JAVA?那就用游戏来学习JAVA吧!什么是Robocode? 其实我对机器人一直很感兴趣.我 ...
Ajax——ajax调用数据总结
在做人事系统加入批量改动的功能中,须要将前台中的数据传给后台.后台并运行一系列的操作. 通过查询和学习了解到能够通过ajax将值传入到后台,并在后台对数据进行操作. 说的简单点.就是ajax调用后台的 ...
Android手机APN设置（中国移动联通3G 电信天翼），解决不能上网的问题
中国移动第一步,设置CMNET上网新建APN 1.名称:cmnet 2.APN:cmnet 3.APN类型:default 就仅仅填写上面3个选项,其它都是默认,不用填写. 第二步,设置彩信新建 ...
命令行分析java线程CPU占用
1.使用top命令找出占用cpu最高的JAVA进程pid号 2. 找出占用cpu最高的线程: top -Hp -n 1 3. 打印占CPU最高JAVA进程pid的堆栈信息 jstack pid &g ...
在opensips中记录通话记录
1.为acc表增加额外的字段记录主叫被叫进入mysql,选取opensips的数据库ALTER TABLE acc ADD from_uri VARCHAR(64) DEFAULT '' NOT NU ...
RocketMQ与Kafka对比（18项差异）评价版
此文是rocketmq作者vintage.wang所写,对于每项对比,后面都增加了我的观点,有不对的地方,请各位指出. 淘宝内部的交易系统使用了淘宝自主研发的Notify消息中间件,使用Mysql作为 ...

C11 memory_order

概念:

Acquire and Release Fences

编程接口：

问题：已经有封装好的 atomic 变量了，那 atomic_thread_fence 还有用场吗？

C11 memory_order的更多相关文章

随机推荐

热门专题