erlang 分布式数据库Mnesia 实现及应用

- linear hash

ETS/DETS/mnesia 都使用了linear hash算法

http://en.wikipedia.org/wiki/Linear_hashing

redis dict 的实现类似于linear hash，渐进式rehash，保证操作是O(1)。不过除了每次操作时执行一个bucket的rehash，而且每100ms内使用1ms 执行加快rehash进程。

虽然虽然rehash过程渐进式的，但在key space过大时，同时使用LRU过期，buckets 这个大数组的malloc 就能让refis卡上一阵子。

曾遇到的一个案例：现网redis使用主备自动切换模式，有段时间老无故自动切换。排查发现是key space 1000kw+，切换时大量evict，bluckets 需要malloc一个*2的，也就是10M* 24 * 2 = 480M内存，内存一直处于满地状态，靠着LRU替换，此时需要清理出这么大一块，导致redis 实例数秒停止响应导致切换。从这个案例和内存利用率来看，redis 使用时尽量保证keyspace 别太大吧。

- ETS

Erlang内置数据库挑战7000WQPS

http://blog.yufeng.info/archives/876

ETS 实现很简单，就一个内存字典。使用读写锁，只读情况下达到很高的TPS，曾在我老T420笔记本测试过字典在单核心情况下读写400w/s。从这个测试数据看ETS 的读操作其实和全局内存字典读取速度差不多，效率很高。写性能因为全局锁的关系，不可避免受限且并发越高性能越差。建议对写入频繁ETS做分表操作。

- DETS

ETS的落地存储方式，有单表2G大小限制，可以有cache 但默认cache 0 也就是默认读写都操作磁盘。

前面说到DETS 是基于linear hash 存储，hash 方式不是很磁盘友好、不是文件块 cache友好；cache 只是作为行级索引，没有块级索引。

总的说DETS 和真正完整的存储引擎还有一定差距，单独使用价值不大，所以基本都是用于基于它的Mnesia集群版本来使。

Since all operations performed by Dets are disk operations, it is important to realize that a single look-up operation involves a series of disk seek and read operations. For this reason, the Dets functions are much slower than the corresponding Ets functions, although Dets exports a similar interface.

Dets organizes data as a linear hash list and the hash list grows gracefully as more data is inserted into the table. Space management on the file is performed by what is called a buddy system. The current implementation keeps the entire buddy system in RAM, which implies that if the table gets heavily fragmented, quite some memory can be used up. The only way to defragment a table is to close it and then open it again with the repair option set to force.

- Mnesia

基于ETS/DETS, 的纯erlang 实现的强大分布式数据库，而disc Mnesia 表大小受dets 限制，但可以使用fragmentation，frag 类似于分区表。

使用LevelDB 替换DETS（1/4启动时间，1/2冲突，1/3 内存占用）

Mnesia Backend Plugin Framework and a LevelDB-based Plugin: Roland Karlsson, Malcolm Matalka

https://www.youtube.com/watch?v=xBcgrIOzGbQ

whatsapp：

disc_copies tables

Partitioned islands and fragmented tables

All operations run async_dirty

Use key hashing to collapse all ops per key

to a single process

http://stackoverflow.com/questions/23180484/why-big-companies-use-mnesia-instead-of-using-riak-or-couchdb

First of all, mnesia has no 2 gigabyte limit. It is limited on a 32bit architecture, but hardly any are present anymore for real work. And on 64bit, you are not limited to 2 gigabyte. I have seen databases on the order of several hundred gigabytes. The only problem is the initial start-up time for those.

Mnesia is built to handle:

Very low latency K/V lookup, not necessarily linearizible.
Proper transactions with linearizible changes (C in the CAP theorem). These are allowed to run at a much worse latency as they are expected to be relatively rare.
On-line schema change
Survival even if nodes fail in a cluster (where cluster is smallish, say 10-50 machines at most)

The design is such that you avoid a separate process since data is in the Erlang system already. You have QLC for datalog-like queries. And you have the ability to store any Erlang term.

Mnesia fares well if the above is what you need. Its limits are:

You can't get a machine with more than 2 terabytes of memory. And loading 2 teras from scratch is going to be slow.
Since it is a CP system and not an AP system, the loss of nodes requires manual intervention. You may not need transactions as well. You might also want to be able to seamlessly add more nodes to the system and so on. For this, Riak is a better choice.
It uses optimistic locking which gives trouble if many processes tries to access the same row in a transaction.

erlang 分布式数据库Mnesia 实现及应用的更多相关文章

开源分布式数据库中间件MyCat源码分析系列
MyCat是当下很火的开源分布式数据库中间件,特意花费了一些精力研究其实现方式与内部机制,在此针对某些较为重要的源码进行粗浅的分析,希望与感兴趣的朋友交流探讨. 本源码分析系列主要针对代码实现,配置. ...
分布式数据库的四分结构设计 BCDE
首先,对关系型数据库的表进行四种分类定义: Basis 根基,Content 内容, Description 说明, Extension 扩展. Basis:Baisis 表是唯一的,为了实现标准而得 ...
分布式数据库中的Paxos 算法
分布式数据库中的Paxos 算法 http://baike.baidu.com/link?url=ChmfvtXRZQl7X1VmRU6ypsmZ4b4MbQX1pelw_VenRLnFpq7rMvY ...
Distributed4：SQL Server 分布式数据库性能测试
我使用三台SQL Server 2012 搭建分布式数据库,将一年的1.4亿条数据大致均匀存储在这三台Server中,每台Server 存储4个月的数据,Physical Server的配置基本相同, ...
Distributed3：SQL Server 创建分布式数据库
分布式数据库的优势是将IO分散在不同的Physical Disk上,每次查询都由多台Server的CPU,I/O共同负载,通过各节点并行处理数据来提高性能,劣势是消耗大量的网络带宽资源,管理难度大.在 ...
【Java EE 学习 30】【闪回】【导入导出】【管理用户安全】【分布式数据库】【数据字典】【方案】
一.闪回 1.可能的误操作 (1)错误的删除了记录 (2)错误的删除了表 (3)查询历史记录 (4)撤销已经提交了的事务. 2.对应着以上四种类型的误操作,有四种闪回类型 (1)闪回表:将表回退到过去 ...
云时代的分布式数据库：阿里分布式数据库服务DRDS
发表于2015-07-15 21:47| 10943次阅读| 来源<程序员>杂志| 27 条评论| 作者王晶昱 <程序员>杂志数据库DRDS分布式沈询摘要:伴随着系统性能.成 ...
Erlang 103 Erlang分布式编程
Outline 笔记系列 Erlang环境和顺序编程Erlang并发编程Erlang分布式编程YawsErlang/OTP 日期变更说明 2014-11-23 A Outl ...
SQL Server分布式数据库技术(LinkedServer,CT,SSB)
SQL Server自定义业务功能的数据同步在不同业务需求的驱动下,数据库的模块化拆分将会面临一些比较特殊的业务逻辑处理需求.例如,在数据库层面的数据同步需求.同步过程中,可能会有一些比较复杂的业务 ...

随机推荐

如何创建一个简单的Visual Studio Code扩展
注:本文提到的代码示例下载地址>How to create a simple extension for VS Code VS Code 是微软推出的一款轻量级的代码编辑器,免费,开源,支持多种 ...
SQLite3源程序分析之虚拟机
前言最早的虚拟机可追溯到IBM的VM/370,到上个世纪90年代,在计算机程序设计语言领域又出现一件革命性的事情——Java语言的出现,它与c++最大的不同在于它必须在Java虚拟机上运行.Java ...
mybatis判断传入list大小
<if test="tenantIds.size() > 0"> AND A.PROC_TARGET_ID IN <foreach collection=& ...
【BZOJ 4581】【Usaco2016 Open】Field Reduction
http://www.lydsy.com/JudgeOnline/problem.php?id=4581 考虑$O(n^3)$暴力. 实际上枚举最靠边的三个点就可以了,最多有12个点. 还是暴力= ...
vue-Resource（与后端数据交互）
单来说,vue-resource就像jQuery里的$.ajax,用来和后端交互数据的.可以放在created或者ready里面运行来获取或者更新数据... vue-resource文档:https: ...
java 中正则表达式匹配
String str = "#a#,#b#"; String reg="\\#+[^\\#]+\\#+"; Pattern p=Pattern.compile( ...
SQLAlchemy(一)
说明 SQLAlchemy只是一个翻译的过程,我们通过类来操作数据库,他会将我们的对应数据转换成SQL语句. 运用ORM创建表 #!/usr/bin/env python #! -*- coding: ...
mongodb未授权访问漏洞
catalogue . mongodb安装 . 未授权访问漏洞 . 漏洞修复及加固 . 自动化检测点 1. mongodb安装 apt-get install mongodb 0x1: 创建数据库目录 ...
利用scrapy和MongoDB来开发一个爬虫
今天我们利用scrapy框架来抓取Stack Overflow里面最新的问题(),并且将这些问题保存到MongoDb当中,直接提供给客户进行查询. 安装在进行今天的任务之前我们需要安装二个框架,分别 ...
AJAX学习笔记
AJAX不是一种编程语言,AJAX是一种实现网页异步加载的技术,即不刷新网页也能部分的更新网页的内容.如:提交表单信息,通过ajax可以不刷新页面来使得人们明白如何正确的填写信息,判断填写信息的错误或 ...

erlang 分布式数据库Mnesia 实现及应用

erlang 分布式数据库Mnesia 实现及应用的更多相关文章

随机推荐

热门专题