Cassandra Issue with Tombstone
1. Cassandra is quicker than postgre and have lower change to lose data. Cassandra doesn't have foreign keys, locking mechanism and etcs, so that it's quicker on writes.
2. Everything in cassandra is a write. Insert/update/delete is also write.
3. Setting a column to null/ deleting a column will create a tomestone; Deleting a row/primary key/partitio will create a single row tomestone
4. Could adjust tombstone_warn_threshold and tombstone_failure_threshold in cassandra.yaml.
5. Could adjust gc_grace_seconds when creating table
6. Hitting tombstone limit only happens per query.
Related Attributes
Delete will create tombstones
tombstone_warn_threshold: 1000 (default), could be found in cassandra.yaml
tombstone_failure_threshold: 100000 (default), could be found in cassandra.yaml
tombstone_compaction_interval: table attribute
min_compaction_threshold: table attribute #Compaction will only be eligible after min_compaction_threshold SSTables exist, by default it’s 4.
gc_grace_seconds: table attribute
snapshot_before_compaction: false
Check table attributes here http://docs.datastax.com/en/cassandra/2.1/cassandra/reference/referenceTableAttributes.html
Could consider using DateTieredCompactionStrategy instead of the default SizeTieredCompactionStrategy.
Cassandra MBean
Use Jconsole to remotely connect to:
hostname:7199
e.g. localhost:7199
Check/change the TombstoneFailureThreshold attribute inside StorageService MBean.
Force a flush and compaction
sudo nodetool -h localhost -p 7199 -u OC_APP_RAINBOWDBA -pw a3c224d4b89192d2ea3ea943dd7e9648 flush rainbowdba undeliveredmessage
sudo nodetool -h localhost -p 7199 -u OC_APP_RAINBOWDBA -pw a3c224d4b89192d2ea3ea943dd7e9648 compact rainbowdba undeliveredmessage
Deleted rows will only disappear when gc_grace_seconds time passed and a flush and compaction has been forced
Truncating Table
Truncating a table is an immediate operation and won’t leave any tomestones.
Don’t insert Null into columns
Inserting a null value to the column will leave a cell tomestone. Deleting a partition/row will also create a single row tombstone.
Deleting a partition will create a partition tomestone and override the existing cell tomestones. This only happens in memory table not on the disk. Not sure whether creating a partition tomestone will cause a compaction of the cell tomestones on disk.
Using TTL
insert into undeliveredmessage("id", "message","type") values('1','message','RAVEN') using ttl 5;
This query will result in 3 tomestone cells and one row tombstone.
Cassandra partition size limitation
In Cassandra, the maximum number of cells (rows x columns) in a single partition is 2 billion.
Additionally, a single column value may not be larger than 2GB. Partitions greater than 100Mb can cause significant pressure on the heap.
Performance Test
Test script TestCassandraPerformance.java could be found in
Cassandra version: 2.2.3, cqlsh 5.0.1
1. TombstoneFailureThreshold = 500
Seems persist 102000 rows and then delete them won’t hit the limit of the tomestone.
2. TombstoneFailureThreshold = 1
insert into undeliveredmessage("id","message","sent","type") values('3', 'message3', True, null);
and then select * from undeliveredmessage is fine
2. TombstoneFailureThreshold = 1
insert into undeliveredmessage("id","message","sent","type") values('3', 'message3', null, null);
and then select * from undeliveredmessage will hit the tomestone limit
|
deleted rows number |
existing rows number |
locally recovery time |
vector 2 recovery time |
|
150_000 * 9 |
Operation Timed Out |
Operation Timed Out |
|
|
150_000 * 5 |
9_072 ms |
10_152 ms |
|
|
150_000 * 3 |
5_679 ms |
7_957 ms |
|
|
150_000 * 2 |
3_025 ms |
5_218 ms |
|
|
150_000 |
1_326 ms |
1_879 ms |
|
|
150_000 |
158 ms |
333 ms |
|
|
150_000 *2 |
562 ms |
1_963 ms |
|
|
150_000 *3 |
2_223 ms |
3_833 ms |
|
|
150_000 *5 |
3_476 ms |
9_726 ms |
|
|
150_000 *10 |
Operation Timed Out |
Operation Timed Out |
|
|
150_000 |
150_000 |
1_321 ms |
3_735 ms |
|
150_000 *2 |
150_000 |
1_893 ms |
4_939 ms |
Note that we will hit timeout issue when having 150_000 *10 deleted rows in the table.
Hitting tombstone limit
For Dash you should see
com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried for query failed (tried: localhost/127.0.0.1:9042 (com.datastax.driver.core.exceptions.ReadTimeoutException: Cassandra timeout during read query at consistency ONE (1 responses were required but only 0 replica responded)))
at com.datastax.driver.core.ControlConnection.reconnectInternal(ControlConnection.java:223)
For query in command line, you should see something like:
Traceback (most recent call last):
File "/usr/bin/cqlsh.py", line 1172, in perform_simple_statement
rows = future.result(self.session.default_timeout)
File "/usr/share/cassandra/lib/cassandra-driver-internal-only-2.7.2.zip/cassandra-driver-2.7.2/cassandra/cluster.py", line 3347, in result
raise self._final_exception
ReadFailure: code=1300 [Replica(s) failed to execute read] message="Operation failed - received 0 responses and 1 failures" info={'failures': 1, 'received_responses': 0, 'required_responses': 1, 'consistency': 'ONE'}
Cassandra Issue with Tombstone的更多相关文章
- Cassandra issue - "The clustering keys ordering is wrong for @EmbeddedId"
在Java连接Cassandra的情况下, 当使用组合主键时, 默认第一个是Partition Key, 后续的均为Clustering Key. 如果有多个Clustering Key, 在Java ...
- akka-typed(10) - event-sourcing, CQRS实战
在前面的的讨论里已经介绍了CQRS读写分离模式的一些原理和在akka-typed应用中的实现方式.通过一段时间akka-typed的具体使用对一些经典akka应用的迁移升级,感觉最深的是EvenSou ...
- Cassandra简介
在前面的一篇文章<图形数据库Neo4J简介>中,我们介绍了一种非常流行的图形数据库Neo4J的使用方法.而在本文中,我们将对另外一种类型的NoSQL数据库——Cassandra进行简单地介 ...
- Cassandra 计数器counter类型和它的限制
文档基础 Cassandra 2.* CQL3.1 翻译多数来自这个文档 更新于2015年9月7日,最后有参考资料 作为Cassandra的一种类型之一,Counter类型算是限制最多的一个.Coun ...
- 闲聊cassandra
原创,转载请注明出处 今天聊聊cassandra,里面用了不少分布式系统设计的经典算法比如consistent hashing, bloom filter, merkle tree, sstable, ...
- 开源软件:NoSql数据库 - 图数据库 Cassandra
转载原文:http://www.cnblogs.com/loveis715/p/5299495.html Cassandra简介 在前面的一篇文章<图形数据库Neo4J简介>中,我们介绍了 ...
- Cassandra User 问题汇总(1)------------repair
Cassandra Repair 问题 问1: 文档建议每周或者每月跑一次full repair.那么如果我是使用partition rangerepair,是否还有必要在cluster的每个节点上定 ...
- 从Stage角度看cassandra write
声明 文章发布于CSDN cassandra concurrent 具体实现 cassandra并发技术文中介绍了java的concurrent实现,这里介绍cassandra如何基于java实现ca ...
- Cassandra 原理介绍
Cassandra最初源自Facebook,结合了Google BigTable面向列的特性和[Amazon Dynamo](http://en.wikipedia.org/wiki/Dynamo(s ...
随机推荐
- 性能调优之MYSQL高并发优化下
三.算法的优化 尽量避免使用游标,因为游标的效率较差,如果游标操作的数据超过1万行,那么就应该考虑改写..使用基于游标的方法或临时表方法之前,应先寻找基于集的解决方案来解决问题,基于集的方法通常更有效 ...
- 4月6日--js生成随机数列
newarr=[1,2,3,4,5,6] function randomsort(a,b){ return Math.random()>0.5?-1:1;}//用Math.random()函数生 ...
- 07 The VC Dimension
当N大于等于2,k大于等于3时, 易得:mH(N)被Nk-1给bound住. VC维:最小断点值-1/H能shatter的最大k值. 这里的k指的是存在k个输入能被H给shatter,不是任意k个输入 ...
- linux常用脚本
转载于http://justcoding.iteye.com/blog/1943504 我们在运维中,尤其是linux运维,都知道脚本的重要性,脚本会让我们的 运维事半功倍,所以学会写脚本是我们每个l ...
- Java中ArrayList,Vector,LinkedList,HashMap,HashTable,HashSet对比及总结
1.所有的集合的父类都是Collection的接口 2.Set List Map 区别 A 在Set里面:无法添加元素的顺序,所以Set里面的元素不能重复 B 在List中:有索引号,类似于数组, ...
- --save 和 --save-dev的区别
--save是对生产环境所需依赖的声明(开发应用中使用的框架,库,比如jquery,bootstrap等) --save-dev是对开发环境所需依赖的声明(构建工具,测试工具,比如babel,gulp ...
- Robotframe work学习之初(二)
一.F5帮助 Robot Framework 并没有像其它框架一样提供一份完整的 API 文档,所以,我们没办法通过官方 API文档进行习.RIDE 提供了 F5 快捷键来打开帮助文档. search ...
- spring的MVC执行原理
spring的MVC执行原理 1.spring mvc将所有的请求都提交给DispatcherServlet,它会委托应用系统的其他模块负责对请求 进行真正的处理工作. 2.DispatcherSer ...
- Unity -JsonUtility的使用
今天,为大家分享一下unity上的Json序列化,应该一说到这个词语,我们肯定会觉得,这应该是很常用的一个功能点:诚然,我们保存数据的时候,也许会用到json序列化,所以,我们有必要快速了解一下它的简 ...
- GPIO的配置过程
今天看到一篇很好的博文,,看这里:http://www.cnblogs.com/crazyxu/archive/2011/10/14/2212337.html 下面总结一下,加深一下理解. 要使用GP ...