hbase性能调优之压缩测试

文章概述：

1、顺序写

2、顺序读

3、随机写

4、随机读

5、SCAN数据

0 性能测试工具

hbase org.apache.hadoop.hbase.PerformanceEvaluation

Usage: java org.apache.hadoop.hbase.PerformanceEvaluation \

[--nomapred] [--rows=ROWS] [--table=NAME] \

[--compress=TYPE] [--blockEncoding=TYPE] [-D<property=value>]* <command> <nclients>

Options:

nomapred Run multiple clients using threads (rather than use mapreduce)

rows Rows each client runs. Default: One million

sampleRate Execute test on a sample of total rows. Only supported by randomRead. Default: 1.0

table Alternate table name. Default: 'TestTable'

compress Compression type to use (GZ, LZO, ...). Default: 'NONE'

flushCommits Used to determine if the test should flush the table. Default: false

writeToWAL Set writeToWAL on puts. Default: True

presplit Create presplit table. Recommended for accurate perf analysis (see guide). Default: disabled

inmemory Tries to keep the HFiles of the CF inmemory as far as possible. Not guaranteed that reads are always served from memory. Default: false

latency Set to report operation latencies. Currently only supported by randomRead test. Default: False

Note: -D properties will be applied to the conf used.

For example:

-Dmapred.output.compress=true

-Dmapreduce.task.timeout=60000

Command:

filterScan Run scan test using a filter to find a specific row based on it's value (make sure to use --rows=20)

randomRead Run random read test

randomSeekScan Run random seek and scan 100 test

randomWrite Run random write test

scan Run scan test (read every row)

scanRange10 Run random seek scan with both start and stop row (max 10 rows)

scanRange100 Run random seek scan with both start and stop row (max 100 rows)

scanRange1000 Run random seek scan with both start and stop row (max 1000 rows)

scanRange10000 Run random seek scan with both start and stop row (max 10000 rows)

sequentialRead Run sequential read test

sequentialWrite Run sequential write test

Args:

nclients Integer. Required. Total number of clients (and HRegionServers)

running: 1 <= value <= 500

Examples:

To run a single evaluation client:

$ bin/hbase org.apache.hadoop.hbase.PerformanceEvaluation sequentialWrite 1

1 顺序写测试

测试基准：10个并发客户端，写入200万行数据

1.1 无压缩顺序写

hbase org.apache.hadoop.hbase.PerformanceEvaluation --rows=2000000 --nomapred --table=none_test randomRead 10

1.2 LZO顺序写

hbase org.apache.hadoop.hbase.PerformanceEvaluation --rows=2000000 --nomapred --compress=LZO --table=none_test randomRead 10

1.3 有无压缩对比

对比指标	不压缩	LZO压缩
插入100万行数据平均时间
文件大小(1000万行数据)	19.2G	4.7G

2 顺序读测试

2.1 无压缩顺序读

2.2 LZO顺序读

2.3 有无压缩对比

参考文献：

[1] 性能调优 | HBase表操作使用LZO

hbase性能调优之压缩测试的更多相关文章

hbase性能调优_表设计案例
hbase性能调优案例 1.人员-角色人员有多个角色角色优先级角色有多个人员人员删除添加角色角色可以添加删除人员人员角色删除添加设计思路 person表 ...
hbase性能调优（1）
hbase性能调优标签: hbase 性能调优 | 发表时间:2014-05-17 15:10 | 作者:无尘道长分享到: 出处:http://www.iteye.com 一.服务端调优 1.参数 ...
hbase性能调优案例
hbase性能调优案例 1.人员-角色人员有多个角色角色优先级角色有多个人员人员删除添加角色角色可以添加删除人员人员角色删除添加设计思路 person表 ...
HDP之HBase性能调优
(官方文档翻译及整理) 一.系统级调优 1．保证充足的RAM 2．64位的操作系统 3．Linux的swappiness设置为0 : sysctl vm.swappiness=10 vim /etc/ ...
Hbase性能调优（一）
转自:https://blog.csdn.net/yueyedeai/article/details/14648111 1.修改Linux配置 Linux系统最大可打开文件数一般默认的参数值是1024 ...
Hbase性能调优（二）
一.HBase关键参数配置指导如果同时存在读和写的操作,这两种操作的性能会相互影响.如果写入导致的flush和Compaction操作频繁发生,会占用大量的磁盘IO操作,从而影响读取的性能.如果写入 ...
HBase性能调优
因官方Book Performance Tuning部分章节没有按配置项进行索引,不能达到快速查阅的效果.所以我以配置项驱动,重新整理了原文,并补充一些自己的理解,如有错误,欢迎指正. 配置优化 zo ...
HBase性能调优（转）
原文链接:http://www.blogjava.net/ivanwan/archive/2011/06/15/352350.html 因官方Book Performance Tuning部分章节没有 ...
hbase性能调优（转载）
一.服务端调优 1.参数配置 1).hbase.regionserver.handler.count:该设置决定了处理RPC的线程数量,默认值是10,通常可以调大,比如:150,当请求内容很大(上MB ...

随机推荐

I - u Calculate e
Description A simple mathematical formula for e is where n is allowed to go to infinity. This can ac ...
Ubuntu14.04上安装tftpd服务
首先sudo apt-get install tftpd-hpa, tftp-hpa 然后sudo vim /etc/default/tftpd-hpa 配置文件如下 TFTP_USERNAME=& ...
Codeforces Round #280 (Div. 2)E Vanya and Field(简单题)
转载请注明出处: http://www.cnblogs.com/fraud/ ——by fraud 本场题目都比较简单,故只写了E题. E. Vanya and Field Vany ...
After a rest, go on
busy during the whole May holiday. running between S and H, waste much time leaving things behind. t ...
ASPCMS 多条件查询
1. 表单样例: <form name="topFrm" id="topFrm" action="/search.asp"> & ...
h5 如何打包apk
1.需要下载安装MyEclipse2014,Android SDK,eclipse(需配置Android开发环境) Java和Android环境安装与配置. 2.打开MyEclipse2014,新建一 ...
凸包(hd1392)
Surround the Trees Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 65536/32768 K (Java/Other ...
css案例学习之按钮超链接
效果 css实现 <html> <head> <title>按钮超链接</title> <style> a{ /* 统一设置所有样式 */ ...
Php开发官方IDE ZEND
From http://www.zend.com/en/products/studio 注:唯一的缺点是收费.
uva 1589 by sixleaves
坑爹的模拟题目.自己对于这种比较复杂点得模拟题的能力概述还不够,还多加练习.贴别是做得时候一直再想如何检查车中间有没有棋子,炮中间有没有棋子.到网上参考别人的代码才发先这么简单的办法,自己尽然想不到. ...

hbase性能调优之压缩测试

hbase性能调优之压缩测试的更多相关文章

随机推荐

热门专题