073 HBASE的读写以及client API

一：读写思想

1.系统表

　　hbase：namespace

　　　　存储hbase中所有的namespace的信息

　　hbase：meta　　　

　　　　rowkey:hbase中所有表的region的名称
　　　　column：regioninfo：region的名称，region的范围
　　　　server：该region在哪台regionserver上

2.读写流程

　　tbname,rowkey -> region -> regionserver -> store -> storefile

　　但是这些都是加载过meta表之后，然后meta表如何寻找？

3.读的流程　　

　　-》根据表名和rowkey找到对应的region
　　-》zookeeper中存储了meta表的region信息
　　-》从meta表中获取相应的region的信息
　　-》找到对应的regionserver
　　-》查找对应的region
　　-》读memstore
　　-》storefile

4.写的流程　　

　　-》根据表名和rowkey找到对应的region
　　-》zookeeper中存储了meta表的region信息
　　-》从meta表中获取相应的region的信息
　　-》找到对应的regionserver
　　-》正常情况
　　-》WAL（write ahead log预写日志），一个regionserver维护一个hlog
　　-》memstore (达到一定大小，flush到磁盘)
　　-》当多个storefile达到一定大小以后，会进行compact，合并成一个storefile
　　-》当单个storefile达到一定大小以后，会进行split操作，等分割region

5.注意点

　　关于版本的合并和删除是在compact阶段完成的。hbase只负责数据的增加存储
　　hmaster短暂的不参与实际的读写

二：HBase Client API 的书写

1.添加依赖

2.添加配置文件

　　core-site.xml

　　hdfs-site.xml

　　hbase-site.xml

　　log4j.properties

　　regionservers

3.get的书写

4.put的书写

5.delete的书写

　　注意全部删除：

6.scan的书写

7.过滤条件的scan的书写

三：复制源代码

 package com.beifeng.bigdat;

 import java.io.IOException;

 import org.apache.hadoop.conf.Configuration;

 import org.apache.hadoop.hbase.Cell;

 import org.apache.hadoop.hbase.CellUtil;

 import org.apache.hadoop.hbase.HBaseConfiguration;

 import org.apache.hadoop.hbase.client.Delete;

 import org.apache.hadoop.hbase.client.Get;

 import org.apache.hadoop.hbase.client.HTable;

 import org.apache.hadoop.hbase.client.Put;

 import org.apache.hadoop.hbase.client.Result;

 import org.apache.hadoop.hbase.client.ResultScanner;

 import org.apache.hadoop.hbase.client.Scan;

 import org.apache.hadoop.hbase.filter.Filter;

 import org.apache.hadoop.hbase.filter.PrefixFilter;

 import org.apache.hadoop.hbase.util.Bytes;

 public class HbaseClientTest {

     public static HTable getTable(String name) throws Exception{

         Configuration conf=HBaseConfiguration.create();

         HTable table=new HTable(conf,name);

         return table;

     }

     public static void getData(HTable table) throws Exception{

         Get get=new Get(Bytes.toBytes("103"));

         get.addFamily(Bytes.toBytes("info"));

         Result rs=table.get(get);

         for(Cell cell:rs.rawCells()){

             System.out.println(

                     Bytes.toString(CellUtil.cloneFamily(cell))+"--"+

                     Bytes.toString(CellUtil.cloneQualifier(cell))+"---"+

                     Bytes.toString(CellUtil.cloneValue(cell))+"----"+

                     cell.getTimestamp()

                     );

             System.out.println("----------------------------------------------");

         }

     }

     public static void putData(HTable table) throws Exception{

         Put put=new Put(Bytes.toBytes("103"));

         put.add(Bytes.toBytes("info"),

                 Bytes.toBytes("name"),

                 Bytes.toBytes("zhaoliu"));

         table.put(put);

         getData(table);

     }

     public static void deleteData(HTable table) throws Exception{

         Delete delete =new Delete(Bytes.toBytes("103"));

         delete.deleteColumns(Bytes.toBytes("info"), Bytes.toBytes("name"));

         table.delete(delete);

         getData(table);

     }

     public static void scanData(HTable table) throws Exception{

         Scan scan =new Scan();

         ResultScanner rs=table.getScanner(scan);

         for(Result r:rs){

             System.out.println(Bytes.toString(r.getRow()));

             for(Cell cell:r.rawCells()){

                 System.out.println(

                         Bytes.toString(CellUtil.cloneFamily(cell))+"---"+

                         Bytes.toString(CellUtil.cloneQualifier(cell))+"---"+

                         Bytes.toString(CellUtil.cloneValue(cell))+"--"+

                         cell.getTimestamp()

                     );

             System.out.println();

             }

         }

     }

     public static void filterScan(HTable table) throws Exception{

         Scan scan =new Scan();

         Filter filter=new PrefixFilter(Bytes.toBytes("10"));

         scan.setFilter(filter);

         scan.setCacheBlocks(true);

         scan.setCaching(1000);

         scan.setBatch(100);

         ResultScanner rs=table.getScanner(scan);

         for(Result r:rs){

             System.out.println(Bytes.toString(r.getRow()));

             for(Cell cell:r.rawCells()){

                 System.out.println(

                         Bytes.toString(CellUtil.cloneFamily(cell))+"---"+

                         Bytes.toString(CellUtil.cloneQualifier(cell))+"---"+

                         Bytes.toString(CellUtil.cloneValue(cell))+"--"+

                         cell.getTimestamp()

                     );

             System.out.println();

             }

         }

     }

     public static void main(String[] args) throws Exception {

         HTable table=getTable("nstest1:tb1");

         //getData(table);

         //putData(table);

         //deleteData(table);

         //scanData(table);

         filterScan(table);

     }

 }

073 HBASE的读写以及client API的更多相关文章

HBASE的读写以及client API
一:读写思想 1.系统表 hbase:namespace 存储hbase中所有的namespace的信息 hbase:meta rowkey:hbase中所有表的region的名称 column:re ...
HBase 二次开发 java api和demo
1. 试用thrift python/java以及hbase client api.结论例如以下: 1.1 thrift的安装和公布繁琐.可能会遇到未知的错误,且hbase.thrift的版本 ...
hbase的读写过程
hbase的读写过程: hbase的架构: Hbase真实数据hbase真实数据存储在hdfs上,通过配置文件的hbase.rootdir属性可知,文件在/user/hbase/下hdfs dfs - ...
Hbase的读写流程
HBase读写流程 1.HBase读数据流程 HRegionServer保存着meta表以及表数据,要访问表数据,首先Client先去访问zookeeper,从zookeeper里面获取meta表所在 ...
HBase 数据读写流程
HBase 数据读写流程 2016-10-18 杜亦舒读数据 HBase的表是按行拆分为一个个 region 块儿,这些块儿被放置在各个 regionserver 中假设现在想在用户表中获取 ro ...
ecshop /api/client/api.php、/api/client/includes/lib_api.php SQL Injection Vul
catalog . 漏洞描述 . 漏洞触发条件 . 漏洞影响范围 . 漏洞代码分析 . 防御方法 . 攻防思考 1. 漏洞描述 ECShop存在一个盲注漏洞,问题存在于/api/client/api. ...
Memcached Java Client API详解
针对Memcached官方网站提供的java_memcached-release_2.0.1版本进行阅读分析,Memcached Java客户端lib库主要提供的调用类是SockIOPool和MemC ...
Jersey(1.19.1) - Client API, Uniform Interface Constraint
The Jersey client API is a high-level Java based API for interoperating with RESTful Web services. I ...
Jersey(1.19.1) - Client API, Ease of use and reusing JAX-RS artifacts
Since a resource is represented as a Java type it makes it easy to configure, pass around and inject ...

随机推荐

MySQL - 日常操作二备份还原
登录mysql的命令 # 格式: mysql -h 主机地址 -u 用户名 -p 用户密码 mysql -h 110. -P3306 -uroot -p mysql -uroot -p -S /dat ...
出现fonts/fontawesome-webfont.woff?v=4.5.0 net::ERR_ABORTED
虽然网页正常显示和运行,但是有2个字体文件出现404错误. 原因:服务器没有配置MIME类型而已. 1. 在IIS网站中,找打网站对应的MIME类型,双击. 2.能看到此网站对应的MIME类型,点击右 ...
python基础知识~logger模块
一配置文件模块 import logging ->导入模块 logger = logging.getLogger('mylogger') ->初始化类二创建句柄 1 文件句柄 fh = ...
Caffe2 Detectron安装错误记录
caffe2 caffe2的安装方法有几种.其中最方便的是conda install.但是要求必须安装Anaconda. conda install -c caffe2 caffe2-cuda8.0- ...
maven私服内容补充
1.添加阿里云中央仓库注意Download Remote Indexes选项为True 1.登陆nexus私服(默认账号密码:admin/admin123) 2.点击右侧Repositories 3 ...
【ARTS】01_12_左耳听风-20190128~20190203
ARTS: Algrothm: leetcode算法题目 Review: 阅读并且点评一篇英文技术文章 Tip/Techni: 学习一个技术技巧 Share: 分享一篇有观点和思考的技术文章 Algo ...
使用neo4j-import工具导入数据
从Neo4j2.2版本开始,系统就自带了一个大数据量的导入工具:neo4j-import,可支持并行.可扩展的大规模csv数据导入(本例版本为:3.4.7版本) 1.前提条件关闭neo4j 无法在原 ...
linux内核capable源代码分析【转】
转自:https://blog.csdn.net/sanwenyublog/article/details/50856849 linux内核里对于进程的权限管理有一个很重要的函数capable,以前看 ...
ES系列三、基本知识准备
一.基础概念 1.索引索引(index)是elasticsearch的一个逻辑存储,可以理解为关系型数据库中的数据库,es可以把索引数据存放到一台服务器上,也可以sharding后存到多台服务器上, ...
BootStrap学习从现在开始
前言原文链接 http://aehyok.com/Blog/Detail/6.html 当下最流行的前端开发框架Bootstrap,可大大简化网站开发过程,从而深受广大开发者的喜欢.本文总结了Boo ...

073 HBASE的读写以及client API

073 HBASE的读写以及client API的更多相关文章

随机推荐

热门专题