presto-cli通过hive查询hdfs

1. 启动hive metastore

2. 启动hive thrift接口

参考：http://www.cnblogs.com/kisf/p/7497261.html

3. 下载presto server

wget https://repo1.maven.org/maven2/com/facebook/presto/presto-server/0.186/presto-server-0.186.tar.gz

4. 配置

config.properties

coordinator=true

node-scheduler.include-coordinator=true

http-server.http.port=20000

query.max-memory=40GB

query.max-memory-per-node=4GB

exchange.http-client.request-timeout=20s

http-server.threads.max=400

discovery-server.enabled=true

discovery.uri=http://10.183.225.158:20000

　　该配置将coordinate与worker配置再一台机器上。集群部署建议分开：

etc/config.properties包含 Presto Server 相关的配置，每一个 Presto Server 可以同时作为 coordinator 和 worker 使用。你可以将他们配置在一个节点上，但是，在一个大的集群上建议分开配置以提高性能。

coordinator 的最小配置：

coordinator=true

node-scheduler.include-coordinator=false

http-server.http.port=

task.max-memory=1GB

discovery-server.enabled=true

discovery.uri= http://xx1:9090

worker 的最小配置：

coordinator=false

http-server.http.port=

task.max-memory=1GB

discovery.uri= http://xx2:9090

可选的，作为测试，你可以在一个节点上同时配置两者：

coordinator=true

node-scheduler.include-coordinator=true

http-server.http.port=

task.max-memory=1GB

discovery-server.enabled=true

discovery.uri=http://xx3:9090

node.properties

node.environment=xxxoipresto01

node.data-dir=/data/slot0/presto/presto-01-coordinator

node.id=presto-01-coord

jvm.properties

-server

-Xmx10G

-XX:+UseConcMarkSweepGC

-XX:+ExplicitGCInvokesConcurrent

-XX:+CMSClassUnloadingEnabled

-XX:+AggressiveOpts

-XX:+HeapDumpOnOutOfMemoryError

-XX:ReservedCodeCacheSize=600M

-XX:NativeMemoryTracking=summary

-Dcom.sun.management.jmxremote

-Dcom.sun.management.jmxremote.authenticate=false

-Dcom.sun.management.jmxremote.ssl=false

-Dcom.sun.management.jmxremote.port=20001

-Dcom.sun.management.jmxremote.rmi.port=20001

log.properties

com.facebook.presto=INFO

etc/catalog/hive.properties

connector.name=hive-hadoop2

hive.metastore.uri=thrift://hadoop1:9083

hive.config.resources=/xxx/soft/hadoop-2.7.3/etc/hadoop/core-site.xml,/xxx/soft/hadoop-2.7.3/etc/hadoop/hdfs-site.xml

hive.metastore.authentication.type=KERBEROS

hive.metastore.service.principal=hive/hadoop1@JENKIN.COM

hive.metastore.client.principal=presto@JENKIN.COM

hive.metastore.client.keytab=/etc/presto/presto.keytab

hive.hdfs.authentication.type=KERBEROS

hive.hdfs.impersonation.enabled=false

hive.hdfs.presto.principal=presto@JENKIN.COM

hive.hdfs.presto.keytab=/etc/presto/presto.keytab

4. 启动presto coordinate

/usr/bin/nohup /xxx/soft/lesports-presto-server-0.176/bin/launcher run --node-config /etc/presto/01/node.properties --jvm-config /etc/presto/01/jvm.config --config /etc/presto/01/config.properties --log-levels-file /etc/presto/01/log.properties >/xxx/soft/lesports-presto-server-0.176/stdout.log 2>&1 &

　　这里为了做集群隔离把配置文件放到了另外的目录。

启动是否成功看日志，或者http://10.183.225.158:20000/看管理界面，或者：

netstat -nlp | grep 20000

5. 下载presto-cli

wget https://repo1.maven.org/maven2/com/facebook/presto/presto-cli/0.186/presto-cli-0.186-executable.jar

6. 改名为presto-cli

mv presto-cli-0.186-executable.jar presto-cli

7. 启动

./presto-cli --server hadoop1:20000 --catalog hive --schema jenkindb

　　10.183.225.158的hostname为hadoop1

8. 查询

presto:jenkindb> show tables;

  Table

----------

 jenkintb

 test1

(2 rows)

Query 20171020_023712_00002_a78ua, FINISHED, 1 node

Splits: 18 total, 18 done (100.00%)

0:01 [2 rows, 49B] [2 rows/s, 51B/s]

presto:jenkindb> select * from jenkintb;

 id |   name

----+----------

  1 | jenkin

  2 | jenkin.k

  3 | anne

(3 rows)

Query 20171020_023724_00003_a78ua, FINISHED, 1 node

Splits: 17 total, 17 done (100.00%)

0:00 [3 rows, 27B] [8 rows/s, 72B/s]

hive导入数据参考：http://www.cnblogs.com/kisf/p/7497261.html

问题解决：

问题：failed: No worker nodes available　

解决：presto coordinate config.properties增加 node-scheduler.include-coordinator=true

问题：org.apache.hadoop.ipc.RemoteException: User: presto@JENKIN.COM is not allowed to impersonate root

解决：hive.properties增加 hive.hdfs.impersonation.enabled=false

presto-cli通过hive查询hdfs的更多相关文章

cdh5.7 做完HA后hive 查询出现异常： expected: hdfs://nameservice
异常信息如下: select * from b_pt_pr_customer_address_info limit 19; FAILED: SemanticException Unable to de ...
使用shell+awk完成Hive查询结果格式化输出
好久不写,一方面是工作原因,有些东西没发直接发,另外的也是习惯给丢了,内因所致.今天是个好日子,走起! btw,实际上这种格式化输出应该不只限于某一种需求,差不多是通用的. 需求: --基本的:当前H ...
hive查询遇到java.io.EOFException: Unexpected end of input stream错误
hive查询遇到java.io.EOFException: Unexpected end of input stream错误原因基本上有两个: 空文件不完整的文件解决办法: 删除对应文件- 参考 ...
hive查询ncdc天气数据
使用hive查询ncdc天气数据在hive中将ncdc天气数据导入,然后执行查询shell,可以让hive自动生成mapredjob,快速去的想要的数据结果. 1. 在hive中创建ncdc表,这个 ...
hive 报错/tmp/hive on HDFS should be writable. Current permissions are: rwx--x--x
启动hive时报例如以下错误:/tmp/hive on HDFS should be writable. Current permissions are: rwx--x--x 这是/tmp/hive文 ...
hive查询不加分区的一个异常
今天下午有同事反馈她提交了了一个SQL后,hive 查询就停止响应了. 我看了下,发现hiveserver确实hug住了.听过查看日志,发现了一个牛逼的SQL, 这个SQL很简单: select a. ...
hive查询语句入门（hive DDL）
hive DDL 启动hadoop /apps/hadoop/sbin/start-all.sh 开启MySQL库,用于存放hive的元数据 sudo service mysql start 启动hi ...
求解：为什么impala实现hive查询可以使用ifnull()函数，不可以使用length() 函数
求大神解惑,找了很久都没有找到为什么??? hive支持length() 函数,不支持ifnull()函数??? impala实现hive查询支持ifnull()函数,不支持length() 函数 ...
hive查询结果输出到hdfs上
insert overwrite directory "/mapredOutput/UserYesterdayInterest/${hiveconf:day}"row format ...

随机推荐

eclispe创建gradle项目
1.打开eclipse,选择Help——>install from Catalog,安装如图所示的gradle 2.右击空白处,new——>other——>Gradle——>G ...
微信开放平台全网发布时，检测失败 —— C#
主要就是三个:返回API文本消息,返回普通文本消息,发送事件消息 --会出现失败的情况 (后续补充说明:出现检测出错,不一定是代码出现了问题,也有可能是1.微信方面检测时出现服务器请求失败,2.我 ...
delphi 获取本机IP地址和MAC地址
unit NetFunc; interface uses SysUtils, Windows, dialogs, winsock, Classes, ComObj, WinInet, Variants ...
AVL树与红黑树
平衡树是平时经常使用数据结构. C++/JAVA中的set与map都是通过红黑树实现的. 通过了解平衡树的实现原理,可以更清楚的理解map和set的使用场景. 下面介绍AVL树和红黑树. 1. AVL ...
Ubuntu14.04下安装DevStack
虚拟机中的网络配置 NET8 为nat net2 为host-only 虚拟机网络配置 # The primary network interface vmnet nat type auto eth0 ...
Thinkphp的cookie的怎么玩？
在使用COOKIE的时候,首先要对COOKIE进行加密,加密方式采用:异位或的方式进行加密: // 异位或加密 1是加密 0 是解密 function encrytion($value,$type=0 ...
java之面向对象三大特征（封装，继承，多态）
一.封装封装是指将对象的状态信息隐藏在对象内部,不允许外部程序直接访问对象内部信息,而是通过该类提供的对外方法进行内部信息的操作和访问. 封装可以达到以下目的: 1)隐藏类的实现细节 2)让使用者只 ...
委托（Func与Action）
1.平时我们如果要用到委托一般都是先声明一个委托类型,比如: private delegate string Say(); string说明适用于这个委托的方法的返回类型是string类型,委托名Sa ...
poj3349 Snowflake Snow Snowflakes【HASH】
Snowflake Snow Snowflakes Time Limit: 4000MS Memory Limit: 65536K Total Submissions: 49991 Accep ...
使用colmap进行稠密重建
colmap应该是目前state-of-art的增量式SFM方案,可以方便的对一系列二维图片进行三维重建不用对摄像机进行标定,只需要从不同角度对重建场景或物体进行拍摄得到一系列图像作为输入首先需要 ...

presto-cli通过hive查询hdfs

presto-cli通过hive查询hdfs的更多相关文章

随机推荐

热门专题