CDH/Hadoop 5.15 installation steps
I will talk the main steps to install CDH 5.15 on Linux(CENT OS 6.10). The installation method is Manual installation with tarball and parcels. The newest version if CDH 6.0.1 when I write this guide but it does not support CENT OS 6.10 so I have to install 5.15.
Software components List:
- CDH, the parcel. CDH-5.15.1-1.cdh5.15.1.p0.4-el6.parcel
- CM: cloudery manager. cloudera-manager-el6-cm5.15.1_x86_64.tar.gz
- mysql 5.7.73 and jdbc driver: mysql-connector-java-8.0.11.jar
- jvm: jdk-8u181-linux-x64.rpm
Here is how to get them:
JVM:
wget http://download.oracle.com/otn-pub/java/jdk/8u181-b13/96a7b8442fe848ef90c96a2fad6ed6d1/jdk-8u181-linux-x64.rpm?AuthParam=1539265359_30b3a4b9e17f3ed3b0962980168c2721 CDH:
wget http://archive.cloudera.com/cm5/cm/5/cloudera-manager-el6-cm5.15.1_x86_64.tar.gz
wget http://archive.cloudera.com/cdh5/parcels/latest/CDH-5.15.1-1.cdh5.15.1.p0.4-el6.parcel
wget http://archive.cloudera.com/cdh5/parcels/latest/CDH-5.15.1-1.cdh5.15.1.p0.4-el6.parcel.sha1
wget http://archive.cloudera.com/cdh5/parcels/5.15.1.4/manifest.json MySQL Yum Repo:
wget https://repo.mysql.com//mysql57-community-release-el6-11.noarch.rpm
Hareware:
- ....more powerful, more better...
Machine/OS Setup:
- Making sure the time is sychronized across machines(Configure ntp if needed).
- Network: change hostname you like. I have 4 servers ns01, ns02, ns03, ns04. Add them into the /etc/hosts file like below. Making sure you have high bandwidth accross the nodes. Mine is 210MB/Sec.
192.168.0.79 ns04
192.168.0.77 ns02
192.168.0.232 ns01
192.168.0.114 ns03 - Tune swap(each node):
vim /etc/sysctl.confvm.swappiness = 10 - Disable security Linux. SELINUX=disabled in /etc/selinux/config
- Turn off firewall(each node):
service iptables stop.
chkconfig iptables off - SSH without pasword:
- run ssh-keygen -t rsa in each node. type Enter when prompted.
- run cp .ssh/id_rsa.pub .ssh/pub_key.ns0$n in each node to save the public key to a seperate file. $n is the number of the machine.
- copy those files to one of the machine like ns01.

- run cat pub_key.ns0* >> authorized_keys in ns01
- boardcase the file to all other machines.

JVM setup(each node)
......do it in each machine. Please use the rpm distribution to install(rpm -ivh jdk*.rpm) and set the JAVA_HOME to /usr/java/latest. I got an error like 'deploy client configuration for spark' during setup using the jdk*.tar.gz distribution.
MySQL Installation/Setup(one node, ns01)
- Install and init mysql:
https://opensourcedbms.com/dbms/installing-mysql-5-7-on-centosredhatfedora/ - Sql to create databases/users for Hadoop
create database hive DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
create database oozie DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
create database hue DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
create database activity DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
GRANT ALL PRIVILEGES ON hive.* to 'hive'@'%' identified by 'hive' with grant option ;
flush privileges ;
GRANT ALL PRIVILEGES ON oozie.* to 'oozie'@'%' identified by 'oozie' with grant option ;
flush privileges ;
GRANT ALL PRIVILEGES ON hue.* to 'hue'@'%' identified by 'hue' with grant option ;
flush privileges ;
GRANT ALL PRIVILEGES ON activity.* to 'activity'@'%' identified by 'activity' with grant option ;
flush privileges ;
CDH Install
- untar the cm to target folder /opt (ns01 node).
tar -xzf cloudera-manager-el6-cm5.15.1_x86_64.tar.gz -C /opt - copy mysql jdbc driver to avoid the 'java.lang.ClassNotFoundException: com.mysql.jdbc.Driver' (ns01 node)
cp mysql-connector-java-8.0.11.jar /opt/cm-5.15.1/share/cmf/lib/
cp mysql-connector-java-8.0.11.jar /opt/cloudera/parcels/CDH-5.15.1-1.cdh5.15.1.p0.4/lib/hive/lib/
........................................(cp mysql driver to the proper location for other components need mysql connection to create and init database,tables......) - Creat Cloudera Manager DB(ns01 node)
(Run in MySQL)
GRANT ALL PRIVILEGES ON scm.* to 'scm'@'%' identified by 'scm' with grant option ;(Run in Shell)
/opt/cm-5.15.1/share/cmf/schema/scm_prepare_database.sh mysql -hlocalhost -uroot -p*** --scm-host ns01 scm scm scm - copy the hadoop parcel to the parcel-repo folder. rename (mv) the *.sha1 file to .sha . See below result of the folder.(ns01 node)

- update the cloudera manager host server so the agents can connect to it.(ns01 node)
vim /opt/cm-5.15.1/etc/cloudera-scm-agent/config.ini
- copy cloudera manager to other machines as well. You can tar the folder /opt/cm-5.15.1 and scp to other machines.
- Add cloudera-scm User ( each node )
useradd --system --home /opt/cm-5.15.1/run/cloudera-scm-server --no-create-home --shell=/bin/false -comment "Cloudera SCM User" cloudera-scm - Reboot all machines before start cm. I write a simple script rebootAll.sh to do it.
- Run the service and configuration manager(scm) on server node and agent nodes. You can also run the agent on the server node so you will have one more node to install Hadoop.
/opt/cm-5.15.1/etc/init.d/cloudera-scm-server start (run in only 1 node)
/opt/cm-5.15.1/etc/init.d/cloudera-scm-agent start (run in all nodes) - launch the CM by: http://ns01:7180 from you browser
- Wait several seconds if needed. If you can see the login page(credential: admin/admin), there should be no big problem. I will paste some of the UI here.
hosts list, you can see I have 4 nodes.
The repository you have configured.
Assign the roles for node.

CDH/Hadoop 5.15 installation steps的更多相关文章
- 使用yum安装CDH Hadoop集群
使用yum安装CDH Hadoop集群 2013.04.06 Update: 2014.07.21 添加 lzo 的安装 2014.05.20 修改cdh4为cdh5进行安装. 2014.10.22 ...
- CDH hadoop的安装
1 先拷贝tar包到目录底下(tar 包解压 tar zxvf) 2 : 1.使用课程提供的hadoop-2.5.0-cdh5.3.6.tar.gz,上传到虚拟机的/usr/local目录下.(htt ...
- [大牛翻译系列]Hadoop(15)MapReduce 性能调优:优化MapReduce的用户JAVA代码
6.4.5 优化MapReduce用户JAVA代码 MapReduce执行代码的方式和普通JAVA应用不同.这是由于MapReduce框架为了能够高效地处理海量数据,需要成百万次调用map和reduc ...
- CDH 5.16.1 离线部署 & 通过 CDH 部署 Hadoop 服务
参考 Cloudera Enterprise 5.16.x Installing Cloudera Manager, CDH, and Managed Services Installation Pa ...
- hadoop再次集群搭建(4)-Cloudera Manager Installation
决定选择 Cloudera Manager 进行安装,阅读官方文档,掌握大概脉络. Cloudera Manager在集群已经实现ssh免秘钥登录,能够访问网络资源和本地资源的情 ...
- Mac OS X上搭建伪分布式CDH版本Hadoop开发环境
最近在研究数据挖掘相关的东西,在本地 Mac 环境搭建了一套伪分布式的 hadoop 开发环境,采用CDH发行版本,省时省心. 参考来源 How-to: Install CDH on Mac OSX ...
- Install RHadoop with Hadoop 2.2 – Red Hat Linux
Prerequisite Hadoop 2.2 has been installed (and the below installation steps should be applied on ea ...
- A record--Offline deployment of Big Data Platform CDH Cluster
A record--Offline deployment of Big Data Platform CDH Cluster Tags: Cloudera-Manager CDH Hadoop Depl ...
- P6 Professional Installation and Configuration Guide (Microsoft SQL Server Database) 16 R1
P6 Professional Installation and Configuration Guide (Microsoft SQL Server Database) 16 R1 May ...
随机推荐
- 小程序内嵌H5——判断小程序环境的坑
现在各种小程序风靡,这边H5的需求还没有搞定,产品又要求做小程序版本,做可以,关键是618前上线,我-- whatever,618要做推广,日期订了,剩下的就只能是排期,定方案,尽可能完成. 最后和产 ...
- 逍遥云天 微信小程序开发之获取用户手机号码——使用简单php接口demo进行加密数据解密
后边要做一个微信小程序,并要能获取用户微信绑定的手机号码.而小程序开发文档上边提供的获取手机号码的接口(getPhoneNumber())返回的是密文,需要服务器端进行解密,但是官方提供的开发文档一如 ...
- 『ACM C++』 Codeforces | 1066A - Points in Segments
大一生活真 特么 ”丰富多彩“ ,多彩到我要忙到哭泣,身为班长,很多班级的事情需要管理,也是,什么东西都得体验学一学,从学生会主席.团委团总支.社团社长都体验过一番了,现在差个班长也没试过,就来体验了 ...
- 阿里云Docker镜像仓库(Docker Registry)
镜像仓库申请地址: https://cr.console.aliyun.com/cn-shanghai/instances/repositories 一.创建命名空间 例如daniel-hub ...
- springboot+layui实现增删查改
本文描述springboot和layui的结合,采用了springboot内置的jdbc,根据不同需要可以进行修改:分页采用了layui中自带分页格式! ----------------------- ...
- 利用HaoheDI从数据库抽取数据导入到hbase中
下载apache-phoenix-4.14.0-HBase-1.4-bin.tar.gz 将其中的 phoenix-4.14.0-HBase-1.4-client.jar phoenix-core-4 ...
- 04IP编址(网络层)
帧中type为0x0800,送给ip ip报文结构 TTL 生存时间最大为255,经过三层设备就减1 protocol:协议号 version:4,6 source ip address:源ip编 ...
- ggplot2画简单的heatmap
gg_heatmap gg_heatmap PeRl ggplot2 heatmap 偶然的机会,发现ggplot2画的heatmap也挺好看的,除了不能画出聚类树来(手动滑稽). 随意新建了两个矩阵 ...
- struts2学习笔记四
一.contextMap中的数据操作 root根:List 元素1 元素2 元素3 元素4 元素5 contextMap:Map key value application Map key value ...
- python基础的一些知识点
ord 将字符转换为ASCIIchr 将ASCII转换为字符 元组不可修改,当只有一个元素时,要添加一个逗号集合不可修改,元素无序,不能重复 列表.元组.字典都是可迭代对象,就是可以遍历的对象多层循环 ...