CDH- 测试mr

cdh的mr样例算法的jar包在

[zc.lee@ip---- hadoop-0.20-mapreduce]$ pwd

/opt/cloudera/parcels/CDH-5.10.-.cdh5.10.0.p0./lib/hadoop-0.20-mapreduce

查看该目录下的文件

[zc.lee@ip---- hadoop-0.20-mapreduce]$ ll

total

drwxr-xr-x  root root    Jan    bin

-rw-r--r--  root root  Jan    CHANGES.txt

drwxr-xr-x  root root    Jan    cloudera

lrwxrwxrwx  root root      Jul    conf -> /etc/hadoop/conf

drwxr-xr-x  root root    Jan    contrib

drwxr-xr-x  root root    Jan    example-confs

lrwxrwxrwx  root root      Jul    hadoop-ant-2.6.-mr1-cdh5.10.0.jar -> ../../jars/hadoop-ant-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-ant-mr1.jar -> hadoop-ant-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-core-2.6.-mr1-cdh5.10.0.jar -> ../../jars/hadoop-core-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-core-mr1.jar -> hadoop-core-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-examples-2.6.-mr1-cdh5.10.0.jar -> ../../jars/hadoop-examples-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-examples.jar -> hadoop-examples-mr1.jar

lrwxrwxrwx  root root      Jul    hadoop-examples-mr1.jar -> hadoop-examples-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-test-2.6.-mr1-cdh5.10.0.jar -> ../../jars/hadoop-test-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-test-mr1.jar -> hadoop-test-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-tools-2.6.-mr1-cdh5.10.0.jar -> ../../jars/hadoop-tools-2.6.-mr1-cdh5.10.0.jar

lrwxrwxrwx  root root      Jul    hadoop-tools-mr1.jar -> hadoop-tools-2.6.-mr1-cdh5.10.0.jar

drwxr-xr-x  root root    Jan    include

drwxr-xr-x  root root    Jan    lib

-rw-r--r--  root root   Jan    LICENSE.txt

-rw-r--r--  root root     Jan    NOTICE.txt

-rw-r--r--  root root    Jan    README.txt

drwxr-xr-x  root root    Jan    sbin

drwxr-xr-x  root root    Jan    webapps

可以用hadoop-examples.jar里面的wordcount做测试

#hadoop jar hadoop-examples.jar

可以看到里面都有些上面可以使用的类

An example program must be given as the first argument.

Valid program names are:

aggregatewordcount: An Aggregate based map/reduce program that counts the words in the input files.

aggregatewordhist: An Aggregate based map/reduce program that computes the histogram of the words in the input files.

bbp: A map/reduce program that uses Bailey-Borwein-Plouffe to compute exact digits of Pi.

dbcount: An example job that count the pageview counts from a database.

distbbp: A map/reduce program that uses a BBP-type formula to compute exact bits of Pi.

grep: A map/reduce program that counts the matches of a regex in the input.

join: A job that effects a join over sorted, equally partitioned datasets

multifilewc: A job that counts words from several files.

pentomino: A map/reduce tile laying program to find solutions to pentomino problems.

pi: A map/reduce program that estimates Pi using a quasi-Monte Carlo method.

randomtextwriter: A map/reduce program that writes 10GB of random textual data per node.

randomwriter: A map/reduce program that writes 10GB of random data per node.

secondarysort: An example defining a secondary sort to the reduce.

sort: A map/reduce program that sorts the data written by the random writer.

sudoku: A sudoku solver.

teragen: Generate data for the terasort

terasort: Run the terasort

teravalidate: Checking results of terasort

wordcount: A map/reduce program that counts the words in the input files.

wordmean: A map/reduce program that counts the average length of the words in the input files.

wordmedian: A map/reduce program that counts the median length of the words in the input files.

wordstandarddeviation: A map/reduce program that counts the standard deviation of the length of the words in the input files.

这里我直接取wordcount类来做测试，首先上传文件到hdfs准备好计算

hdfs dfs -mkdir /user/zc.lee/input/

hdfs dfs -put /user/PG/conf/type.txt /user/zc.lee/input/

开始计算

hadoop jar hadoop-examples.jar wordcount /user/zc.lee/input/type.txt /user/zc.lee/ouputtest

检查结果

hdfs dfs -text /user/zc.lee/ouputtest/*

CDH- 测试mr的更多相关文章

Mac OS X上搭建伪分布式CDH版本Hadoop开发环境
最近在研究数据挖掘相关的东西,在本地 Mac 环境搭建了一套伪分布式的 hadoop 开发环境,采用CDH发行版本,省时省心. 参考来源 How-to: Install CDH on Mac OSX ...
Windows下Eclipse提交MR程序到HadoopCluster
作者:Syn良子出处:http://www.cnblogs.com/cssdongl 欢迎转载,转载请注明出处. 以前Eclipse上写好的MapReduce项目经常是打好包上传到Hadoop测试集 ...
Hadoop安装-部署-测试
一:准备Linux环境[安装略] a.修改主机名 vim /etc/sysconfig/network NETWORKING= ...
关于素数：求不超过n的素数，素数的判定(Miller Rabin 测试)
关于素数的基本介绍请参考百度百科here和维基百科here的介绍首先介绍几条关于素数的基本定理: 定理1:如果n不是素数,则n至少有一个( 1, sqrt(n) ]范围内的的因子定理2:如果n不是 ...
Hadoop 中利用 mapreduce 读写 mysql 数据
Hadoop 中利用 mapreduce 读写 mysql 数据有时候我们在项目中会遇到输入结果集很大,但是输出结果很小,比如一些 pv.uv 数据,然后为了实时查询的需求,或者一些 OLAP ...
Hadoop-2.2.0 （传 hadoop-2.2.0.tar.gz）
配置hadoop 2.1 上传hadoop包 2.2 解压hadoop包首先在根目录下创建一个cloud目录 mkdir /cloud tar -zxvf hadoop-2.2.0.tar.gz - ...
Creating a Hadoop-2.x project in Eclipse
Creating a Hadoop-2.x project in Eclipse hortonworks:MapReduce Ports http://docs.hortonworks.com/HDP ...
通过mapreduce把mysql的数据读取到hdfs
前面讲过了怎么通过mapreduce把mysql的一张表的数据放到另外一张表中,这次讲的是把mysql的数据读取到hdfs里面去具体怎么搭建环境我这里就不多说了.参考通过mapreduce把mys ...
通过mapreduce把mysql的一张表的数据导到另外一张表中
怎么安装hadoop集群我在这里就不多说了,我这里安装的是三节点的集群先在主节点安装mysql 启动mysql 登录mysql 创建数据库,创建表格,先把数据加载到表格 t ,表格t2是空的 mys ...

随机推荐

【问题记录】web项目访问时出现404
请一定检查一下项目的Context root是否是你访问时使用的. Context root设置为/时,可以直接用ip+端口访问. Context root设置为项目名的,访问时请带上项目名. 设置方 ...
Android内容提供者
一个应用中的数据库对别人是不会提供直接的访问的,而是提供接口给别人访问,但是一般应用开发的时候都是去获取别人的数据,而不是自己提供数据. 继承ContentProvider: 在Menifest中注册 ...
centos7 ACL
Linux文件权限与属性详解之 ACL Linux文件权限与属性详解之一般权限Linux文件权限与属性详解之 ACLLinux文件权限与属性详解之 SUID.SGID & SBI ...
js父页面和子页面之间传值
今天和朋友一块讨论,怎样通过js在父页面和子页面之间传值的问题,总结例如以下: 需求描写叙述:父页面有多个子页面.实如今父页面点击子页面,传值到子页面. 看着非常easy,试了好久.主要纠结在怎样获取 ...
【BZOJ2422】Times 树状数组
[BZOJ2422]Times Description 小y作为一名资深的dotaer,对视野的控制有着深刻的研究.每个单位在一段特定的时间内会出现在小y的视野内,除此之外的时间都在小y看不到的地方. ...
[转]为 windows cmd 设置代理
为 windows cmd 设置代理转自:http://blog.csdn.net/lovelyelfpop/article/details/69586366 通过cmd命令行执行某些命令,如果这些 ...
宇视摄像机/NVR OCX插件插件安装出现：Failed to register ocx, error code 14001 错误的解决方法
最近在使用EasyNVR接入海康.宇视的摄像机进行景观直播的项目时,需要进入宇视设备进行音视频编码参数的调整,要说呢,海康的产品好就是要好很多: 海康的设备后台管理页面,不需要装插件也能进去,而且能调 ...
zookeeper snowflake 实战
目录写在前面 1.1.1. 集群节点的命名服务 1.1.2. snowflake 的ID算法改造 SnowFlake算法的优点: SnowFlake算法的缺点: 写在最后疯狂创客圈亿级流量高并 ...
c#的const可以用于引用类型吗
答案是可以的.不过用const修饰的类实例只能是null. class A{ public int a=0; } class B{ const A constA=null; const object ...
我的Android进阶之旅------>Android疯狂连连看游戏的实现之加载界面图片和实现游戏Activity(四)
正如在<我的Android进阶之旅------>Android疯狂连连看游戏的实现之状态数据模型(三)>一文中看到的,在AbstractBoard的代码中,当程序需要创建N个Piec ...

CDH- 测试mr

CDH- 测试mr的更多相关文章

随机推荐

热门专题