hadoop2.2编程:Tool, ToolRunner, GenericOptionsParser, Configuration
继承关系:
1. java.util
Interface Map.Entry<K,V>
description:
public static interface Map.Entry<K,V>
methods:
| Modifier and Type | Method and Description |
|---|---|
boolean |
equals(Object o)
Compares the specified object with this entry for equality.
|
K |
getKey()
Returns the key corresponding to this entry.
|
V |
getValue()
Returns the value corresponding to this entry.
|
int |
hashCode()
Returns the hash code value for this map entry.
|
V |
setValue(V value)
Replaces the value corresponding to this entry with the specified value (optional operation).
|
2.java.lang.Object
|__ org.apache.hadoop.conf.Configuration
constructor: public class Configuration extends Objectimplements Iterable<Map.Entry<String,String>>, Writable 3.org.apache.hadoop.util Class ToolRunner java.lang.Object |__ org.apache.hadoop.util.ToolRunner description:
public class ToolRunner extends Object
ToolRunner can be used to run classes implementing Tool interface. It works in conjunction with GenericOptionsParser to parse the generic hadoop command line arguments and modifies the Configuration of the Tool. The application-specific options are passed along without being modified.
methods:
static int |
run(Configuration conf, Tool tool, String[] args) Runs the given Tool by Tool.run(String[]),after parsing with the given generic arguments. |
static int |
run(Tool tool, String[] args)Runs the Tool with itsConfiguration. |
4.org.apache.hadoop.util Interface Tooldescription:
public interface Tool extends Configurablemethods:
int |
run(String[] args) Execute the command with the given arguments. |
5.org.apache.hadoop.conf
Interface Configurable
constructor:
public interface Configurable methods:
Configuration |
getConf() Return the configuration used by this object. |
void |
setConf(Configuration conf)Set the configuration to be used by this object. |
6.
java.lang.Object |__ org.apache.hadoop.conf.Configureddescription:
public class Configured extends Objectimplements Configurable
constructor:
Configured() Construct a Configured. |
Configured(Configuration conf)Construct a Configured |
methods:
Configuration |
getConf() Return the configuration used by this object. |
void |
setConf(Configuration conf)Set the configuration to be used by this object. |
Code1 (Configuration里添加的resource是String类型):
import java.util.Map.Entry;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.conf.Configured;
import org.apache.hadoop.util.ToolRunner;
import org.apache.hadoop.util.Tool;
import org.apache.hadoop.fs.Path;
public class ConfigurationPrinter extends Configured implements Tool {
static {
Configuration.addDefaultResource("config.xml");
}
@Override
public int run(String[] args) throws Exception {
Configuration conf = getConf();
for (Entry<String, String> hash: conf) {
System.out.printf("%s=%s\n", hash.getKey(), hash.getValue());
}
return 0;
}
public static void main(String[] args) throws Exception {
int exitCode = ToolRunner.run(new ConfigurationPrinter(), args);
System.exit(exitCode);
}
}
注:Configuration class提供只一种静态方法:addDefaultresource(String name), 如上述代码, 添加Resource "config.xml"为String类型时,hadoop将从classpath里查找此文件;若Resource 为Path()类型时,hadoop将从local filesystem里查找此文件: Configuration conf = new Configuration(); conf.addResource(new Path("config.xml"));
code1的执行步骤:
#将自定义的config文件config.xml放在hadoop的$HADOOP_CONF_DIR里 mv config.xml $HADOOP_HOME/etc/hadoop/
#假如我们添加的resource如下:
<!--cat $HADOOP_HOME/etc/hadoop/config.xml-->
<configuration>
<property>
<name>color</name>
<value>yellow</value>
</property>
<property>
<name>size</name>
<value>10</value>
</property>
<property>
<name>weight</name>
<value>heavy</value>
<final>true</final>
</property>
</configuration>
执行代码:
mkdir class source $HADOOP_HOME/libexec/hadoop-config.sh javac -d class ConfigurationPrinter.java jar -cvf ConfigurationPrinter.jar -C class ./ export HADOOP_CLASSPATH=ConfigurationPrinter.jar:$CLASSPATH #下面查找刚才添加的resource是否被读入 #我们在config.xml里添加了一项 <name>color</name>,执行 yarn ConfigurationPrinter|grep "color" color=yellow #可见代码是正确的
或者在commandline里指定HADOOP_CONF_DIR, 比如执行:
yarn ConfigurationPrinter --conf config.xml | grep color color=yellow
也是可以的!
Code2 (Configuration里添加的resource是Path类型):
import java.util.Map.Entry;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.conf.Configured;
import org.apache.hadoop.util.ToolRunner;
import org.apache.hadoop.util.Tool;
import org.apache.hadoop.fs.Path;
public class ConfigurationPrinter extends Configured implements Tool {
@Override
public int run(String[] args) throws Exception {
Configuration conf = new Configuration();
conf.addResource(new Path("config.xml"));
for (Entry<String, String> hash: conf) {
System.out.printf("%s=%s\n", hash.getKey(), hash.getValue());
}
return 0;
}
public static void main(String[] args) throws Exception {
int exitCode = ToolRunner.run(new ConfigurationPrinter(), args);
System.exit(exitCode);
}
}
此时添加的resource类型是Path()类型,故hadoop将从local filesystem里查找config.xml, 不需要将config.xml放在conf/下面,只要在代码中指定config.xml在本地文件系统中的路径即可(new Path("../others/config.xml"))
运行步骤:
mkdir class source $HADOOP_HOME/libexec/hadoop-config.sh javac -d class ConfigurationPrinter.java jar -cvf ConfigurationPrinter.jar -C class ./ export HADOOP_CLASSPATH=ConfigurationPrinter.jar:$CLASSPATH #下面查找刚才添加的resource是否被读入 #我们在config.xml里添加了一项 <name>color</name>,执行 yarn ConfigurationPrinter|grep "color" color=yellow #可见代码是正确的
备注:ConfigurationParser支持set individual properties:
Generic Options
The supported generic options are:
-conf <configuration file> specify a configuration file
-D <property=value> use value for given property
-fs <local|namenode:port> specify a namenode
-jt <local|jobtracker:port> specify a job tracker
-files <comma separated list of files> specify comma separated
files to be copied to the map reduce cluster
-libjars <comma separated list of jars> specify comma separated
jar files to include in the classpath.
-archives <comma separated list of archives> specify comma
separated archives to be unarchived on the compute machines.
可以尝试:
yarn ConfigurationPrinter -d fuck=Japan | grep fuck #输出为: fuck=Japan
再次提醒:
ToolRunner can be used to run classes implementing Tool interface. It works in conjunction with GenericOptionsParser to parse the generic hadoop command line arguments and modifies the Configuration of the Tool. The application-specific options are passed along without being modified.
ToolRunner和GenericOptionsParser共同来(解析|修改) generic hadoop command line arguments (什么是generic hadoop command line arguments? 比如:yarn command [genericOptions] [commandOptions]
hadoop2.2编程:Tool, ToolRunner, GenericOptionsParser, Configuration的更多相关文章
- hadoop2.2编程:从default mapreduce program 来理解mapreduce
下面写一个default mapreduce 的程序: import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapr ...
- hadoop2.2编程:使用MapReduce编程实例(转)
原文链接:http://www.cnblogs.com/xia520pi/archive/2012/06/04/2534533.html 从网上搜到的一篇hadoop的编程实例,对于初学者真是帮助太大 ...
- hadoop2.2编程:MRUnit测试
引用地址:http://www.cnblogs.com/lucius/p/3442381.html examples: Overview This document explains how to w ...
- hadoop2.2编程:矩阵相乘简单实现
/* matrix-matrix multiplication on Hadoop A x B = C constraint: A, B, C must be of the same size I u ...
- hadoop2.2编程:MRUnit
examples: Overview This document explains how to write unit tests for your map reduce code, and test ...
- hadoop2.2编程:DFS API 操作
1. Reading data from a hadoop URL 说明:想要让java从hadoop的dfs里读取数据,则java 必须能够识别hadoop hdfs URL schema, 因此我 ...
- LPCScrypt, DFUSec : USB FLASH download, programming, and security tool, LPC-Link 2 Configuration tool, Firmware Programming
What does this tool do? The LPC18xx/43xx DFUSec utility is a Windows PC tool that provides support f ...
- hadoop2.2编程: SequenceFileWritDemo
import java.io.IOException; import java.net.URI; import org.apache.hadoop.fs.FileSystem; import org. ...
- Hadoop2.2编程:新旧API的区别
Hadoop最新版本的MapReduce Release 0.20.0的API包括了一个全新的Mapreduce JAVA API,有时候也称为上下文对象. 新的API类型上不兼容以前的API,所以, ...
随机推荐
- 严重: Exception starting filter struts2 java.lang.NullPointerException (转载)
严重: Exception starting filter struts2 java.lang.NullPointerException at com.opensymphony.xwork2.util ...
- 374. Guess Number Higher or Lower
We are playing the Guess Game. The game is as follows: 我们来玩一个游戏,规则如下: I pick a number from 1 to n. Y ...
- Head First 设计模式系列之二----备忘录模式(java版)
申明:这几天无意中关注到备忘录模式,比较陌生回家一番参考书,只在附录里记录了该模式.后来在园子里有发现了有专门写设计模式的博客,并且写的也得牛逼.附上链接 http://www.cnblogs.com ...
- 使用Class.getResource和ClassLoader.getResource方法获取文件路径
自从转投Java阵营后,一直发下Java程序的路径读取异常麻烦,因此查阅了比较多的版本内容,整合了一份自己的学习笔记.主要使用Class及通过ClassLoader来动态获取文件路径. 查阅链接如下: ...
- Newtonsoft.Json.dll反序列化JSON字符串的方法
1.直接反序列化JSON字符串 //引用序列化.反序列化JSON字符串用到的空间 using Newtonsoft.Json; using Newtonsoft.Json.Linq; //定义一个 ...
- C# 多线程基础
多线程 无论您是为具有单个处理器的计算机还是为具有多个处理器的计算机进行开发,您都希望应用程序为用户提供最好的响应性能,即使应用程序当前正在完成其 他工作.要使应用程序能够快速响应用户操作,同时在用户 ...
- HttpContext.Current多线程调用
1.在web网站的Global中,进行数据量比较大的初始化工作,而为了使用户在页面上能够及时响应,我们在Global中开启了一个线程执行该函数模块. 不过,在线程中用到HttpContext.Curr ...
- CentOS下shell显示-bash-4.1#不显示用户名路径的解决方法
CentOS下shell显示-bash-4.1$不显示用户名路径的解决方法 问题描述: CentOS下新增一个用户,登录进去之后shell脚本的信息如下: 而不是我们经常看到的username@hos ...
- 怎么预防sql注入攻击
假设sql是搜索用户A的文章,sql会是这样: select * from table where owner='A'; sql注入攻击者会修改用户名来实现攻击,例如把A 改成A' or 1='1 组 ...
- SQL语句中使用条件逻辑
select name, sal, case when sal >= 4000 then 'Good' when sal <= 2000 then 'Bad' else 'Ok' end ...