以上是,weekend110的yarn的job提交流程源码分析的复习总结

下面呢,来讲weekend110的hadoop中的序列化机制

1363157985066      13726230503  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27     2481         24681       200

1363157995052      13826544101  5C-0E-8B-C7-F1-E0:CMCC       120.197.40.4                      4       0       264  0       200

1363157991076      13926435656  20-10-7A-28-CC-0A:CMCC       120.196.100.99                          2       4       132  1512         200

1363154400022      13926251106  5C-0E-8B-8B-B1-50:CMCC       120.197.40.4                      4       0       240  0       200

1363157993044      18211575961  94-71-AC-CD-E6-18:CMCC-EASY     120.196.100.99        iface.qiyi.com  视频网站         15         12     1527         2106         200

1363157995074      84138413         5C-0E-8B-8C-E8-20:7DaysInn 120.197.40.4   122.72.52.12            20     16     4116         1432         200

1363157993055      13560439658  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116         954  200

1363157995033      15920133257  5C-0E-8B-C7-BA-20:CMCC      120.197.40.4   sug.so.360.cn  信息安全         20     20     3156         2936         200

1363157983019      13719199419  68-A1-B7-03-07-B1:CMCC-EASY      120.196.100.82                          4       0       240  0       200

1363157984041      13660577991  5C-0E-8B-92-5C-20:CMCC-EASY      120.197.40.4   s19.cnzz.com  站点统计         24     9         6960         690  200

1363157973098      15013685858  5C-0E-8B-C7-F7-90:CMCC       120.197.40.4   rank.ie.sogou.com  搜索引擎         28     27         3659         3538         200

1363157986029      15989002119  E8-99-C4-4E-93-E0:CMCC-EASY      120.196.100.99        www.umeng.com    站点统计         3         3       1938         180  200

1363157992093      13560439658  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          15     9       918  4938         200

1363157986041      13480253104  5C-0E-8B-C7-FC-80:CMCC-EASY      120.197.40.4                      3       3       180  180  200

1363157984040      13602846565  5C-0E-8B-8B-B6-00:CMCC       120.197.40.4   2052.flash2-http.qq.com         综合门户         15     12     1938         2910         200

1363157995093      13922314466  00-FD-07-A2-EC-BA:CMCC      120.196.100.82        img.qfc.cn                  12     12     3008         3720         200

1363157982040      13502468823  5C-0A-5B-6A-0B-D4:CMCC-EASY    120.196.100.99        y0.ifengimg.com      综合门户         57     102  7335         110349     200

1363157986072      18320173382  84-25-DB-4F-10-1A:CMCC-EASY      120.196.100.99        input.shouji.sogou.com   搜索引擎         21     18     9531         2412         200

1363157990043      13925057413  00-1F-64-E1-E6-9A:CMCC        120.196.100.55        t3.baidu.com   搜索引擎         69     63         11058       48243       200

1363157988072      13760778710  00-FD-07-A4-7B-08:CMCC       120.196.100.82                          2       2       120  120  200

1363157985066      13726238888  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27     2481         24681       200

1363157993055      13560436666  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116         954  200

手机号码                        时间戳                     Ip            网站      上行流量   下行流量   总的流量 

LongWritable的源码

/**

* Licensed to the Apache Software Foundation (ASF) under one

* or more contributor license agreements.  See the NOTICE file

* distributed with this work for additional information

* regarding copyright ownership.  The ASF licenses this file

* to you under the Apache License, Version 2.0 (the

* "License"); you may not use this file except in compliance

* with the License.  You may obtain a copy of the License at

*

*     http://www.apache.org/licenses/LICENSE-2.0

*

* Unless required by applicable law or agreed to in writing, software

* distributed under the License is distributed on an "AS IS" BASIS,

* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

* See the License for the specific language governing permissions and

* limitations under the License.

*/

package org.apache.hadoop.io;

import java.io.DataInput;

import java.io.DataOutput;

import java.io.IOException;

import org.apache.hadoop.classification.InterfaceAudience;

import org.apache.hadoop.classification.InterfaceStability;

/** A WritableComparable for longs. */

@InterfaceAudience.Public

@InterfaceStability.Stable

public class LongWritable implements WritableComparable<LongWritable> {

private long value;

public LongWritable() {}

public LongWritable(long value) { set(value); }

/** Set the value of this LongWritable. */

public void set(long value) { this.value = value; }

/** Return the value of this LongWritable. */

public long get() { return value; }

@Override

public void readFields(DataInput in) throws IOException {

value = in.readLong();

}

@Override

public void write(DataOutput out) throws IOException {

out.writeLong(value);

}

/** Returns true iff <code>o</code> is a LongWritable with the same value. */

@Override

public boolean equals(Object o) {

if (!(o instanceof LongWritable))

return false;

LongWritable other = (LongWritable)o;

return this.value == other.value;

}

@Override

public int hashCode() {

return (int)value;

}

/** Compares two LongWritables. */

@Override

public int compareTo(LongWritable o) {

long thisValue = this.value;

long thatValue = o.value;

return (thisValue<thatValue ? -1 : (thisValue==thatValue ? 0 : 1));

}

@Override

public String toString() {

return Long.toString(value);

}

/** A Comparator optimized for LongWritable. */

public static class Comparator extends WritableComparator {

public Comparator() {

super(LongWritable.class);

}

@Override

public int compare(byte[] b1, int s1, int l1,

byte[] b2, int s2, int l2) {

long thisValue = readLong(b1, s1);

long thatValue = readLong(b2, s2);

return (thisValue<thatValue ? -1 : (thisValue==thatValue ? 0 : 1));

}

}

/** A decreasing Comparator optimized for LongWritable. */

public static class DecreasingComparator extends Comparator {

@Override

public int compare(WritableComparable a, WritableComparable b) {

return -super.compare(a, b);

}

@Override

public int compare(byte[] b1, int s1, int l1, byte[] b2, int s2, int l2) {

return -super.compare(b1, s1, l1, b2, s2, l2);

}

}

static {                                       // register default comparator

WritableComparator.define(LongWritable.class, new Comparator());

}

}

WritableComparable的源码

/**

* Licensed to the Apache Software Foundation (ASF) under one

* or more contributor license agreements.  See the NOTICE file

* distributed with this work for additional information

* regarding copyright ownership.  The ASF licenses this file

* to you under the Apache License, Version 2.0 (the

* "License"); you may not use this file except in compliance

* with the License.  You may obtain a copy of the License at

*

*     http://www.apache.org/licenses/LICENSE-2.0

*

* Unless required by applicable law or agreed to in writing, software

* distributed under the License is distributed on an "AS IS" BASIS,

* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

* See the License for the specific language governing permissions and

* limitations under the License.

*/

package org.apache.hadoop.io;

import org.apache.hadoop.classification.InterfaceAudience;

import org.apache.hadoop.classification.InterfaceStability;

/**

* A {@link Writable} which is also {@link Comparable}.

*

* <p><code>WritableComparable</code>s can be compared to each other, typically

* via <code>Comparator</code>s. Any type which is to be used as a

* <code>key</code> in the Hadoop Map-Reduce framework should implement this

* interface.</p>

*

* <p>Note that <code>hashCode()</code> is frequently used in Hadoop to partition

* keys. It's important that your implementation of hashCode() returns the same

* result across different instances of the JVM. Note also that the default

* <code>hashCode()</code> implementation in <code>Object</code> does <b>not</b>

* satisfy this property.</p>

*

* <p>Example:</p>

* <p><blockquote><pre>

*     public class MyWritableComparable implements WritableComparable<MyWritableComparable> {

*       // Some data

*       private int counter;

*       private long timestamp;

*

*       public void write(DataOutput out) throws IOException {

*         out.writeInt(counter);

*         out.writeLong(timestamp);

*       }

*

*       public void readFields(DataInput in) throws IOException {

*         counter = in.readInt();

*         timestamp = in.readLong();

*       }

*

*       public int compareTo(MyWritableComparable o) {

*         int thisValue = this.value;

*         int thatValue = o.value;

*         return (thisValue &lt; thatValue ? -1 : (thisValue==thatValue ? 0 : 1));

*       }

*

*       public int hashCode() {

*         final int prime = 31;

*         int result = 1;

*         result = prime * result + counter;

*         result = prime * result + (int) (timestamp ^ (timestamp &gt;&gt;&gt; 32));

*         return result

*       }

*     }

* </pre></blockquote></p>

*/

@InterfaceAudience.Public

@InterfaceStability.Stable

public interface WritableComparable<T> extends Writable, Comparable<T> {

}

这样可以减少网络带宽,所以,为什么hadoop用到自己的序列化机制。

以上是weekend110的hadoop中的序列化机制

//将对象数据序列化到数据流中

@Override

public void write(DataOutput out) throws IOException {

// TODO Auto-generated method stub

}

序列化里,是要把数据写出去

//从数据流中反序列出对象数据

@Override

public void readFields(DataInput in) throws IOException {

// TODO Auto-generated method stub

}

反序列化,是要读入数据。

至此,FlowBean.java代码已经写完。

1363157985066      13726230503  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27     2481         24681       200

1363157995052      13826544101  5C-0E-8B-C7-F1-E0:CMCC       120.197.40.4                      4       0       264  0       200

1363157991076        20-10-7A-28-CC-0A:CMCC       120.196.100.99                          2       4       132  1512         200

1363154400022      13926251106  5C-0E-8B-8B-B1-50:CMCC       120.197.40.4                      4       0       240  0       200

1363157993044        94-71-AC-CD-E6-18:CMCC-EASY     120.196.100.99        iface.qiyi.com  视频网站         15         12     1527         2106         200

1363157995074      84138413         5C-0E-8B-8C-E8-20:7DaysInn 120.197.40.4   122.72.52.12            20     16     4116         1432         200

1363157993055        C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116         954  200

1363157995033      15920133257  5C-0E-8B-C7-BA-20:CMCC      120.197.40.4   sug.so.360.cn  信息安全         20     20     3156         2936         200

1363157983019      13719199419  68-A1-B7-03-07-B1:CMCC-EASY      120.196.100.82                          4       0       240  0       200

1363157984041      13660577991  5C-0E-8B-92-5C-20:CMCC-EASY      120.197.40.4   s19.cnzz.com  站点统计         24     9         6960         690  200

1363157973098      15013685858  5C-0E-8B-C7-F7-90:CMCC       120.197.40.4   rank.ie.sogou.com  搜索引擎         28     27         3659         3538         200

1363157986029      15989002119  E8-99-C4-4E-93-E0:CMCC-EASY      120.196.100.99        www.umeng.com    站点统计         3         3       1938         180  200

1363157992093      13560439658  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          15     9       918  4938         200

1363157986041      13480253104  5C-0E-8B-C7-FC-80:CMCC-EASY      120.197.40.4                      3       3       180  180  200

1363157984040        5C-0E-8B-8B-B6-00:CMCC       120.197.40.4   2052.flash2-http.qq.com         综合门户         15     12     1938         2910         200

1363157995093      13922314466  00-FD-07-A2-EC-BA:CMCC      120.196.100.82        img.qfc.cn                  12     12     3008         3720         200

1363157982040      13502468823  5C-0A-5B-6A-0B-D4:CMCC-EASY    120.196.100.99        y0.ifengimg.com      综合门户         57     102  7335         110349     200

1363157986072        84-25-DB-4F-10-1A:CMCC-EASY      120.196.100.99        input.shouji.sogou.com   搜索引擎         21     18     9531         2412         200

1363157990043        00-1F-64-E1-E6-9A:CMCC        120.196.100.55        t3.baidu.com   搜索引擎         69     63         11058       48243       200

1363157988072      13760778710  00-FD-07-A4-7B-08:CMCC       120.196.100.82                          2       2       120  120  200

1363157985066      13726238888  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27     2481         24681       200

1363157993055      13560436666  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116         954  200

1363157985066      13726230503  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27     2481         24681       200

1363157995052      13826544101  5C-0E-8B-C7-F1-E0:CMCC       120.197.40.4                      4       0       264  0       200

1363157991076      13926435656  20-10-7A-28-CC-0A:CMCC       120.196.100.99                          2       4       132  1512         200

1363154400022      13926251106  5C-0E-8B-8B-B1-50:CMCC       120.197.40.4                      4       0       240  0       200

1363157993044      18211575961  94-71-AC-CD-E6-18:CMCC-EASY     120.196.100.99        iface.qiyi.com  视频网站         15         12     1527         2106         200

1363157995074      84138413         5C-0E-8B-8C-E8-20:7DaysInn 120.197.40.4   122.72.52.12            20     16     4116         1432         200

1363157993055      13560439658  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116         954  200

1363157995033      15920133257  5C-0E-8B-C7-BA-20:CMCC      120.197.40.4   sug.so.360.cn  信息安全         20     20     3156         2936         200

1363157983019      13719199419  68-A1-B7-03-07-B1:CMCC-EASY      120.196.100.82                          4       0       240  0       200

1363157984041      13660577991  5C-0E-8B-92-5C-20:CMCC-EASY      120.197.40.4   s19.cnzz.com  站点统计         24     9         6960         690  200

1363157973098      15013685858  5C-0E-8B-C7-F7-90:CMCC       120.197.40.4   rank.ie.sogou.com  搜索引擎         28     27         3659         3538         200

1363157986029      15989002119  E8-99-C4-4E-93-E0:CMCC-EASY      120.196.100.99        www.umeng.com    站点统计         3         3       1938         180  200

1363157992093      13560439658  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          15     9       918  4938         200

1363157986041      13480253104  5C-0E-8B-C7-FC-80:CMCC-EASY      120.197.40.4                      3       3       180  180  200

1363157984040      13602846565  5C-0E-8B-8B-B6-00:CMCC       120.197.40.4   2052.flash2-http.qq.com         综合门户         15     12     1938         2910         200

1363157995093      13922314466  00-FD-07-A2-EC-BA:CMCC      120.196.100.82        img.qfc.cn                  12     12              3720         200

1363157982040      13502468823  5C-0A-5B-6A-0B-D4:CMCC-EASY    120.196.100.99        y0.ifengimg.com      综合门户         57     102  7335         110349     200

1363157986072      18320173382  84-25-DB-4F-10-1A:CMCC-EASY      120.196.100.99        input.shouji.sogou.com   搜索引擎         21     18     9531         2412         200

1363157990043      13925057413  00-1F-64-E1-E6-9A:CMCC        120.196.100.55        t3.baidu.com   搜索引擎         69     63         11058       48243       200

1363157988072      13760778710  00-FD-07-A4-7B-08:CMCC       120.196.100.82                          2       2       120  120  200

1363157985066      13726238888  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27              24681       200

1363157993055      13560436666  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116         954  200

1363157985066      13726230503  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27     2481         24681       200

1363157995052      13826544101  5C-0E-8B-C7-F1-E0:CMCC       120.197.40.4                      4       0       264  0       200

1363157991076      13926435656  20-10-7A-28-CC-0A:CMCC       120.196.100.99                          2       4       132  1512         200

1363154400022      13926251106  5C-0E-8B-8B-B1-50:CMCC       120.197.40.4                      4       0       240         200

1363157993044      18211575961  94-71-AC-CD-E6-18:CMCC-EASY     120.196.100.99        iface.qiyi.com  视频网站         15         12     1527         2106         200

1363157995074      84138413         5C-0E-8B-8C-E8-20:7DaysInn 120.197.40.4   122.72.52.12            20     16     4116         1432         200

1363157993055      13560439658  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116         954  200

1363157995033      15920133257  5C-0E-8B-C7-BA-20:CMCC      120.197.40.4   sug.so.360.cn  信息安全         20     20     3156         2936         200

1363157983019      13719199419  68-A1-B7-03-07-B1:CMCC-EASY      120.196.100.82                          4       0       240         200

1363157984041      13660577991  5C-0E-8B-92-5C-20:CMCC-EASY      120.197.40.4   s19.cnzz.com  站点统计         24     9         6960         690  200

1363157973098      15013685858  5C-0E-8B-C7-F7-90:CMCC       120.197.40.4   rank.ie.sogou.com  搜索引擎         28     27         3659         3538         200

1363157986029      15989002119  E8-99-C4-4E-93-E0:CMCC-EASY      120.196.100.99        www.umeng.com    站点统计         3         3       1938         180  200

1363157992093      13560439658  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          15     9       918  4938         200

1363157986041      13480253104  5C-0E-8B-C7-FC-80:CMCC-EASY      120.197.40.4                      3       3       180  180  200

1363157984040      13602846565  5C-0E-8B-8B-B6-00:CMCC       120.197.40.4   2052.flash2-http.qq.com         综合门户         15     12     1938         2910         200

1363157995093      13922314466  00-FD-07-A2-EC-BA:CMCC      120.196.100.82        img.qfc.cn                  12     12     3008         3720         200

1363157982040      13502468823  5C-0A-5B-6A-0B-D4:CMCC-EASY    120.196.100.99        y0.ifengimg.com      综合门户         57     102  7335         110349     200

1363157986072      18320173382  84-25-DB-4F-10-1A:CMCC-EASY      120.196.100.99        input.shouji.sogou.com   搜索引擎         21     18     9531         2412         200

1363157990043      13925057413  00-1F-64-E1-E6-9A:CMCC        120.196.100.55        t3.baidu.com   搜索引擎         69     63         11058       48243       200

1363157988072      13760778710  00-FD-07-A4-7B-08:CMCC       120.196.100.82                          2       2       120  120  200

1363157985066      13726238888  00-FD-07-A4-72-B8:CMCC       120.196.100.82        i02.c.aliimg.com                24     27     2481         24681       200

1363157993055      13560436666  C4-17-FE-BA-DE-D9:CMCC      120.196.100.99                          18     15     1116           200

[hadoop@weekend110 ~]$ /home/hadoop/app/hadoop-2.4.1/bin/hadoop jar flow.jar cn.itcast.hadoop.mr.flowsum.FlowSumRunner /flow/data /flow/output

以上是weekend110的流量求和mr程序开发

1 weekend110的复习 + hadoop中的序列化机制 + 流量求和mr程序开发的更多相关文章

  1. 一脸懵逼学习Hadoop中的序列化机制——流量求和统计MapReduce的程序开发案例——流量求和统计排序

    一:序列化概念 序列化(Serialization)是指把结构化对象转化为字节流.反序列化(Deserialization)是序列化的逆过程.即把字节流转回结构化对象.Java序列化(java.io. ...

  2. hadoop中的序列化与Writable接口

    本文地址:http://www.cnblogs.com/archimedes/p/hadoop-writable-interface.html,转载请注明源地址. 简介 序列化和反序列化就是结构化对象 ...

  3. hadoop中的序列化

    此文已由作者肖凡授权网易云社区发布. 欢迎访问网易云社区,了解更多网易技术产品运营经验. 最近在学习hadoop,发现hadoop的序列化过程和jdk的序列化有很大的区别,下面就来说说这两者的区别都有 ...

  4. hadoop中的序列化与Writable类

    本文地址:http://www.cnblogs.com/archimedes/p/hadoop-writable-class.html,转载请注明源地址. hadoop中自带的org.apache.h ...

  5. 基于HBase Hadoop 分布式集群环境下的MapReduce程序开发

    HBase分布式集群环境搭建成功后,连续4.5天实验客户端Map/Reduce程序开发,这方面的代码网上多得是,写个测试代码非常容易,可是真正运行起来可说是历经挫折.下面就是我最终调通并让程序在集群上 ...

  6. Hadoop中的RPC机制

    1.  RPC——远程过程调用协议,它是一种通过网络从远程计算机程序上请求服务,而不需要了解底层网络技术的协议.RPC协议假定某些传输协议的存在,如TCP或UDP,为通信程序之间携带信息数据.在OSI ...

  7. 微信小程序中的bindTap事件(微信小程序开发QQ群:604788754)

    bindTap对应的绑定事件, 第一个:wx.navigateTo wx.navigateTo({ url:"../content/content" }) 第二个:wx.redir ...

  8. Python中的序列化以及pickle和json模块介绍

    Python中的序列化指的是在程序运行期间,变量都是在内存中保存着的,如果我们想保留一些运行中的变量值,就可以使用序列化操作把变量内容从内存保存到磁盘中,在Python中这个操作叫pickling,等 ...

  9. 3 weekend110的shuffle机制 + mr程序的组件全貌

    前面,讲到了hadoop的序列化机制,mr程序开发,自定义排序,自定义分组. 有多少个reduce的并发任务数可以控制,但有多少个map的并发任务数还没 缓存,分组,排序,转发,这些都是mr的shuf ...

随机推荐

  1. Java基础巩固--正则表达式

    本篇文章是学习尚学堂的关于正则表达式的视频教程时,所做的笔记.供广大编程爱好者学习之用,也留给日后自己复习使用! 1.为什么要有正则表达式? 正则表达式可以方便的对数据进行匹配,可以进行更加复杂的字符 ...

  2. sed工具使用

    sed命令使用形式 1.sed命令从管道中读取数据处理 command | sed ' edit command' 通过管道把一个命令的标准输出读入到sed的标准输入,sed就起到了过滤作用 2.se ...

  3. markdown与textile之间互相转换

    markdown与textile之间互相转换 redmine中默认使用的是textile那么从别的地方复制过来的markdown格式的内容需要进行转换 找到一款工具叫做pandoc http://jo ...

  4. web开发工具IDE

    1.NetBeans 2.Zend Studio 3.JetBrains WebStorm 4.JetBrains PhpStorm 5.Koala 6.Ionic Lab 7.sublime 8.N ...

  5. Python httpsqs封装类

    ''' httpsqs队列封装 @author xp_go@qq.com a = HttpsqsClient('192.168.0.218','1218','httpsqsmmall.com') pr ...

  6. CCTV评论员评论步行者与奇才的比赛

    步行者客场迎战主场作战的奇才,奇才的战士可能由于过度兴奋或是过度紧张身体僵硬,本来能打进的球都失掉了.反而,由于步行者取得了两位数的领先,越大心情越放松,打出了过去很少见的流畅局面. CCTV评论员就 ...

  7. asp.net mvc将html编译

    从数据库查询出来的值,如果包含html标签并且通过MVC绑定页面的话,那么他会通过浏览器编译为字符串显示,所以我们有得在从新的转一次: HtmlString hh = new HtmlString(M ...

  8. 转:如何在Linux上提高文本的搜索效率

    原文来自于:http://www.geekfan.net/6881/ 对于系统管理员或程序员来说,当需要在复杂配置的目录中或者在大型源码树中搜寻特定的文本或模式时,grep类型的工具大概是最受欢迎的. ...

  9. java 容器类大集结

    这个世界是程序员的世界,归根到底是数据的世界,要统治这个世界,首先要学会征服数据. 没有最好的,只有最合适的,如何在不同的环境先选择最优的存储的结构呢?且看下文分解: 以下内容部分来自网络,参考: h ...

  10. [原博客] POI系列(4)

    正规.严谨.精妙. -POI BZOJ 1531 : [POI2005]Bank notes 裸的背包,可以二进制拆分一下.一个物品比如说有n个,可以拆成 1,2,4,8,16...个. OJ上没有样 ...