org.apache.hadoop.fs-BufferedFSInputStream

封装了FSInputStream

 package org.apache.hadoop.fs;

 import java.io.BufferedInputStream;

 import java.io.IOException;

 /**

  * A class optimizes reading from FSInputStream by bufferring

  */

 //通过缓存优化FSInputStream读取

 public class BufferedFSInputStream extends BufferedInputStream

 implements Seekable, PositionedReadable {

 //两个接口在前面刚看过了，功能为...

   /**

    * Creates a <code>BufferedFSInputStream</code>

    * with the specified buffer size,

    * and saves its  argument, the input stream

    * <code>in</code>, for later use.  An internal

    * buffer array of length  <code>size</code>

    * is created and stored in <code>buf</code>.

    *

    * @param   in     the underlying input stream.

    * @param   size   the buffer size.

    * @exception IllegalArgumentException if size <= 0.

    */

   public BufferedFSInputStream(FSInputStream in, int size) {

     super(in, size);

   }

 //通过跟踪父类代码知道对接了输入流“管道”，初始化了一个大小为size的buffer

   public long getPos() throws IOException {

     return ((FSInputStream)in).getPos()-(count-pos);

   }

 //返回现在的偏移量

   public long skip(long n) throws IOException {

     if (n <= 0) {

       return 0;

     }

     seek(getPos()+n);

     return n;

   }

 //跳过n长度后得到现偏移量

   public void seek(long pos) throws IOException {

     if( pos<0 ) {

       return;

     }

     // optimize: check if the pos is in the buffer

     long end = ((FSInputStream)in).getPos();

     long start = end - count;

     if( pos>=start && pos<end) {

       this.pos = (int)(pos-start);

       return;

     }

     // invalidate buffer

     this.pos = 0;

     this.count = 0;

     ((FSInputStream)in).seek(pos);

   }

 //实现了Seekable的seek方法

   public boolean seekToNewSource(long targetPos) throws IOException {

     pos = 0;

     count = 0;

     return ((FSInputStream)in).seekToNewSource(targetPos);

   }

 //.....

   public int read(long position, byte[] buffer, int offset, int length) throws IOException {

     return ((FSInputStream)in).read(position, buffer, offset, length) ;

   }

   public void readFully(long position, byte[] buffer, int offset, int length) throws IOException {

     ((FSInputStream)in).readFully(position, buffer, offset, length);

   }

   public void readFully(long position, byte[] buffer) throws IOException {

     ((FSInputStream)in).readFully(position, buffer);

   }

 }

org.apache.hadoop.fs-BufferedFSInputStream的更多相关文章

用java运行Hadoop程序报错:org.apache.hadoop.fs.LocalFileSystem cannot be cast to org.apache.
用java运行Hadoop例程报错:org.apache.hadoop.fs.LocalFileSystem cannot be cast to org.apache.所写代码如下: package ...
spark-shell报错：Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/fs/FSDataInputStream
环境: openSUSE42.2 hadoop2.6.0-cdh5.10.0 spark1.6.0-cdh5.10.0 按照网上的spark安装教程安装完之后,启动spark-shell,出现如下报错 ...
报错：Exception in thread "main" java.lang.NoClassDefFoundError: Lorg/apache/hadoop/fs/FileSystem
报错现象: Exception in thread "main" java.lang.NoClassDefFoundError: Lorg/apache/hadoop/fs/Fil ...
Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/fs/CanUnbuffer
在执行spark on hive 的时候在 sql.show()处报错 : Exception in thread "main" java.lang.NoClassDefFoun ...
hive启动时报错 java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: ${system:java.io.tmpdir%7D/$%7Bsystem:user.name%7D at org.apache.hadoop.fs.Path.initialize
错误提示信息如下错误信息如下 [root@node1 bin]# ./hive Logging initialized -bin/lib/hive-common-.jar!/hive-log4j.p ...
org.apache.hadoop.fs.FsUrlStreamHandlerFactory 在哪个jar包
org.apache.hadoop.fs.FsUrlStreamHandlerFactory在org.apache.hadoop类中,org.apache.hadoop在hadoop安装目录下.
sparkOnYarn报错org.apache.hadoop.fs.FSDataInputStream
Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/fs/FSDataInpu ...
Hadoop 3.1.2报错：xception in thread "main" org.apache.hadoop.fs.UnsupportedFileSystemException: No FileSystem for scheme "hdfs"
报错内容如下: Exception in thread "main" org.apache.hadoop.fs.UnsupportedFileSystemException: No ...
你遇到了吗？Caused by: org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.fs.FileAlreadyExistsException)
我在使用 Structured Streaming 的 ForeachWriter,写 HDFS 文件时,出现了这个异常这个异常出现的原因是HDFS作为一个分布式文件系统,支持多线程读,但是不支持多 ...
错误Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/fs/FSDataInputStream排查思路
spark1(默认CDH自带版本)不存在这个问题,主要是升级了spark2(CDHparcel升级)版本安装后需要依赖到spark1的旧配置去读取hadoop集群的依赖包. 1./etc/spark2 ...

随机推荐

JSON 省市数据包括港澳
data: [{ name: "北京", cities: ["西城", "东城", "崇文", "宣武&quo ...
【恒天云技术分享系列10】OpenStack块存储技术
原文:http://www.hengtianyun.com/download-show-id-101.html 块存储,简单来说就是提供了块设备存储的接口.用户需要把块存储卷附加到虚拟机(或者裸机)上 ...
五、python使用模块
if __name__=='__main__':用法: 当我们在命令行运行模块文件时,Python解释器把一个特殊变量__name__置为__main__,而如果在其他地方导入该hello模块时,if ...
HDU4861:Couple doubi（费马小定理）
题意: 给出k个球和质数p,对每个球以公式val(i)=1^i+2^i+...+(p-1)^i (mod p)计算出它的价值,然后两个人轮流拿,最后拿到的球的总价值大的获胜,问我们先手是否获胜. 我们 ...
redis的lists类型
List是一个链表结构 , 主要功能是push . pop .获取一个范围的所有值等等 , 操作中key理解为链表的名字 . redis 的 list类型其实就是一个每个子元素都是string类型的双 ...
Spark的应用程序
Spark的应用程序,分为两部分:Spark driver 和 Spark executor.
linux中vi编辑器
vi编辑器,通常称之为vi,是一种广泛存在于各种UNIX和Linux系统中的文本编辑程序.它的功能十分强大,但是命令繁多,不容易掌握,它可以执行输出.删除.查找.替换.块操作等众多文本操作,而且用户 ...
ASP.NET Web Form和MVC中防止F5刷新引起的重复提交问题
转载 http://www.cnblogs.com/hiteddy/archive/2012/03/29/Prevent_Resubmit_When_Refresh_Reload_In_ASP_NET ...
HDU 3265 Posters (线段树+扫描线)(面积并)
题目链接:http://acm.hdu.edu.cn/showproblem.php?pid=3265 给你n个中间被挖空了一个矩形的中空矩形,让你求他们的面积并. 其实一个中空矩形可以分成4个小的矩 ...
Python 3.2: 使用pymysql连接Mysql
在python 3.2 中连接MYSQL的方式有很多种,例如使用mysqldb,pymysql.本文主要介绍使用Pymysql连接MYSQL的步骤 1 安装pymysql · ...

org.apache.hadoop.fs-BufferedFSInputStream

org.apache.hadoop.fs-BufferedFSInputStream的更多相关文章

随机推荐

热门专题