【Zookeeper】源码分析之持久化--FileSnap

一、前言

　　前篇博文已经分析了FileTxnLog的源码，现在接着分析持久化中的FileSnap，其主要提供了快照相应的接口。

二、SnapShot源码分析

　　SnapShot是FileTxnLog的父类，接口类型，其方法如下　　

public interface SnapShot {

    /**

     * deserialize a data tree from the last valid snapshot and

     * return the last zxid that was deserialized

     * @param dt the datatree to be deserialized into

     * @param sessions the sessions to be deserialized into

     * @return the last zxid that was deserialized from the snapshot

     * @throws IOException

     */

    // 反序列化

    long deserialize(DataTree dt, Map<Long, Integer> sessions)

        throws IOException;

    /**

     * persist the datatree and the sessions into a persistence storage

     * @param dt the datatree to be serialized

     * @param sessions

     * @throws IOException

     */

    // 序列化

    void serialize(DataTree dt, Map<Long, Integer> sessions,

            File name)

        throws IOException;

    /**

     * find the most recent snapshot file

     * @return the most recent snapshot file

     * @throws IOException

     */

    // 查找最新的snapshot文件

    File findMostRecentSnapshot() throws IOException;

    /**

     * free resources from this snapshot immediately

     * @throws IOException

     */

    // 释放资源

    void close() throws IOException;

}

　　说明：可以看到SnapShot只定义了四个方法，反序列化、序列化、查找最新的snapshot文件、释放资源。

三、FileSnap源码分析

　　FileSnap实现了SnapShot接口，主要用作存储、序列化、反序列化、访问相应snapshot文件。

　　3.1 类的属性　

public class FileSnap implements SnapShot {

    // snapshot目录文件

    File snapDir;

    // 是否已经关闭标识

    private volatile boolean close = false;

    // 版本号

    private static final int VERSION=2;

    // database id

    private static final long dbId=-1;

    // Logger

    private static final Logger LOG = LoggerFactory.getLogger(FileSnap.class);

    // snapshot文件的魔数(类似class文件的魔数)

    public final static int SNAP_MAGIC

        = ByteBuffer.wrap("ZKSN".getBytes()).getInt();

}

　　说明：FileSnap主要的属性包含了是否已经关闭标识。

　　3.2 类的核心函数

　　1. deserialize函数

　　函数签名如下：

　　public long deserialize(DataTree dt, Map<Long, Integer> sessions)，是对SnapShot的deserialize函数的实现。其源码如下　　

    public long deserialize(DataTree dt, Map<Long, Integer> sessions)

            throws IOException {

        // we run through 100 snapshots (not all of them)

        // if we cannot get it running within 100 snapshots

        // we should  give up

        // 查找100个合法的snapshot文件

        List<File> snapList = findNValidSnapshots(100);

        if (snapList.size() == 0) { // 无snapshot文件，直接返回

            return -1L;

        }

        //

        File snap = null;

        // 默认为不合法

        boolean foundValid = false;

        for (int i = 0; i < snapList.size(); i++) { // 遍历snapList

            snap = snapList.get(i);

            // 输入流

            InputStream snapIS = null;

            CheckedInputStream crcIn = null;

            try {

                LOG.info("Reading snapshot " + snap);

                // 读取指定的snapshot文件

                snapIS = new BufferedInputStream(new FileInputStream(snap));

                // 验证

                crcIn = new CheckedInputStream(snapIS, new Adler32());

                InputArchive ia = BinaryInputArchive.getArchive(crcIn);

                // 反序列化

                deserialize(dt,sessions, ia);

                // 获取验证的值Checksum

                long checkSum = crcIn.getChecksum().getValue();

                // 从文件中读取val值

                long val = ia.readLong("val");

                if (val != checkSum) { // 比较验证，不相等，抛出异常

                    throw new IOException("CRC corruption in snapshot :  " + snap);

                }

                // 合法

                foundValid = true;

                // 跳出循环

                break;

            } catch(IOException e) {

                LOG.warn("problem reading snap file " + snap, e);

            } finally { // 关闭流

                if (snapIS != null)

                    snapIS.close();

                if (crcIn != null)

                    crcIn.close();

            }

        }

        if (!foundValid) { // 遍历所有文件都未验证成功

            throw new IOException("Not able to find valid snapshots in " + snapDir);

        }

        // 从文件名中解析出zxid

        dt.lastProcessedZxid = Util.getZxidFromName(snap.getName(), "snapshot");

        return dt.lastProcessedZxid;

    }

　　说明：deserialize主要用作反序列化，并将反序列化结果保存至dt和sessions中。其大致步骤如下

　　① 获取100个合法的snapshot文件，并且snapshot文件已经通过zxid进行降序排序，进入②

　　② 遍历100个snapshot文件，从zxid最大的开始，读取该文件，并创建相应的InputArchive，进入③

　　③ 调用deserialize(dt,sessions, ia)函数完成反序列化操作，进入④

　　④ 验证从文件中读取的Checksum是否与新生的Checksum相等，若不等，则抛出异常，否则，进入⑤

　　⑤ 跳出循环并关闭相应的输入流，并从文件名中解析出相应的zxid返回。

　　⑥ 在遍历100个snapshot文件后仍然无法找到通过验证的文件，则抛出异常。

　　在deserialize函数中，会调用findNValidSnapshots以及同名的deserialize(dt,sessions, ia)函数，findNValidSnapshots函数源码如下　　

    private List<File> findNValidSnapshots(int n) throws IOException {

        // 按照zxid对snapshot文件进行降序排序

        List<File> files = Util.sortDataDir(snapDir.listFiles(),"snapshot", false);

        int count = 0;

        List<File> list = new ArrayList<File>();

        for (File f : files) { // 遍历snapshot文件

            // we should catch the exceptions

            // from the valid snapshot and continue

            // until we find a valid one

            try {

                // 验证文件是否合法，在写snapshot文件时服务器宕机

                // 此时的snapshot文件非法;非snapshot文件也非法

                if (Util.isValidSnapshot(f)) {

                    // 合法则添加

                    list.add(f);

                    // 计数器加一

                    count++;

                    if (count == n) { // 等于n则跳出循环

                        break;

                    }

                }

            } catch (IOException e) {

                LOG.info("invalid snapshot " + f, e);

            }

        }

        return list;

    }

　　说明：该函数主要是查找N个合法的snapshot文件并进行降序排序后返回，Util的isValidSnapshot函数主要是从文件名和文件的结尾符号是否是"/"来判断snapshot文件是否合法。其源码如下　

    public static boolean isValidSnapshot(File f) throws IOException {

        // 文件为空或者非snapshot文件，则返回false

        if (f==null || Util.getZxidFromName(f.getName(), "snapshot") == -1)

            return false;

        // Check for a valid snapshot

        // 随机访问文件

        RandomAccessFile raf = new RandomAccessFile(f, "r");

        try {

            // including the header and the last / bytes

            // the snapshot should be atleast 10 bytes

            if (raf.length() < 10) { // 文件大小小于10个字节，返回false

                return false;

            }

            // 移动至倒数第五个字节

            raf.seek(raf.length() - 5);

            byte bytes[] = new byte[5];

            int readlen = 0;

            int l;

            while(readlen < 5 &&

                  (l = raf.read(bytes, readlen, bytes.length - readlen)) >= 0) { // 将最后五个字节存入bytes中

                readlen += l;

            }

            if (readlen != bytes.length) {

                LOG.info("Invalid snapshot " + f

                        + " too short, len = " + readlen);

                return false;

            }

            ByteBuffer bb = ByteBuffer.wrap(bytes);

            int len = bb.getInt();

            byte b = bb.get();

            if (len != 1 || b != '/') { // 最后字符不为"/",不合法

                LOG.info("Invalid snapshot " + f + " len = " + len

                        + " byte = " + (b & 0xff));

                return false;

            }

        } finally {

            raf.close();

        }

        return true;

    }

　　deserialize(dt,sessions, ia)函数的源码如下　　

    public void deserialize(DataTree dt, Map<Long, Integer> sessions,

            InputArchive ia) throws IOException {

        FileHeader header = new FileHeader();

        // 反序列化至header

        header.deserialize(ia, "fileheader");

        if (header.getMagic() != SNAP_MAGIC) { // 验证魔数是否相等

            throw new IOException("mismatching magic headers "

                    + header.getMagic() +

                    " !=  " + FileSnap.SNAP_MAGIC);

        }

        // 反序列化至dt、sessions

        SerializeUtils.deserializeSnapshot(dt,ia,sessions);

    }

　　说明：该函数主要作用反序列化，并将反序列化结果保存至header和sessions中。其中会验证header的魔数是否相等。

　　2. serialize函数　

　　函数签名如下：protected void serialize(DataTree dt,Map<Long, Integer> sessions, OutputArchive oa, FileHeader header) throws IOException

    protected void serialize(DataTree dt,Map<Long, Integer> sessions,

            OutputArchive oa, FileHeader header) throws IOException {

        // this is really a programmatic error and not something that can

        // happen at runtime

        if(header==null) // 文件头为null

            throw new IllegalStateException(

                    "Snapshot's not open for writing: uninitialized header");

        // 将header序列化

        header.serialize(oa, "fileheader");

        // 将dt、sessions序列化

        SerializeUtils.serializeSnapshot(dt,oa,sessions);

    }

　　说明：该函数主要用于序列化dt、sessions和header，其中，首先会检查header是否为空，然后依次序列化header，sessions和dt。

　　3. serialize函数

　　函数签名如下：public synchronized void serialize(DataTree dt, Map<Long, Integer> sessions, File snapShot) throws IOException　　

    public synchronized void serialize(DataTree dt, Map<Long, Integer> sessions, File snapShot)

            throws IOException {

        if (!close) { // 未关闭

            // 输出流

            OutputStream sessOS = new BufferedOutputStream(new FileOutputStream(snapShot));

            CheckedOutputStream crcOut = new CheckedOutputStream(sessOS, new Adler32());

            //CheckedOutputStream cout = new CheckedOutputStream()

            OutputArchive oa = BinaryOutputArchive.getArchive(crcOut);

            // 新生文件头

            FileHeader header = new FileHeader(SNAP_MAGIC, VERSION, dbId);

            // 序列化dt、sessions、header

            serialize(dt,sessions,oa, header);

            // 获取验证的值

            long val = crcOut.getChecksum().getValue();

            // 写入值

            oa.writeLong(val, "val");

            // 写入"/"

            oa.writeString("/", "path");

            // 强制刷新

            sessOS.flush();

            crcOut.close();

            sessOS.close();

        }

    }

　　说明：该函数用于将header、sessions、dt序列化至本地snapshot文件中，并且在最后会写入"/"字符。该方法是同步的，即是线程安全的。

四、总结

　　FileSnap源码相对较简单，其主要是用于操作snapshot文件，也谢谢各位园友的观看~　　

【Zookeeper】源码分析之持久化--FileSnap的更多相关文章

【Zookeeper】源码分析之持久化（二）之FileSnap
一.前言前篇博文已经分析了FileTxnLog的源码,现在接着分析持久化中的FileSnap,其主要提供了快照相应的接口. 二.SnapShot源码分析 SnapShot是FileTxnLog的父类 ...
zookeeper源码分析之五服务端(集群leader)处理请求流程
leader的实现类为LeaderZooKeeperServer,它间接继承自标准ZookeeperServer.它规定了请求到达leader时需要经历的路径: PrepRequestProcesso ...
zookeeper源码分析之四服务端(单机)处理请求流程
上文: zookeeper源码分析之一服务端启动过程中,我们介绍了zookeeper服务器的启动过程,其中单机是ZookeeperServer启动,集群使用QuorumPeer启动,那么这次我们分析 ...
zookeeper源码分析之三客户端发送请求流程
znode 可以被监控,包括这个目录节点中存储的数据的修改,子节点目录的变化等,一旦变化可以通知设置监控的客户端,这个功能是zookeeper对于应用最重要的特性,通过这个特性可以实现的功能包括配置的 ...
Zookeeper 源码分析-启动
Zookeeper 源码分析-启动博客分类: Zookeeper 本文主要介绍了zookeeper启动的过程运行zkServer.sh start命令可以启动zookeeper.入口的main ...
【Zookeeper】源码分析之持久化--FileTxnLog
一.前言前一篇已经分析了序列化,这篇接着分析Zookeeper的持久化过程源码,持久化对于数据的存储至关重要,下面进行详细分析. 二.持久化总体框架持久化的类主要在包org.apache.zook ...
【Zookeeper】源码分析之持久化--FileTxnSnapLog
一.前言前面分析了FileSnap,接着继续分析FileTxnSnapLog源码,其封装了TxnLog和SnapShot,其在持久化过程中是一个帮助类. 二.FileTxnSnapLog源码分析 2 ...
【Zookeeper】源码分析之持久化（三）之FileTxnSnapLog
一.前言前面分析了FileSnap,接着继续分析FileTxnSnapLog源码,其封装了TxnLog和SnapShot,其在持久化过程中是一个帮助类. 二.FileTxnSnapLog源码分析 2 ...
【Zookeeper】源码分析之持久化（一）之FileTxnLog
一.前言前一篇已经分析了序列化,这篇接着分析Zookeeper的持久化过程源码,持久化对于数据的存储至关重要,下面进行详细分析. 二.持久化总体框架持久化的类主要在包org.apache.zook ...

随机推荐

JS复选框选中
Web前端之复选框选中属性熟悉web前端开发的人都知道,判断复选框是否选中是经常做的事情,判断的方法很多,但是开发过程中常常忽略了这些方法的兼容性,而是实现效果就好了.博主之前用户不少方法,经常 ...
webBrowser 参数设置
//禁用脚本错误等类似的窗口信息 this.webBrowser1.ScriptErrorsSuppressed = true; //禁用右键菜单 this.webBrowser1.IsWebBrow ...
Spring IOC之Classpath扫描和管理的组件
在前面的大部分例子我们使用XML去指明配置数据去定义在Spring容器中的每一个BeanDefinition.上一节我们展示了如何在代码层注解的方式来提供大量的配置信息.即使在这些例子中,但是,基础 ...
CSS3自适配手机屏幕[转]
<!DOCTYPE html> <html> <head> <meta http-equiv="Content-Type" content ...
《Programming Hive》读书笔记（两）Hive基础知识
<Programming Hive>读书笔记(两)Hive基础知识 :第一遍读是浏览.建立知识索引,由于有些知识不一定能用到,知道就好.感兴趣的部分能够多研究. 以后用的时候再具体看.并结 ...
HTTP 报文中的 Header 字段进行身份验证
[小技巧][ASP.Net MVC Hack] 使用 HTTP 报文中的 Header 字段进行身份验证在一些 Web 系统中,身份验证是依靠硬件证书进行的:在电脑上插入 USB 证书,浏览器插件读 ...
Bootstrap 模态框(也可以说的弹出层)
最近在尝试使用bootstrap的模态框使用模态框主要要引入一下几个js和css: bootstrap.css jquery.1.9.1.js(这个可以灵活选择) bootstrap.js html ...
CompareValues标签对Model中的属性进行验证
在Asp.Net MVC中实现CompareValues标签对Model中的属性进行验证在Asp.Net MVC中可以用继承ValidationAttribute的方式,自定制实现Model两个 ...
iOS基础 - 类扩展
一.类扩展(class extension,匿名分类) 1.格式 @interface 类名 () { // 成员变量... } // 方法声明... @end 2.作用 1> 写在.m文件中 ...
c# in deep 之对Linq表达式范围变量限制问题的一些解决办法
linq表达式的标准形式为from...where...select,其中from后面跟的就是范围变量.linq中范围变量需要是泛型的集合,假如我们想对ArrayList或Object[]进行处理,l ...

【Zookeeper】源码分析之持久化--FileSnap

【Zookeeper】源码分析之持久化--FileSnap的更多相关文章

随机推荐

热门专题