flink-connector-kafka consumer checkpoint源码分析

转发请注明原创地址：http://www.cnblogs.com/dongxiao-yang/p/7700600.html

《flink-connector-kafka consumer的topic分区分配源码》一文提到了在flink-connector-kafka的consumer初始化的时候有三种offset提交模式：KAFKA_PERIODIC，DISABLED和ON_CHECKPOINTS。

其中ON_CHECKPOINTS表示在flink做完checkpoint后主动向kafka提交offset的方法，本文主要分析一下flink-connector-kafka在源码如何使用checkpoint机制实现offset的恢复和提交。

flink conusmer的实现基类FlinkKafkaConsumerBase定义如下，这个类实现了了与checkpoin相关的三个接口CheckpointedFunction，CheckpointedRestoring<HashMap<KafkaTopicPartition, Long>>，CheckpointListener。根据官网文档，CheckpointedRestoring的restoreState()方法已经被CheckpointedFunction的initializeState取代，所以重点关注三个方法实现

1initializeState() 实例初始化或者recover的时候调用

2snapshotState() 每次创建checkpoint的时候调用

3 notifyCheckpointComplete() 每次checkpoint结束的时候调用

public abstract class FlinkKafkaConsumerBase<T> extends RichParallelSourceFunction<T> implements

        CheckpointListener,

        ResultTypeQueryable<T>,

        CheckpointedFunction,

        CheckpointedRestoring<HashMap<KafkaTopicPartition, Long>> {

initializeState

    @Override

    public final void initializeState(FunctionInitializationContext context) throws Exception {

        // we might have been restored via restoreState() which restores from legacy operator state

        if (!restored) {

            restored = context.isRestored();

        }

        OperatorStateStore stateStore = context.getOperatorStateStore();

        offsetsStateForCheckpoint = stateStore.getSerializableListState(DefaultOperatorStateBackend.DEFAULT_OPERATOR_STATE_NAME);

        if (context.isRestored()) {

            if (restoredState == null) {

                restoredState = new HashMap<>();

                for (Tuple2<KafkaTopicPartition, Long> kafkaOffset : offsetsStateForCheckpoint.get()) {

                    restoredState.put(kafkaOffset.f0, kafkaOffset.f1);

                }

                LOG.info("Setting restore state in the FlinkKafkaConsumer.");

                if (LOG.isDebugEnabled()) {

                    LOG.debug("Using the following offsets: {}", restoredState);

                }

            }

        } else {

            LOG.info("No restore state for FlinkKafkaConsumer.");

        }

    }

这个方法的逻辑比较简单，在task恢复的时候从stateStore中序列化出来之前存储的ListState<Tuple2<KafkaTopicPartition, Long>> 状态数据，并放到restoredState这个变量，用于下面open方法直接恢复对应的分区和offset起始值。

snapshotState

    @Override

    public final void snapshotState(FunctionSnapshotContext context) throws Exception {

        if (!running) {

            LOG.debug("snapshotState() called on closed source");

        } else {

            offsetsStateForCheckpoint.clear();

            final AbstractFetcher<?, ?> fetcher = this.kafkaFetcher;

            if (fetcher == null) {

                // the fetcher has not yet been initialized, which means we need to return the

                // originally restored offsets or the assigned partitions

                for (Map.Entry<KafkaTopicPartition, Long> subscribedPartition : subscribedPartitionsToStartOffsets.entrySet()) {

                    offsetsStateForCheckpoint.add(Tuple2.of(subscribedPartition.getKey(), subscribedPartition.getValue()));

                }

                if (offsetCommitMode == OffsetCommitMode.ON_CHECKPOINTS) {

                    // the map cannot be asynchronously updated, because only one checkpoint call can happen

                    // on this function at a time: either snapshotState() or notifyCheckpointComplete()

                    pendingOffsetsToCommit.put(context.getCheckpointId(), restoredState);

                }

            } else {

                HashMap<KafkaTopicPartition, Long> currentOffsets = fetcher.snapshotCurrentState();

                if (offsetCommitMode == OffsetCommitMode.ON_CHECKPOINTS) {

                    // the map cannot be asynchronously updated, because only one checkpoint call can happen

                    // on this function at a time: either snapshotState() or notifyCheckpointComplete()

                    pendingOffsetsToCommit.put(context.getCheckpointId(), currentOffsets);

                }

                for (Map.Entry<KafkaTopicPartition, Long> kafkaTopicPartitionLongEntry : currentOffsets.entrySet()) {

                    offsetsStateForCheckpoint.add(

                            Tuple2.of(kafkaTopicPartitionLongEntry.getKey(), kafkaTopicPartitionLongEntry.getValue()));

                }

            }

            if (offsetCommitMode == OffsetCommitMode.ON_CHECKPOINTS) {

                // truncate the map of pending offsets to commit, to prevent infinite growth

                while (pendingOffsetsToCommit.size() > MAX_NUM_PENDING_CHECKPOINTS) {

                    pendingOffsetsToCommit.remove(0);

                }

            }

        }

    }

snapshot方法创建checkpoint的做法是把当前的KafkaTopicPartition和目前消费到的offset值不断存放到offsetsStateForCheckpoint这个state对象里，然后把当前的checkpointid和对应的offset存到pendingOffsetsToCommit这个linkmap。当前offset的获取分两个情况，初始化的时候（if (fetcher == null) {...}）和fetcher已经初始化成功，初始化的时候从restoredState获取，正常运行中获取fetcher.snapshotCurrentState()。

notifyCheckpointComplete

public final void notifyCheckpointComplete(long checkpointId) throws Exception {

        if (!running) {

            LOG.debug("notifyCheckpointComplete() called on closed source");

            return;

        }

        final AbstractFetcher<?, ?> fetcher = this.kafkaFetcher;

        if (fetcher == null) {

            LOG.debug("notifyCheckpointComplete() called on uninitialized source");

            return;

        }

        if (offsetCommitMode == OffsetCommitMode.ON_CHECKPOINTS) {

            // only one commit operation must be in progress

            if (LOG.isDebugEnabled()) {

                LOG.debug("Committing offsets to Kafka/ZooKeeper for checkpoint " + checkpointId);

            }

            try {

                final int posInMap = pendingOffsetsToCommit.indexOf(checkpointId);

                if (posInMap == -1) {

                    LOG.warn("Received confirmation for unknown checkpoint id {}", checkpointId);

                    return;

                }

                @SuppressWarnings("unchecked")

                HashMap<KafkaTopicPartition, Long> offsets =

                    (HashMap<KafkaTopicPartition, Long>) pendingOffsetsToCommit.remove(posInMap);

                // remove older checkpoints in map

                for (int i = 0; i < posInMap; i++) {

                    pendingOffsetsToCommit.remove(0);

                }

                if (offsets == null || offsets.size() == 0) {

                    LOG.debug("Checkpoint state was empty.");

                    return;

                }

                fetcher.commitInternalOffsetsToKafka(offsets, offsetCommitCallback);

            } catch (Exception e) {

                if (running) {

                    throw e;

                }

                // else ignore exception if we are no longer running

            }

        }

    }

notifyCheckpointComplete主要是在checkpoint结束后在ON_CHECKPOINTS的情况下向kafka集群commit offset，方法调用时会拿到已经完成的checkpointid，从前文的pendingOffsetsToCommit列表里找到对应的offset。如果判断索引不存在，则直接退出。否则，移除该索引对应的快照信息，然后将小于当前索引（较旧的）的快照信息也一并移除（这一点我之前解释过，因为所有的检查点都是按时间递增有序的）。最后将当前完成的检查点对应的消息的偏移量进行commit，也即commitOffsets。只不过这里该方法被定义为抽象方法，因为Kafka不同版本的API差别的原因，由适配不同版本的consumer各自实现，目前kafka09和010实现都是在Kafka09Fetcher内实现的commitInternalOffsetsToKafka方法。

参考文档：

http://blog.csdn.net/yanghua_kobe/article/details/51503885

flink-connector-kafka consumer checkpoint源码分析的更多相关文章

flink checkpoint 源码分析（二）
转发请注明原创地址http://www.cnblogs.com/dongxiao-yang/p/8260370.html flink checkpoint 源码分析 (一)一文主要讲述了在JobMan ...
Flink源码阅读（二）——checkpoint源码分析
前言在Flink原理——容错机制一文中,已对checkpoint的机制有了较为基础的介绍,本文着重从源码方面去分析checkpoint的过程.当然本文只是分析做checkpoint的调度过程,只是尽 ...
Kafka 探险 - 生产者源码分析: 核心组件
这个 Kafka 的专题,我会从系统整体架构,设计到代码落地.和大家一起杠源码,学技巧,涨知识.希望大家持续关注一起见证成长! 我相信:技术的道路,十年如一日!十年磨一剑! 往期文章 Kafka 探险 ...
高吞吐量的分布式发布订阅消息系统Kafka之Producer源码分析
引言 Kafka是一款很棒的消息系统,今天我们就来深入了解一下它的实现细节,首先关注Producer这一方. 要使用kafka首先要实例化一个KafkaProducer,需要有brokerIP.序列化 ...
Kafka 0.8源码分析—ZookeeperConsumerConnector
1.HighLevelApi High Level Api是多线程的应用程序,以Topic的Partition数量为中心.消费的规则如下: 一个partition只能被同一个ConsumersGrou ...
flink checkpoint 源码分析（一）
转发请注明原创地址http://www.cnblogs.com/dongxiao-yang/p/8029356.html checkpoint是Flink Fault Tolerance机制的重要构成 ...
flink1.7 checkpoint源码分析
初始化state类 //org.apache.flink.streaming.runtime.tasks.StreamTask#initializeState initializeState(); p ...
Flink命令行提交job (源码分析)
这篇文章主要介绍从命令行到任务在Driver端运行的过程通过flink run 命令提交jar包运行程序以yarn 模式提交任务命令类似于: flink run -m yarn-cluster X ...
Kafka#4：存储设计分布式设计源码分析
https://sites.google.com/a/mammatustech.com/mammatusmain/kafka-architecture/4-kafka-detailed-archite ...

随机推荐

PKUSC2018训练日程(4.18~5.30)
(总计:共66题) 4.18~4.25:19题 4.26~5.2:17题 5.3~5.9: 6题 5.10~5.16: 6题 5.17~5.23: 9题 5.24~5.30: 9题 4.18 [BZO ...
UVA 1514 Piece it together （二分图匹配）
[题目链接] Link [题目大意] 给你一些由一块黑块和两块白块组成的L形拼图,问你是否能够拼成给出的图 [题解] 我们将所有的黑块拆点,拆分为纵向和横向,和周围的白块连边, 如果能够得到完美匹配, ...
显示图案 Exercise06_06
import java.util.Scanner; /** * @author 冰樱梦 * 时间:2018年下半年 * 题目:显示图案 * 输入一个数 5 1 2 1 3 2 1 4 3 2 1 5 ...
[转]从此爱上iOS Autolayout
原文地址这篇不是autolayout教程,只是autolayout动员文章和经验之谈,在本文第五节友情链接和推荐中,我将附上足够大家熟练使用autolayout的教程.这篇文章两个月前就想写下来,但 ...
Ubuntu 16.04下将ISO镜像制作成U盘启动的工具-UNetbootin（UltraISO的替代工具）
说明: 1.在Windows下制作ISO镜像的U盘启动工具有很多,但是在Linux平台下估计就只有UNetbootin这个工具最好用了,效果和Windows下的制作方法差不多,但是这个工具只能针对Li ...
Redis Exception: Exceeded timeout of 00:00:03
Redis Exception: Exceeded timeout of 00:00:03 居然是重启了网管, 把网络禁用重启就好了. 服最终更新: 原来是架构湿设置为每分钟只能读取6 ...
JavaScript中的模块化之AMD和CMD
前言: 为什么我们需要模块化开发,模块化开发的好处有哪些? 首先我们先说一下非模块化的开发方式带来的弊端. 非模块化开发中会导致一些问题的出现,变量和函数命名可能相同,会造成变量污染和冲突,并且出错时 ...
Jenkins持续集成实战总结
原文:https://my.oschina.net/CandyDesire/blog/341331#comment-list 持续集成什么是持续集成随着软件开发复杂度的不断提高,团队开发成员间如何 ...
[SpringMVC+redis]自定义aop注解实现控制器访问次数限制
原文:http://www.cnblogs.com/xiaoyangjia/p/3762150.html?utm_source=tuicool 我们需要根据IP去限制用户单位时间的访问次数,防止刷手机 ...
python 输出所有大小写字母, range()以及列表切片
所以在写的时候,只要把它们的ASCII列出,并转化成字符型chr 即可. print [chr(i) for i in range(65,91)]#所有大写字母 print [chr(i) for i ...

flink-connector-kafka consumer checkpoint源码分析

flink-connector-kafka consumer checkpoint源码分析的更多相关文章

随机推荐

热门专题