从消费者看 rebalance

kafka java 客户端发送请求，大量使用 RequestFuture，因此先说明下该类。

RequestFuture 类的成员属性 listeners 是 RequestFutureListener 的集合，调用 complete 方法，会触发 listener 的 onSuccess 方法。

public void complete(T value) {

    try {

        if (value instanceof RuntimeException)

            throw new IllegalArgumentException("The argument to complete can not be an instance of RuntimeException");

        if (!result.compareAndSet(INCOMPLETE_SENTINEL, value))

            throw new IllegalStateException("Invalid attempt to complete a request future which is already complete");

        fireSuccess();

    } finally {

        completedLatch.countDown();

    }

}

private void fireSuccess() {

    T value = value();

    while (true) {

        RequestFutureListener<T> listener = listeners.poll();

        if (listener == null)

            break;

        listener.onSuccess(value);

    }

}

值得关注的是 compose 和 chain 方法，这两个方法均是为当前 RequestFuture 添加 listener，listener 的 onSuccess 又是调用另一个 RequestFuture 的方法。

public <S> RequestFuture<S> compose(final RequestFutureAdapter<T, S> adapter) {

    // 创建新的 RequestFuture 对象

    final RequestFuture<S> adapted = new RequestFuture<>();

    // 为旧的 RequestFuture 添加 listener

    addListener(new RequestFutureListener<T>() {

        @Override

        public void onSuccess(T value) {

            adapter.onSuccess(value, adapted);

        }

        @Override

        public void onFailure(RuntimeException e) {

            adapter.onFailure(e, adapted);

        }

    });

    // 返回新的 RequestFuture 对象

    return adapted;

}

public void chain(final RequestFuture<T> future) {

    // 为当前 RequestFuture 添加 listener

    addListener(new RequestFutureListener<T>() {

        @Override

        public void onSuccess(T value) {

            future.complete(value);

        }

        @Override

        public void onFailure(RuntimeException e) {

            future.raise(e);

        }

    });

}

rebalance 入口在 ConsumerCoordinator#poll

客户端判断是否需要重新加入组，即 rebalance

//ConsumerCoordinator#needRejoin

public boolean needRejoin() {

    if (!subscriptions.partitionsAutoAssigned())

        return false;

    // 所订阅 topic 的分区数量发生变化

    // we need to rejoin if we performed the assignment and metadata has changed

    if (assignmentSnapshot != null && !assignmentSnapshot.equals(metadataSnapshot))

        return true;

    // 所订阅的 topic 发生变化

    // we need to join if our subscription has changed since the last join

    if (joinedSubscription != null && !joinedSubscription.equals(subscriptions.subscription()))

        return true;

    // 消费者加入组，或退出组，由心跳线程设置 rejoinNeeded = true

    return super.needRejoin();

}

消费者开始 rebalance

// AbstractCoordinator#joinGroupIfNeeded

void joinGroupIfNeeded() {

    while (needRejoin() || rejoinIncomplete()) {

        ensureCoordinatorReady();

        if (needsJoinPrepare) {

            // 调用用户传入的 ConsumerRebalanceListener

            onJoinPrepare(generation.generationId, generation.memberId);

            needsJoinPrepare = false;

        }

        // 发送 join group 的请求

        RequestFuture<ByteBuffer> future = initiateJoinGroup();

        client.poll(future);

        if (future.succeeded()) {

            onJoinComplete(generation.generationId, generation.memberId, generation.protocol, future.value());

            resetJoinGroupFuture();

            needsJoinPrepare = true;

        } else {

            resetJoinGroupFuture();

            RuntimeException exception = future.exception();

            if (exception instanceof UnknownMemberIdException ||

                    exception instanceof RebalanceInProgressException ||

                    exception instanceof IllegalGenerationException)

                continue;

            else if (!future.isRetriable())

                throw exception;

            time.sleep(retryBackoffMs);

        }

    }

}

AbstractCoordinator#initiateJoinGroup

private synchronized RequestFuture<ByteBuffer> initiateJoinGroup() {

    if (joinFuture == null) {

        disableHeartbeatThread();

        state = MemberState.REBALANCING;

        joinFuture = sendJoinGroupRequest();

        joinFuture.addListener(new RequestFutureListener<ByteBuffer>() {

            @Override

            public void onSuccess(ByteBuffer value) {

                // handle join completion in the callback so that the callback will be invoked

                // even if the consumer is woken up before finishing the rebalance

                synchronized (AbstractCoordinator.this) {

                    log.info("Successfully joined group with generation {}", generation.generationId);

                    state = MemberState.STABLE;

                    rejoinNeeded = false;

                    if (heartbeatThread != null)

                        heartbeatThread.enable();

                }

            }

            @Override

            public void onFailure(RuntimeException e) {

                // we handle failures below after the request finishes. if the join completes

                // after having been woken up, the exception is ignored and we will rejoin

                synchronized (AbstractCoordinator.this) {

                    state = MemberState.UNJOINED;

                }

            }

        });

    }

    return joinFuture;

}

AbstractCoordinator#sendJoinGroupRequest

private RequestFuture<ByteBuffer> sendJoinGroupRequest() {

    if (coordinatorUnknown())

        return RequestFuture.coordinatorNotAvailable();

    // send a join group request to the coordinator

    log.info("(Re-)joining group");

    JoinGroupRequest.Builder requestBuilder = new JoinGroupRequest.Builder(

            groupId,

            this.sessionTimeoutMs,

            this.generation.memberId,

            protocolType(),

            metadata()).setRebalanceTimeout(this.rebalanceTimeoutMs);

    log.debug("Sending JoinGroup ({}) to coordinator {}", requestBuilder, this.coordinator);

    return client.send(coordinator, requestBuilder)

            .compose(new JoinGroupResponseHandler());

}

重点关注 client.send(coordinator, requestBuilder).compose(new JoinGroupResponseHandler());
为老的 RequestFuture 添加 listener，返回新的 RequestFuture

ConsumerNetworkClient#send

public RequestFuture<ClientResponse> send(Node node, AbstractRequest.Builder<?> requestBuilder) {

    long now = time.milliseconds();

    // 使用 RequestFutureCompletionHandler 作为回调函数

    RequestFutureCompletionHandler completionHandler = new RequestFutureCompletionHandler();

    ClientRequest clientRequest = client.newClientRequest(node.idString(), requestBuilder, now, true,

            completionHandler);

    unsent.put(node, clientRequest);

    // wakeup the client in case it is blocking in poll so that we can send the queued request

    client.wakeup();

    return completionHandler.future;

}

JoinGroupResponseHandler#handle

public void handle(JoinGroupResponse joinResponse, RequestFuture<ByteBuffer> future) {

    Errors error = joinResponse.error();

    if (error == Errors.NONE) {

        log.debug("Received successful JoinGroup response: {}", joinResponse);

        sensors.joinLatency.record(response.requestLatencyMs());

        synchronized (AbstractCoordinator.this) {

            if (state != MemberState.REBALANCING) {

                // if the consumer was woken up before a rebalance completes, we may have already left

                // the group. In this case, we do not want to continue with the sync group.

                future.raise(new UnjoinedGroupException());

            } else {

                AbstractCoordinator.this.generation = new Generation(joinResponse.generationId(),

                        joinResponse.memberId(), joinResponse.groupProtocol());

                if (joinResponse.isLeader()) {

                    onJoinLeader(joinResponse).chain(future);

                } else {

                    onJoinFollower().chain(future);

                }

            }

        }

    } else if (error == Errors.COORDINATOR_LOAD_IN_PROGRESS) {

        log.debug("Attempt to join group rejected since coordinator {} is loading the group.", coordinator());

        // backoff and retry

        future.raise(error);

    } else if (error == Errors.UNKNOWN_MEMBER_ID) {

        // reset the member id and retry immediately

        resetGeneration();

        log.debug("Attempt to join group failed due to unknown member id.");

        future.raise(Errors.UNKNOWN_MEMBER_ID);

    } else if (error == Errors.COORDINATOR_NOT_AVAILABLE

            || error == Errors.NOT_COORDINATOR) {

        // re-discover the coordinator and retry with backoff

        markCoordinatorUnknown();

        log.debug("Attempt to join group failed due to obsolete coordinator information: {}", error.message());

        future.raise(error);

    } else if (error == Errors.INCONSISTENT_GROUP_PROTOCOL

            || error == Errors.INVALID_SESSION_TIMEOUT

            || error == Errors.INVALID_GROUP_ID) {

        // log the error and re-throw the exception

        log.error("Attempt to join group failed due to fatal error: {}", error.message());

        future.raise(error);

    } else if (error == Errors.GROUP_AUTHORIZATION_FAILED) {

        future.raise(new GroupAuthorizationException(groupId));

    } else {

        // unexpected error, throw the exception

        future.raise(new KafkaException("Unexpected error in join group response: " + error.message()));

    }

}

收到响应后，最终的执行流是 RequestFutureCompletionHandler -> JoinGroupResponseHandler#handle

private RequestFuture<ByteBuffer> onJoinLeader(JoinGroupResponse joinResponse) {

    try {

        // perform the leader synchronization and send back the assignment for the group

        Map<String, ByteBuffer> groupAssignment = performAssignment(joinResponse.leaderId(), joinResponse.groupProtocol(),

                joinResponse.members());

        SyncGroupRequest.Builder requestBuilder =

                new SyncGroupRequest.Builder(groupId, generation.generationId, generation.memberId, groupAssignment);

        log.debug("Sending leader SyncGroup to coordinator {}: {}", this.coordinator, requestBuilder);

        return sendSyncGroupRequest(requestBuilder);

    } catch (RuntimeException e) {

        return RequestFuture.failure(e);

    }

}

private RequestFuture<ByteBuffer> onJoinFollower() {

    // send follower's sync group with an empty assignment

    SyncGroupRequest.Builder requestBuilder =

            new SyncGroupRequest.Builder(groupId, generation.generationId, generation.memberId,

                    Collections.<String, ByteBuffer>emptyMap());

    log.debug("Sending follower SyncGroup to coordinator {}: {}", this.coordinator, requestBuilder);

    return sendSyncGroupRequest(requestBuilder);

}

private RequestFuture<ByteBuffer> sendSyncGroupRequest(SyncGroupRequest.Builder requestBuilder) {

    if (coordinatorUnknown())

        return RequestFuture.coordinatorNotAvailable();

    return client.send(coordinator, requestBuilder)

            .compose(new SyncGroupResponseHandler());

}

用 RequestFuture 把 JoinGroupResponseHandler 和 SyncGroupResponseHandler 串联起来了

private class SyncGroupResponseHandler extends CoordinatorResponseHandler<SyncGroupResponse, ByteBuffer> {

    @Override

    public void handle(SyncGroupResponse syncResponse,

                       RequestFuture<ByteBuffer> future) {

        Errors error = syncResponse.error();

        if (error == Errors.NONE) {

            sensors.syncLatency.record(response.requestLatencyMs());

            future.complete(syncResponse.memberAssignment());

        } else {

            requestRejoin();

            if (error == Errors.GROUP_AUTHORIZATION_FAILED) {

                future.raise(new GroupAuthorizationException(groupId));

            } else if (error == Errors.REBALANCE_IN_PROGRESS) {

                log.debug("SyncGroup failed because the group began another rebalance");

                future.raise(error);

            } else if (error == Errors.UNKNOWN_MEMBER_ID

                    || error == Errors.ILLEGAL_GENERATION) {

                log.debug("SyncGroup failed: {}", error.message());

                resetGeneration();

                future.raise(error);

            } else if (error == Errors.COORDINATOR_NOT_AVAILABLE

                    || error == Errors.NOT_COORDINATOR) {

                log.debug("SyncGroup failed: {}", error.message());

                markCoordinatorUnknown();

                future.raise(error);

            } else {

                future.raise(new KafkaException("Unexpected error from SyncGroup: " + error.message()));

            }

        }

    }

}

rebalance 过程最后的 listener

joinFuture.addListener(new RequestFutureListener<ByteBuffer>() {

    @Override

    public void onSuccess(ByteBuffer value) {

        // handle join completion in the callback so that the callback will be invoked

        // even if the consumer is woken up before finishing the rebalance

        synchronized (AbstractCoordinator.this) {

            log.info("Successfully joined group with generation {}", generation.generationId);

            state = MemberState.STABLE;

            rejoinNeeded = false;

            if (heartbeatThread != null)

                heartbeatThread.enable();

        }

    }

    @Override

    public void onFailure(RuntimeException e) {

        // we handle failures below after the request finishes. if the join completes

        // after having been woken up, the exception is ignored and we will rejoin

        synchronized (AbstractCoordinator.this) {

            state = MemberState.UNJOINED;

        }

    }

});

从消费者看 rebalance的更多相关文章

OpenStack_Swift源代码分析——Ring的rebalance算法源代码具体分析
1 Command类中的rebalnace方法在上篇文章中解说了,创建Ring已经为Ring加入设备.在加入设备后须要对Ring进行平衡,平衡 swift-ring-builder object.b ...
RocketMQ 消费者
本文分析 DefaultMQPushConsumer,异步发送消息,多线程消费的情形. DefaultMQPushConsumerImpl MQClientInstance 一个客户端进程只有一个 M ...
kafka消费者offset存储策略
由于 consumer 在消费过程中可能会出现断电宕机等故障,consumer 恢复后,需要从故障前的位置的继续消费,所以 consumer 需要实时记录自己消费到了哪个 offset,以便故障恢 ...
Kafka Rebalance机制和选举策略总结
自建博客地址:https://www.bytelife.net,欢迎访问! 本文为博客同步发表文章,为了更好的阅读体验,建议您移步至我的博客本文作者: Jeffrey 本文链接: https://w ...
Kafka 0.8源码分析—ZookeeperConsumerConnector
1.HighLevelApi High Level Api是多线程的应用程序,以Topic的Partition数量为中心.消费的规则如下: 一个partition只能被同一个ConsumersGrou ...
RocketMQ之十：RocketMQ消息接收源码
1. 简介 1.1.接收消息 RebalanceService:均衡消息队列服务,负责通过MQClientInstance分配当前 Consumer 可消费的消息队列( MessageQueue ). ...
Kafka学习笔记（四）—— API使用
Kafka学习笔记(四)-- API使用 1.Producer API 1.1 消息发送流程 Kafka的Producer发送消息采用的是异步发送的方式.在消息发送的过程中,涉及到了两个线程--mai ...
【原创】美团二面：聊聊你对 Kafka Consumer 的架构设计
在上一篇中我们详细聊了关于 Kafka Producer 内部的底层原理设计思想和细节, 本篇我们主要来聊聊 Kafka Consumer 即消费者的内部底层原理设计思想. 1.Consumer之总体 ...
ASM磁盘组扩容流程
环境:RHEL 6.5 + GI 11.2.0.4 + Oracle 11.2.0.4 1.确认磁盘权限正确 2.图形界面配置 3.启用asmca配置 4.修改磁盘组rebalance power级别 ...

随机推荐

python cv2展示网络图片、图片编解码、及与base64转换
从网络读取图像数据并展示需要使用cv2.imdecode()函数,从指定的内存缓存中读取数据,并把数据转换(解码)成图像格式:主要用于从网络传输数据中恢复出图像. # -*- coding: utf ...
ms17_0199样本测试
一大早就各种消息弹框,于是就来测试一波 https://github.com/nixawk/metasploit-framework/blob/8ab0b448fdce15999f155dfd7b22 ...
JS异常missing ) after argument list
JS异常报错 missing ) after argument list 在使用JS拼接DOM元素时,有这种情况发生,'<a onclick="del(' + data.id + ') ...
009-通过jmx监控tomcat
前言想理解怎么监控tomcat,必需识下图(图片源出网络) zabbix-Web前端界面,它通过数据库里数据展示.和其它组件不直接关联zabbix-server运行在10051端口,Zabbix-se ...
(转)MyBatis 一、二级缓存和自定义缓存
1.一级缓存 MyBatis 默认开启了一级缓存,一级缓存是在SqlSession 层面进行缓存的.即,同一个SqlSession ,多次调用同一个Mapper和同一个方法的同一个参数, 只会进行一次 ...
hdu 6205 card card card 最大子段和
#include<iostream> #include<deque> #include<memory.h> #include<stdio.h> #inc ...
WebApi 接口参数：传参详解
前言:还记得刚使用WebApi那会儿,被它的传参机制折腾了好久,查阅了半天资料.如今,使用WebApi也有段时间了,今天就记录下API接口传参的一些方式方法,算是一个笔记,也希望能帮初学者少走弯路.本 ...
C#WinFrom导出Excel
采用的是以DataGridView的形式导出,使用NPOI.dll 1.由于使用的是DataGridView,所以类需要创建在From的Project下,DLL导入NPOI 2.代码如下 using ...
【模板】Lucas定理
代码如下 #include <bits/stdc++.h> using namespace std; typedef long long LL; const int maxn=1e5+10 ...
jmeter+ant 实现自动化接口测试环境配置
前置:安装jdk 1.8以上一.安装jemeter 下载地址:http://jmeter.apache.org/download_jmeter.cgi 1.1 解压jmeter,放在某个目录,例如D ...

从消费者看 rebalance

从消费者看 rebalance的更多相关文章

随机推荐

热门专题