JStorm与Storm源码分析（二）--任务分配，assignment

mk-assignments主要功能就是产生Executor与节点+端口的对应关系,将Executor分配到某个节点的某个端口上，以及进行相应的调度处理。代码注释如下：

 ;;参数nimbus为nimbus-data对象,:scratch-topology-id为需要重新调度的Topology的id

 (defnk mk-assignments [nimbus :scratch-topology-id nil]

   (let [conf (:conf nimbus);;分别从nimbus-data中获取conf,storm-cluster-state和inimbus对象,并将其保存为临时变量

         storm-cluster-state (:storm-cluster-state nimbus)

         ^INimbus inimbus (:inimbus nimbus)

         ;;从zk中读取所有活跃的Topologies,获取他们id的集合

         topology-ids (.active-storms storm-cluster-state)

         ;;根据前面得到的Topology-id的集合,对每一个id调用read-topology-details方法

         ;;从参数nimbus-data中获取topology-details信息,并以<topology-id,topology-details>保存在集合中

         topologies (into {} (for [tid topology-ids]

                               {tid (read-topology-details nimbus tid)}))

         ;;利用前面得到的<topology-id,topology-details>集合创建Topologies对象

         topologies (Topologies. topologies)

         ;;读取所有已经分配资源的Topology的id的集合。

         assigned-topology-ids (.assignments storm-cluster-state nil)

         existing-assignments (into {} (for [tid assigned-topology-ids]

           ;; 对于那些已经分配资源但需要重新调度的Topology(由scratch-topology-id指定),

           ;; 我们忽略其之前的分配，故之前分配占用的所有slot将被视为空闲slot(空闲资源),可重新被调度使用。

           (when (or (nil? scratch-topology-id) (not= tid scratch-topology-id))

               {tid (.assignment-info storm-cluster-state tid nil)})))

         ;; 调用compute-new-topology->executor->node+port方法为所有Topologies计算新的调度,

         ;; 并返回topology->executor->node+port

         topology->executor->node+port (compute-new-topology->executor->node+port

                                        nimbus

                                        existing-assignments

                                        topologies

                                        scratch-topology-id)

         ;;获取当前系统时间(秒)

         now-secs (current-time-secs)

         ;;调用basic-supervisor-details-map方法获取ZooKeeper中所有的SupervisorInfo信息,

         ;;然后将其转换为<supervisor-id,SupervisorDetails>集合,具体操作看1

         basic-supervisor-details-map (basic-supervisor-details-map storm-cluster-state)

         ;; 对topology->executor->node+port中各项进行处理,通过添加开始时间等构建最终的作业

         ;; 返回得到<topology-id Assignment>集合

         new-assignments (into {} (for [[topology-id executor->node+port] topology->executor->node+port

             ;;根据topology-id获取Topology的任务分配情况

             :let [existing-assignment (get existing-assignments topology-id)

                   ;;从executor->node+port信息中提取所有的节点信息

                   all-nodes (->> executor->node+port vals (map first) set)

                   ;;根据all-nodes获取每个节点的主机名信息,并返回一个<node hostname>集合

                   node->host (->> all-nodes

                                   (mapcat (fn [node]

                                             (if-let [host (.getHostName inimbus basic-supervisor-details-map node)]

                                               [[node host]]

                                               )))

                                   (into {}))

                   ;;将上述获取到的<node, hostname>集合和<node, host>集合,得到所有<node host>关系.

                   ;;如果存在相同的node,则与其对应的主机名将采用<node,hostname>集合中的值

                   all-node->host (merge (:node->host existing-assignment) node->host)

                   ;;调用changed-executors,通过将executor->node+port信息同existing-assignment中的信息进行比对,

                   ;;计算出所有被重新分配的Executor

                   reassign-executors (changed-executors (:executor->node+port existing-assignment) executor->node+port)

                   ;;通过将已经存在的assignment中的executor->start-time-secs信息

                   ;;与所有被重新分配的通过将已经存在的assignment中的executor->start-time-secs进行合并,

                   ;;获得最新的所有<executor,start-time-secs>集合

                   start-times (merge (:executor->start-time-secs existing-assignment)

                                     (into {}

                                           (for [id reassign-executors]

                                             [id now-secs]

                                             )))]]

        ;;创建Assignment对象,参数分别为该Topology在Nimbus服务器上的root文件夹路径、

        ;;<node,host>集合、新的executor->node+port映射关系以及新的<executor,start-time-secs>集合

        {topology-id (Assignment.

                      (master-stormdist-root conf topology-id)

                      (select-keys all-node->host all-nodes)

                      executor->node+port

                      start-times)}))]

     ;; 对于新计算的<topology-id,assignment>集合中的每一项,比较其新的调度与当前运行时的调度之间是否发生了变化

     ;; 如果没有发生变化,就打印一条记录;否则将该Topology在ZooKeeper中保存的调度结果更新assignment

     (doseq [[topology-id assignment] new-assignments

             :let [existing-assignment (get existing-assignments topology-id)

                   topology-details (.getById topologies topology-id)]]

       (if (= existing-assignment assignment)

         (log-debug "Assignment for " topology-id " hasn't changed")

         (do

           (log-message "Setting new assignment for topology id " topology-id ": " (pr-str assignment))

           (.set-assignment! storm-cluster-state topology-id assignment)

           )))

     ;;对于前面得到的new-assignments中的每一项,首先计算出新增的slot,

     ;;再将其转换化为worker-slot对象,返回的是<topology-id,worker-slot>集合,

     ;;最后调用inimbus的assignSlots方法来分配slot

     (->> new-assignments

           (map (fn [[topology-id assignment]]

             (let [existing-assignment (get existing-assignments topology-id)]

               [topology-id (map to-worker-slot (newly-added-slots existing-assignment assignment))]

               )))

           (into {})

           (.assignSlots inimbus topologies))

     ))

在该过程中，如果某个Slot不存在Executor的超时,而Supervisor的ZooKeeper心跳超时时,
认为当前Slot依然有效,可以分配认为.最坏的情况就是这些分配过去的Executor会超时,在下一轮的分配过程中,则不会分配。

mk-assignments的详细过程如下:

1.从ZooKeeper中读取所有活跃的Topologies

2.从ZooKeeper中读取当前的assignments,获取所有已经分配资源的Topology的id的集合。

3.对Topologies进行新assignments

3.1通过调用computer-topology->executors取出所有已经assignment的topology的executors

3.2 update-all-heartbeats,对每一个Topology进行更新心跳

3.3调用compute-topology->alive-executors过滤topology->executors,保留alive的executors

3.4调用compute->supervisor->dead-ports找出dead ports

3.5调用compute-topology->scheduler-assignment转换ZooKeeper中的assignment为SchedulerAssignment

3.6通过调用missing-assignment-topologies找出需要从新assign的Topology

3.7通过调用all-scheduling-slots得到所有Supervisor节点中可用的slot数量

3.8调用read-all-supervisor-details得到所有的Supervisor节点SupervisorDetails

3.9获取backtype.storm.scheduler.Cluster

3.10调用scheduler.schedule分配所有的Topologies

3.11通过调用compute-topology->executor->node_port转换SchedulerAssignment为Assignment,输出ressign日志

4.通过将已经存在的assignment中的executor->start-time-secs信息与所有被重新分配的通过将已经存在的assignment中的executor->start-time-secs进行合并,获得最新的所有<executor,start-time-secs>集合,补充start-times等信息,获得new-assignments。

5.调用set-assignment!将新的assignment结果写入ZooKeeper.

mk-assignments负责对当前集群中所有Topology进行新一轮的任务调度。先检查已运行的Topology所占用的资源，判断它们是否有问题以及重新分配；根据系统当前的可用资源,为新提交的Topology分配任务。mk-assignments会将所有assignment信息更新到ZooKeeper中，Supervisor周期性地检查这些分配信息，并根据这些分配信息做相应的调度处理。

注：学习李明老师Storm源码分析和陈敏敏老师Storm技术内幕与大数据实现的笔记的整理。
欢迎关注下面二维码进行技术交流：

JStorm与Storm源码分析（二）--任务分配，assignment的更多相关文章

JStorm与Storm源码分析（四）--均衡调度器，EvenScheduler
EvenScheduler同DefaultScheduler一样,同样实现了IScheduler接口, 由下面代码可以看出: (ns backtype.storm.scheduler.EvenSche ...
JStorm与Storm源码分析（一）--nimbus-data
Nimbus里定义了一些共享数据结构,比如nimbus-data. nimbus-data结构里定义了很多公用的数据,请看下面代码: (defn nimbus-data [conf inimbus] ...
JStorm与Storm源码分析（三）--Scheduler，调度器
Scheduler作为Storm的调度器,负责为Topology分配可用资源. Storm提供了IScheduler接口,用户可以通过实现该接口来自定义Scheduler. 其定义如下: public ...
storm源码分析之任务分配--task assignment
在"storm源码分析之topology提交过程"一文最后,submitTopologyWithOpts函数调用了mk-assignments函数.该函数的主要功能就是进行topo ...
JStorm与Storm源码分析（五）--SpoutOutputCollector与代理模式
本文主要是解析SpoutOutputCollector源码,顺便分析该类中所涉及的设计模式–代理模式. 首先介绍一下Spout输出收集器接口–ISpoutOutputCollector,该接口主要声明 ...
Storm源码分析--Nimbus-data
nimbus-datastorm-core/backtype/storm/nimbus.clj (defn nimbus-data [conf inimbus] (let [forced-schedu ...
Fresco 源码分析(二) Fresco客户端与服务端交互(1) 解决遗留的Q1问题
4.2 Fresco客户端与服务端的交互(一) 解决Q1问题从这篇博客开始,我们开始讨论客户端与服务端是如何交互的,这个交互的入口,我们从Q1问题入手(博客按照这样的问题入手,是因为当时我也是从这里 ...
框架-springmvc源码分析(二)
框架-springmvc源码分析(二) 参考: http://www.cnblogs.com/leftthen/p/5207787.html http://www.cnblogs.com/leftth ...
Tomcat源码分析二：先看看Tomcat的整体架构
Tomcat源码分析二:先看看Tomcat的整体架构 Tomcat架构图我们先来看一张比较经典的Tomcat架构图: 从这张图中,我们可以看出Tomcat中含有Server.Service.Conn ...

随机推荐

PHP验证码的制作教程
自己过去自学了PHP绘画验证码的教程,现在就把这一部分笔记跟大家分享,希望可以帮到大家. 顺带,我会在后面把我整理的一整套CSS3,PHP,MYSQL的开发的笔记打包放到百度云,有需要可以直接去百度云 ...
一些JQuery使用技巧
最近做项目,在使用JQuery中遇到一些问题记录下. 1.根据Id查询父级内容,或者父级的父级之前会使用$("#id").parent().parent(): 这种使用有很大的弊 ...
JVM中class文件探索与解析（一）
一直想成为一名优秀的架构师的我,转眼已经工作快两年了,对于java内核了解甚少,闲来时间,看看JVM,吧自己的一些研究写下来供大家参考,有不对的地方请指正. 废话不多说,一起来看看JVM中类文件是如何 ...
mysql的my.ini文件详解
mysql数据库在配置时包含很多信息:端口号,字符编码,指定根路径 basedir,指定数据存放的路径等信息 mysql的字体编码分为两种: 服务器编码客户端输入的编码通常服务器的编码都是utf- ...
【Android Developers Training】 73. 布局变化的动画
注:本文翻译自Google官方的Android Developers Training文档,译者技术一般,由于喜爱安卓而产生了翻译的念头,纯属个人兴趣爱好. 原文链接:http://developer ...
1.如何使用vbs打开网页并且登陆
例如自动打开繁星的网页并且登录 Private Sub CommandButton1_Click() Dim ie As Object Set ie = CreateObject("Inte ...
Mac用ssh登录Ubuntu14.04
在Ubuntu上配置ssh-server sudo apt-get install openssh-server 然后确认ssh-server是否启动 ps -e | grep ssh 如果存在s ...
Jquery-鼠标事件
鼠标事件是在用户移动鼠标光标或者使用任意鼠标键点击时触发的.(1):click事件:click事件于用户在元素敲击鼠标左键,并在相同元素上松开左键时触发. $('p').click(fu ...
关于jquery全选反选批量删除的一点心得
废话不多说直接上代码: 下面是jsp页面的html代码: <table id="contentTable" class=""> <thead& ...
Python的语言类型
Python 是强类型的动态脚本语言 . 强类型:不允许不同类型相加动态:不使用显示数据类型声明,且确定一个变量的类型是在第一次给它赋值的时候脚本语言:一般也是解释型语言,运行代码只需要一个解释器 ...

JStorm与Storm源码分析（二）--任务分配，assignment

JStorm与Storm源码分析（二）--任务分配，assignment的更多相关文章

随机推荐

热门专题