Akka源码分析-CircuitBreaker（熔断器）

　　熔断器，在很多技术栈中都会出现的一种技术。它是在分布式系统中提供一个稳定的阻止嵌套失败的机制。

　　该怎么理解呢？简单来说，在分布式环境中，如果某个计算节点出现问题，很容易出现失败的逆向传到或整个系统的雪崩。什么意思呢？比如某个服务按照顺序依次调用了其他的三个服务，分别为A/B/C。如果B服务由于某种原因，响应变慢了，本来100毫秒就完成了，现在是1秒。此时A就会等待B服务的时间也就变成了1秒，那么就意味着会有很多的A服务调用在等待，如果并发量非常大，很容易就会造成A服务所在的节点出现问题，也就是说，B的问题传递给了A。熔断器就是用来解决这个问题的。

　　A服务和B服务之间有个熔断器，A通过熔断器调用B服务，熔断器会根据某种算法判断B服务是否正常，如果B不正常，则A调用的时候会立即失败，而不用再等待1秒的时间，同时也不会去调用B服务。这样节约了A服务判断失败的时间，也减少了B服务的压力。等B服务正常的时候，A就可以正常调用了。

　　关于熔断器的原理和使用，这里不再啰嗦，读者可参考引用的第一遍博客。我这里只分析在akka中如何实现熔断器。

　　按照惯例，我们还是从akka的官方demo入手分析。

class DangerousActor extends Actor with ActorLogging {

  import context.dispatcher

  val breaker =

    new CircuitBreaker(

      context.system.scheduler,

      maxFailures = 5,

      callTimeout = 10.seconds,

      resetTimeout = 1.minute).onOpen(notifyMeOnOpen())

  def notifyMeOnOpen(): Unit =

    log.warning("My CircuitBreaker is now open, and will not close for one minute")

  //#circuit-breaker-initialization

  //#circuit-breaker-usage

  def dangerousCall: String = "This really isn't that dangerous of a call after all"

  def receive = {

    case "is my middle name" ⇒

      breaker.withCircuitBreaker(Future(dangerousCall)) pipeTo sender()

    case "block for me" ⇒

      sender() ! breaker.withSyncCircuitBreaker(dangerousCall)

  }

  //#circuit-breaker-usage

　　首先来看CircuitBreaker的创建过程，它有四个参数，我们着重分析后面三个：最大失败次数、调用超时时间、重置的超时时间。

/**

* Provides circuit breaker functionality to provide stability when working with "dangerous" operations, e.g. calls to

* remote systems

* Transitions through three states:

* - In *Closed* state, calls pass through until the `maxFailures` count is reached. This causes the circuit breaker

* to open. Both exceptions and calls exceeding `callTimeout` are considered failures.

* - In *Open* state, calls fail-fast with an exception. After `resetTimeout`, circuit breaker transitions to

* half-open state.

* - In *Half-Open* state, the first call will be allowed through, if it succeeds the circuit breaker will reset to

* closed state. If it fails, the circuit breaker will re-open to open state. All calls beyond the first that

* execute while the first is running will fail-fast with an exception.

* @param scheduler Reference to Akka scheduler

* @param maxFailures Maximum number of failures before opening the circuit

* @param callTimeout [[scala.concurrent.duration.FiniteDuration]] of time after which to consider a call a failure

* @param resetTimeout [[scala.concurrent.duration.FiniteDuration]] of time after which to attempt to close the circuit

* @param executor [[scala.concurrent.ExecutionContext]] used for execution of state transition listeners

class CircuitBreaker(

scheduler: Scheduler,

maxFailures: Int,

callTimeout: FiniteDuration,

val resetTimeout: FiniteDuration,

maxResetTimeout: FiniteDuration,

exponentialBackoffFactor: Double)(implicit executor: ExecutionContext) extends AbstractCircuitBreaker

　　使用就比较简单了，就是把要执行的代码传给withCircuitBreaker或withSyncCircuitBreaker。

  /**

   * Wraps invocations of asynchronous calls that need to be protected

   *

   * @param body Call needing protected

   * @return [[scala.concurrent.Future]] containing the call result or a

   *   `scala.concurrent.TimeoutException` if the call timed out

   *

   */

  def withCircuitBreaker[T](body: ⇒ Future[T]): Future[T] = currentState.invoke(body, CircuitBreaker.exceptionAsFailure)

　　withSyncCircuitBreaker的参数是一个body，也就是函数的传名调用，你可以把它理解成一个函数指针吧。代码很简单，就是调用了currentState.invoke。那currentState是什么呢？还记得熔断器的三种状态么？开放、关闭、半开。很显然第一次调用的时候，应该是关闭状态，body的代码会被正常执行。

  /**

   * Helper method for accessing underlying state via Unsafe

   *

   * @return Reference to current state

   */

  @inline

  private[this] def currentState: State =

    Unsafe.instance.getObjectVolatile(this, AbstractCircuitBreaker.stateOffset).asInstanceOf[State]

　　currentState的类型是State。

　　State有三种实现：Closed、HalfOpen、Open。

   /**

     * Implementation of invoke, which simply attempts the call

     *

     * @param body Implementation of the call that needs protected

     * @return Future containing result of protected call

     */

    override def invoke[T](body: ⇒ Future[T], defineFailureFn: Try[T] ⇒ Boolean): Future[T] =

      callThrough(body, defineFailureFn)

　先来看Closed的invoke实现，很简单，调用callThrough，从名字来看应该就是直接调用body这个函数。

/**

     * Shared implementation of call across all states.  Thrown exception or execution of the call beyond the allowed

     * call timeout is counted as a failed call, otherwise a successful call

     *

     * @param body Implementation of the call

     * @param defineFailureFn function that define what should be consider failure and thus increase failure count

     * @return Future containing the result of the call

     */

    def callThrough[T](body: ⇒ Future[T], defineFailureFn: Try[T] ⇒ Boolean): Future[T] = {

      def materialize[U](value: ⇒ Future[U]): Future[U] = try value catch { case NonFatal(t) ⇒ Future.failed(t) }

      if (callTimeout == Duration.Zero) {

        val start = System.nanoTime()

        val f = materialize(body)

        f.onComplete {

          case s: Success[_] ⇒

            notifyCallSuccessListeners(start)

            callSucceeds()

          case Failure(ex) ⇒

            notifyCallFailureListeners(start)

            callFails()

        }

        f

      } else {

        val start = System.nanoTime()

        val p = Promise[T]()

        implicit val ec = sameThreadExecutionContext

        p.future.onComplete { fResult ⇒

          if (defineFailureFn(fResult)) {

            callFails()

          } else {

            notifyCallSuccessListeners(start)

            callSucceeds()

          }

        }

        val timeout = scheduler.scheduleOnce(callTimeout) {

          if (p tryFailure timeoutEx) {

            notifyCallTimeoutListeners(start)

          }

        }

        materialize(body).onComplete {

          case Success(result) ⇒

            p.trySuccess(result)

            timeout.cancel

          case Failure(ex) ⇒

            if (p.tryFailure(ex)) {

              notifyCallFailureListeners(start)

            }

            timeout.cancel

        }

        p.future

      }

    }

　　callThrough分两种情况，一种是body有超时时间，另一种是没有超时时间（会一直等待body调用成功）。我们只分析设置了超时时间的情况，跟没有超时时间的实现也差不多，无非是多了一个promise。

　　首先创建了一个Promise，然后设置了Promise的失败、成功的处理逻辑。失败调用callFails，成功调用callSucceeds。

　　materialize这个函数比较简单，就是直接调用body方法，处理了非致命异常。body调用成功，则设置promise状态为成功；否则设置Promise为失败。那如果body一直执行，超时了呢？

　　那就是scheduler.scheduleOnce这段代码发挥作用的时候了，它起了一个timer，到达时间后设置Promise为失败，然后调用notifyCallTimeoutListeners。但一定要注意，即使调用超时，body也会一直等待返回结果直至退出的。这跟future的机制有关。

　　简单起见，我们首先来看执行成功且没有超时的情况。根据源码逻辑应该执行notifyCallSuccessListeners(start)、 callSucceeds()。notifyCallTimeoutListeners不再分析，这是用来回调开发者定义的函数的。

 /**

     * On successful call, the failure count is reset to 0

     *

     * @return

     */

    override def callSucceeds(): Unit = set(0)

　　Closed的callSucceeds实现的是不是有点简单了，就是调用了set。set是干啥的呢？来看看Closed定义。

private object Closed extends AtomicInteger with State

　　Closed不仅是一个State，还是一个AtomicInteger。简单来说它还是一个线程安全的计数器。当调用成功时，就会把当前的值设置成0，那如果失败呢，应该是++吧。

 /**

     * On failed call, the failure count is incremented.  The count is checked against the configured maxFailures, and

     * the breaker is tripped if we have reached maxFailures.

     *

     * @return

     */

    override def callFails(): Unit = if (incrementAndGet() == maxFailures) tripBreaker(Closed)

　　来看看callFails，它就是把当前技术加1，然后判断是否到达最大失败次数，如果到达，则调用tripBreaker。

/**

   * Trips breaker to an open state.  This is valid from Closed or Half-Open states.

   *

   * @param fromState State we're coming from (Closed or Half-Open)

   */

  private def tripBreaker(fromState: State): Unit = transition(fromState, Open)

/**

   * Implements consistent transition between states. Throws IllegalStateException if an invalid transition is attempted.

   *

   * @param fromState State being transitioning from

   * @param toState State being transitioning from

   */

  private def transition(fromState: State, toState: State): Unit = {

    if (swapState(fromState, toState))

      toState.enter()

    // else some other thread already swapped state

  }

　　其实就是到达最大失败次数后，转换当前状态，到达Open，然后调用enter方法。

 /**

     * On entering this state, schedule an attempted reset via [[akka.actor.Scheduler]] and store the entry time to

     * calculate remaining time before attempted reset.

     *

     * @return

     */

    override def _enter(): Unit = {

      set(System.nanoTime())

      scheduler.scheduleOnce(currentResetTimeout) {

        attemptReset()

      }

      val nextResetTimeout = currentResetTimeout * exponentialBackoffFactor match {

        case f: FiniteDuration ⇒ f

        case _                 ⇒ currentResetTimeout

      }

      if (nextResetTimeout < maxResetTimeout)

        swapResetTimeout(currentResetTimeout, nextResetTimeout)

    }

　　上面是Open的enter实现，上来就调用set。set是啥？

 private object Open extends AtomicLong with State

　　从定义来看，set是AtomicLong里面的方法。其实就是设置进入Open状态的时间。然后启动一个timer，设置重置的时间。那如果达到指定时间，attemptReset做了什么呢？

/**

   * Attempts to reset breaker by transitioning to a half-open state.  This is valid from an Open state only.

   *

   */

  private def attemptReset(): Unit = transition(Open, HalfOpen)

　　额，其实就是到达了半开状态。我们来看看如果在开放状态，失败次数超过了指定次数，到达Open时，invoke的逻辑是怎样的。

/**

     * Fail-fast on any invocation

     *

     * @param body Implementation of the call that needs protected

     * @return Future containing result of protected call

     */

    override def invoke[T](body: ⇒ Future[T], defineFailureFn: Try[T] ⇒ Boolean): Future[T] = {

      notifyCallBreakerOpenListeners()

      Future.failed(new CircuitBreakerOpenException(remainingDuration()))

    }

　　invoke逻辑有点简单啊，就是通知了listener，然后直接设置返回值为失败了。根本没有调用body的代码。也就是说在Open状态，所有的调用都会直接失败，一直到达重置时间转入HalfOpen状态未止。

/**

     * On entry, guard should be reset for that first call to get in

     *

     * @return

     */

    override def _enter(): Unit = set(true)

　　进入半开状态逻辑还是调用set，只不过这里的值是true，那想必HalfOpen还是一个AtomicBoolean了。

/**

     * Allows a single call through, during which all other callers fail-fast.  If the call fails, the breaker reopens.

     * If the call succeeds the breaker closes.

     *

     * @param body Implementation of the call that needs protected

     * @return Future containing result of protected call

     */

    override def invoke[T](body: ⇒ Future[T], defineFailureFn: Try[T] ⇒ Boolean): Future[T] =

      if (compareAndSet(true, false))

        callThrough(body, defineFailureFn)

      else {

        notifyCallBreakerOpenListeners()

        Future.failed[T](new CircuitBreakerOpenException(0.seconds))

      }

　　那我们来看看invoke。它首先判断当前值是不是true，如果是true则赋值为false，很显然只有在第一次调用的时候改值才是true。也就是说从Open状态到HalfOpen状态时，只有第一次invoke会执行callThrough，也就是会执行body，如果是第二次则会直接失败。

　　我们知道callThrough在body执行成功时调用callSucceeds，失败时调用callFails。我们来看看HalfOpen这两个函数的实现。

    /**

     * Reset breaker on successful call.

     *

     * @return

     */

    override def callSucceeds(): Unit = resetBreaker()

    /**

     * Reopen breaker on failed call.

     *

     * @return

     */

    override def callFails(): Unit = tripBreaker(HalfOpen)

  /**

   * Resets breaker to a closed state.  This is valid from an Half-Open state only.

   *

   */

  private def resetBreaker(): Unit = transition(HalfOpen, Closed)

　　首先来看成功的情况，根据源码调用逻辑，它就是把状态从HalfOpen转换到了Closed状态。Closed状态我们已经分析过了。那失败如何处理呢？

  /**

   * Trips breaker to an open state.  This is valid from Closed or Half-Open states.

   *

   * @param fromState State we're coming from (Closed or Half-Open)

   */

  private def tripBreaker(fromState: State): Unit = transition(fromState, Open)

　　失败时又一次转到了Open状态，这个状态也分析过了。

　　至此akka的熔断器逻辑分析完毕。有没有非常简单？简单来说就是，刚开始熔断器处于Closed状态，用户的逻辑会正常执行，如果失败次数和超时次数超过指定次数，就会进入Open状态；Open状态不会执行用户逻辑，会直接失败，等到指定时间后，进入HalfOpen状态；HalfOpen状态第一次调用成功，就进入Closed状态，如果调用失败则重新进入Open状态，再次等待指定时间。

　　其实官网的一个图就可以解释清楚了。有人问withSyncCircuitBreaker跟withCircuitBreaker有啥区别，其实吧，区别不大，一个是同步的，一个是异步的。异步是通过Await.result来转化成同步的。

/**

   * Wraps invocations of synchronous calls that need to be protected

   *

   * Calls are run in caller's thread. Because of the synchronous nature of

   * this call the  `scala.concurrent.TimeoutException` will only be thrown

   * after the body has completed.

   *

   * Throws java.util.concurrent.TimeoutException if the call timed out.

   *

   * @param body Call needing protected

   * @param defineFailureFn function that define what should be consider failure and thus increase failure count

   * @return The result of the call

   */

  def withSyncCircuitBreaker[T](body: ⇒ T, defineFailureFn: Try[T] ⇒ Boolean): T =

    Await.result(

      withCircuitBreaker(

        try Future.successful(body) catch { case NonFatal(t) ⇒ Future.failed(t) },

        defineFailureFn),

      callTimeout)

　　上面代码可以从根本上解释两者的区别，我就不再过多分析了。

防雪崩利器：熔断器 Hystrix 的原理与使用

Akka源码分析-CircuitBreaker（熔断器）的更多相关文章

Akka源码分析-Akka Typed
对不起,akka typed 我是不准备进行源码分析的,首先这个库的API还没有release,所以会may change,也就意味着其概念和设计包括API都会修改,基本就没有再深入分析源码的意义了. ...
Akka源码分析-Akka-Streams-概念入门
今天我们来讲解akka-streams,这应该算akka框架下实现的一个很高级的工具.之前在学习akka streams的时候,我是觉得云里雾里的,感觉非常复杂,而且又难学,不过随着对akka源码的深 ...
Akka源码分析-Cluster-Metrics
一个应用软件维护的后期一定是要做监控,akka也不例外,它提供了集群模式下的度量扩展插件. 其实如果读者读过前面的系列文章的话,应该是能够自己写一个这样的监控工具的.简单来说就是创建一个actor,它 ...
Akka源码分析-Cluster-Distributed Publish Subscribe in Cluster
在ClusterClient源码分析中,我们知道,他是依托于“Distributed Publish Subscribe in Cluster”来实现消息的转发的,那本文就来分析一下Pub/Sub是如 ...
Akka源码分析-Cluster-Singleton
akka Cluster基本实现原理已经分析过,其实它就是在remote基础上添加了gossip协议,同步各个节点信息,使集群内各节点能够识别.在Cluster中可能会有一个特殊的节点,叫做单例节点. ...
Akka源码分析-Persistence
在学习akka过程中,我们了解了它的监督机制,会发现actor非常可靠,可以自动的恢复.但akka框架只会简单的创建新的actor,然后调用对应的生命周期函数,如果actor有状态需要回复,我们需要h ...
Akka源码分析-local-DeathWatch
生命周期监控,也就是死亡监控,是akka编程中常用的机制.比如我们有了某个actor的ActorRef之后,希望在该actor死亡之后收到响应的消息,此时我们就可以使用watch函数达到这一目的. c ...
Akka源码分析-Cluster-ActorSystem
前面几篇博客,我们依次介绍了local和remote的一些内容,其实再分析cluster就会简单很多,后面关于cluster的源码分析,能够省略的地方,就不再贴源码而是一句话带过了,如果有不理解的地方 ...
Akka源码分析-Serialization
今天我们来谈一下akka的序列化框架,其实序列化.反序列化是一个老生常谈的问题,那么我们为什么还要研究一下akka的序列化框架呢?不就是使用哪种序列化.反序列化方法的区别么?其实刚开始的时候我也是这么 ...

随机推荐

com.sun.xxx.utils不存在问题的解决
com.sun.org.apache.xml.internal.security.utils does not exist问题的解决在网上找个很多的答案,但我的问题没有解决,睡一晚上后,被我误打误撞 ...
解决idea中启动tomcat出现控制台乱码问题
尝试了很多方法,最后终于解决了,现在提供给大家一个我认为最简单也最有效的方案. 1.修改配置文件找到idea的安装目录,在bin文件夹下找到以下两个文件,用记事本或者其他软件打开: 然后两个文件中都 ...
转 asterisk app命令中文翻译
常规指令 Authenticate:鉴别用户 VMAuthenticate:根据“voicemail.conf”鉴别用户 Curl:接受外接URLs的修复.支持POSTing DUNDiLookup: ...
Edmonds 开花算法
Edmonds 开花算法 input: 图G,匹配M,未饱和点u idea: 查找从 u 開始的 M-交错路径.对每一个顶点记录父亲节点. 发现花朵.则收缩. 维护 S 和 T.S 表示沿着已经饱和的 ...
转: 将Eclipse代码导入到AndroidStudio的两种方式
评注: 讲解的非常之详细转自: http://www.cnblogs.com/ct2011/p/4183553.html 说到使用AndroidStudio,除了新建的项目,我们都会面临的问题 ...
CentOS 6.x Radius
CentOS 6.x Radius 一. 实现环境: 1.系统:CentOS release 6.6 (Final) 2.需要软件包: 1) freeradius-2.1.12-6.e16.x ...
SDUTOJ 2475 Power Strings
<pre class="cpp" name="code">#include<iostream> #include<stdio.h& ...
【PostgreSQL】PostgreSQL操作-psql基本命令
在阅读的过程中有不论什么问题,欢迎一起交流邮箱:1494713801@qq.com QQ:1494713801 一.建立数据库连接 ---------------- 接入PostgreSQL数 ...
优化 html 标签为何能用HTML/CSS解决的问题就不要使用JS？
优化 html 标签 2018年05月11日 08:56:24 阅读数:19 有些人写页面会走向一个极端,几乎页面所有的标签都用div,究其原因,用div有很多好处,一个是div没有默认样式,不会有m ...
STL源代码剖析——STL算法之set集合算法
前言本节介绍set集合的相关算法,各自是并集set_union,差集set_difference,交集set_intersection 和对称差集set_symmetric_difference.这 ...

Akka源码分析-CircuitBreaker（熔断器）

Akka源码分析-CircuitBreaker（熔断器）的更多相关文章

随机推荐

热门专题