// :: WARN RandomBlockReplicationPolicy: Expecting  replicas with only  peer/s.
// :: WARN BlockManager: Block input-- replicated to only peer(s) instead of peers
// :: ERROR Executor: Exception in task 0.0 in stage 113711.0 (TID )
java.lang.AssertionError: assertion failed
at scala.Predef$.assert(Predef.scala:)
at org.apache.spark.storage.BlockInfo.checkInvariants(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfo.readerCount_$eq(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$$$anonfun$apply$.apply(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$$$anonfun$apply$.apply(BlockInfoManager.scala:)
at scala.Option.foreach(Option.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$.apply(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$.apply(BlockInfoManager.scala:)
at scala.collection.Iterator$class.foreach(Iterator.scala:)
at scala.collection.AbstractIterator.foreach(Iterator.scala:)
at org.apache.spark.storage.BlockInfoManager.releaseAllLocksForTask(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockManager.releaseAllLocksForTask(BlockManager.scala:)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:)
at java.lang.Thread.run(Thread.java:)
// :: WARN TaskSetManager: Lost task 0.0 in stage 113711.0 (TID , localhost, executor driver): java.lang.AssertionError: assertion failed
at scala.Predef$.assert(Predef.scala:)
at org.apache.spark.storage.BlockInfo.checkInvariants(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfo.readerCount_$eq(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$$$anonfun$apply$.apply(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$$$anonfun$apply$.apply(BlockInfoManager.scala:)
at scala.Option.foreach(Option.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$.apply(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$.apply(BlockInfoManager.scala:)
at scala.collection.Iterator$class.foreach(Iterator.scala:)
at scala.collection.AbstractIterator.foreach(Iterator.scala:)
at org.apache.spark.storage.BlockInfoManager.releaseAllLocksForTask(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockManager.releaseAllLocksForTask(BlockManager.scala:)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:)
at java.lang.Thread.run(Thread.java:) // :: ERROR TaskSetManager: Task in stage 113711.0 failed times; aborting job
// :: ERROR JobScheduler: Error running job streaming job ms.
org.apache.spark.SparkException: An exception was raised by Python:
Traceback (most recent call last):
File "/home/admin/agent/spark/python/lib/pyspark.zip/pyspark/streaming/util.py", line , in call
r = self.func(t, *rdds)
File "/home/admin/agent/spark/python/lib/pyspark.zip/pyspark/streaming/dstream.py", line , in takeAndPrint
taken = rdd.take(num + )
File "/home/admin/agent/spark/python/lib/pyspark.zip/pyspark/rdd.py", line , in take
res = self.context.runJob(self, takeUpToNumLeft, p)
File "/home/admin/agent/spark/python/lib/pyspark.zip/pyspark/context.py", line , in runJob
port = self._jvm.PythonRDD.runJob(self._jsc.sc(), mappedRDD._jrdd, partitions)
File "/home/admin/agent/spark/python/lib/py4j-0.10.4-src.zip/py4j/java_gateway.py", line , in __call__
answer, self.gateway_client, self.target_id, self.name)
File "/home/admin/agent/spark/python/lib/py4j-0.10.4-src.zip/py4j/protocol.py", line , in get_return_value
format(target_id, ".", name), value)
py4j.protocol.Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.PythonRDD.runJob.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task in stage 113711.0 failed times, most recent failure: Lost task 0.0 in stage 113711.0 (TID , localhost, executor driver): java.lang.AssertionError: assertion failed
at scala.Predef$.assert(Predef.scala:)
at org.apache.spark.storage.BlockInfo.checkInvariants(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfo.readerCount_$eq(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$$$anonfun$apply$.apply(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$$$anonfun$apply$.apply(BlockInfoManager.scala:)
at scala.Option.foreach(Option.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$.apply(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockInfoManager$$anonfun$releaseAllLocksForTask$.apply(BlockInfoManager.scala:)
at scala.collection.Iterator$class.foreach(Iterator.scala:)
at scala.collection.AbstractIterator.foreach(Iterator.scala:)
at org.apache.spark.storage.BlockInfoManager.releaseAllLocksForTask(BlockInfoManager.scala:)
at org.apache.spark.storage.BlockManager.releaseAllLocksForTask(BlockManager.scala:)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:)
at java.lang.Thread.run(Thread.java:)
  
Driver stacktrace:
at org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:)
at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$.apply(DAGScheduler.scala:)
at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$.apply(DAGScheduler.scala:)
at scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:)
at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:)
at org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:)
at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$.apply(DAGScheduler.scala:)
at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$.apply(DAGScheduler.scala:)
at scala.Option.foreach(Option.scala:)
at org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:)
at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.doOnReceive(DAGScheduler.scala:)
at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:)
at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:)
at org.apache.spark.util.EventLoop$$anon$.run(EventLoop.scala:)
at org.apache.spark.scheduler.DAGScheduler.runJob(DAGScheduler.scala:)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:)
at org.apache.spark.api.python.PythonRDD$.runJob(PythonRDD.scala:)
at org.apache.spark.api.python.PythonRDD.runJob(PythonRDD.scala)
at sun.reflect.GeneratedMethodAccessor55.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:)
at java.lang.reflect.Method.invoke(Method.java:)
at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:)
at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:)
at py4j.Gateway.invoke(Gateway.java:)
at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:)
at py4j.commands.CallCommand.execute(CallCommand.java:)
at py4j.GatewayConnection.run(GatewayConnection.java:)
at java.lang.Thread.run(Thread.java:)

排查原因1:

  1. 【不是】由于代码中checkpoint目录为本地导致,搭建了hdfs,将checkpoint移到hdfs,发现还是运行一天左右就挂掉,报错如上。

  2. 待续

请大虾们指点。

filebeat+kafka+SparkStreaming程序报错及解决办法的更多相关文章

  1. 【Runtime Error】打开Matlib7.0运行程序报错的解决办法

    1.在C盘建立一个文件夹temp,存放临时文件: 2.右键我的电脑-属性-高级系统设置-环境变量-系统变量,将TEMP.TMP的值改成C:\temp: 3.还是在第2步那里,新建变量,变量名称为BLA ...

  2. 未在本地计算机上注册“microsoft.ACE.oledb.12.0”提供程序报错的解决办法

    https://www.jb51.net/article/157457.htm 下载32位版本安装即可 Microsoft Access Database Engine Redistributable ...

  3. Base64 报错 的解决办法 (Base-64 字符数组或字符串的长度无效。, 输入的不是有效的 Base-64 字符串,因为它包含非 Base-64 字符、两个以上的填充字符,或者填充字符间包含非法字符。)

    Base64 报错 的解决办法, 报错如下:1. FormatException: The input is not a valid Base-64 string as it contains a n ...

  4. Springboot数据库连接池报错的解决办法

    Springboot数据库连接池报错的解决办法 这个异常通常在Linux服务器上会发生,原因是Linux系统会主动断开一个长时间没有通信的连接 那么我们的问题就是:数据库连接池长时间处于间歇状态,导致 ...

  5. Loadrunner参数化连接oracle、mysql数据源报错及解决办法

    Loadrunner参数化连接oracle.mysql数据源报错及解决办法 (本人系统是Win7 64,  两位小伙伴因为是默认安装lr,安装在 最终参数化的时候,出现连接字符串无法自动加载出来: 最 ...

  6. PHP empty函数报错的解决办法

    PHP empty函数在检测一个非变量情况下报错的解决办法. PHP开发时,当你使用empty检查一个函数返回的结果时会报错:Fatal error: Can't use function retur ...

  7. eclipse中的js文件报错的解决办法

    在使用别人的项目的时候,导入到eclipse中发现js文件报错,解决办法是关闭eclipse的js校验功能. 三个步骤: 1. 右键点击项目->properties->Validation ...

  8. VM装mac10.9教程+报错信息解决办法

    VM装mac10.9教程+报错信息解决办法 教程1: 教你在Vmware 10下安装苹果Mac10.9系统 地址:http://tieba.baidu.com/p/2847457021 教程2: VM ...

  9. Oracle数据库误删文件导致rman备份报错RMAN-06169解决办法

    Oracle数据库误删文件导致rman备份报错RMAN-06169解决办法 可能是误删文件导致在使用rman备份时候出现以下提示 RMAN-06169: could not read file hea ...

随机推荐

  1. 〖Linux〗Ubuntu13.10 安装qt开发环境

    sudo apt-get install qtcreator libqt4-dev libqt4-dbg libqt4-gui libqt4-sql qt4-dev-tools qt4-doc qt4 ...

  2. 《微赢微信公众平台系统5月14最新破解高级运营版+水果机+邀请函+微汽车+微食品+用户CRM》

    <微赢微信公众平台系统5月14最新破解高级运营版+水果机+邀请函+微汽车+微食品+用户CRM> 此版本号眼下是淘宝卖600RMB的,其他VIP源代码论坛也都还没有公布.咱们这里全然免费分享 ...

  3. ActiveMQ基本介绍

    1.ActiveMQ服务器工作模型       通过ActiveMQ消息服务交换消息.消息生产者将消息发送至消息服务,消息消费者则从消息服务接收这些消息.这些消息传送操作是使用一组实现 ActiveM ...

  4. urlretrieve 如何给文件下载设置下载进度?

    #python #xiaodeng #如何给文件下载设置下载进度? import urllib def callbackinfo(down,block,size): ''' 回调函数: down:已经 ...

  5. 商业规则引擎IBM WebSphere ILog JRules概述,开发基础教程

    Ilog Jrules开发基础教程有7篇,地址规则引擎Ilog Jrules开发基础教程[连载1]-- 概述篇 概述篇 规则引擎是一种嵌套在应用程序中的组件,它实现了将业务规则从应用程序代码中分离出来 ...

  6. java多线程学习--java.util.concurrent

    CountDownLatch,api 文档:http://docs.oracle.com/javase/7/docs/api/java/util/concurrent/CountDownLatch.h ...

  7. hello oc

    printf("Hello C\n"); //OC可以采用C语言的输出方式 printf("The number is %d\n",100);//%d 输出数字 ...

  8. 【转】一个对 Dijkstra 的采访视频

    一个对 Dijkstra 的采访视频 (也可以访问 YouTube 或者从源地址下载 MPEG1,300M) 之前在微博上推荐了一个对 Dijkstra 的采访视频,看了两遍之后觉得实在很好,所以再正 ...

  9. Web - TCP与UDP的差别

    是否面向连接:TCP面向连接.UDP面向非连接. 传输可靠性:TCP可靠.UDP不可靠. 应用场合:TCP经常使用于传输大量数据,UDP经常使用于传输少量数据. 速度:TCP传输速度较慢,而UDP速度 ...

  10. ArcMap导入数据到ArcSDE报000597或者000224的错误

    这两天碰到不同用户提出的不同的问题,可是分析之后发现导致该问题的解决办法是同一个原因. -------------------------------------------------------- ...