three supported reliability levels: * End-to-end * Store on failure * Best effort
https://github.com/cloudera/flume/blob/master/flume-docs/src/docs/UserGuide/Introduction
| === Reliability | |
| Reliability, the ability to continue delivering events in the face of | |
| failures without losing data, is a vital feature of Flume. Large | |
| distributed systems can and do suffer partial failures in many ways - | |
| physical hardware can fail, resources such as network bandwidth or | |
| memory can become scarce, or software can crash or run slowly. Flume | |
| emphasizes fault-tolerance as a core design principle and keeps | |
| running and collecting data even when many components have failed. | |
| Flume can guarantee that all data received by an agent node will | |
| eventually make it to the collector at the end of its flow as long as | |
| the agent node keeps running. That is, data can be *reliably* | |
| delivered to its eventual destination. | |
| However, reliable delivery can be very resource intensive and is often | |
| a stronger guarantee than some data sources require. Therefore, Flume | |
| allows the user to specify, on a per-flow basis, the level of | |
| reliability required. There are three supported reliability levels: | |
| * End-to-end | |
| * Store on failure | |
| * Best effort | |
| .A Note About Reliability | |
| ****************** | |
| Although Flume is extremely tolerant to machine, network, and software | |
| failures, there is never any such thing as '100% reliability'. If all | |
| the machines in a Flume installation were irrevocably destroyed in | |
| some terrible data center incident, all copies of Flume's data would | |
| be lost and there would be no way to recover them. Therefore all of | |
| Flume's reliability levels make guarantees about data delivery 'until | |
| some maximum number of failures have occurred'. Flume's failure modes | |
| - in terms of what can fail and what will keep running if they do - | |
| are described in detail later in this guide. | |
| ****************** | |
| The *end-to-end* reliability level guarantees that once Flume accepts | |
| an event, that event will make it to the endpoint - as long as the | |
| agent that accepted the event remains live long enough. The first | |
| thing the agent does in this setting is write the event to disk in a | |
| ''write-ahead log'' (WAL) so that, if the agent crashes and restarts, | |
| knowledge of the event is not lost. After the event has successfully | |
| made its way to the end of its flow, an acknowledgment is sent back to | |
| the originating agent so that it knows it no longer needs to store the | |
| event on disk. This reliability level can withstand any number of | |
| failures downstream of the initial agent. | |
| The *store on failure* reliability level causes nodes to only require | |
| an acknowledgement from the node one hop downstream. If the sending | |
| node detects a failure, it will store data on its local disk until the | |
| downstream node is repaired, or an alternate downstream destination | |
| can be selected. While this is effective, data can be lost if a | |
| compound or silent failure occurs. | |
| The *best-effort* reliability level sends data to the next hop with no | |
| attempts to confirm or retry delivery. If nodes fail, any data that | |
| they were in the process of transmitting or receiving can be | |
| lost. This is the weakest reliability level, but also the most | |
| lightweight. |
| === Reliability | |
| Reliability, the ability to continue delivering events in the face of | |
| failures without losing data, is a vital feature of Flume. Large | |
| distributed systems can and do suffer partial failures in many ways - | |
| physical hardware can fail, resources such as network bandwidth or | |
| memory can become scarce, or software can crash or run slowly. Flume | |
| emphasizes fault-tolerance as a core design principle and keeps | |
| running and collecting data even when many components have failed. | |
| Flume can guarantee that all data received by an agent node will | |
| eventually make it to the collector at the end of its flow as long as | |
| the agent node keeps running. That is, data can be *reliably* | |
| delivered to its eventual destination. | |
| However, reliable delivery can be very resource intensive and is often | |
| a stronger guarantee than some data sources require. Therefore, Flume | |
| allows the user to specify, on a per-flow basis, the level of | |
| reliability required. There are three supported reliability levels: | |
| * End-to-end | |
| * Store on failure | |
| * Best effort | |
| .A Note About Reliability | |
| ****************** | |
| Although Flume is extremely tolerant to machine, network, and software | |
| failures, there is never any such thing as '100% reliability'. If all | |
| the machines in a Flume installation were irrevocably destroyed in | |
| some terrible data center incident, all copies of Flume's data would | |
| be lost and there would be no way to recover them. Therefore all of | |
| Flume's reliability levels make guarantees about data delivery 'until | |
| some maximum number of failures have occurred'. Flume's failure modes | |
| - in terms of what can fail and what will keep running if they do - | |
| are described in detail later in this guide. | |
| ****************** | |
| The *end-to-end* reliability level guarantees that once Flume accepts | |
| an event, that event will make it to the endpoint - as long as the | |
| agent that accepted the event remains live long enough. The first | |
| thing the agent does in this setting is write the event to disk in a | |
| ''write-ahead log'' (WAL) so that, if the agent crashes and restarts, | |
| knowledge of the event is not lost. After the event has successfully | |
| made its way to the end of its flow, an acknowledgment is sent back to | |
| the originating agent so that it knows it no longer needs to store the | |
| event on disk. This reliability level can withstand any number of | |
| failures downstream of the initial agent. | |
| The *store on failure* reliability level causes nodes to only require | |
| an acknowledgement from the node one hop downstream. If the sending | |
| node detects a failure, it will store data on its local disk until the | |
| downstream node is repaired, or an alternate downstream destination | |
| can be selected. While this is effective, data can be lost if a | |
| compound or silent failure occurs. | |
| The *best-effort* reliability level sends data to the next hop with no | |
| attempts to confirm or retry delivery. If nodes fail, any data that | |
| they were in the process of transmitting or receiving can be | |
| lost. This is the weakest reliability level, but also the most | |
| lightweight. |
three supported reliability levels: * End-to-end * Store on failure * Best effort的更多相关文章
- SignalR Supported Platforms -摘自网络
SignalR is supported under a variety of server and client configurations. In addition, each transpor ...
- store操作
store.remove(rs); store.sync({ success: function (e, opt) { this.store.commitChanges(); }, failure: ...
- extjs 解决使用store.sync()方法更新item有时不触发后台action的问题
问题描述: extjs 解决使用store.sync()方法更新item有时不触发后台action,不出发后台action的原因是item的字段值没有变化 解决方法: item.setDirty(tr ...
- PMP用语集
AC actual cost 实际成本 ACWP actual cost of work performed 已完工作实际成本 BAC budget at completion 完工预算 BCWP b ...
- Solaris10安装配置LDAP(iPlanet Directory Server )
Solaris10安装光盘自带了iPlanet Directory Server安装包,系统管理员可以利用iPlanet Directory Server在Solaris系统创建一个LDAP Serv ...
- memory ordering 内存排序
Memory ordering - Wikipedia https://en.wikipedia.org/wiki/Memory_ordering https://zh.wikipedia.org/w ...
- Web测试介绍2一 安全测试
安全测试是在IT软件产品的生命周期中,特别是产品开发基本完成到发布阶段,对产品进行检验以验证产品符合安全需求定义和产品质量标准的过程. 主要安全需求包括: (i) 认证 Authent ...
- Python 3.6.0的sqlite3模块无法执行VACUUM语句
Python 3.6.0的sqlite3模块存在一个bug(见issue 29003),无法执行VACUUM语句. 一执行就出现异常: Traceback (most recent call last ...
- Flume1.5.0的安装、部署、简单应用(含伪分布式、与hadoop2.2.0、hbase0.96的案例)
目录: 一.什么是Flume? 1)flume的特点 2)flume的可靠性 3)flume的可恢复性 4)flume 的 一些核心概念 二.flume的官方网站在哪里? 三.在哪里下载? 四.如何安 ...
随机推荐
- 转 手把手教你最简单的开源项目托管GitHub入门教程
传送门 自从google code关闭了下载服务了之后,GitHub作为了目前最好用的免费开源项目托管站点,众多开源项目都托管在github,其中不乏著名的播放器MPC-HC. 不习惯于英文的朋友,难 ...
- Windows下php环境变量的配置
1.找到php的路径比如"E:\php_env\PHP". 2.需要保证该目录下php.ini的配置是正确的,如果是刚下载的php包,则可能需要修改相应的一些配置:将php目 ...
- Jmeter骚操作—文件上传、下载
最近很多同学都在问jmeter上传.下载文件的脚本怎么做,要压测上传.下载文件的功能,脚本怎么做,网上查了都说的很含糊,这次呢,咱们就好好的把jmeter的上传下载文件好好缕缕,都整明白了,怎么个过程 ...
- make makefile cmake qmake都是什么,有什么区别
原文:https://www.zhihu.com/question/27455963 作者:玟清链接:https://www.zhihu.com/question/27455963/answer/36 ...
- springboot idea激活指定profile
多Profile文件 配置文件编写的时,可以是application-{profile}.properties/yml,默认使用application.properties的配置: 激活指定profi ...
- 笔记-迎难而上之Java基础进阶6
import java.io.*; public class InputStreamDemo{ public static void main(String[] args) throws IOExce ...
- Android-TextView属性ellipsize多行失效的解决思路
多余文字显示省略号的常规做法 android:ellipsize="end" //省略号显示在末尾 android:ellipsize="middle" //省 ...
- [反汇编练习] 160个CrackMe之030
[反汇编练习] 160个CrackMe之030. 本系列文章的目的是从一个没有任何经验的新手的角度(其实就是我自己),一步步尝试将160个CrackMe全部破解,如果可以,通过任何方式写出一个类似于注 ...
- 转: 如何选CDN:互联网大直播时代的CDN选择指南
from: http://www.chnvideo.com/blog-classic-cdn.html SRS 编码器 如何选CDN:互联网大直播时代的CDN选择指南 CDN是一个服务型的公司, ...
- Frotinet60D IP映射 開放指定端口是實際應用
有多個外網IP的環境中,在某一部電腦上,如何使用多個外網IP中的某一個?且為使用的Bitcomet開放TCP,TUP特定端口? 1.在本機windows防火墻中"輸入" 和&quo ...