Log system architecture
0. 技术选型参考
1. Collector
Keywords: Collector, Processor
| 名称 | Beats | Fluentd-bit |
|---|---|---|
| Introduction | Beats are a collector and processor of lightweight (resource efficient, no dependencies, small) and open source log shippers that act as agents installed on the different servers in your infrastructure for collecting logs or metrics. | Fluent Bit was born to address the need for a high performance and optimized tool that can collect and process data from any input source, unify that data and deliver it to multiple destinations. |
| Owner | Elastic | Treasure Data |
| Open Source | True | True |
| Github Stars | 5742 | 608 |
| License | Apache License v2.0 | Apache License v2.0 |
| Scope | Containers / Servers / K8S | Containers / Servers / K8S |
| Language | Go | C |
| Memory | ~10MB | ~500KB |
| Performance | High | High |
| Dependencies | Zero dependencies, unless some special plugin requires them. | Zero dependencies, unless some special plugin requires them. |
| Category | Auditbeat,Filebeat,Heartbeat,Metricbeat,Packetbeat,Winlogbeat | NaN |
| Configuration | File(.yml)/Cmd | File(custom file extension and syntax)/Cmd |
| Essence | Collector & Processor | Collector & Processor |
| Input/Module | File, Docker, Syslog, Nginx, Mysql, Postgresql, etc | File,CPU, Disk, Docker, Syslog, etc |
| Output | Elasticsearch, Logstash, Kafka, Redis, File, Console | ES, File, Kafka, etc |
1.1 Filebeat 架构图
- Ingest Node - A es plugin which pre-process documents before the actual document indexing happen and replace for Logstash. The ingest node intercepts bulk and index requests, it applies transformations, and it then passes the documents back to the index or bulk APIs. Define a pipeline(Processors) that specifies a series of processors, then register the pipeline id in Filebeat configuration file.
- Kafka - Prevent loss of data and manage logging output speed.
1.2 Fluent bit 架构图
| Name | Description | Samples |
|---|---|---|
| Input | Entry point of data. Implemented through Input Plugins, this interface allows to gather or receive data. | Samples |
| Parser | Parsers allow to convert unstructured data gathered from the Input interface into a structured one. Parsers are optional and depends on Input plugins. | Prospector and processors in Filebeat |
| Filter | The filtering mechanism allows to alter the data ingested by the Input plugins. Filters are implemented as plugins. | Prospector and processors in Filebeat |
| Buffer | By default, the data ingested by the Input plugins, resides in memory until is routed and delivered to an Output interface. | |
| Routing | Data ingested by an Input interface is tagged, that means that a Tag is assigned and this one is used to determinate where the data should be routed based on a match rule. | |
| Output | An output defines a destination for the data. Destinations are handled by output plugins. Note that thanks to the Routing interface, the data can be delivered to multiple destinations. | Samples |
2. Log Transporter
Keywords: Collector, Processor, Aggregator
| 名称 | Logstah | Fluentd |
|---|---|---|
| Introduction | Logstash is an open source, server-side data processing pipeline that ingests data from a multitude of sources simultaneously, transforms it, and then sends it to your stash. | Fluentd is an open source data collector, which lets you unify the data. |
| Owner | Elastic | Treasure Data |
| Open Source | True | True |
| Github Stars | 9105 | 6489 |
| License | Apache License v2.0 | Apache License v2.0 |
| Scope | Containers / Servers / K8S | Containers / Servers / K8S |
| Language | JRuby(JVM) | Ruby & C |
| Memory | 200MB+ | ~40MB |
| Performance | Middle | High |
| Dependencies | JVM | Ruby Gem |
| Configuration | File(custom file extension and syntax)/Cmd | File(custom file extension and syntax)/Cmd |
| Essence | Collector, Processor, Aggregator | CCollector, Processor, Aggregator |
| Input/Module | Limited only by your imagination(Serilog) | Limited only by your imagination(Nlog) |
| Output | Limited only by your imagination | Limited only by your imagination |
Further Reading: Fluentd vs. Logstash: A Comparison of Log Collectors
3. 初步总结
| 比较 | Beats + Logstash | Fluentd bit + Fluentd | 说明 |
|---|---|---|---|
| 功能实现 | √ | √ | 基本一致 |
| 安装与配置简易性 | √ | ||
| 内存占用 | √ | JVM 特性使然 | |
| 可靠性 | √ | √ | 前者使用 registry file + redis 实现可靠性,后者使用内置 buffering 实现可靠性 |
| 可扩展性 | √ | √ | 插件生态和可扩展性基本一致。后者为分布型插件管理 |
| 趋势 | √ | ELK -> EFK | |
| 其他 | √ | √ | 前者更倾向于使用 go & java 技术栈,后者有 docker, k8s 官方 log driver 类型和案例支持 |
Tips: 任一层级都可以自由替换.
4. Visualizer
Keywords: Query, Analyze, Monitor
| 名称 | Kibana | Grafana |
|---|---|---|
| Introduction | Kibana is an open source data visualization plugin for Elasticsearch. | Data visualization & Monitoring with support for Graphite, InfluxDB, Prometheus, Elasticsearch and many more databases.The leading open source software for time series analytics. |
| Owner | Elastic | Grafana |
| Open Source | True | True |
| Github Stars | 9k+ | 22k+ |
| License | Apache License v2.0 | Apache License v2.0 |
| Scope | ElasticSearch only | ElasticSearch, InfluxDB, PostgreSQL etc |
| Language | Javascript | Go & Typescript |
| Configuration | File(.yml)/Cmd | File(custom file extension and syntax)/Cmd |
| Simple Query | Lucene syntax and filter components | filter components.Different from each other data source |
| Full-Text Query | Yes | No |
| Security | Plugins or libraries | Integration |
| Notification | Plugins or libraries | Integration |
| Advantages | Log, ES | Multiple data source, APM, Timeseries |
Working together.
5. Log Storage and Analyzer
Keywords:Storage, ES, Postgresql, Zombodb, Arangodb
5.1 ElasticSearch
- 同时支持单文档的对象搜索+模糊搜索+全文搜索
- Skywalking 官方支持存储媒介
- 作为流行 Output 支持绝大部分 Log 相关系统
- 天生分布式
- 一键设置过期窗口,索引重建
- ……
- 占用资源较多,对存储介质要求高
- 运维成本更高
- 持久化
- 安全性 - Search Guard
- ……
6. 总结
- Sinks(Log sinks, Beats, Fluentd-bit) -> Storages(ElasticSearch, Postgresql,Zombodb etc).
- Collctors(Beats, Fluentd-bit) -> Kafka -> Fluentd -> Storages(ElasticSearch, Postgresql,Zombodb etc).
7. 扩展
Log system architecture的更多相关文章
- Heterogeneous System Architecture
https://en.wikipedia.org/wiki/Heterogeneous_System_Architecture Steps performed when offloading calc ...
- WikiMedia system architecture
w 前端 服务端 后端
- Crazyflie 2.0 System Architecture
Crazyflie 2.0架构包含两个微控制器: A NRF51, Cortex-M0, 用于实现无线通信和电源管理: (1)按键开关逻辑(ON/OFF logic) (2)控制给其它系统供电(STM ...
- Linux System Log Collection、Log Integration、Log Analysis System Building Learning
目录 . 为什么要构建日志系统 . 通用日志系统的总体架构 . 日志系统的元数据来源:data source . 日志系统的子安全域日志收集系统:client Agent . 日志系统的中心日志整合系 ...
- 学习:Log中'main', 'system', 'radio', 'events'
在Android中不同的log写到不同的设备中,共有/dev/log/system, /dev/log/main, /dev/log/radion, /dev/log/events四中类型.其中默认L ...
- 分布式学习材料Distributed System Prerequisite List
接下的内容按几个大类来列:1. 文件系统a. GFS – The Google File Systemb. HDFS1) The Hadoop Distributed File System2) Th ...
- 100 open source Big Data architecture papers for data professionals
zhuan :https://www.linkedin.com/pulse/100-open-source-big-data-architecture-papers-anil-madan Big Da ...
- Sharing The Application Tier File System in Oracle E-Business Suite Release 12.2
The most current version of this document can be obtained in My Oracle Support Knowledge Document 13 ...
- Game Engine Architecture 9
[Game Engine Architecture 9] 1.Formatted Output with OutputDebugString() int VDebugPrintF(const char ...
随机推荐
- Pycharm:使用笔记
1.快捷键笔记 CTRL+D:复制当前行 CTRL+/:注释选中行 CTRL+ALT+L:自动format,自动进行标准格式化 ALT + 7:查看当前文件的类和方法概览 Ctrl + Y 删除 ...
- layer.js 中弹框显示不全的问题
在使用 layer.js 做弹框的时候,遇到在浏览器缩小时,弹框显示不全的问题,如下: 这是不行的,因为我们有的时候想缩小浏览器视窗,但是一旦缩小到一定程度,就会把弹窗的关闭按钮遮住一部分,并且主体弹 ...
- springboot @Value获取值为空,解决办法
在spring中,常常使用 @Value("${property}") 从application.properties中取值,需要注意两点 使用 @Value 的类不能使用 new ...
- HTTPS数据传输过程简介
HTTPS数据传输过程 1.客户端发起HTTPS连接握手 2.服务端收到HTTPS握手连接请求,与客户建立握手过程,和TCP的三次握手类似,并发送一系列的加密算法组合给客户端,与客户端协商加密算法组合 ...
- mysql修改密码方法
1. 修改密码有三种方法:1.1 ---->用mysqladmin修改密码格式:mysqladmin -u用户名 -p旧密码 password 新密码 例子:# mysqladmin -uroo ...
- Linux 子网掩码计算, 二进制十进制互相转换
看下边例子 192.168.0.1/24 192.168.0.1/32 192.168.0.1/28 上边24,32,28对应的掩码都是什么,怎么计算的 24,32,28,对应的就是多少个二进制的1 ...
- 真机调试adb:wait for device 解决方案
1.adb logcat 命令的时候,cmd总是提示adb server did't ACK. 分析一下,明显adb server没有开启成功,服务启动失败一般都是端口绑定失败,所以我们只 ...
- (转)C# Windows服务 弹出消息提醒框
出处:http://blog.csdn.net/donghui6116773/article/details/53467069 服务(Service)对于大家来说一定不会陌生,它是Windows 操作 ...
- 关于TP5.0搜索后分页
使用$Model->where($where)->paginate($page,$count);的时候如果点击下一页会出现条件重置问题,经过测试可以使用Db::name('data') - ...
- Java15-java语法基础(十四)抽象类
Java15-java语法基础(十四)抽象类 一.抽象类的作用 三个类都有"执行任务"的方法,分别在这三个类中进行定义,因此需要重复编写代码,降低了程序开发效率,且增加了程序出现错 ...