Log system architecture

0. 技术选型参考

1. Collector

Keywords: Collector, Processor

名称	Beats	Fluentd-bit
Introduction	Beats are a collector and processor of lightweight (resource efficient, no dependencies, small) and open source log shippers that act as agents installed on the different servers in your infrastructure for collecting logs or metrics.	Fluent Bit was born to address the need for a high performance and optimized tool that can collect and process data from any input source, unify that data and deliver it to multiple destinations.
Owner	Elastic	Treasure Data
Open Source	True	True
Github Stars	5742	608
License	Apache License v2.0	Apache License v2.0
Scope	Containers / Servers / K8S	Containers / Servers / K8S
Language	Go	C
Memory	~10MB	~500KB
Performance	High	High
Dependencies	Zero dependencies, unless some special plugin requires them.	Zero dependencies, unless some special plugin requires them.
Category	Auditbeat,Filebeat,Heartbeat,Metricbeat，Packetbeat，Winlogbeat	NaN
Configuration	File(.yml)/Cmd	File(custom file extension and syntax)/Cmd
Essence	Collector & Processor	Collector & Processor
Input/Module	File, Docker, Syslog, Nginx, Mysql, Postgresql, etc	File,CPU, Disk, Docker, Syslog, etc
Output	Elasticsearch, Logstash, Kafka, Redis, File, Console	ES, File, Kafka, etc

1.1 Filebeat 架构图

Ingest Node - A es plugin which pre-process documents before the actual document indexing happen and replace for Logstash. The ingest node intercepts bulk and index requests, it applies transformations, and it then passes the documents back to the index or bulk APIs. Define a pipeline(Processors) that specifies a series of processors, then register the pipeline id in Filebeat configuration file.
Kafka - Prevent loss of data and manage logging output speed.

1.2 Fluent bit 架构图

Name	Description	Samples
Input	Entry point of data. Implemented through Input Plugins, this interface allows to gather or receive data.	Samples
Parser	Parsers allow to convert unstructured data gathered from the Input interface into a structured one. Parsers are optional and depends on Input plugins.	Prospector and processors in Filebeat
Filter	The filtering mechanism allows to alter the data ingested by the Input plugins. Filters are implemented as plugins.	Prospector and processors in Filebeat
Buffer	By default, the data ingested by the Input plugins, resides in memory until is routed and delivered to an Output interface.
Routing	Data ingested by an Input interface is tagged, that means that a Tag is assigned and this one is used to determinate where the data should be routed based on a match rule.
Output	An output defines a destination for the data. Destinations are handled by output plugins. Note that thanks to the Routing interface, the data can be delivered to multiple destinations.	Samples

2. Log Transporter

Keywords: Collector, Processor, Aggregator

名称	Logstah	Fluentd
Introduction	Logstash is an open source, server-side data processing pipeline that ingests data from a multitude of sources simultaneously, transforms it, and then sends it to your stash.	Fluentd is an open source data collector, which lets you unify the data.
Owner	Elastic	Treasure Data
Open Source	True	True
Github Stars	9105	6489
License	Apache License v2.0	Apache License v2.0
Scope	Containers / Servers / K8S	Containers / Servers / K8S
Language	JRuby（JVM）	Ruby & C
Memory	200MB+	~40MB
Performance	Middle	High
Dependencies	JVM	Ruby Gem
Configuration	File(custom file extension and syntax)/Cmd	File(custom file extension and syntax)/Cmd
Essence	Collector, Processor, Aggregator	CCollector, Processor, Aggregator
Input/Module	Limited only by your imagination（Serilog）	Limited only by your imagination（Nlog）
Output	Limited only by your imagination	Limited only by your imagination

Further Reading: Fluentd vs. Logstash: A Comparison of Log Collectors

3. 初步总结

比较	Beats + Logstash	Fluentd bit + Fluentd	说明
功能实现	√	√	基本一致
安装与配置简易性	√
内存占用		√	JVM 特性使然
可靠性	√	√	前者使用 registry file + redis 实现可靠性，后者使用内置 buffering 实现可靠性
可扩展性	√	√	插件生态和可扩展性基本一致。后者为分布型插件管理
趋势		√	ELK -> EFK
其他	√	√	前者更倾向于使用 go & java 技术栈，后者有 docker, k8s 官方 log driver 类型和案例支持

Tips: 任一层级都可以自由替换.

4. Visualizer

Keywords: Query, Analyze, Monitor

名称	Kibana	Grafana
Introduction	Kibana is an open source data visualization plugin for Elasticsearch.	Data visualization & Monitoring with support for Graphite, InfluxDB, Prometheus, Elasticsearch and many more databases.The leading open source software for time series analytics.
Owner	Elastic	Grafana
Open Source	True	True
Github Stars	9k+	22k+
License	Apache License v2.0	Apache License v2.0
Scope	ElasticSearch only	ElasticSearch, InfluxDB, PostgreSQL etc
Language	Javascript	Go & Typescript
Configuration	File(.yml)/Cmd	File(custom file extension and syntax)/Cmd
Simple Query	Lucene syntax and filter components	filter components.Different from each other data source
Full-Text Query	Yes	No
Security	Plugins or libraries	Integration
Notification	Plugins or libraries	Integration
Advantages	Log, ES	Multiple data source, APM, Timeseries

Working together.

5. Log Storage and Analyzer

Keywords:Storage, ES, Postgresql, Zombodb, Arangodb

5.1 ElasticSearch

同时支持单文档的对象搜索+模糊搜索+全文搜索
Skywalking 官方支持存储媒介
作为流行 Output 支持绝大部分 Log 相关系统
天生分布式
一键设置过期窗口，索引重建
……

占用资源较多，对存储介质要求高
运维成本更高
持久化
安全性 - Search Guard
……

6. 总结

Sinks(Log sinks, Beats, Fluentd-bit) -> Storages(ElasticSearch, Postgresql,Zombodb etc).
Collctors(Beats, Fluentd-bit) -> Kafka -> Fluentd -> Storages(ElasticSearch, Postgresql,Zombodb etc).

7. 扩展

Log system architecture的更多相关文章

Heterogeneous System Architecture
https://en.wikipedia.org/wiki/Heterogeneous_System_Architecture Steps performed when offloading calc ...
WikiMedia system architecture
w 前端服务端后端
Crazyflie 2.0 System Architecture
Crazyflie 2.0架构包含两个微控制器: A NRF51, Cortex-M0, 用于实现无线通信和电源管理: (1)按键开关逻辑(ON/OFF logic) (2)控制给其它系统供电(STM ...
Linux System Log Collection、Log Integration、Log Analysis System Building Learning
目录 . 为什么要构建日志系统 . 通用日志系统的总体架构 . 日志系统的元数据来源:data source . 日志系统的子安全域日志收集系统:client Agent . 日志系统的中心日志整合系 ...
学习：Log中'main', 'system', 'radio', 'events'
在Android中不同的log写到不同的设备中,共有/dev/log/system, /dev/log/main, /dev/log/radion, /dev/log/events四中类型.其中默认L ...
分布式学习材料Distributed System Prerequisite List
接下的内容按几个大类来列:1. 文件系统a. GFS – The Google File Systemb. HDFS1) The Hadoop Distributed File System2) Th ...
100 open source Big Data architecture papers for data professionals
zhuan :https://www.linkedin.com/pulse/100-open-source-big-data-architecture-papers-anil-madan Big Da ...
Sharing The Application Tier File System in Oracle E-Business Suite Release 12.2
The most current version of this document can be obtained in My Oracle Support Knowledge Document 13 ...
Game Engine Architecture 9
[Game Engine Architecture 9] 1.Formatted Output with OutputDebugString() int VDebugPrintF(const char ...

随机推荐

RocketMQ入门(Filter)_5
RocketMQ中存储的消息对于消费者来说,并不完全都是他们需要的,因此需要对消息进行过滤. 订阅Topic主题 ,选择Tags都是我们简单的过滤.Topic是大分类,Tags是二级分类. Rocke ...
tp5框架中jquery+ajax分页
jaxa分页,点击按钮直接替换数据, //php代码$page=Request::instance()->param("page"); $page = empty($page ...
spring-AOP之通知和顾问
通知和顾问都是切面的实现形式,其中通知可以完成对目标对象方法简单的织入功能. 而顾问包装了通知,可以让我们对通知实现更加精细化的管理,让我们可以指定具体的切入点. 通知分为前置通知,环绕通知及后置通知 ...
EntityFrameworkCore DBFirst
需要引用如下nuget包 Microsoft.EntityFrameworkCore Microsoft.EntityFrameworkCore.SqlServer Microsoft.EntityF ...
django内置分页功能扩展
实现自定制页码数类型class myPaginator(Paginator): def __init__(self,curr_page,per_page_num,*args,**kwargs): se ...
ABAP 省市县级联搜索帮助
在展示ABAP代码之前,先建立自建表ZCHENH006,表中包含两个关键字段 BELNR(地区编码),SDESC(地区描述). 编码规则参考:身份证前六位地区编码规则,可参考我另外一篇Blog导入系统 ...
cdnbest配置强制ssl跳转
如何配置强制ssl跳转 1. 登陆用户站点,点击下图图标: 2. 如下图添加证书和开启强制ssl即可 hsts解释和作用: 国际互联网工程组织IETF正在推行一种新的Web安全协议HTTP Stric ...
c++ 面试题(算法类)
1,从无序的数据流中找到其中位数:(用大根堆和小根堆来实现) float getMidimum(vector<int>& nums) { priority_queue<int ...
373. Find K Pairs with Smallest Sums 找出求和和最小的k组数
［抄题］: You are given two integer arrays nums1 and nums2 sorted in ascending order and an integer k. D ...
什么是XML？
XML被设计用来传输和存储数据. HTML被设计用来显示数据. 什么是XML? XML指可扩展标记语言(EXtensible Markup Language) XML是一种标记语言,很类似HTML X ...

Log system architecture

0. 技术选型参考

1. Collector

1.1 Filebeat 架构图

1.2 Fluent bit 架构图

2. Log Transporter

3. 初步总结

4. Visualizer

5. Log Storage and Analyzer

5.1 ElasticSearch

6. 总结

7. 扩展

Log system architecture的更多相关文章

随机推荐

热门专题