FLume监控文件夹，将数据发送给Kafka以及HDFS的配置文件详解

详细配置文件flume-conf.properties如下：

############################################

#  producer config

###########################################

#agent section

producer.sources = s

producer.channels = c c1

producer.sinks = r r1

#source section

#producer.sources.s.type = exec

#producer.sources.s.command = tail -f -n+1 /usr/local/test.log

producer.sources.s.type = spooldir

producer.sources.s.spoolDir = /usr/local/testlog

producer.sources.s.fileHeader = true

producer.sources.s.batchSize = 100

producer.sources.s.channels = c c1

# Each sink's type must be defined

producer.sinks.r.type = org.apache.flume.plugins.KafkaSink

producer.sinks.r.metadata.broker.list=127.0.0.1:9092

producer.sinks.r.partition.key=0

producer.sinks.r.partitioner.class=org.apache.flume.plugins.SinglePartition

producer.sinks.r.serializer.class=kafka.serializer.StringEncoder

producer.sinks.r.request.required.acks=0

producer.sinks.r.max.message.size=1000000

producer.sinks.r.producer.type=sync

producer.sinks.r.custom.encoding=UTF-8

producer.sinks.r.custom.topic.name=topcar

#store in HDFS

producer.sinks.r1.type = hdfs

producer.sinks.r1.channel = c1

producer.sinks.r1.hdfs.path=hdfs://node2:9000/user/flume/events/%Y-%m-%d-%H

producer.sinks.r1.hdfs.filePrefix=events-

#producer.sinks.r1.hdfs.fileSuffix = .log #设定后缀

producer.sinks.r1.hdfs.round = true

producer.sinks.r1.hdfs.roundValue = 10

producer.sinks.r1.hdfs.roundUnit = minute

#--文件格式:默认SequenceFile，可选 DataStream \ CompressedStream

producer.sinks.r1.hdfs.fileType=DataStream

#--Format for sequence file records. “Text” or “Writable”

producer.sinks.r1.hdfs.writeFormat=Text

producer.sinks.r1.hdfs.rollInterval=0

#--触发roll操作的文件大小in bytes (0: never roll based on file size)

producer.sinks.r1.hdfs.rollSize=128000000

#--在roll操作之前写入文件的事件数量(0 = never roll based on number of events)

producer.sinks.r1.hdfs.rollCount=0

producer.sinks.r1.hdfs.idleTimeout=60

#--使用local time来替换转移字符 (而不是使用event header的timestamp)

producer.sinks.r1.hdfs.useLocalTimeStamp = true

producer.channels.c1.type = memory

producer.channels.c1.capacity = 1000

producer.channels.c1.transactionCapacity=1000

producer.channels.c1.keep-alive=30

#Specify the channel the sink should use

producer.sinks.r.channel = c

# Each channel's type is defined.

producer.channels.c.type = memory

producer.channels.c.capacity = 1000

############################################

#   consumer config

###########################################

consumer.sources = s

consumer.channels = c

consumer.sinks = r

consumer.sources.s.type = seq

consumer.sources.s.channels = c

consumer.sinks.r.type = logger

consumer.sinks.r.channel = c

consumer.channels.c.type = memory

consumer.channels.c.capacity = 100

consumer.sources.s.type = org.apache.flume.plugins.KafkaSource

consumer.sources.s.zookeeper.connect=127.0.0.1:2181

consumer.sources.s.group.id=testGroup

consumer.sources.s.zookeeper.session.timeout.ms=400

consumer.sources.s.zookeeper.sync.time.ms=200

consumer.sources.s.auto.commit.interval.ms=1000

consumer.sources.s.custom.topic.name=topcar

consumer.sources.s.custom.thread.per.consumer=4

Flume启动命令如下：

bin/flume-ng agent --conf conf --conf-file conf/flume-conf.properties --name producer -Dflume.root.logger=INFO,console

FLume监控文件夹，将数据发送给Kafka以及HDFS的配置文件详解的更多相关文章

Python 的 pyinotify 模块监控文件夹和文件的变动
官方参考: https://github.com/seb-m/pyinotify/wiki/Events-types https://github.com/seb-m/pyinotify/wiki/I ...
Storm监控文件夹变化统计文件单词数量
监控指定文件夹,读取文件(新文件动态读取)里的内容,统计单词的数量. FileSpout.java,监控文件夹,读取新文件内容 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 ...
【.Net 学习系列】-- FileSystemWatcher 监控文件夹新生成文件，并在确认文件没有被其他程序占用后将其移动到指定文件夹
监控文件夹测试程序: using System; using System.Collections.Generic; using System.IO; using System.Linq; using ...
[转帖]Linux下inotify监控文件夹状态，发生变化后触发rsync同步
Linux下inotify监控文件夹状态,发生变化后触发rsync同步 https://www.cnblogs.com/fjping0606/p/6114123.html 1.安装工具--inotif ...
1.8-1.10 大数据仓库的数据收集架构及监控日志目录日志数据，实时抽取之hdfs系统上
一.数据仓库架构二.flume收集数据存储到hdfs 文档:http://flume.apache.org/releases/content/1.9.0/FlumeUserGuide.html#hd ...
网卡配置文件详解用户管理与文件权限篇文件与目录权限软连接 tar解压命令 killall命令 linux防火墙 dns解析设置计划任务crond服务软件包安装阿里云 yum源安装
Linux系统基础优化及常用命令 Linux基础系统优化引言没有,只有一张图. Linux的网络功能相当强悍,一时之间我们无法了解所有的网络命令,在配置服务器基础环境时,先了解下网络参数设定命令. ...
Nagios监控平台搭建及配置文件详解
Nagios是一款开源的免费网络监视工具,能有效监控Windows.Linux和Unix的主机状态,交换机路由器等网络设置,打印机等.在系统或服务状态异常时发出邮件或短信报警第一时间通知网站运维人员, ...
Spring配置文件详解 – applicationContext.xml文件路径
Spring配置文件详解 – applicationContext.xml文件路径 Java编程 spring的配置文件applicationContext.xml的默 ...
如何用R来处理数据表的长宽转换（图文详解）
不多说,直接上干货! 很多地方都需用到这个知识点,比如Tableau里. 通常可以采取如python 和 r来作为数据处理的前期. Tableau学习系列之Tableau如何通过数据透视表方式读取 ...

随机推荐

8VC Venture Cup 2016 - Elimination Round F - Group Projects dp好题
F - Group Projects 题目大意:给你n个物品, 每个物品有个权值ai, 把它们分成若干组, 总消耗为每组里的最大值减最小值之和. 问你一共有多少种分组方法. 思路:感觉刚看到的时候的想 ...
牛客练习赛3 B - 贝伦卡斯泰露
链接:https://www.nowcoder.net/acm/contest/13/B来源:牛客网题目描述贝伦卡斯泰露,某种程度上也可以称为古手梨花,能够创造几率近乎为0的奇迹,通过无限轮回成 ...
在静态方法中应用spring注入的类
最近在一次项目的重构中,原项目需要在静态方法中调用service,现在需要更换框架,service需要自动注入,无法再静态方法中调用解决思路: 创建一个当前类的静态变量,创建一个方法,使用@Post ...
四、redis系列之主从复制与哨兵机制
1. 绪言在现实应用环境中,出于数据容量.容灾.性能等因素的考虑,往往不会只使用一台服务器,而是使用集群的方式.Redis 中也有类似的维持一主多从的方式提高 Redis 集群的高可用性的方案,而其 ...
STP协议树配置
STP协议树作用为了提高网络可靠性,交换网络中通常会使用冗余链路. 然而,冗余链路会给交换网络带来环路风险并导致广播风暴以及MAC地址表不稳定等问题进而会影响到用户的通信质量. 生成树协议STP( ...
iOS 11开发教程（二）编写第一个iOS 11应用
iOS 11开发教程(二)编写第一个iOS 11应用编写第一个iOS 11应用本节将以一个iOS 11应用程序为例,为开发者讲解如何使用Xcode 9.0去创建项目,以及iOS模拟器的一些功能.编 ...
1011 World Cup Betting (20)（20 point(s)）
problem With the 2010 FIFA World Cup running, football fans the world over were becoming increasingl ...
[ 原创 ] Java基础6--构造函数和抽象类的性质
构造函数的性质 // A.方法名与类名相同: // B.没有返回类型(例如return.void等):// C.不能被static.final.native.abstract和synchronized ...
Django-高级特性
分页 1.固定显示分页数目 2.点击相应分页取出对应数据具体实现: from django.utils.safestring import mark_safe class Pagination(ob ...
Linux命令学习<不断更新>
没有系统的学习过Linux命令,遇到了就学习一下,慢慢积累. 1.echo 命令,学习网站『https://linux.cn/article-3948-1.html』. echo单词有回声.共鸣的意思 ...

FLume监控文件夹，将数据发送给Kafka以及HDFS的配置文件详解

详细配置文件flume-conf.properties如下：

Flume启动命令如下：

FLume监控文件夹，将数据发送给Kafka以及HDFS的配置文件详解的更多相关文章

随机推荐

热门专题