flume+kafka

这里演示在单机fulume环境下,kafka作为source ,chanel , sink时三种情况

下面的测试都是基于下面的基本的配置文件进行修改的

a1.sources = r1

a1.sinks = k1

a1.channels = c1

# For each one of the sources, the type is defined

#agent.sources.seqGenSrc.type = seq

a1.sources.r1.type = netcat

a1.sources.r1.bind=mini1

a1.sources.r1.port=

# The channel can be defined as follows.

#agent.sources.seqGenSrc.channels = memoryChannel

a1.channels.c1.type=memory

a1.channels.c1.capacity=

a1.channels.c1.transactionCapacity =

# Each sink's type must be defined

#agent.sinks.loggerSink.type = logger

a1.sinks.k1.type = logger

#Specify the channel the sink should use

#agent.sinks.loggerSink.channel = memoryChannel

a1.sources.r1.channels = c1

a1.sinks.k1.channel = c1

# Each channel's type is defined.

#agent.channels.memoryChannel.type = memory

# In this case, it specifies the capacity of the memory channel

#agent.channels.memoryChannel.capacity =

kafka作为source时的配置和produce程序

a1.sources.r1.type = org.apache.flume.source.kafka.KafkaSource

a1.sources.r1.channels = c1

a1.sources.r1.batchSize =

a1.sources.r1.batchDurationMillis =

a1.sources.r1.kafka.bootstrap.servers = mini1:

a1.sources.r1.kafka.topics = Operator

a1.sources.r1.kafka.consumer.group.id = custom.g.id

public static void main(String[] args) throws IOException {

        Properties props = new Properties();

              props.load(TestConsumer.class.getClass().getResourceAsStream("/kafkaProduce.properties"));

        Producer<Integer, String> producer = new KafkaProducer<>(props);

        for (int i = ; i <; i++)

            producer.send(new ProducerRecord<Integer, String>("Operator", i, getRandomPhoneNum()));

        producer.close();

       // System.out.println(getRandomPhoneNum());

    }

    public static String getRandomPhoneNum(){

        String[] basePrefix=new String[]{"","","","","",""};

        return basePrefix[new Random().nextInt(basePrefix.length)]+ RandomUtils.nextInt(,);

    }

kafka作为channel时 ,topic必须是一个新的topic如果topic中存在数据那么在启动时会报错

a1.channels.c1.type = org.apache.flume.channel.kafka.KafkaChannel

a1.channels.c1.kafka.bootstrap.servers = mini1:,mini2:,mini3:

a1.channels.c1.kafka.topic = flumedat

a1.channels.c1.kafka.consumer.group.id = flume-consumer

 #修改source

a1.sources.r1.type = exec

a1.sources.r1.command = tail -F /home/hadoop/flume/test/logs/flume.dat

a1.sources.r1.channels = c1

按照官网的说明,当kafka作为channel时可以不需要sink或者source

The Kafka channel can be used for multiple scenarios:

With Flume source and sink - it provides a reliable and highly available channel for events
With Flume source and interceptor but no sink - it allows writing Flume events into a Kafka topic, for use by other apps
With Flume sink, but no source - it is a low-latency, fault tolerant way to send events from Kafka to Flume sinks such as HDFS, HBase or Solr

kafka作为sink时

a1.sources.r1.type = spooldir

a1.sources.r1.channels = c1

a1.sources.r1.spoolDir = /home/hadoop/flume/test/logs/kfksink

a1.sources.r1.deletePolicy = immediate

a1.sources.r1.fileHeader = true

a1.sinks.k1.type = org.apache.flume.sink.kafka.KafkaSink

a1.sinks.k1.kafka.topic = flumesink

a1.sinks.k1.kafka.bootstrap.servers = mini1:

a1.sinks.k1.kafka.flumeBatchSize =

a1.sinks.k1.kafka.producer.acks =

a1.sinks.k1.kafka.producer.linger.ms =

#压缩

a1.sinks.ki.kafka.producer.compression.type = snappy

此时打开kafka消费程序

        Properties props = new Properties();

        props.load(TestConsumer.class.getClass().getResourceAsStream("/kfkConsumer.properties"));

        KafkaConsumer<Integer, String> consumer = new KafkaConsumer<>(props);

        consumer.subscribe(Arrays.asList("flumesink"));

        while (true) {

            ConsumerRecords<Integer, String> records = consumer.poll();

            for (ConsumerRecord<Integer, String> record : records) {

                System.out.print("Thread : " + Thread.currentThread().getName());

                System.out.printf("  offset = %d, key = %s, value = %s, partition = %d %n", record.offset(), record.key(), record.value(), record.partition());

            }

            consumer.commitSync();

        }

    }

配置文件来源于http://flume.apache.org/FlumeUserGuide.html

flume+kafka的更多相关文章

简单测试flume+kafka+storm的集成
集成 Flume/kafka/storm 是为了收集日志文件而引入的方法,最终将日志转到storm中进行分析.storm的分析方法见后面文章,这里只讨论集成方法. 以下为具体步骤及测试方法: 1.分别 ...
【转】flume+kafka+zookeeper 日志收集平台的搭建
from:https://my.oschina.net/jastme/blog/600573 flume+kafka+zookeeper 日志收集平台的搭建收藏 jastme 发表于 10个月前阅 ...
hadoop 之 kafka 安装与 flume -> kafka 整合
62-kafka 安装 : flume 整合 kafka 一.kafka 安装 1.下载 http://kafka.apache.org/downloads.html 2. 解压 tar -zxvf ...
Flume+Kafka+Strom基于伪分布式环境的结合使用
目录: 一.Flume.Kafka.Storm是什么,如何安装? 二.Flume.Kafka.Storm如何结合使用? 1) 原理是什么? 2) Flume和Kafka的整合 3) Kafka和St ...
flume+kafka (分区实现默认单分区)
这篇文章主要是log4j+flume+kafka的内容首先从从下面的地址下载flume+kafka的插件包 https://github.com/beyondj2ee/flumeng-kafka-p ...
Flume - Kafka日志平台整合
1. Flume介绍 Flume是Cloudera提供的一个高可用的,高可靠的,分布式的海量日志采集.聚合和传输的系统,Flume支持在日志系统中定制各类数据发送方,用于收集数据:同时,Flume提供 ...
Flume+Kafka+Storm+Hbase+HDSF+Poi整合
Flume+Kafka+Storm+Hbase+HDSF+Poi整合需求: 针对一个网站,我们需要根据用户的行为记录日志信息,分析对我们有用的数据. 举例:这个网站www.hongten.com(当 ...
Flume+Kafka+Storm整合
Flume+Kafka+Storm整合 1. 需求: 有一个客户端Client可以产生日志信息,我们需要通过Flume获取日志信息,再把该日志信息放入到Kafka的一个Topic:flume-to-k ...
大数据处理框架之Strom：Flume+Kafka+Storm整合
环境虚拟机:VMware 10 Linux版本:CentOS-6.5-x86_64 客户端:Xshell4 FTP:Xftp4 jdk1.8 storm-0.9 apache-flume-1.6.0 ...
Flume+Kafka整合
脚本生产数据---->flume采集数据----->kafka消费数据------->storm集群处理数据日志文件使用log4j生成,滚动生成! 当前正在写入的文件在满足一定的数 ...

随机推荐

jquery ajax 获取 json 文件数据
[ {"name":"project1"}, {"name":"project2"}, {"name" ...
Android批量图片加载经典系列——Volley框架实现多布局的新闻列表
一.问题描述 Volley是Google 2013年发布的实现Android平台上的网络通信库,主要提供网络通信和图片下载的解决方案,比如以前从网上下载图片的步骤可能是这样的流程: 在ListAdap ...
DOS命令：列出某目录下的所有文本文件名并重定向到某文件
命令如下: >dir /b *.txt>output.txt dir无需说,/b 是只要文件名,>是重定向. 2013年11月7日13:36:57
如何使用飞秋FeiQ实现两电脑通信（或传输文件）
如何使用飞秋FeiQ实现两电脑通信(或传输文件) 1. 在两天电脑上,分别按照飞秋FeiQ 我使用的绿色飞秋2013正式版 2. 使用一根网线,将两电脑的网口连接一起 3. 设置飞秋FeiQ的端口号不 ...
jquery翻页
http://js.itivy.com/simplePagination.js/index.html#page-10 http://www.oschina.net/news/41941/7-html5 ...
unity3d GameCenter的使用
原地址:http://blog.sina.com.cn/s/blog_6b3661a901013zmh.html 因为开发的游戏需要支持GameCenter,老大把这活交给我来搞,于是俺就百度Goog ...
vim中翻页的命令
整页翻页 ctrl-f ctrl-b f就是forword b就是backward 翻半页 ctrl-d ctlr-u d=down u=up 滚一行 ctrl-e ctrl-y zz 让光标所杂 ...
hibernate中错误笔记
1.在写Student.hbm.xml 中, hibernate-mapping 中指定类和数据库对应的表字段时,不小心将property写为properties,报错: ERROR: HHH000 ...
分享阿里云SLB-负载均衡的实现基本原理架构
负载均衡技术原理浅析 https://help.aliyun.com/knowledge_detail/39444.html?spm=5176.7839438.2.6.XBbX5l 阿里定制版的LVC ...
重要:Linux下IDE--KDevelop (用来跟踪调试C++) Ubuntu下QT4开发环境的搭建及初体验
Linux下安装Qt4有两大问题,一是环境变量,二是IDE(集成开发环境).安装Qt4也有两种方法,一种是apt-get,一种是下载源码包,而后一种方法已经人证实是最有可能不好使的方法.所以我最终采 ...

flume+kafka

flume+kafka的更多相关文章

随机推荐

热门专题