StreamSets 设计Edge pipeline

edge pipeline 运行在edge 执行模式，我们可以使用 data collector UI 进行edge pipeline 设计，
设计完成之后，你可以部署对应的pipeline到edge 设备

可以设计的edge pipeline

edge 发送pipeline

edge 发送pipeline 使用特定的orgin读取edge设备上的数据，这个pipeline 可以在将数据发送到data collector 之前进行数据的处理

edge 接收pipeline

接收pipeline可以接收来自edge 设备或者 data collector pipeline的数据

orgin 组件

Dev Random Record Source
Dev Raw Data Source
Directory Edge pipelines do not support multithreaded processing.
In an edge pipeline, the Directory origin always creates a single thread to read the files even if you configure it to use multiple threads.
File Tail
In edge pipelines, the File Tail origin can read a single set of files.
If you configure multiple sets of files for the origin, the origin reads only the files configured in the first set.
HTTP Client
In edge pipelines, the HTTP Client origin does not support batch processing mode, pagination, or OAuth2 authorization.
HTTP Server Edge pipelines do not support multithreaded processing.
In an edge pipeline, the HTTP Server origin always creates a single thread to read the files even if you configure it to use multiple threads.
MQTT Subscriber Edge pipelines that use MQTT stages require using an intermediary MQTT broker.
For example, an edge sending pipeline uses an MQTT Publisher destination to write to an MQTT broker. The MQTT broker temporarily stores the data until the MQTT Subscriber origin in the edge receiving pipeline reads the data.
Sensor Reader
System Metrics
WebSocket Client
Windows Event Log

processsor 组件

Delay
Dev Identity
Expression Evaluator
Field Remover
JavaScript Evaluator In edge pipelines, the JavaScript Evaluator processor does not support the sdcFunctions scripting object.
Stream Selector

destinations 组件

CoAP Client
HTTP Client
Kafka Producer
MQTT Publisher Edge pipelines that use MQTT stages require using an intermediary MQTT broker.
For example, an edge sending pipeline uses an MQTT Publisher destination to write to an MQTT broker. The MQTT broker temporarily stores the data until the MQTT Subscriber origin in the Data Collector receiving pipeline reads the data.
Trash
WebSocket Client

错误记录处理

Discard 丢踢
The pipeline discards the record.
Write to File 写到文件
The pipeline writes error records and related details to a local directory on the edge device. Create another edge pipeline with a Directory origin to process the error records written to the file.
Write to MQTT 写到mqtt
The pipeline publishes error records and related details to a topic on an MQTT broker. Create another edge or standalone Data Collector pipeline with an MQTT Subscriber origin to process the error records published to the broker.

支持的数据格式

json
text

限制

Email and webhook notifications cannot be sent by edge pipelines.
Rules and alerts cannot be defined for edge pipelines.
Edge pipelines support a limited number of record, math, pipeline, and string functions.
Edge pipelines do not support dataflow triggers.
Edge pipelines do not support multithreaded processing.
You cannot capture snapshots for edge pipelines.

参考资料

https://streamsets.com/documentation/datacollector/latest/help/datacollector/UserGuide/Edge_Mode/EdgePipelineTypes.html#concept_c14_m4r_4bb

StreamSets 设计Edge pipeline的更多相关文章

StreamSets 相关文章
相关streamsets 文章(不按顺序) 学习视频-百度网盘 StreamSets 设计Edge pipeline StreamSets Data Collector Edge 说明 streams ...
StreamSets 管理 SDC Edge上的pipeline
可选的方式: ui (data colelctor) 发送命令 UI 主要是创建edge pipeline 的时候进行edge server 的配置默认是 http://localhost:1863 ...
StreamSets 部署 Pipelines 到 SDC Edge
可以使用如下方法: 下载edge 运行包并包含pipeline定义文件. 直接发布到edge 设备. 在data colelctor 机器配置并配置了edge server 地址(主要需要网络可访问) ...
streamsets geoip 使用
geoip 分析对于网站数据分析是很方便的安装geoip2 下载地址 https://dev.maxmind.com/geoip/geoip2/geolite2/ 配置streamsets geoi ...
如何评价一个pipeline的好坏
生物信息NGS相关软件众多. 常用的比对软件:bwa,bowtie: 去pcr重复的软件\:samtools,picard: calling variant:samtools/bcftools,gat ...
pipeline 结构设计
目录一.pipeline步骤二.案例 pipeline详解只生成一次制品不同环境部署系统集成测试指定版本部署一.pipeline步骤当团队开始设计第一个pipeline时,该如何下手呢 ...
使用Pipeline抽象业务生命周期流程
上篇关于流程引擎的文章还是快两年以前的<微服务业务生命周期流程管控引擎>,这中间各种低代码平台层出不穷,虽然有些仅仅是OA+表单的再度包装,但有些的确是在逻辑和操作单元层面进行了真正的高度 ...
Netty源码分析--创建Channel（三）
恩~,没错,其实这一篇才是真正的开始分析源码,你打我呀~. 先看一下我Netty的启动类 private void start() throws Exception { EventLoopGroup ...
Jenkins教程（四）安装BlueOcean与Maven构建
前言本文旨在使用BlueOcean实现构建可视化与使用Maven构建上一节Jenkins教程(三)添加凭据与流水线拉取Git代码拉下来的代码什么是Blue Ocean Blue Ocean 重新思 ...

随机推荐

Union、Union All、Intersect、Minus用法和区别
假设我们有一个表Student,包括以下字段与数据: [c-sharp] view plain copydrop table student; create table student ( ...
Centos75 安装 postgresql11
切换到root账户, #安装yum 源 yum install https://download.postgresql.org/pub/repos/yum/11/redhat/rhel-7-x86_6 ...
从e.getMessage()为null看Java异常机制
问题:自定义异常触发了,但是自定义的提示信息RuntimeException却没有带过来. throw new RuntimeException("不允许插入报价主项和报价子项同时重复的记录 ...
ubuntu14.04安装CUDA8.0
ubuntu安装CUDA 因为深度学习需要用到CUDA,所以写篇博客,记录下自己安装CUDA 的过程. 1 安装前的检查安装CUDA之前,首先要做一些事情,检查你的机器是否可以安装CUDA. 1.1 ...
Java堆(heap)、栈(stack)和队列的区别
Java里面Stack有两种含义: 一:数据结构 Stack,即java.util.Stack import java.util.Stack; import java.util.Iterator; i ...
System.Data.SQLite未能加载文件或程序集
1.简直是作死帝呀.不需要修改dll的名字,否则就坐等悲剧吧如果项目中有x86和x64的dll,可以建两个不同的文件夹分别存放,但是千万不要修改掉默认的dll的名字 System.Data.SQLi ...
简单实现Ubuntu16.04 + caffe2 + CUDA9.0 + cuDNN8.0
在Ubuntu16.04 CUDA9.0 cuDNN8.0的环境下安装caffe2 本博客比较简单,cuda9.0 cudnn8.0部分请看上一篇博客,其中详细讲了: 如何安装驱动安装cuda 安装 ...
UVa 11609 组队（快速幂）
https://vjudge.net/problem/UVA-11609 题意: 有n个人,选一个或多个人参加比赛,其中一名当队长,有多少种方案?如果参赛者完全相同,但队长不同,算作不同的方案. 思路 ...
spring boot2 基于百度云apiface实现人脸检测与认证1
原理介绍: 基于百度云的人脸资料库(用户上传),调用本地摄像头抓拍的图像,与百度云的用户图像做比对,实现人脸认证. 主要步骤如下: 1. 创建百度去账号 2. 在百度云控制台中创建人脸识别的应用,并记 ...
安全之路:Web渗透技术及实战案例解析(第2版)
安全之路:Web渗透技术及实战案例解析(第2版)

StreamSets 设计Edge pipeline

可以设计的edge pipeline

orgin 组件

processsor 组件

destinations 组件

错误记录处理

支持的数据格式

限制

参考资料

StreamSets 设计Edge pipeline的更多相关文章

随机推荐

热门专题