What's Wrong With Hue Oozie Editor?
本文原文出处: http://blog.csdn.net/bluishglc/article/details/47021019 严禁不论什么形式的转载,否则将托付CSDN官方维护权益!
First, let’s make the topic clear:
Comparing with providing raw Oozie workflow/coordinator xml file, what’s disadvantages to create workflow/coordinator with Hue Oozie Editor? ( The Hue Oozie Editor version discussed by this artical is HDP 2.2.4)
If no deep understanding with Hue Oozie Editor, everybody will like it at the first glance, why not? It’s so easy to use, what you see is what you get, who want to write the ugly xml file manually?
But the truth is: the Hue Oozie Editor is not so good as it looks, it’s far away to stable and powerful tool to create/manage workflows.
Here are the problems:
As core source codes for workflows, the raw xml files should add into version control. The Hue Oozie Editor have no or very weak version control ability.
If we maintain raw xml file in project, with building tools, we can configure environment related parameters, i.e. namenode, input/output data location and etc. And then we can easily build project for dev, test or production environment.
On the contrary, what if using Hue Oozie Editor? Congratulations! Please do the duplicated job again on production cluster: re-create the workflows/coordinators on the production cluster manually. Well, there’s an import/export feature in Hue Oozie Editor, but it’s only for workflow not for coordinator, and even for workflows, you still have to change all environment related parameters manually.
It can’t support some advanced features, so we have to edit raw xml file. For example: you can’t assign the expression between input-events and dataset, i.e. ${coord:current(-1)}, you can only map them directly.
Can’t import/export coordinators.
Well, at least, could we import our raw workflow file into Hue Oozie Editor?
Let’s look at how weak the current Hue Oozie Editor:
For schema version, Hue Oozie Editor only support not higher than 0.4 of workflow and not higher than 0.2 of hive-action, otherwise you can’t import your raw file.
It’s hard to believe: the property name: jobTracker and nameNode are HARD CODE! If you don’t use the two property name, again, you can’t import your raw file.
Some parameters accept embedded parameter, i.e. ${nameNode}/data/${year}/${month}, but some not, as for which accept which not? Try by yourself one by one, otherwise, you can’t import your raw file still.
Nobody hate UI design tools, but it has to be good enough. by now, I would say, building workflows above Hue Oozie Editor is unwise.
Obviously, we should choose raw xml file not Hue Oozie Editor.
But there is only one small trouble, the Hue can only start a workflow/coordinator edited by Hue Oozie Editor.Note: once a workflow/coordinator started, you can monitor & stop it from Hue even it’s described by raw xml.
First, I don’t think this is a trouble, we can start a workflow/coordinator with command line. Please do remember: normally, a workflow/coordinator is long-term running & background service, we scarcely start/stop it. So, the command line is enough for the operation and maintenance.
Besides command line, you can also start a workflow/coordinator via Oozie Restful API from remote.
What's Wrong With Hue Oozie Editor?的更多相关文章
- 【原创】大叔经验分享(49)hue访问hdfs报错/hue访问oozie editor页面卡住
hue中使用hue用户(hue admin)访问hdfs报错: Cannot access: /. Note: you are a Hue admin but not a HDFS superuser ...
- #数据技术选型#即席查询Shib+Presto,集群任务调度HUE+Oozie
郑昀 创建于2014/10/30 最后更新于2014/10/31 一)选型:Shib+Presto 应用场景:即席查询(Ad-hoc Query) 1.1.即席查询的目标 使用者是产品/运营/销售 ...
- HUE Oozie : error=2, No such file or directory采坑记录
HUE Oozie : error=2, No such file or directory采坑记录 1.错误详情 一直都是同一种方式在hue上定义workflow,不知为啥 今天定义的就是不行... ...
- Hue,Oozie运行sqoop找不到驱动解决办法
一.前言 平台:CDH 5.13.0 公司在客户那边有个项目需要部署cloudera平台,部署的时候,在这个版本的cdh中,除了基本组件,还选了sqoop2作为数据传输工具,希望能在Oozie中通过工 ...
- Spark教程——(11)Spark程序local模式执行、cluster模式执行以及Oozie/Hue执行的设置方式
本地执行Spark SQL程序: package com.fc //import common.util.{phoenixConnectMode, timeUtil} import org.apach ...
- HUE配置文件hue.ini 的liboozie和oozie模块详解(图文详解)(分HA集群)
不多说,直接上干货! 我的集群机器情况是 bigdatamaster(192.168.80.10).bigdataslave1(192.168.80.11)和bigdataslave2(192.168 ...
- 【转载】Hadoop可视化分析利器之Hue
http://qindongliang.iteye.com/blog/2212619 先来看下hue的架构图: (1)Hue是什么? Hue是一个可快速开发和调试Hadoop生态系统各种应用的一个基 ...
- Hue 之 SparkSql interpreters的配置及使用
1.环境说明: HDP 2.4 V3 sandbox hue 4.0.0 2.hue 4.0.0 编译及安装 地址:https://github.com/cloudera/hue/releases/t ...
- Hue的全局配置文件hue.ini(图文详解)
Hue版本:hue-3.9.0-cdh5.5.4 需要编译才能使用(联网) 说给大家的话:大家电脑的配置好的话,一定要安装cloudera manager.毕竟是一家人的.同时,我也亲身经历过,会有部 ...
随机推荐
- mybatis中sql标签和include标签
1.首先定义一个sql标签,一定要定义唯一id.(name,age是要查询的字段) <sql id="Base_Column_List" >name,age</s ...
- java JSON 和 Object 相互转换
1.导入 jackson jar <dependency> <groupId>com.fasterxml.jackson.core</groupId> <ar ...
- http响应的封装
响应的封装: 资源的初始化 分析请求响应信息,根据状态响应码,发送不同的状态码 浏览器根据状态信息,做出不同的执行 构建正文,也就是根据浏览器客服端的请求发送响应信息: 发送响应:code 是状态码, ...
- 洛谷 P1278 单词游戏
P1278 单词游戏 题目描述 Io和Ao在玩一个单词游戏. 他们轮流说出一个仅包含元音字母的单词,并且后一个单词的第一个字母必须与前一个单词的最后一个字母一致. 游戏可以从任何一个单词开始. 任何单 ...
- git batch
git batch 不用每次自己写了:不是特别推荐哦: git add . git commit -m "commit" git push git status
- [Gatsby] Install Gatsby and Scaffold a Blog
In this lesson, you’ll install Gatsby and the plugins that give the default starter the ability to t ...
- JDBC创建mysql连接池代码
1.底层实现类(DBConnection) package JDBC.JDBCPool.MyJDBCPool; import java.sql.Connection; import java.sql. ...
- lscript.ld 链接器脚本
sumary选项卡 lscript.ld是这个应用程序的链接器脚本. 这是实用的作为一个报告 看看内存是针相应用程序. 它也能够被编辑以改变应用程序的位置.双击Hello_Zynqàsrcà lscr ...
- yolo源码解析(1):代码逻辑
一. 整体代码逻辑 yolo中源码分为三个部分,\example,\include,以及\src文件夹下都有源代码存在. 结构如下所示 ├── examples │ ├── darknet.c(主程序 ...
- MHA+ProxySQL 读写分离高可用
文档结构如下: 1.ProxySQL说明 ProxySQL是mysql的一款中间件的产品,是灵活的mysql代理层,可以实现读写分离,支持query路由器的功能,支持动态指定sql进行缓存,支持动态加 ...