[转]Informatica vs SSIS
转自 http://blog.csdn.net/thy822/article/details/8489779
这篇文章, 我不能同意更多, 所以转在这里.
Here is my thinking after looking at both products...
First of all let me tell you what I like about informatica....
- Inbuilt tool to export mappings to Excel and Visio.... helpful to document Process.
- Reusability of sources,Targets and Mappings (In SSIS No native Reusabilty except you are using
BI xPress Package Builder)
- Inbuilt Version Control: SSIS Can give you this After Installing VSS,
SVN or other products but in Informatica it native feature to Checkout
and Checkin.
- Visibility of MetaData: you can click on any column in Pipeline and it
will show you where it originated from ... Pretty neat :)
- Scalibility Out of the box
Now let me go through some main points which will drive your purchase decision.
Trialing software and Cost:
Informatica costs you tons of money.. this is where 90% companies
drop their decision to buy it... once you buy it ...it will cost you
even more to maintain it every year so good luck with that!!! Every Dev
license will cost you money ... on the other end
Dev License of SQL Server is virtually Free ($50 per Dev Edition
License).... and Every SQL Server License comes with SSIS Server.
Trialing informatica is not easy... you wont find a place to download
it... on the other end downloading Entire BI Studio from
Microsoft is click away this itself shows me Informatica wants your all
money.... seriously why is that hard ????
Development Speed:
What I found after creating simple Work flow in Informatica takes
twice long than creating workflow in SSIS due to the fact informatica
stores Sources, targets and Mappings in seperate locations and then you
bring them all together in WorkFlow (similar as
ControlFlow in SSIS). One thing I like about this approach that
reusabily but from experience I hardly reuse Mappings (Load Customer
defined in one Workflow is rarely used in some other WorkFlow) so it
sounds like appealing feature but atleast not for me.
Informatica has too many jumps to Design WorkFlow, Session, Execute
WorkFlow and Monitor Progress all feels like I am using 3-4 different
applications... while SSIS is all in one compact window... and very
clean and Visual Debugging.
Scripting in SSIS is way more powerful than Informatica due to the
fact SSIS Script Editor is Shell of Visul Studio which gives you
BreakPoints, Watch, CallStack, Intellisense and Step By Step
Dbugging.... SSIS Scripting is C# or VB.net ... Informatica
is JAVA
Inbuilt Tasks:
SSIS has advantage of having several inbuilt task which is not
available in Informatica. But on the other end some Adapters and
Pipeline Features Impressed me in informatica including XML
Destination, XML Transformation, Upsert Capabilty, Visibility of
Metadata.
Some of my fav task in SSIS Missing in Informatica:
- ForEach Looping
- Script task
- File System task
In SSIS you can spent few hundred bucks to buy any missing
features.... e.g. SFTP/FTPS, Zip/Unzip, SecureEmail, Sharepoint,
SalesForce Adapter, Upsert Destination, XML Destination
Storage:
Informatica - Everything is stored in Central Repository (Oracle or
SQL DB)... what this means to developer ... You have nothing stored on
your machine. You have to be connected to Informatica Server all the
time. This was very frustrating coz I cant work
offline unlike SSIS.
SSIS: You can develop SSIS Packages on your local machine in
completely disconnected mode. When you ready to deploy to production it
can be stored in SQL Repository (msdb database) or can be stored on File
System.
Debugging WorkFlow and Pipeline:
SSIS has lots of ways to debug various things... using BreakPoints,
Watch Window, DataViewer . I didnt find similar features in Informatica.
Debugging WorkFlow is more difficust in Informatica compared to SSIS.
Community Support:
SSIS Community support is amzing compared to any ETL Product out
there. I didnt find a single scenario which is not blogged by community.
Se
Extensibility:
Informatica is closed system doesnt allow you to create your own
Sources and Targets and Tasks like SSIS Does... You will find lots of
Free tasks and Adapter in SSIS which can fit your custom need or you can
create your own using SSIS APIs ... but in Informatica
its impossible or atleast I never found a way to do so.
3rd Party Support:
Informatica keeps tight control what they want inside their
Products. If you need more functionality in informatica then you have to
Pay tons of Money to get few more Adapters - Called "Power Exchange Module".
With SSIS you will find plenty of 3rd party products which will fit to your custom ETL Need at low cost yet high Quality. (e.g.
TaskFactory)
Well
... Informatica was pioneer in ETL market … they came very first in the
market they dominated many fortune 500 hundred companies... But since
then MS Seriously invested
in BI Segment and current trend is MS BI is the fastest growing trend
due to many facts.
-
MS BI Attracts many Mid-Size and Smaller shops because its one stop
solution for entire BI (SQL, IS, RS and AS)... it costs several times
less than what other vendors charge.
- Feature gap is not any more issue after SQL Server 2008 Released.
-
Due to the tight budget and economy many companies going towards
reducing cost and that is another reason why MS BI is trending up.
-
MS invested significantly to prove it can scan out (multiple nodes) and
scale up (running on powerful hardware with multi processor and higer
RAM). MS Introduced Parallel
Datawarehouse and DW Fast Track to attract bigger shops.
- The biggest drawback for MS BI right is it only works on Windows Platform ...
Again dont take my word ... read this article...
http://www.informationweek.com/news/206104502
Conclusion:
MS BI is the fastest growing trend right now :) … so if you are
concerned about job market then Relax … you will find plenty of JOBS for
MS BI
[转]Informatica vs SSIS的更多相关文章
- SSIS 包部署 Package Store 后,在 IS 中可以执行,AGENT 执行却报错
可以执行 SSIS Package ,证明用 SSIS Package 的账户是可以执行成功的.SQL Server Agent 默认指定账号是 Network Service. 那么可以尝试一下将 ...
- SSIS 属性:ExecValueVariable
有些Task组件执行完成之后,会产生输出结果,称作Execution Value,例如,Execute SQL Task在执行完成之后,会返回受影响的数据行数.Task组件的Execution Val ...
- SSIS 实例——将SQL获取的信息传递到Email中
最近在为公司财务开发一个邮件通知时遇到了一个技术问题.原来我设计SSIS的是每天将ERP系统支付数据导出到财务支付平台后 Email 通知财务,然后财务到支付平台上进行支付操作.由于那个时候开发时间很 ...
- 介绍几种SSIS部署方式
介绍 如果你已经开发完一个不错的SSIS包并且能够在你的本地完美的运行,每个任务都亮起绿色的通过标志.这时为了能够让这个包处理能够在指定时间运行,你需要将其发布到一个服务器上,并做好相关配置.作为开发 ...
- informatica读取FTP文件
以下为一个完整的informatica读取ftp文件,并导入到系统中. 第一步: 通过shell脚本下载压缩包文件 /server/infa_shared/crm_prod/shell/ftpFrom ...
- SSIS 处理NULL
不同于SQL Server中NULL表示值是未知的(Unknown Value),没有数据类型,但是,在SSIS中,NULL是有数据类型的,要获取某一个NULL值,必须指定数据类型,例如,变量 Int ...
- SSIS 数据类型和类型转换
在进行ETL开发时,数据类型(Data Type)是最基础的,但也容易被忽略,楼主使用的SQL Server 版本是2012,用此博文记录,常用的SSIS数据类型和TSQL数据类型的映射.SSIS的数 ...
- SSIS Destination 组件使用Fast-Load mode出错
查看一个Package的历史Message 数据,发现 DataFlow Task 经常出错,错误信息的Description是: Description: "While reading c ...
- SSIS Design6:利用数据流
数据流利用内存来缓冲数据,并在内存中处理数据转换,由于内存的访问速度是非常快的,所以SSIS数据流转换性能是非常高效的.SSIS Engine将数据分批加载到内存中,当Data Flow将一批新的数据 ...
随机推荐
- Jenkins企业版与CloudBees
Jenkins企业版由CloudBess提供! 参考 https://www.cloudbees.com/ http://www.infoq.com/cn/news/2012/01/cloudbees ...
- uva12716 gcd
题意:给出N,1<=b<=a<=N,求满足gcd(a,b)=a xor b的pair (a,b)的个数 有个重要的结论:若gcd(a,b)=a xor b=c,那么b=a-c 如果一 ...
- linux 中php以及nginx的重启命令
关闭php killall php-fpm 注意:php关闭之前,使用ps aux|grep 'php' 查看一下php的服务及进程 php重启 /usr/local/php/sbin/php-fpm ...
- Bzoj2563 阿狸和桃子的游戏
Time Limit: 3 Sec Memory Limit: 128 MBSubmit: 701 Solved: 496 Description 阿狸和桃子正在玩一个游戏,游戏是在一个带权图G= ...
- 数据结构算法C语言实现(六)---2.4一元多项式的表示及相加
一.简述 利用链表表示稀疏多项式,并基于之前的一些操作(编程实现上还是有所不同的)组合新的操作实现一元多项式的表示及相加. 二.ADT 抽象数据类型一元多项式的定义 ADT Polyomail{ 数据 ...
- UEditor上传图片到七牛云储存(java)
我们的网站一般放在虚拟空间或者服务器上,图片如果存在本地目录,会占用很多空间和流量,还增加了负担,好的办法是把图片存放到云储存服务里面,平时用url去拿 云储存:普遍说又拍云和七牛比较好,看到七牛免费 ...
- centos7 建立虚拟目录
一.安装mysql,直接用yum安装即可,mysql在centos7.0版本中被mariadb替代了. 命令: yum install mysql-server mysql 安装好了,选择修改mysq ...
- MySQL安装问题
win8.1安装mysql5.6时遇到问题,主要出现最后一步配置卡在了start server这一步会弹窗配置时间过长. 在网上查找方式并尝试之后都不能解决问题,尝试了卸载软件,清理注册表中的mysq ...
- css3之自定义字体
使用@font-face自定义字体 我们在浏览国外的一些个人网站时,总是可以发现一些非常个性的字体,比如
- JavaWeb---总结(十三)使用Session防止表单重复提交
在平时开发中,如果网速比较慢的情况下,用户提交表单后,发现服务器半天都没有响应,那么用户可能会以为是自己没有提交表单,就会再点击提交按钮重复提交表单,我们在开发中必须防止表单重复提交. 一.表单重复提 ...