Airbnb/Apache Superset – the open source dashboards and visualization tool – first impressions and link to a demo
https://assemblinganalytics.com/post/airbnbapache-superset-first-impressions-and-link-to-a-demo/
Today I spent some time looking into Superset, the analytics and BI open source tool from Airbnb which is now being incubated into Apache. Superset is into the Tableau and PowerBI arena and it is quite mature already, mature enough for business users too (though not as customizable as other solutions).
You can try Superset yourself by visiting the demo I set up at superset.assemblinganalytics.com (user and password “demo”).
Superset dashboards look like this:

It does not get to the level of polish of Tableau yet from a design perspective, but dashboards do look beautiful and definitely much better than what business users are used to today from the MS Office software. One thing to notice right from the start is that Superset is set to support as many visualizations as possible by leveraging existing open source projects (like plot.ly), this is a big differentiator versus other dashboard solutions where visualizations are less numerous.
But they key differentiator of Superset is that it is open source and that is built on an extensible framework (Flask) in a programming language that is becoming the de-facto go-to language for data science (Python). I have run into issues with traditional enterprise software vendors many times on missing feature xyz or even visualization xyz, in this cases if you are lucky you can get that feature in the next release 6-8 months from now, if not (which is usually the case) you just have to deal with the missing feature. One example of that is the lack of an Excel export in Tableau, which is due to Tableau just not listening to its users and wanting to push online usage rather than data export, so bottom line I had to build something myself, but it’s a hack and right now not supported.
Being open source Superset is also more flexible, adding features or visualization is something you could do if you can program, or you could sponsor if you have a budget. Talking about budget, Superset is completely free, no limitations other than the computing power of your server. This means that adding an additional, say, 1,000 users is going to be very cheap. Cost is important as developing a data-driven organization means all your employees have access to not only advanced analytical dashboards but also to powerful tools to do additional analysis on their own, this is one of the weak points of Tableau.
OK, so Superset is cool and might have a bright future, but is it ready for enterprise usage now? The answer is: it depends, analytics culture plays an important role here. To highlight pros and cons more in detail, I’ll go over a point by point analysis comparing with tools I have used in my work experience (Tableau, Cognos and BusinessObjects).
Please remember those are first impressions about Superset.
Learning curve
Superset is much easier to start with than Tableau and let’s not even compare to old stuff like Cognos or Business Objects. Superset wins hands down in this department.
Online vs desktop
Superset works on all most used browsers and has been built as a web app, so it does not require any additional desktop installations for power features. As such, Superset is the only tool out there that can be used completely in the web browser which is an advantage. You might think Business Objects is also used online, but the way I have seen it being used by most users is to extract Excel files to then work on desktop, this is not a fully online data exploration workflow.
Data exploration
Superset’s mission is to make it easy to explore data and find insight. This is something that has a lot of potentials as it can scale (a big part of that is the lack of license costs on a per user basis). Below you can see how this data explore works, it changes depending on the chart type that you want to visualize.

It’s snappy and quite responsive, but there is no refresh button so sometimes you have to save in order to “refresh”, a bit un-intuitive but it works. The experience is not as interactive as Tableau and there is a lot of small details that are missing. Overall it works, but it needs to develop further. Reading around, the team at Airbnb seems to be very focused on making this experience better, so I expect there will be improvements in the future. However, it’s already a quite impressive tool and once you get into the flow you can use it to get business insights already.
I plan to test this further with my own data, so stay tuned for more commentary on this one!
Adding data
As of now, uploading data in Superset is not possible, you have to upload data separately in a database. While I understand why this is the case – data management does not belong into an analytics tool – I do see some challenges with that, especially for beginners. You can, however, build a relatively small Flask app to handle this use case (which is uploading Excel files, .csv, etc.). One thing I am wondering is if the Superset navigation bar on top can inherit additional drop down menus. Ideally, users would not even realize they are not using Superset when doing data operations and uploads.
Visual filters
This is a big one for me and right now it’s a miss in Superset. In Tableau users can click on graphs and that can trigger additional actions (usually a filter on other graphs). Filtering features, in general, are a bit limited in Superset and I hope that will improve over time. I did some research on the roadmap, but I could not find any discussions about action triggers on charts.
Tooltips
The tooltips in Superset cannot be customized. Tableau has very powerful tool tips that can display a lot of information on various fields, this is not the case in Superset.

As you can see above, the highlighted tooltip is good, but no additional information can be added to it.
Calculations
Superset queries databases, there is a nice SQL editor to build custom SQL, but that’s where it stops. This means that right now if you step out of SQL you can’t build the calculations you need.

However, Superset runs on top of Pandas, which is THE python library for data crunching and manipulations (in many cases it can replace R and SASS easily). So, I think that SQL Lab might expand one day in something more than just SQL, maybe adding custom functions that run on top of Pandas data frames? I think this direction would make sense, so I wouldn’t be surprised to see it coming at some point.
Some ideas of future developments
One strong advantage of Superset, which right now does not have as many features as Tableau or even PowerBI, is that it is open source. You know what else is open source? WordPress. What if Superset goes the direction of WordPress? What if Superset becomes the WordPress of analytic CMS’s? Superset could easily go in that direction as by using the blueprints framework of Flask it can be extended with additional web apps and plugins – easily. Although there doesn’t seem to be an integration for this use case yet if this happens Superset might scale beyond being a dashboard web application to be a full-fledged BI and data science solution. This architecture makes sense as by leveraging plugins you can fit analytics to the exact needs of your organization. Compare that to running a full fledged Business Objects solution, which is an “all in but not what you need” solution, and you’ll see the benefits.
Some ideas that come to mind of additional functionality that could be built as a plugin:
- A notification system. Think a system where admins can schedule email reports or alert rules for sending a notification when some event happens (i.e. you just converted a big customer today, etc.).
- A data management system. Think running simple ETL tasks using a simple online data pipeline tool. I am guessing a web interface for Airbnb Airflow would suffice, but I did not check this one out yet
- A wiki
- A forum
- A certification system to rank the quality of dashboards (maybe some dashboards for executives needs to be checked by finance first before they get an “updated as of…” flag, etc.)
- An integration with one of the data science notebook or workbench applications
- And much much more
I think there are so many ways this could go, which is why I am studying Superset and I plan to keep an eye on it. To be completely fair, some of the above could be done also with Tableau and PowerBI by leveraging their javascript API, but so far that ecosystem has not produced any quality plug-ins I can think of. Also, Superset is built from the ground up to be used on the web browser and to be extended modularly, it’s a much better premise for flexibility and extensibility than closed source software.
Conclusions
Is Superset a replacement of all other BI tools out there? No. Can it grow to be a strong competitor? Yes. I do suspect Superset will always do things slightly differently though, so the organizations leveraging it do need to have a certain culture of analytics, but there is potential. Superset can live together with other analytics tools such as Tableau or Power BI. It is important to keep in mind that Superset scales better as you don’t incur enormous license costs and it’s built with data analysts in mind. Superset wants everybody in the organization to be a data scientist, at least a bit. Other tools are more geared towards power users that distribute reports and dashboards. Ultimately, it’s this difference in vision that might set Superset apart from other similar tools in the enterprise world.
Airbnb/Apache Superset – the open source dashboards and visualization tool – first impressions and link to a demo的更多相关文章
- [EXP]Apache Superset < 0.23 - Remote Code Execution
# Exploit Title: Apache Superset < 0.23 - Remote Code Execution # Date: 2018-05-17 # Exploit Auth ...
- root cause org.apache.ibatis.ognl.OgnlException: source is null for getProperty(null, "XXX")
在执行一个查询语句的时候,mybatis报错:root cause org.apache.ibatis.ognl.OgnlException: source is null for getProper ...
- org.apache.ibatis.ognl.OgnlException: source is null for getProperty(null, "enterpCd")-Mybatis报错
一.问题由来 下午快要下班时,登录测试服务器查看日志信息,看看有没有新的异常信息,如果有的话好及时修改.结果一看果然有新的异常信息. 主要的异常信息如下: 2020-10-13 14:51:03,03 ...
- 使用 Apache Superset 可视化 ClickHouse 数据
Apache Superset是一个强大的BI工具,它提供了查看和探索数据的方法.它在 ClickHouse 用户中也越来越受欢迎. 我们将介绍安装 Superset 的 2 种方法,然后展示如何从 ...
- Apache Superset 1.2.0教程 (二)——快速入门(可视化王者英雄数据)
上一篇我们已经成功的安装了superset,那么该如何可视化我们的数据呢?本文将可视化王者英雄的数据,快速的入门Superset. 一.连接数据源 首先确保mysql可以正常连接使用,并且准备好数据. ...
- 【译】Six Open Source Dashboards to Organize Your Data
作者:Ben Gregory on Jun 29, 2016 译者:carsonzhu 在天文学家看来,我们相信每个组织都可以从数据的正确集中,组织和清理中受益. 我们正在建立一个公司来做到这一点 ...
- win10下apache superset的使用
官方文档:http://superset.apache.org/ 一.环境准备 安装python3即3.4以上版本 二.python创建一个虚拟环境用来作为superset的容器 -pip3 inst ...
- Apache Superset 1.2.0教程 (三)—— 图表功能详解
通过之前章节的学习,我们已经成功地安装了superset,并且连接mysql数据库,可视化了王者英雄的数据.使用的是最简单Table类型的图表,但是superset还支持非常多的图表类型. 本文我们将 ...
- Caused by: org.apache.ibatis.ognl.OgnlException: source is null for getProperty(null, "mil_id")
今天在使用mybatis处理数据库的时候,突然抛出了上述异常,让我感到很惊讶,因为在处理save的时候,在Mybatis的配置文件中,我根本就没有使用到ognl表达式,系统怎么会抛出上述异常.而且之前 ...
随机推荐
- HDFS追本溯源:体系架构详解
Hadoop是一个开发和运行处理大规模数据的软件平台,是Apache的一个用Java语言实现开源软件框架,实现在大量计算机组成的集群中对海量数据进行分布式计算.用户可以在不了解分布式底层细节的情况下, ...
- Java中数组的扩容
在写程序的过程中,我们常常会碰见数组空间不够用的情况,比如我已经初始化了一个数组int []a = {1,2,3,4,5,6,7,8,9,10} ;这时,我想往数组下标3的位置插入一个元素,该怎么做? ...
- Android的加速度传感器模拟摇一摇的效果-android学习之旅(66)
主要介绍一下android的加速传感器的简单用法,模拟摇一摇 ,如果x,y,z三个方向的加速度超过了15,就会弹出Toast,当然你可以设置更复杂的策略,比如判断间隔 代码如下 public clas ...
- c#一些特殊语法
1.using 语法 using不仅可以作为导入包,重命名类名.还可以释放资源 using (Pen gridLinePen = new Pen(Color.red)) { e.Graphics.Dr ...
- (NO.00003)iOS游戏简单的机器人投射游戏成形记(九)
现在按住手臂可以非常自然和舒服的旋转了,丝般顺滑:移动停止,旋转立即停止,没有什么惯性影响了. 以上一共介绍了2中旋转方式,到底采用哪种方式呢?其实看实际游戏的需求和个人的喜好了.本猫在Level中添 ...
- Android Studio查看应用数字签名-android学习之旅(76)
Android Studio和Eclispe还是有比较大的区别,在这地方,eclipse可以直接在设置里面,而AS就需要通过Terminal来查看 步骤 1.首先定位到.android 一般都是在C盘 ...
- HTML5 移动开发入门知识点
转自:http://www.cnblogs.com/blog-zwei1989/archive/2012/12/12/2815049.html 1.先来看淘宝无线wiki要求在页面中添加的meta标签 ...
- 【一天一道LeetCode】#25. Reverse Nodes in k-Group
一天一道LeetCode系列 (一)题目 Given a linked list, reverse the nodes of a linked list k at a time and return ...
- 【Qt编程】Qt学习之状态栏
QStatusBar类提供一个水平条来显示状态信息.所谓状态信息,拿个简单的例子来说,当你在word中编辑时,左下角就会出现页面.字数等等信息.状态信息可以分为三类:临时信息,如一般的提示信息:正常信 ...
- AngularJS进阶(十四)AngularJS灵异代码事件
AngularJS灵异代码事件 注:请点击此处进行充电! 事情原委 router_sys.js源代码如下: 自己在html路由跳转的代码如下: 但是在实际路由过程中,却路由到了下面的状态,相应的页面中 ...