Tutorial: Importing and analyzing data from a Web Page using Power BI Desktop
In this tutorial, you will learn how to import a table of data from a Web page and create a report to visualize this data. As part of this process, you navigate across tables available on a web page, and apply data transformation steps to bring the table into a new shape.
In this article - Task 1: Connect to a web data source - Task 2: Shape data in the Query view - Step 1: Remove Other Columns to only display columns of interest - Step 2: Replace Values to clean up values in a selected column - Step 3: Filter values in a column - Step 4: Rename a column - Step 5: Filter null values in a column - Step 6: Rename a query - Query Steps created - Task 3: Create visualizations using the Report view - Step 1: Load the query to your report - Step 2: Create a Map visualization
Task 1: Connect to a web data source
In task 1, you import a Tournament Summary table from the UEFA European Football Championship Wikipedia page at http://en.wikipedia.org/wiki/UEFA_European_Football_Championship

Add a Wikipedia page data source
In the Getting Started dialog or in the Home ribbon tab, click Get Data.
This brings up the Get Data dialog, where you can pick from a wide range of data sources to import data into Power BI Desktop. We will select Web which is available under the All or Other group.
In the Web Content dialog box, in the URL text box, paste the Wikipedia URL (http://en.wikipedia.org/wiki/UEFA_European_Football_Championship).
Click OK.
After establishing a connection to the web page, you see a list of tables available on this Wikipedia page in the Navigator dialog. You can single-click on each of these tables to preview the data.
In the Navigator left-pane, select the Results[edit] table for the Tournament Summary results, or select the Results[edit] table and select Edit. This will allow us to reshape this table before loading it to the Report, since the data is not in the shape that we need for our analysis.

This will land a preview of the table in the Query view, where we can apply a set of transformation steps to clean up the data.

Task 2: Shape data in the subject table
Now that you have the subject table selected for your data query, you learn how to perform various data shaping and cleansing steps.
Step 1: Remove Other Columns to only display columns of interest
In this step, you remove all columns except Year and Final Winners.
In the Query Preview grid, select the Year and Final Winners columns (use CTRL + Click).
Right-click a column header in the Query Preview grid, and click Remove Other Columns to remove the unselected columns. Note that this operation is also available in the Home ribbon tab, in the Manage Columns group.

Step 2: Replace Values to clean up values in a selected column
In this step, you replace the Details suffix in the Year column. Note that this suffix is on a new line so it is not visible in the table preview. However, if you click in one of the cells with a numeric value in the Year column, you will see the full value in the detailed view.

Select the Year column.
In the Query view ribbon, click Replace Values under the Home tab or right-click the Year column, and click Replace Values to replace Details with empty text.
In the Replace Values dialog box, type Details in the Value to Find text box and leave the Replace With text box empty.
Click OK.

Step 3: Filter values in a column
In this step, you filter the Year column to display rows that do not contain “Year”.
Click the filter drop down arrow on the Year column.
In the Filter drop-down, clear the Year option.
Click OK.

Step 4: Rename a column
Now that we have cleaned up the data in the Year column, we are going to work on the Final Winner column.
Since we are only looking at the list of winners, we can rename this column to Country.
Select the Final Winner column in the Query preview.
In the Query view ribbon, under the Transform tab and Any Column group, you will find Rename.
This will make the column name editable. We will rename this column to Country.
Step 5: Filter out null values in a column
We also need to filter out null values in the Country column. In order to do this, we could use the filter menu as we saw in Step 3, or alternatively we can:
Right-click on one of the cells in the Country column that contain a null value.
Select Text Filters -> Does not Equal in the context menu.
This creates a new filter step to remove rows with null values in the Country column.
Step 6: Name a query
In this step, you name your final query Euro Cup Winners.
- In the Query Settings pane, in the Name text box, enter Euro Cup Winners.

Task 3: Create visualizations using the Report view
Now that we have converted the data into the shape that we need for our analysis, we can load the resulting table into our Report and create a few visualizations.
Step 1: Load the query to your report
In order to load the query results to Power BI Desktop and create a report, we select Close & Load from the Home ribbon.

This will trigger evaluation of the query and load of the table output to the Report. In Power BI Desktop, select the Report icon to see Power BI Desktop in Report view.

You can see the resulting table fields in the Fields pane at the right of the Report view.

Step 2: Create a Map visualization
In order to create a visualization, we can drag fields from the Field list and drop them in the Report canvas.
Drag the Country field and drop it in the Report canvas. This will create a new visualization in the Report canvas. In this case, since we have a list of countries, it will create a Map visualization.

We can easily change the type of visualization by clicking on a different icon in the Visualization pane.

3. We are going to stay with the Map visualization type to Map, We can also resize the visualization by dragging from one of the corners of the visualization up to the desired size.

4. Note that currently all the points in the map have the same size. We want to change this so that countries with more Euro Cup tournaments won are represented with a larger point in the map. In order to do thiso, we can drag the Year field in the Fields list to the Values box in the lower half of the Fields pane.

As you can see, it is very easy to customize visualizations in your report, in order to present the data in the way that you want. Power BI Desktop provides a seamless end-to-end experience from getting data from a wide range of data sources and shaping it to meet your analysis needs to visualizing this data in rich and interactive ways. Once your report is ready, you can upload it to Power BI and create dashboards based on it, which you can share with other Power BI users.
This concludes the Importing Data from the Web tutorial. You can download the completed Power BI Desktop file here.
Tutorial: Importing and analyzing data from a Web Page using Power BI Desktop的更多相关文章
- Tutorial: Facebook analytics using Power BI Desktop
In this tutorial you learn how to import and visualize data from Facebook. During the tutorial you'l ...
- [Project] Simulate HTTP Post Request to obtain data from Web Page by using Python Scrapy Framework
1. Background Though it's always difficult to give child a perfect name, parent never give up trying ...
- DEDECMS系统安全篇之移data目录到Web根目录以外听语音
http://jingyan.baidu.com/article/ad310e80aeb0971849f49e8e.html 主要三个步骤: 1./include/common.inc.php 2.还 ...
- mysql --secure-file-priv is set to NULL.Operations related to importing and exporting data are disabled
--secure-file-priv is set to NULL. Operations related to importing and exporting data are disabledmy ...
- 《Using Python to Access Web Data》 Week5 Web Services and XML 课堂笔记
Coursera课程<Using Python to Access Web Data> 密歇根大学 Week5 Web Services and XML 13.1 Data on the ...
- How To Crawl A Web Page with Scrapy and Python 3
sklearn实战-乳腺癌细胞数据挖掘(博主亲自录制视频) https://study.163.com/course/introduction.htm?courseId=1005269003& ...
- Home | eMine: Web Page Transcoding Based on Eye Tracking Project Page
Home | eMine: Web Page Transcoding Based on Eye Tracking Project Page The World Wide Web (web) has m ...
- save a web page as a single file (mht format) using Delphi code
Here's how to save a web page as a single file (mht format) using Delphi code: uses CDO_TLB, ADODB_T ...
- How a web page loads
The major web browsers load web pages in basically the same way. This process is known as parsing an ...
随机推荐
- 为知笔记 Markdown 新手指南
为知笔记 Markdown 新手指南 http://www.wiz.cn/feature-markdown.html 时序图,流程图详细流程图语法 http://adrai.github.io/flo ...
- 【spring 7】spring和Hibernate的整合:声明式事务
一.声明式事务简介 Spring 的声明式事务管理在底层是建立在 AOP 的基础之上的.其本质是对方法前后进行拦截,然后在目标方法开始之前创建或者加入一个事务,在执行完目标方法之后根据执行情况提交或者 ...
- centos7 搭建docker内运行rabbitmq,然后再镜像ha方案的完全教程,暂时一个宿主机只能运行一个docker的rabbitmq,但是集群 ha都正常
1.安装centos7.x,配置好网络2.因为docker需要比较高版本的内核,比如使用overlayfs作为默认docker文件系统要3.18,所以先升级内核到3.18以上版本,能直接过4是最佳了检 ...
- 【MVC】ASP.NET MVC中实现多个按钮提交的几种方法
有时候会遇到这种情况:在一个表单上需要多个按钮来完成不同的功能,比如一个简单的审批功能. 如果是用webform那不需要讨论,但asp.net mvc中一个表单只能提交到一个Action处理,相对比较 ...
- ASP.NET Razor 视图引擎编程参考
ASP.NET Razor 视图引擎编程参考 转载请注明出处:http://surfsky.cnblogs.com Rasor 视图引擎 http://msdn.microsoft.com/ ...
- jsp-status 404错误的解决方法汇总
接下来的解决方法实在一下情况下进行的: 1.tomcat配置是对的,能打开tomcat的主页(网址:http://localhost:8080/),如图, 但是在输入具体网址的时候,例如:http:/ ...
- 完成了server和client的框架设计
界面暂且也不搞.先把框架搭建起来.
- node中的流程控制中,co,thunkify为什么return callback()可以做到流程控制?
前言 我在学习generator ,yield ,co,thunkify的时候,有许多费解的地方,经过了许多的实践,也慢慢学会用,慢慢的理解,前一阵子有个其他项目的同事过来我们项目组学习node,发现 ...
- webform网站相关数据控件和其他
一.asp:Repeater <div class="bd"> <ul> <asp:Repeater ID="rept_slide" ...
- wcf调用oracle存储过程
public IList<ACCP_RAIN> QueryAll(string beginTime, string endTime, string type) { beginTime = ...