Tutorial: Importing and analyzing data from a Web Page using Power BI Desktop
In this tutorial, you will learn how to import a table of data from a Web page and create a report to visualize this data. As part of this process, you navigate across tables available on a web page, and apply data transformation steps to bring the table into a new shape.
In this article - Task 1: Connect to a web data source - Task 2: Shape data in the Query view - Step 1: Remove Other Columns to only display columns of interest - Step 2: Replace Values to clean up values in a selected column - Step 3: Filter values in a column - Step 4: Rename a column - Step 5: Filter null values in a column - Step 6: Rename a query - Query Steps created - Task 3: Create visualizations using the Report view - Step 1: Load the query to your report - Step 2: Create a Map visualization
Task 1: Connect to a web data source
In task 1, you import a Tournament Summary table from the UEFA European Football Championship Wikipedia page at http://en.wikipedia.org/wiki/UEFA_European_Football_Championship

Add a Wikipedia page data source
In the Getting Started dialog or in the Home ribbon tab, click Get Data.
This brings up the Get Data dialog, where you can pick from a wide range of data sources to import data into Power BI Desktop. We will select Web which is available under the All or Other group.
In the Web Content dialog box, in the URL text box, paste the Wikipedia URL (http://en.wikipedia.org/wiki/UEFA_European_Football_Championship).
Click OK.
After establishing a connection to the web page, you see a list of tables available on this Wikipedia page in the Navigator dialog. You can single-click on each of these tables to preview the data.
In the Navigator left-pane, select the Results[edit] table for the Tournament Summary results, or select the Results[edit] table and select Edit. This will allow us to reshape this table before loading it to the Report, since the data is not in the shape that we need for our analysis.

This will land a preview of the table in the Query view, where we can apply a set of transformation steps to clean up the data.

Task 2: Shape data in the subject table
Now that you have the subject table selected for your data query, you learn how to perform various data shaping and cleansing steps.
Step 1: Remove Other Columns to only display columns of interest
In this step, you remove all columns except Year and Final Winners.
In the Query Preview grid, select the Year and Final Winners columns (use CTRL + Click).
Right-click a column header in the Query Preview grid, and click Remove Other Columns to remove the unselected columns. Note that this operation is also available in the Home ribbon tab, in the Manage Columns group.

Step 2: Replace Values to clean up values in a selected column
In this step, you replace the Details suffix in the Year column. Note that this suffix is on a new line so it is not visible in the table preview. However, if you click in one of the cells with a numeric value in the Year column, you will see the full value in the detailed view.

Select the Year column.
In the Query view ribbon, click Replace Values under the Home tab or right-click the Year column, and click Replace Values to replace Details with empty text.
In the Replace Values dialog box, type Details in the Value to Find text box and leave the Replace With text box empty.
Click OK.

Step 3: Filter values in a column
In this step, you filter the Year column to display rows that do not contain “Year”.
Click the filter drop down arrow on the Year column.
In the Filter drop-down, clear the Year option.
Click OK.

Step 4: Rename a column
Now that we have cleaned up the data in the Year column, we are going to work on the Final Winner column.
Since we are only looking at the list of winners, we can rename this column to Country.
Select the Final Winner column in the Query preview.
In the Query view ribbon, under the Transform tab and Any Column group, you will find Rename.
This will make the column name editable. We will rename this column to Country.
Step 5: Filter out null values in a column
We also need to filter out null values in the Country column. In order to do this, we could use the filter menu as we saw in Step 3, or alternatively we can:
Right-click on one of the cells in the Country column that contain a null value.
Select Text Filters -> Does not Equal in the context menu.
This creates a new filter step to remove rows with null values in the Country column.
Step 6: Name a query
In this step, you name your final query Euro Cup Winners.
- In the Query Settings pane, in the Name text box, enter Euro Cup Winners.

Task 3: Create visualizations using the Report view
Now that we have converted the data into the shape that we need for our analysis, we can load the resulting table into our Report and create a few visualizations.
Step 1: Load the query to your report
In order to load the query results to Power BI Desktop and create a report, we select Close & Load from the Home ribbon.

This will trigger evaluation of the query and load of the table output to the Report. In Power BI Desktop, select the Report icon to see Power BI Desktop in Report view.

You can see the resulting table fields in the Fields pane at the right of the Report view.

Step 2: Create a Map visualization
In order to create a visualization, we can drag fields from the Field list and drop them in the Report canvas.
Drag the Country field and drop it in the Report canvas. This will create a new visualization in the Report canvas. In this case, since we have a list of countries, it will create a Map visualization.

We can easily change the type of visualization by clicking on a different icon in the Visualization pane.

3. We are going to stay with the Map visualization type to Map, We can also resize the visualization by dragging from one of the corners of the visualization up to the desired size.

4. Note that currently all the points in the map have the same size. We want to change this so that countries with more Euro Cup tournaments won are represented with a larger point in the map. In order to do thiso, we can drag the Year field in the Fields list to the Values box in the lower half of the Fields pane.

As you can see, it is very easy to customize visualizations in your report, in order to present the data in the way that you want. Power BI Desktop provides a seamless end-to-end experience from getting data from a wide range of data sources and shaping it to meet your analysis needs to visualizing this data in rich and interactive ways. Once your report is ready, you can upload it to Power BI and create dashboards based on it, which you can share with other Power BI users.
This concludes the Importing Data from the Web tutorial. You can download the completed Power BI Desktop file here.
Tutorial: Importing and analyzing data from a Web Page using Power BI Desktop的更多相关文章
- Tutorial: Facebook analytics using Power BI Desktop
In this tutorial you learn how to import and visualize data from Facebook. During the tutorial you'l ...
- [Project] Simulate HTTP Post Request to obtain data from Web Page by using Python Scrapy Framework
1. Background Though it's always difficult to give child a perfect name, parent never give up trying ...
- DEDECMS系统安全篇之移data目录到Web根目录以外听语音
http://jingyan.baidu.com/article/ad310e80aeb0971849f49e8e.html 主要三个步骤: 1./include/common.inc.php 2.还 ...
- mysql --secure-file-priv is set to NULL.Operations related to importing and exporting data are disabled
--secure-file-priv is set to NULL. Operations related to importing and exporting data are disabledmy ...
- 《Using Python to Access Web Data》 Week5 Web Services and XML 课堂笔记
Coursera课程<Using Python to Access Web Data> 密歇根大学 Week5 Web Services and XML 13.1 Data on the ...
- How To Crawl A Web Page with Scrapy and Python 3
sklearn实战-乳腺癌细胞数据挖掘(博主亲自录制视频) https://study.163.com/course/introduction.htm?courseId=1005269003& ...
- Home | eMine: Web Page Transcoding Based on Eye Tracking Project Page
Home | eMine: Web Page Transcoding Based on Eye Tracking Project Page The World Wide Web (web) has m ...
- save a web page as a single file (mht format) using Delphi code
Here's how to save a web page as a single file (mht format) using Delphi code: uses CDO_TLB, ADODB_T ...
- How a web page loads
The major web browsers load web pages in basically the same way. This process is known as parsing an ...
随机推荐
- http 302
404 not found500 internal server error 302临时重定向.指被访问的网页由于各种需求临时跳转到其它页面. yii若用户为游客状态,但controller中添加了权 ...
- 学习总结 DML数据库增删改语句
insert into score t values('111','3-105',88)--插入一行数据 insert into score(sno,cno) values('111','3-105' ...
- Winserver2008R2 .netframework4.5 asp.netmvc 访问出现的是文件列表。
Winserver2008R2 .netframework4.5 asp.netmvc 访问出现的是文件列表,服务器需要安装如下的补丁,才可正常访问. http://www.microsoft.com ...
- ios项目记录
1,如何隐藏状态栏 在基类中重载UIViewController.h中的这个方法 - (BOOL)prefersStatusBarHidden { // iOS7后,[[UIApplication s ...
- .Net性能优化时应该关注的数据
解决性能问题的时候,我往往会让客户添加下面一些计数器进行性能收集. Process object下的所有计数器: Processor object下的所有计数器: System object下的所有计 ...
- Chrome调试(debugger)总是进入paused in debugger状态
在通过Chrome浏览器进行web前端开发时,我们会经常用到Chrome自带的debugger工具,但是经常按完快捷键(F12)后,页面会进入 paused in debugger状态,需要点击右上角 ...
- DEDECMS自动编号(序号)autoindex属性
让织梦dedecms autoindex,itemindex 从0到1开始的办法! 1 2 3 [field:global name=autoindex runphp="yes"] ...
- 【linux】 静态库编译
文件如下: root@ubuntu:/home/test# ll total drwxr-xr-x root root Sep : ./ drwxr-xr-x root root Sep : ../ ...
- Cent OS yum 安装 Adobe flash player
桌面打开浏览器访问:http://get.adobe.com/cn/flashplayer/.网页会判断操作系统和浏览器并下载 Flash Player(支持Firefox浏览器). 或者直接下载: ...
- .NET如何从配置文件中获取连接字符串
一.设置配置文件 <configuration> <!--在configuration下创建一个connectionStrings--> <connectionStrin ...