Tutorial: Importing and analyzing data from a Web Page using Power BI Desktop
In this tutorial, you will learn how to import a table of data from a Web page and create a report to visualize this data. As part of this process, you navigate across tables available on a web page, and apply data transformation steps to bring the table into a new shape.
In this article - Task 1: Connect to a web data source - Task 2: Shape data in the Query view - Step 1: Remove Other Columns to only display columns of interest - Step 2: Replace Values to clean up values in a selected column - Step 3: Filter values in a column - Step 4: Rename a column - Step 5: Filter null values in a column - Step 6: Rename a query - Query Steps created - Task 3: Create visualizations using the Report view - Step 1: Load the query to your report - Step 2: Create a Map visualization
Task 1: Connect to a web data source
In task 1, you import a Tournament Summary table from the UEFA European Football Championship Wikipedia page at http://en.wikipedia.org/wiki/UEFA_European_Football_Championship

Add a Wikipedia page data source
In the Getting Started dialog or in the Home ribbon tab, click Get Data.
This brings up the Get Data dialog, where you can pick from a wide range of data sources to import data into Power BI Desktop. We will select Web which is available under the All or Other group.
In the Web Content dialog box, in the URL text box, paste the Wikipedia URL (http://en.wikipedia.org/wiki/UEFA_European_Football_Championship).
Click OK.
After establishing a connection to the web page, you see a list of tables available on this Wikipedia page in the Navigator dialog. You can single-click on each of these tables to preview the data.
In the Navigator left-pane, select the Results[edit] table for the Tournament Summary results, or select the Results[edit] table and select Edit. This will allow us to reshape this table before loading it to the Report, since the data is not in the shape that we need for our analysis.

This will land a preview of the table in the Query view, where we can apply a set of transformation steps to clean up the data.

Task 2: Shape data in the subject table
Now that you have the subject table selected for your data query, you learn how to perform various data shaping and cleansing steps.
Step 1: Remove Other Columns to only display columns of interest
In this step, you remove all columns except Year and Final Winners.
In the Query Preview grid, select the Year and Final Winners columns (use CTRL + Click).
Right-click a column header in the Query Preview grid, and click Remove Other Columns to remove the unselected columns. Note that this operation is also available in the Home ribbon tab, in the Manage Columns group.

Step 2: Replace Values to clean up values in a selected column
In this step, you replace the Details suffix in the Year column. Note that this suffix is on a new line so it is not visible in the table preview. However, if you click in one of the cells with a numeric value in the Year column, you will see the full value in the detailed view.

Select the Year column.
In the Query view ribbon, click Replace Values under the Home tab or right-click the Year column, and click Replace Values to replace Details with empty text.
In the Replace Values dialog box, type Details in the Value to Find text box and leave the Replace With text box empty.
Click OK.

Step 3: Filter values in a column
In this step, you filter the Year column to display rows that do not contain “Year”.
Click the filter drop down arrow on the Year column.
In the Filter drop-down, clear the Year option.
Click OK.

Step 4: Rename a column
Now that we have cleaned up the data in the Year column, we are going to work on the Final Winner column.
Since we are only looking at the list of winners, we can rename this column to Country.
Select the Final Winner column in the Query preview.
In the Query view ribbon, under the Transform tab and Any Column group, you will find Rename.
This will make the column name editable. We will rename this column to Country.
Step 5: Filter out null values in a column
We also need to filter out null values in the Country column. In order to do this, we could use the filter menu as we saw in Step 3, or alternatively we can:
Right-click on one of the cells in the Country column that contain a null value.
Select Text Filters -> Does not Equal in the context menu.
This creates a new filter step to remove rows with null values in the Country column.
Step 6: Name a query
In this step, you name your final query Euro Cup Winners.
- In the Query Settings pane, in the Name text box, enter Euro Cup Winners.

Task 3: Create visualizations using the Report view
Now that we have converted the data into the shape that we need for our analysis, we can load the resulting table into our Report and create a few visualizations.
Step 1: Load the query to your report
In order to load the query results to Power BI Desktop and create a report, we select Close & Load from the Home ribbon.

This will trigger evaluation of the query and load of the table output to the Report. In Power BI Desktop, select the Report icon to see Power BI Desktop in Report view.

You can see the resulting table fields in the Fields pane at the right of the Report view.

Step 2: Create a Map visualization
In order to create a visualization, we can drag fields from the Field list and drop them in the Report canvas.
Drag the Country field and drop it in the Report canvas. This will create a new visualization in the Report canvas. In this case, since we have a list of countries, it will create a Map visualization.

We can easily change the type of visualization by clicking on a different icon in the Visualization pane.

3. We are going to stay with the Map visualization type to Map, We can also resize the visualization by dragging from one of the corners of the visualization up to the desired size.

4. Note that currently all the points in the map have the same size. We want to change this so that countries with more Euro Cup tournaments won are represented with a larger point in the map. In order to do thiso, we can drag the Year field in the Fields list to the Values box in the lower half of the Fields pane.

As you can see, it is very easy to customize visualizations in your report, in order to present the data in the way that you want. Power BI Desktop provides a seamless end-to-end experience from getting data from a wide range of data sources and shaping it to meet your analysis needs to visualizing this data in rich and interactive ways. Once your report is ready, you can upload it to Power BI and create dashboards based on it, which you can share with other Power BI users.
This concludes the Importing Data from the Web tutorial. You can download the completed Power BI Desktop file here.
Tutorial: Importing and analyzing data from a Web Page using Power BI Desktop的更多相关文章
- Tutorial: Facebook analytics using Power BI Desktop
In this tutorial you learn how to import and visualize data from Facebook. During the tutorial you'l ...
- [Project] Simulate HTTP Post Request to obtain data from Web Page by using Python Scrapy Framework
1. Background Though it's always difficult to give child a perfect name, parent never give up trying ...
- DEDECMS系统安全篇之移data目录到Web根目录以外听语音
http://jingyan.baidu.com/article/ad310e80aeb0971849f49e8e.html 主要三个步骤: 1./include/common.inc.php 2.还 ...
- mysql --secure-file-priv is set to NULL.Operations related to importing and exporting data are disabled
--secure-file-priv is set to NULL. Operations related to importing and exporting data are disabledmy ...
- 《Using Python to Access Web Data》 Week5 Web Services and XML 课堂笔记
Coursera课程<Using Python to Access Web Data> 密歇根大学 Week5 Web Services and XML 13.1 Data on the ...
- How To Crawl A Web Page with Scrapy and Python 3
sklearn实战-乳腺癌细胞数据挖掘(博主亲自录制视频) https://study.163.com/course/introduction.htm?courseId=1005269003& ...
- Home | eMine: Web Page Transcoding Based on Eye Tracking Project Page
Home | eMine: Web Page Transcoding Based on Eye Tracking Project Page The World Wide Web (web) has m ...
- save a web page as a single file (mht format) using Delphi code
Here's how to save a web page as a single file (mht format) using Delphi code: uses CDO_TLB, ADODB_T ...
- How a web page loads
The major web browsers load web pages in basically the same way. This process is known as parsing an ...
随机推荐
- 洛谷P1472 奶牛家谱 Cow Pedigrees
P1472 奶牛家谱 Cow Pedigrees 102通过 193提交 题目提供者该用户不存在 标签USACO 难度普及+/提高 提交 讨论 题解 最新讨论 暂时没有讨论 题目描述 农民约翰准备 ...
- Java基础学习(学习IT企业必读的324个JAVA面试题.pdf 整理)
一.Java程序基础 javac 文件名.java 编译程序 java 类名 运行java程序 代码规范中,一下几点要注意: 包名:包名是全小写的名词,中间可以由点分 ...
- javascript 实现HashTable(哈希表)
一.javascript哈希表简介 javascript里面是没有哈希表的,一直在java,C#中有时候用到了这一种数据结构,javascript里面若没有,感觉非常不顺手.细细看来,其实javasc ...
- 571亿背后:DRC助阿里实现异地双活
571亿背后:DRC助阿里实现异地双活 赶集网SQL自动上线
- CSS 之 Opacity多浏览器透明度兼容处理
用来设定元素透明度的 Opacity 是CSS 3里的一个属性.当然现在还只有少部分浏览器支持. 不过各个浏览器都有自己的私有属性来支持,其中包括老版本的Mozilla和Safari: IE: fil ...
- C++ 学习基础一
1.预处理器指示符如果文件名用尖括号“<”和”>”括起来的,则表示该文件是工程或标准头文件,查找过程会检查预定义的目录.如果文件名用双引号括起来,则表示该文件是用户自定义的头文件,查找该文 ...
- NTP服务搭建
1:实验环境 前提条件:虚拟机环境,windows 客户端,关闭防火墙:linux 服务器关闭防火墙,关闭selinux. Cent OS :NTP服务器 IP: 192.168.80.134 /2 ...
- 017Makefile工程管理
1.为什么需要Makefile? 利用Makefile和make的合作,可以把很多很多的工作合并成一个非常简单的命令:make: make能够使整个程序的编译.链接只需要一个命令(make)就可以完成 ...
- Android IOS WebRTC 音视频开发总结(二十)-- 自由职业
咋看标题感觉与WebRTC和音视频无关,其实有着很大的关联,文章来自博客园RTC.Blacker,转载请说明出处. 背景: 一方面因为对开发人员比较了解,不喜欢约束,喜欢自由自在,所以我们向往自由职业 ...
- pect-shell中的自动交互
这是我在ST写的自动登陆ssh的脚本, 分两个文件 文件1: sssh #!/bin/bash #xql 2011/01/4#auto ssh 138.198.230.170 case $1 in & ...