xml.etree.ElementTree用于解析和构建XML文件

<?xml version="1.0"?>

<data>

    <country name="Liechtenstein">

        <rank>1</rank>

        <year>2008</year>

        <gdppc>141100</gdppc>

        <neighbor name="Austria" direction="E"/>

        <neighbor name="Switzerland" direction="W"/>

    </country>

    <country name="Singapore">

        <rank>4</rank>

        <year>2011</year>

        <gdppc>59900</gdppc>

        <neighbor name="Malaysia" direction="N"/>

    </country>

    <country name="Panama">

        <rank>68</rank>

        <year>2011</year>

        <gdppc>13600</gdppc>

        <neighbor name="Costa Rica" direction="W"/>

        <neighbor name="Colombia" direction="E"/>

    </country>

</data>

解析XML文件

parse()函数，从xml文件返回ElementTree

from xml.etree.ElementTree import parse

tree = parse('demo.xml')  //获取ElementTree

root = tree.getroot()   // 获取根元素

Element.tag 、Element.attrib、Element.text

In [6]: root.tag

Out[6]: 'data'

In [7]: root.attrib

Out[7]: {}

In [25]: root.text

Out[25]: '\n    '

for child in root 迭代获得子元素

In [8]: for child in root:

   ...:     print(child.tag, child.attrib)

   ...:

country {'name': 'Liechtenstein'}

country {'name': 'Singapore'}

country {'name': 'Panama'}

Element.get() 获得属性值

In [27]: for child in root:

    ...:     print (child.tag, child.get('name'))

    ...:

country Liechtenstein

country Singapore

country Panama

root.getchildren() 获得直接子元素

In [21]: root.getchildren()

Out[21]:

[<Element 'country' at 0x7f673581c728>,

 <Element 'country' at 0x7f673581ca98>,

 <Element 'country' at 0x7f673581cc28>]

root[0][1] 根据索引查找子元素

In [9]: root[0][1].text

Out[9]: '2008'

In [10]: root[1][0].text

Out[10]: '4'

root.find()　根据tag查找直接子元素，返回查到的第一个元素

In [13]: root.find('country').attrib

Out[13]: {'name': 'Liechtenstein'}

root.findall() 根据tag查找直接子元素，返回查到的所有元素的列表

In [16]: for country in root.findall('country'):

    ...:     print  (country.attrib)

    ...:

{'name': 'Liechtenstein'}

{'name': 'Singapore'}

{'name': 'Panama'}

root.iterfind() 根据tag查找直接子元素，返回查到的所有元素的生成器

In [22]: root.iterfind('country')

Out[22]: <generator object prepare_child.<locals>.select at 0x7f6736dccfc0>

支持的XPath语句(XML Path)

In [19]: root.findall('.//rank')  //查找任意层次元素

Out[19]:

[<Element 'rank' at 0x7f673581c8b8>,

 <Element 'rank' at 0x7f673581c6d8>,

 <Element 'rank' at 0x7f673581cc78>]

In [32]: root.findall('country/*')  //查找孙子节点元素

Out[32]:

[<Element 'rank' at 0x7f673581c8b8>,

 <Element 'year' at 0x7f673581cbd8>,

 <Element 'gdppc' at 0x7f673581c958>,

 <Element 'neighbor' at 0x7f673581c688>,

 <Element 'neighbor' at 0x7f673581cb38>,

 <Element 'rank' at 0x7f673581c6d8>,

 <Element 'year' at 0x7f673581c5e8>,

 <Element 'gdppc' at 0x7f673581c868>,

 <Element 'neighbor' at 0x7f673581cb88>,

 <Element 'rank' at 0x7f673581cc78>,

 <Element 'year' at 0x7f673581ccc8>,

 <Element 'gdppc' at 0x7f673581cd18>,

 <Element 'neighbor' at 0x7f673581cd68>,

 <Element 'neighbor' at 0x7f673581cdb8>]

In [33]: root.findall('.//rank/..')   // ..表示父元素

Out[33]:

[<Element 'country' at 0x7f673581c728>,

 <Element 'country' at 0x7f673581ca98>,

 <Element 'country' at 0x7f673581cc28>]

In [34]: root.findall('country[@name]')   // 包含name属性的country

Out[34]:

[<Element 'country' at 0x7f673581c728>,

 <Element 'country' at 0x7f673581ca98>,

 <Element 'country' at 0x7f673581cc28>]

In [35]: root.findall('country[@name="Singapore"]')   // name属性为Singapore的country

Out[35]: [<Element 'country' at 0x7f673581ca98>]

In [36]: root.findall('country[rank]')   // 孩子元素中包含rank的country

Out[36]:

[<Element 'country' at 0x7f673581c728>,

 <Element 'country' at 0x7f673581ca98>,

 <Element 'country' at 0x7f673581cc28>]

In [37]: root.findall('country[rank="68"]')   // 孩子元素中包含rank且rank元素的text为68的country

Out[37]: [<Element 'country' at 0x7f673581cc28>]

In [38]: root.findall('country[1]')     // 第一个country

Out[38]: [<Element 'country' at 0x7f673581c728>]

In [39]: root.findall('country[last()]')   // 最后一个country

Out[39]: [<Element 'country' at 0x7f673581cc28>]

In [40]: root.findall('country[last()-1]')    // 倒数第二个country

Out[40]: [<Element 'country' at 0x7f673581ca98>]

root.iter() 递归查询指定的或所有子元素　

In [29]: root.iter()

Out[29]: <_elementtree._element_iterator at 0x7f67355dd728>

In [30]: list(root.iter())

Out[30]:

[<Element 'data' at 0x7f673581c778>,

 <Element 'country' at 0x7f673581c728>,

 <Element 'rank' at 0x7f673581c8b8>,

 <Element 'year' at 0x7f673581cbd8>,

 <Element 'gdppc' at 0x7f673581c958>,

 <Element 'neighbor' at 0x7f673581c688>,

 <Element 'neighbor' at 0x7f673581cb38>,

 <Element 'country' at 0x7f673581ca98>,

 <Element 'rank' at 0x7f673581c6d8>,

 <Element 'year' at 0x7f673581c5e8>,

 <Element 'gdppc' at 0x7f673581c868>,

 <Element 'neighbor' at 0x7f673581cb88>,

 <Element 'country' at 0x7f673581cc28>,

 <Element 'rank' at 0x7f673581cc78>,

 <Element 'year' at 0x7f673581ccc8>,

 <Element 'gdppc' at 0x7f673581cd18>,

 <Element 'neighbor' at 0x7f673581cd68>,

 <Element 'neighbor' at 0x7f673581cdb8>]

In [31]: list(root.iter('rank'))

Out[31]:

[<Element 'rank' at 0x7f673581c8b8>,

 <Element 'rank' at 0x7f673581c6d8>,

 <Element 'rank' at 0x7f673581cc78>]

python模块之xml.etree.ElementTree的更多相关文章

python模块：xml.etree.ElementTree
"""Lightweight XML support for Python. XML is an inherently hierarchical data format, ...
python标准库xml.etree.ElementTree的bug
使用python生成或者解析xml的方法用的最多的可能就数python标准库xml.etree.ElementTree和lxml了,在某些环境下使用xml.etree.ElementTree更方便一些 ...
[python 学习] 使用 xml.etree.ElementTree 模块处理 XML
---恢复内容开始--- 导入数据(读文件和读字符串) 本地文件 country_data.xml <?xml version="1.0"?> <data> ...
[python 2.x] xml.etree.ElementTree module
XML 文件:xmlparse.xml <?xml version="1.0" encoding="UTF-8" standalone="no& ...
Python 标准库之 xml.etree.ElementTree
Python 标准库之 xml.etree.ElementTree Python中有多种xml处理API,常用的有xml.dom.*模块.xml.sax.*模块.xml.parser.expat模块和 ...
python 之xml.etree.ElementTree
Element类型是一种灵活的容器对象,用于在内存中存储结构化数据. ［注意］xml.etree.ElementTree模块在应对恶意结构数据时显得并不安全. 每个element对象都具有以下属性: ...
python解析xml文件之xml.etree.cElementTree和xml.etree.ElementTree区别和基本使用
1.解析速度:ElementTree在 Python 标准库中有两种实现.一种是纯 Python 实现例如 xml.etree.ElementTree ,另外一种是速度快一点的 xml.etree.c ...
python xml.etree.ElementTree模块
使用的XML文件如下:file.xml <?xml version="1.0"?> <data name="ming"> <cou ...
python 解析xml遇到xml.etree.ElementTree.ParseError: not well-formed (invalid token): line 4, column 34
在调试数字驱动用xml文件的方式时,包含读取xml文件的步骤,运行程序报错: d:\test\0629>python XmlUtil.pyTraceback (most recent call ...

随机推荐

C++解析(7)：函数重载分析
0.目录 1.重载的概念 2.C++中的函数重载 3.函数默认参数遇上函数重载 4.编译器调用重载函数的准则 5.重载与指针 6.C++和C相互调用 7.小结 1.重载的概念自然语言中的上下文--你 ...
Eclipse如何将代码变成大写/小写
代码变小写:选中要换的代码,操作Ctrl+Shift+y即可将大写变小写代码变大写:选中要换的代码,操作Ctrl+Shift+x即可将小写变大写
【BZOJ3166】ALO（主席树）
[BZOJ3166]ALO(主席树) 题面权限题qwq 资磁洛谷题解用一个$set$求出左右侧比这个数大的第$2$个数, 然后用可持久化$Trie$算一下就好啦 #include&l ...
mysql权限管理，用户管理
1 创建用户 mysql> truncate table user; //先删除所有用户 mysql> CREATE USER 'paris'@'localhost' IDENTIFIE ...
luoguP5105 不强制在线的动态快速排序
emm 可重集合没用用.直接变成不可重复集合有若干个区间每个区间形如[L,R] [L,R]计算的话,就是若干个连续奇数的和.拆位统计1的个数平衡树维护加入一个[L,R],把相交的区间合并.之后 ...
python基础----__next__和__iter__实现迭代器协议
#_*_coding:utf-8_*_ __author__ = 'Linhaifeng' class Foo: def __init__(self,x): self.x=x def __iter__ ...
opencv 获取摄像头图像
http://www.cnblogs.com/epirus/archive/2012/06/04/2535190.html #include "stdafx.h" #include ...
图像GIF格式介绍
1 图像GIF格式工作原理 GIF是用于压缩具有单调颜色和清晰细节的图像(如线状图.徽标或带文字的插图)的标准格式. GIF(Graphics InterchangeFormat)的原义是“图像互换格 ...
HDU 4584 splay
Shaolin Time Limit: 3000/1000 MS (Java/Others) Memory Limit: 65535/32768 K (Java/Others)Total Sub ...
echo 不换行
原文 http://blog.sina.com.cn/s/blog_4da051a6010184uk.html echo -n 不换行输出 $echo -n "123" $ec ...

python模块之xml.etree.ElementTree

xml.etree.ElementTree用于解析和构建XML文件

解析XML文件

parse()函数，从xml文件返回ElementTree

Element.tag 、Element.attrib、Element.text

for child in root 迭代获得子元素

Element.get() 获得属性值

root.getchildren() 获得直接子元素

root[0][1] 根据索引查找子元素

root.find() 根据tag查找直接子元素，返回查到的第一个元素

root.findall() 根据tag查找直接子元素，返回查到的所有元素的列表

root.iterfind() 根据tag查找直接子元素，返回查到的所有元素的生成器

支持的XPath语句(XML Path)

root.iter() 递归查询指定的或所有子元素

python模块之xml.etree.ElementTree的更多相关文章

随机推荐

热门专题

root.find()　根据tag查找直接子元素，返回查到的第一个元素

root.iter() 递归查询指定的或所有子元素