爪哇国新游记之三十四----Dom4j的XPath操作
Dom4j是Java访问XML的利器之一,另一个是JDom。记得当年因为粗掌握点JDomAPI但项目要求使用Dom4j还闹一阵情绪,现在看来真是没必要,只花一些时间成本就进去一个新世界绝对是值得做的一件事。更何况JDom因无人更新而停顿了。
Dom4j有两个包,一个是dom4j-1.6.1.jar,它提供基本的XML API支持,如访问节点,属性等。
还有一个是jaxen-1.1-beta-9.jar,它提供XPath支持。
关于XPath的语法,请见转载:XPath基本语法
言归正传,下面请看例程。
1.访问节点群
XML样本:
<applications>
<application name='chat'>
<mtLanguage source='ar_ar' target='en_us' />
<mtLanguage source='zh_cn' target='en_us' />
<mtLanguage source='zh_tw' target='en_us' />
<mtLanguage source='en_us' target='ar_ar' />
<mtLanguage source='en_us' target='zh_cn' />
<mtLanguage source='en_us' target='zh_tw' />
<mtLanguage source='en_us' target='fr_fr' />
<mtLanguage source='en_us' target='de_de' />
<mtLanguage source='en_us' target='it_it' />
<mtLanguage source='en_us' target='ja_jp' />
<mtLanguage source='en_us' target='ko_kr' />
<mtLanguage source='en_us' target='pt_br' />
<mtLanguage source='en_us' target='ru_ru' />
<mtLanguage source='en_us' target='es_es' />
<mtLanguage source='fr_fr' target='en_us' />
<mtLanguage source='de_de' target='en_us' />
<mtLanguage source='it_it' target='en_us' />
<mtLanguage source='ja_jp' target='en_us' />
<mtLanguage source='ko_kr' target='en_us' />
<mtLanguage source='pt_br' target='en_us' />
<mtLanguage source='ru_ru' target='en_us' />
<mtLanguage source='es_es' target='en_us' />
</application>
<application name='doc'>
<mtLanguage source='ar_ar' target='en_us' />
<mtLanguage source='zh_cn' target='en_us' />
<mtLanguage source='zh_tw' target='en_us' />
<mtLanguage source='en_us' target='ar_ar' />
<mtLanguage source='en_us' target='zh_cn' />
<mtLanguage source='en_us' target='zh_tw' />
<mtLanguage source='en_us' target='fr_fr' />
<mtLanguage source='en_us' target='de_de' />
<mtLanguage source='en_us' target='hi_in' />
<mtLanguage source='en_us' target='it_it' />
<mtLanguage source='en_us' target='ja_jp' />
<mtLanguage source='en_us' target='ko_kr' />
<mtLanguage source='en_us' target='pt_br' />
<mtLanguage source='en_us' target='ru_ru' />
<mtLanguage source='en_us' target='es_es' />
<mtLanguage source='en_us' target='ur_pk' />
<mtLanguage source='fr_fr' target='en_us' />
<mtLanguage source='de_de' target='en_us' />
<mtLanguage source='hi_in' target='en_us' />
<mtLanguage source='it_it' target='en_us' />
<mtLanguage source='ja_jp' target='en_us' />
<mtLanguage source='ko_kr' target='en_us' />
<mtLanguage source='pt_br' target='en_us' />
<mtLanguage source='ru_ru' target='en_us' />
<mtLanguage source='es_es' target='en_us' />
<mtLanguage source='ur_pk' target='en_us' />
</application>
</applications>
现在,如果我想要访问属性为chat的application节点下的所有mtLanguage子节点,XPath应该这样写:
//applications/application[@name='chat']/mtLanguage
而具体操作的Java语句是:
Document doc= DocumentHelper.parseText(responseXML);// 这个responseXML就是上面的XML样例
List<?> elms=doc.selectNodes("//applications/application[@name='chat']/mtLanguage");
System.out.println("There are "+elms.size()+" language pairs available in text translation");
for(Object obj:elms){
Element elm=(Element)obj;
System.out.println("From "+elm.attributeValue("source")+" to "+elm.attributeValue("target"));
}
执行上面语句输出如下:
There are 22 language pairs available in text translation From ar_ar to en_us From zh_cn to en_us From zh_tw to en_us From en_us to ar_ar From en_us to zh_cn From en_us to zh_tw From en_us to fr_fr From en_us to de_de From en_us to it_it From en_us to ja_jp From en_us to ko_kr From en_us to pt_br From en_us to ru_ru From en_us to es_es From fr_fr to en_us From de_de to en_us From it_it to en_us From ja_jp to en_us From ko_kr to en_us From pt_br to en_us From ru_ru to en_us From es_es to en_us
2.访问特定节点
XML样本:
<rep sts="OK" a="trep" tl="zh-CN">
<docs>
<d dt="ndoc" did="d20160223213120480009045125076363146" lang="en-US"
ctime="2016-02-23T21:31:20" mtime="2016-02-23T21:31:20" orig="1"
mime="text/x-mt-xml" wc="2">
<p pid="1" wc="2">
<s sid="1">
<t tid="1" tt="orig" wc="2">Good evening</t>
</s>
</p>
</d>
<d dt="ndoc" did="d20160223213120480009045125076363146" lang="zh-CN"
ctime="2016-02-23T21:31:20" mtime="2016-02-23T21:31:20" orig="0"
mime="text/x-mt-xml" sc="100.00" wc="1">
<p pid="1" wc="1">
<s sid="1">
<t tid="1" tt="mt" src="mt" sc="100.00" wc="1">晚上好</t>
</s>
</p>
</d>
</docs>
</rep>
如果我想得到上文中“晚上好”这段文字,XPath应该这样写
//rep/docs/d[last()]/p/s/t
对应的Java代码是:
Document doc= DocumentHelper.parseText(responseXML);
Element elm = (Element) doc.selectSingleNode("//rep/docs/d[last()]/p/s/t");
targetTxt=elm.getText();
3.取属性
XML样本:
<rep sts="OK" a="trep" tl="zh-CN">
<docs>1</docs>
</rep>
要取根节点rep的sts属性,XPath可以这样写:
//rep/@sts
而对应的Java语句是:
System.out.println("XML="+responseXML);
Document doc= DocumentHelper.parseText(responseXML);
Attribute attr = (Attribute) doc.selectSingleNode("//rep/@sts");
return attr.getText();
爪哇国新游记之三十四----Dom4j的XPath操作的更多相关文章
- 爪哇国新游记之十四----初试JDBC
import java.sql.Connection; import java.sql.DriverManager; import java.sql.PreparedStatement; import ...
- 爪哇国新游记之十九----使用Stack检查数字表达式中括号的匹配性
/** * 辅助类 * 用于记载字符和位置 * */ class CharPos{ char c; int pos; public CharPos(char c,int pos){ this.c=c; ...
- 爪哇国新游记之二十二----排序判断重复时间复杂度为2n的位图法
import java.util.ArrayList; import java.util.List; /** * 位图法 * 用于整型数组判重复,得到无重复列表 * */ public class B ...
- 爪哇国新游记之二十九----访问URL获取输入流
代码: import java.io.BufferedReader; import java.io.BufferedWriter; import java.io.FileWriter; import ...
- 爪哇国新游记之二十八----从url指定的地址下载文件到本地
package download; import java.io.File; import java.io.FileOutputStream; import java.io.InputStream; ...
- Dynamics CRM 2015/2016新特性之三十四:有了插件日志,调试插件so easy!
关注本人微信和易信公众号: 微软动态CRM专家罗勇 ,回复217或者20160330可方便获取本文,同时可以在第一间得到我发布的最新的博文信息,follow me!我的网站是 www.luoyong. ...
- 爪哇国新游记之十三----XML文件读写
/** * XML读写示例 * @author hx * */ public class XmlReaderWriter{ /** * 读取一个XML文件,返回一个雇员链表 * @param file ...
- 爪哇国新游记之七----使用ArrayList统计水果出现次数
之前学习制作了DArray,了解ArrayList就容易了. /** * 用于存储水果名及数量 * */ public class Fruit{ private String name; public ...
- 爪哇国新游记之二----用于计算三角形面积的Point类和TAngle类
这次尝试用两个类完成一个面积计算任务: Point类代表平面上的点: public class Point { private float x; private float y; public Poi ...
随机推荐
- NOIP200002税收与补贴
试题描述 每样商品的价格越低,其销量就会相应增大.现已知某种商品的成本及其在若干价位上的销量(产品不会低于成本销售),并假设相邻价位间销量的变化是线性的且在价格高于给定的最高价位后,销量以某固定数值递 ...
- Strong TLS configuration on servers
- Use certificates with at least sha-256 hash algorithms (including intermediate certificates).- Use ...
- 遍历josn的三种方式
第一种:使用for循环 js代码: function CyclingJson1() { var testJson = '[{ "name": "小强", &qu ...
- 移动Web—CSS为Retina屏幕替换更高质量的图片
来源:互联网 作者:佚名 时间:12-24 10:37:45 [大 中 小] 点评:Retian似乎是屏幕显示的一种趋势,这也是Web设计师面对的一个新挑战;移动应用程序的设计师们已经学会了如何为Re ...
- myeclipse 第一个web project
创建一个java project. 不行...js文件是javascript代码的文件.应该放在web目录下...java文件是后台管理的程序代码.放在src目录下...不同的... 那是不是把所 ...
- hibernate将本地SQL查询结果封装成对象
hibernate将本地SQL查询结果封装成对象 不知道大家有没有碰过这种情况,迫于很多情况只能用native SQL来查询(如:复杂统计等),然而使用native查询后,结果会被放到object里, ...
- empty()与remove([expr])的区别.转
jquery之empty()与remove()区别 要用到移除指定元素的时候,发现empty()与remove([expr])都可以用来实现.可仔细观察效果的话就可以发现.empty()是只移除了 ...
- Visual Studio快捷键不能使用解决办法
环境: Visual Studio 2010,windows 7 使用Visual Studio查找变量或方法时常用到[定位到]功能 但该功能的快捷键却不能使用,解决办法如下所示: 1.工具--> ...
- 基于LR的HTTP协议接口性能测试脚本实例
背景介绍 XXX项目性能测试中新增业务场景:XX设备的在线激活,因为存在多用户同时在线激活,故需进行性能测试以确认后台服务器系统在多用并发时功能是否正常,性能指标是否满足规格要求.用户使用场景为用户通 ...
- Apache查看并发及TIME_WAIT过多的解决
1.查看并发#ps -ef | grep httpd -c2.查看并发数及tpc连接状态netstat -n | awk '/^tcp/ {++S[$NF]} END {for(a in S) pri ...