The need for XML “schemas”

•Unlike any other data format, XML is totally flexible, elements can be nested in arbitrary ways
•We can start by writing the XML data -- no need for a priori design of a schema
–Think relational databases, or Java classes
•However, schemas are necessary:
–Facilitate the writing of applications that process data
–Constraint the data that is correct for a certain application
–Have a priori agreements between parties with respect to the data being exchanged
•Schema: a model of the data
–Structural definitions
–Type definitions
–Defaults
  • Introduction

DTD is the abbreviation for "Document Type Definition" to define the legal building blocks of   an XML document with a list of legal elements and attributes, which can be defined inline an XML doc or as an external reference. With the DTD, your can verify the data that you receive from the outside world is valid. Elements/attributes names in XML are case-sensitive, so DTD must be case-sensitive also!

Parsing XML Documents

•Parsers
–Validating
•Able to read DTD
•Determine whether XML document conforms to DTD 

– Valid document conforms to DTD

» Document is then well formed, by definition

» Documents can be well formed, but not valid

–Nonvalidating
•Able to read DTD
•Cannot check document against DTD for conformity
 

Example: inline example

<?xml version="1.0"?>  <? PI ?>

<!DOCTYPE note [                // defines that the root element of this document is note

<!ELEMENT note (to,from,heading,body)>  //defines that the note element contains four elements: "to,from,heading,body"

<!ELEMENT to (#PCDATA)>   // defines the to element to be of type "#PCDATA"

<!ELEMENT from (#PCDATA)>                                   Parsable character data

<!ELEMENT heading (#PCDATA)>
<!ELEMENT body (#PCDATA)>
]>

<note>           Root element
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
<body>Don't forget me this weekend</body>
</note>

external example:

use <!DOCTYPE note SYSTEM "note.dtd"> to replace the inline DTD block

SYSTEM indicates its a private DTD (not for public distribution)

  • Building Block

Elements, Attributes, Entities, CDATA, PCDATA

PCDATA should not contain any characters like &, > or < which should be represented by &amp, &lt and &gt entities, respectively.

CDATA will not be parsed by a parser.

 

elements:

  1. Declaring Elements:<!ELEMENT element-name category>
  2. Empty elements: <!ELEMENT oven EMPTY>  <oven/>
  3. Elements with only parsed character data: <!ELEMENT from (#PCDATA)>
  4. can contain any combination of parable data: <!ELEMENT note ANY> replace by specific content now.
  5. Elements with one or more children: <!ELEMENT element-name (child1,child2,…)>
  6. Only one occurrence: <!ELEMENT element-name (child-name)>
  7. one or more occurrence: <!ELEMENT element-name (child-name+)>
  8. zero or more occurrence: <!ELEMENT element-name (child-name*)>
  9. zero or one occurrence: <!ELEMENT element-name (child-name?)>
  10. either or occurrence: <!ELEMENT note (to,from,header,(message|body))>
  11. mixed content: <!ELEMENT note (#PCDATA|to|from|header|message)*>

5:consecutively; 11: no specific sequence

attributes:

<!ATTLIST element-name attribute-name attribute-type attribute-value>

Declaration:     <!ATTLIST payment typeCDATA "check">

XML example:     <payment type="check" />

 

Types

Description

CDATA (strings)

The value is character data except <, >, &, ’and ”

(en1|en2|..)  enumerated

The value must be one from an enumerated list

ID tokenized most restrictive

The value is a unique id--> Uniquely identifies an element

IDREF tokenized

The value is the id of another element--> Point to element with ID

IDREFS tokenized

The value is a list of other ids  consistency to ID

NMTOKEN

The value is a valid XML name

NMTOKENS

The value is a list of valid XML names

ENTITY tokenized

The value is an entity

ENTITIES tokenized

The value is a list of entities

NOTATION

The value is a name of a notation

xml:

The value is a predefined xml value

 

value: the value of the attribute

 

Required:

<!ATTLIST person number CDATA #REQUIRED>

Valid XML:
<person number="5677" />

Invalid XML:
<person />

 

Implied:

<!ATTLIST contact fax CDATA #IMPLIED>

Valid XML:
<contact fax="555-667788" />

Valid XML:
<contact />

 

Fixed:

<!ATTLIST sender company CDATA #FIXED "Microsoft">

Valid XML:
<sender company="Microsoft" />

Invalid XML:
<sender company="W3Schools" />

 

Enumerated attribute values:

<!ATTLIST payment type (check|cash) "cash">

XML example:
<payment type="check" />
or
<payment type="cash" />

elements vs attributes:

There is no rule for when to use elements or attributes

Store data in element is better and use attribute to provide information not relevant to data.

Metadata (data about data) should be stored as attributes, and that data itself should be stored   as elements.

Entities:

internal entities: <!ENTITY entity-name "entity-value">

DTD Example:

<!ENTITY writer "Donald Duck.">

<!ENTITY copyright "Copyright W3Schools.">

XML example:

<author>&writer;&copyright;</author>

 

external entities:<!ENTITY entity-name SYSTEM "URI/URL">

DTD Example:

<!ENTITY writer SYSTEM "http://www.w3schools.com/entities.dtd">

<!ENTITY copyright SYSTEM "http://www.w3schools.com/entities.dtd">

XML example:

<author>&writer;&copyright;</author>

 
  • A General XML Validator - Errors in XML documents will stop your XML program. 

To help you check your xml files, you can syntax-check any XML file here.

 

Semantic Web

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

XML-->DTD&Schema Notes的更多相关文章

  1. JavaScripts学习日记——XML DTD Schema

    今日关键词: XML DTD Schema 1.XML 1 XML的概述 1.1 什么是XML XML全称为Extensible Markup Language,意思是可扩展的标记语言.XML语法上和 ...

  2. XML Dtd Schema

    在XML技术里,可以编写一个文档来约束一个XML文档的书写规范,这称之为XML约束. 整体比较: XML Schema符合XML语法结构. DOM.SAX等XML API很容易解析出XML Schem ...

  3. xml语法、DTD约束xml、Schema约束xml、DOM解析xml

    今日大纲 1.什么是xml.xml的作用 2.xml的语法 3.DTD约束xml 4.Schema约束xml 5.DOM解析xml 1.什么是xml.xml的作用 1.1.xml介绍 在前面学习的ht ...

  4. XML概念定义以及如何定义xml文件编写约束条件java解析xml DTD XML Schema JAXP java xml解析 dom4j 解析 xpath dom sax

    本文主要涉及:xml概念描述,xml的约束文件,dtd,xsd文件的定义使用,如何在xml中引用xsd文件,如何使用java解析xml,解析xml方式dom sax,dom4j解析xml文件 XML来 ...

  5. XML DTD跟SCHEMA约束 语法了解

    dtd语法 元素: <!Element 元素名称 数据类型|包含内容> 数据类型: #PCDATA:普通文本 使用的时候一般用()引起来 包含内容: 该元素下可以出现哪些元素, 用()引起 ...

  6. Eclipse引入自定义XML约束文件(DTD,SCHEMA)问题

    Eclipse引入自定义XML约束文件(DTD,SCHEMA)问题 1:说明 使用Eclipse 编写xml文件的约束文件的,包括DTD约束文件,Schema约束文件的时候, 我们也需要接受eclip ...

  7. XML约束——Schema约束

    XML Schema 也是一种用于定义和描述 XML 文档结构与内容的模式语言,其出现是为了克服 DTD 的局限性 XML Schema VS DTD: •XML Schema符合XML语法结构. • ...

  8. XML的Schema约束

    XSD文档至少要包含:schema根元素和XML模式命名空间的定义.元素定义.需要注意的是XSD中必须定义一个且只能定义一个schema根元素,根元素中包括模式的约束,XML模式命名空间的定义,其他命 ...

  9. XML和Schema

    2017-11-03 19:33:56 XML:Extensible Markup Language,也就是可扩展标记语言.XML工具使处理和转化信息变得十分容易和方便. XML和HTML格式是古老的 ...

  10. XML——DTD

    body, table{font-family: 微软雅黑; font-size: 10pt} table{border-collapse: collapse; border: solid gray; ...

随机推荐

  1. [WPF系列]-数据邦定之DataTemplate 使用 DataTrigger 来应用属性值

    使用 DataTrigger 来应用属性值 当前表示不会告诉我们某个 Task 是家庭任务还是办公室任务.记住 Task 对象拥有类型为 TaskType 的 TaskType 属性,该类型是一个枚举 ...

  2. The Engine Document of JustWeEngine

    JustWeEngine - Android FrameWork An easy open source Android Native Game FrameWork. Github Game core ...

  3. cf之路,1,Codeforces Round #345 (Div. 2)

     cf之路,1,Codeforces Round #345 (Div. 2) ps:昨天第一次参加cf比赛,比赛之前为了熟悉下cf比赛题目的难度.所以做了round#345连试试水的深浅.....   ...

  4. Session跟SessionFactory的线程安全与非安全

    SessionFactory负责创建session,SessionFactory是线程安全的,多个并发线程可以同时访问一个 SessionFactory 并从中获取Session实例. (Sessio ...

  5. Java8并发教程:Threads和Executors

    来之:ImportNew 欢迎阅读我的Java8并发教程的第一部分.这份指南将会以简单易懂的代码示例来教给你如何在Java8中进行并发编程.这是一系列教程中的第一部分.在接下来的15分钟,你将会学会如 ...

  6. C#.NET 大型通用信息化系统集成快速开发平台 4.1 版本 - .NET商业化成品成熟各种数据权限的需求对应例子代码

    还是我上次提出的那个问题问题:假设一个订单表,1.角色A可以看自己的2.角色B可以看工作组的3.角色C可以看金额是1000元以下的(自定义条件是否可行?如果可以,请详细说明)4.角色D可以看整个部门的 ...

  7. Windows 8 下离线安装。net Framework 3.5

    Windows 8 下安装.net Framework 3.5 1)可以将直接双击ISO (或放入光盘/U盘)(安装文件在F盘) 2)使用管理员权限运行命令行程序 3)dism.exe /online ...

  8. MySQL数据库备份--mysqldump用法

    导出要用到MySQL的mysqldump工具,基本用法是:   shell> mysqldump [OPTIONS] database [tables] 如果你不给定任何表,整个数据库将被导出. ...

  9. I2C 基础原理详解

    今天来学习下I2C通信~ I2C(Inter-Intergrated Circuit)指的是 IC(Intergrated Circuit)之间的(Inter) 通信方式.如上图所以有很多的周边设备都 ...

  10. CREATE TABLE 表名 AS SELECT 语句

    1.新表不存在复制表结构即数据到新表 ? 1 2 create table new_table select * from old_talbe; 这种方法会将old_table中所有的内容都拷贝过来, ...