learnxinyminutes-docs/xml.html.markdown

130 lines
3.5 KiB
Markdown
Raw Normal View History

2014-03-14 20:37:48 +04:00
---
language: xml
filename: learnxml.xml
contributors:
- ["João Farias", "https://github.com/JoaoGFarias"]
- ["Rachel Stiyer", "https://github.com/rstiyer"]
2014-03-14 20:37:48 +04:00
---
XML is a markup language designed to store and transport data.
Unlike HTML, XML does not specify how to display or to format data, it just carries it.
* XML Syntax
2014-05-12 03:26:22 +04:00
```xml
<!-- Comments in XML are like this -->
<?xml version="1.0" encoding="UTF-8"?>
<bookstore>
<book category="COOKING">
<title lang="en">Everyday Italian</title>
<author>Giada De Laurentiis</author>
<year>2005</year>
<price>30.00</price>
</book>
<book category="CHILDREN">
<title lang="en">Harry Potter</title>
<author>J K. Rowling</author>
<year>2005</year>
<price>29.99</price>
</book>
<book category="WEB">
<title lang="en">Learning XML</title>
<author>Erik T. Ray</author>
<year>2003</year>
<price>39.95</price>
</book>
</bookstore>
<!-- Above is a typical XML file.
2014-06-01 01:13:23 +04:00
It starts with a declaration, informing some metadata (optional).
2015-10-08 06:11:24 +03:00
XML uses a tree structure. Above, the root node is 'bookstore', which has
2015-10-09 18:05:21 +03:00
three child nodes, all 'books'. Those nodes have more child nodes (or
children), and so on...
2015-10-08 06:11:24 +03:00
2015-10-09 18:05:21 +03:00
Nodes are created using open/close tags, and children are just nodes between
the open and close tags.-->
2015-10-09 18:05:21 +03:00
<!-- XML carries two kinds of data:
1 - Attributes -> That's metadata about a node.
Usually, the XML parser uses this information to store the data properly.
It is characterized by appearing with the format name="value" within the opening
tag.
2 - Elements -> That's pure data.
2014-06-01 01:13:23 +04:00
That's what the parser will retrieve from the XML file.
Elements appear between the open and close tags. -->
2015-10-08 06:11:24 +03:00
<!-- Below, an element with two attributes -->
<file type="gif" id="4293">computer.gif</file>
```
* Well-Formated Document x Validation
An XML document is well-formatted if it is syntactically correct.
2014-06-01 01:13:23 +04:00
However, it is possible to inject more constraints in the document,
using document definitions, such as DTD and XML Schema.
An XML document which follows a document definition is called valid,
in regards to that document.
With this tool, you can check the XML data outside the application logic.
2014-05-12 03:26:22 +04:00
```xml
2015-10-08 06:11:24 +03:00
<!-- Below, you can see an simplified version of bookstore document,
with the addition of DTD definition.-->
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE note SYSTEM "Bookstore.dtd">
<bookstore>
<book category="COOKING">
<title>Everyday Italian</title>
<price>30.00</price>
</book>
</bookstore>
<!-- This DTD could be something like:-->
<!DOCTYPE note
[
<!ELEMENT bookstore (book+)>
<!ELEMENT book (title,price)>
<!ATTLIST book category CDATA "Literature">
<!ELEMENT title (#PCDATA)>
<!ELEMENT price (#PCDATA)>
]>
<!-- The DTD starts with a declaration.
Following, the root node is declared, requiring 1 or more child nodes 'book'.
2014-06-01 01:13:23 +04:00
Each 'book' should contain exactly one 'title' and 'price' and an attribute
called 'category', with "Literature" as its default value.
The 'title' and 'price' nodes contain a parsed character data.-->
<!-- The DTD could be declared inside the XML file itself.-->
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE note
[
<!ELEMENT bookstore (book+)>
<!ELEMENT book (title,price)>
<!ATTLIST book category CDATA "Literature">
<!ELEMENT title (#PCDATA)>
<!ELEMENT price (#PCDATA)>
]>
<bookstore>
<book category="COOKING">
<title>Everyday Italian</title>
<price>30.00</price>
</book>
</bookstore>
2015-03-12 03:38:42 +03:00
```