Specific Node-Type Interfaces

Name	Type	Read-only	2.0	3.0
Attributes
doctype	DocumentType
documentElement	Element
documentURI	DOMString
domConfig	DOMConfiguration
implementation	DOMImplementation
inputEncoding	DOMString
strictErrorChecking	boolean
xmlEncoding	DOMString
xmlStandalone	boolean
xmlVersion	DOMString
Methods
adoptNode	Node
createAttribute	Attr
createAttributeNS	Attr
createCDATASection	CDATASection
createComment	Comment
createDocumentFragment	DocumentFragment
createElement	Element
createElementNS	Element
createEntityReference	EntityReference
createProcessingInstruction	ProcessingInstruction
createTextNode	Text
getElementById	Element
getElementsByTagName	NodeList
getElementsByTagNameNS	NodeList
importNode	Node
normalizeDocument	void
renameNode	Node

The various create...( ) methods are important for applications that wish to modify the structure of a document that was previously parsed. Note that nodes created using one Document instance may only be inserted into the document tree belonging to the Document that created them. DOM Level 2 provided a new importNode( ) method that allows a node, and possibly its children, to be essentially copied from one document to another. DOM Level 3 introduced the adoptNode( ) method that actually moves an entire node subtree from one document to another.

Besides the various node-creation methods, some methods can locate specific XML elements or lists of elements. The methods getElementsByTagName( ) and getElementsByTagNameNS() return a list of all XML elements with the name, and possibly namespace, specified. The getElementById( ) method returns the single element with the given ID attribute.

DOM Level 3 also introduced several attributes that are useful when an application wishes to reconstruct an XML document to its original, pre-parsing format. The inputEncoding, xmlEncoding, and xmlStandalone attributes preserve information about the values of the XML declaration from the original document as well as the character encoding of the document before it was parsed (and converted to Unicode).

One of the major additions to DOM in Level 3 was the inclusion of document validation support within the DOM tree itself. The normalizeDocument( ) method provides the developer with a mechanism for essentially "re-parsing" the XML document from the DOM tree in memory. Various parameters available through the domConfig attribute control how this normalization will occur. It is also possible to change the target version of XML by modifying the xmlVersion attribute before normalization. This will cause the DOM to enforce the XML name construction rules associated with the selected XML version. See Chapter 21 for more information about the differences between XML Versions 1.0 and 1.1.

DocumentFragment

Applications that allow real-time editing of XML documents sometimes need to temporarily park document nodes outside the hierarchy of the parsed document. A visual editor that wants to provide clipboard functionality is one example. When the time comes to implement the cut function, it is possible to move the cut nodes temporarily to a DocumentFragment node without deleting them, rather than having to leave them in place within the live document. Then, when they need to be pasted back into the document, they can be reinserted using a method such as Node.appendChild( ) . The DocumentFragment interface, derived from Node, has no interface-specific attributes or methods.

Element

Element nodes are the most frequently encountered node type in a typical XML document. These nodes are parents for the Text, Comment, EntityReference, ProcessingInstruction, CDATASection, and child Element nodes that comprise the document's body. They also allow access to the Attr objects that contain the element's attributes. Table 19-11 shows all attributes and methods supported by the Element interface.

Table 19-11. The Element interface, derived from Node

Name	Type	Read-only	2.0	3.0
Attributes
schemaTypeInfo	TypeInfo
tagName	DOMString
Methods
getAttribute	DOMString
getAttributeNode	Attr
getAttributeNodeNS	Attr
getAttributeNS	DOMString
getElementsByTagName	NodeList
getElementsByTagNameNS	NodeList
hasAttribute	boolean
hasAttributeNS	boolean
removeAttribute	void
removeAttributeNode	Attr
removeAttributeNS	Attr
setAttribute	void
setAttributeNode	Attr
setAttributeNodeNS	Attr
setAttributeNS	Attr
setIdAttribute	void
setIdAttributeNode	void
setIdAttributeNS	void

Attr

Since XML attributes may contain either text values or entity references, the DOM stores element attribute values as Node subtrees. The following XML fragment shows an element with two attributes:

<!ENTITY bookcase_pic SYSTEM "bookcase.gif" NDATA gif>
<!ELEMENT picture EMPTY>
<!ATTLIST picture
   src ENTITY #REQUIRED
   alt CDATA #IMPLIED>
. . .
<picture src="bookcase_pic" alt="3/4 view of bookcase"/>

The first attribute contains a reference to an unparsed entity; the second contains a simple string. Since the DOM framework stores element attributes as instances of the Attr interface, a few parsers make the contents of attributes available as actual subtrees of Node objects. In this example, the src attribute would contain an EntityReference object instance. Note that the nodeValue of the Attr node gives the flattened text value from the Attr node's children. Table 19-12 shows the attributes and methods supported by the Attr interface.

Table 19-12. The Attr interface, derived from Node

Name	Type	Read-only	2.0	3.0
Attributes
specified	boolean
isId	boolean
name	DOMString
value	DOMString
ownerElement	Element
schemaTypeInfo	TypeInfo

Besides the attribute name and value, the Attr interface exposes the specified flag that indicates whether this particular attribute instance was included explicitly in the XML document or inherited from the !ATTLIST declaration of the DTD. There is also a back pointer to the Element node that owns this attribute object.

CharacterData

Several types of data within a DOM node tree represent blocks of character data that do not include markup. CharacterData is an abstract interface that supports common text-manipulation methods, which are used by the concrete interfaces Comment, Text, and CDATASection. Table 19-13 shows the attributes and methods supported by the CharacterData interface.

Table 19-13. The CharacterData interface, derived from Node

Name	Type	Read-only	DOM 2.0
Attributes
data	DOMString
length	unsigned long
Methods
appendData	void
deleteData	void
insertData	void
replaceData	void

Table 19-14. The Text interface, derived from CharacterData

Name	Type	Read-only	2.0	3.0
Attributes
isElementContentWhitespace	boolean
wholeText	DOMString
Methods
replaceWholeText	Text
splitText	Text

The splitText method provides a way to split a single Text node into two nodes at a given point. This split would be useful if an editing application wished to insert additional markup nodes into an existing island of character data. After the split, it is possible to insert additional nodes into the resulting gap.

Another useful addition (introduced in Level 3) is the wholeText attribute. This attribute returns all of the text contained in the selected Text node, as well as any adjacent Text nodes, in document order. Prior to Level 3, it was necessary to enumerate all children of a given node and concatenate them manually to get the entire text contained within a node.

CDATASection

CDATA sections provide a simplified way to include characters that would normally be considered markup in an XML document. These sections are stored within a DOM document tree as CDATASection nodes. The CDATASection interface, derived from Text, has no interface-specific attributes or methods.