Cloud Account Sign in to Cloud Sign Up for Free Cloud Tier

Oracle Account

Java API for XML Processing (JAXP) Tutorial

Chapter 4
Extensible Stylesheet Language Transformations

The Extensible Stylesheet Language Transformations (XSLT) standard defines mechanisms for addressing XML data (XPath) and for specifying transformations on the data in order to convert it into other forms. JAXP includes an interpreting implementation of XSLT.

In this chapter, you will write out a Document Object Model as an XML file, and you will see how to generate a DOM from an arbitrary data file in order to convert it to XML. Finally, you will convert XML data into a different form, learning about the XPath addressing mechanism along the way.

Introducing XSL, XSLT, and XPath
JAXP Transformation Packages
XSLT Sample Programs
How XPath Works
XPath Exliressions
XSLT/XPath Data Model
Templates and Contexts
Basic XPath Addressing
Basic XPath Exliressions
Combining Index Addresses
Wild Cards
Extended-Path Addressing
XPath Data Tylies and Olierators
String-Value of an Element
XPath Functions
Node-Set Functions
Positional Functions
String Functions
Boolean Functions
Numeric Functions
Conversion Functions
Namesliace Functions
Summary
Writing Out a DOM as an XML File
Reading the XML
Creating a Transformer
Running the TransformationApp01 Sample
Writing Out a Subtree of the DOM
Running the TranformationApp02 Sample
Generating XML from an Arbitrary Data Structure
Creating a Simple File
Creating a Simple Parser
Running the AddressBookReader01 Sample
Creating a Parser that Generates SAX Events
Using the Parser as a SAXSource
Running the TransformationApp03 Sample
Transforming XML Data with XSLT
Defining a Simple Document Tylie
Creating a Test Document
Writing an XSLT Transform
Processing the Basic Structure Elements
Process the <TITLE> Element
Process Headings
Generate a Runtime Message
Writing the Basic Program
Running the Stylizer Sample
Trimming the Whitespace
Running the Stylizer Sample with Trimmed Whitespace
Removing the Last Whitespace
Running the Stylizer Sample with All Whitespace Trimmed
Processing the Remaining Structure Elements
Modify <PARA> Handling
Process <LIST> and <ITEM> Elements
Ordering Templates in a Stylesheet
Process <NOTE> Elements
Running the Stylizer Sample With LIST and NOTE Elements Defined
Process Inline (Content) Elements
Running the Stylizer Sample With Inline Elements Defined
Printing the HTML
What Else Can XSLT Do?
The Trouble with Variables

Introducing XSL, XSLT, and XPath

The Extensible Stylesheet Language (XSL) has three major subcomponents:

XSL-FO

The Formatting Objects standard. By far the largest subcomponent, this standard gives mechanisms for describing font sizes, page layouts, and other aspects of object rendering. This subcomponent is not covered by JAXP, nor is it included in this tutorial.

XSLT

This is the transformation language, which lets you define a transformation from XML into some other format. For example, you might use XSLT to produce HTML or a different XML structure. You could even use it to produce plain text or to put the information in some other document format. (And as you will see in Generating XML from an Arbitrary Data Structure, a clever application can press it into service to manipulate non-XML data as well).

XPath

At bottom, XSLT is a language that lets you specify what sorts of things to do when a particular element is encountered. But to write a program for different parts of an XML data structure, you need to specify the part of the structure you are talking about at any given time. XPath is that specification language. It is an addressing mechanism that lets you specify a path to an element so that, for example, <article><title> can be distinguished from <person><title>. In that way, you can describe different kinds of translations for the different <title> elements.

The remainder of this section describes the packages that make up the JAXP Transformation APIs.

JAXP Transformation Packages

Here is a description of the packages that make up the JAXP Transformation APIs:

javax.xml.transform

This package defines the factory class you use to get a Transformer object. You then configure the transformer with input (source) and output (result) objects, and invoke its transform() method to make the transformation happen. The source and result objects are created using classes from one of the other three packages.

javax.xml.transform.dom

Defines the DOMSource and DOMResult classes, which let you use a DOM as an input to or output from a transformation.

javax.xml.transform.sax

Defines the SAXSource and SAXResult classes, which let you use a SAX event generator as input to a transformation, or deliver SAX events as output to a SAX event processor.

javax.xml.transform.stream

Defines the StreamSource and StreamResult classes, which let you use an I/O stream as an input to or output from a transformation.

XSLT Sample Programs

Unlike for the other chapters in this tutorial, the sample programs used in this chapter are not included in the install-dir /jaxp-1_4_2- release-date /samples directory provided with the JAXP 1.4.2 Reference Implementation. However you can download a ZIP file of the XSLT samples here.

How XPath Works

The XPath specification is the foundation for a variety of specifications, including XSLT and linking/addressing specifications such as XPointer. So an understanding of XPath is fundamental to a lot of advanced XML usage. This section provides an introduction to XPath in the context of XSLT.

XPath Expressions

In general, an XPath expression specifies a pattern that selects a set of XML nodes. XSLT templates then use those patterns when applying transformations. ( XPointer, on the other hand, adds mechanisms for defining a point or a range so that XPath expressions can be used for addressing).

The nodes in an XPath expression refer to more than just elements. They also refer to text and attributes, among other things. In fact, the XPath specification defines an abstract document model that defines seven kinds of nodes:

Root
Element
Text
Attribute
Comment
Processing instruction
Namespace

The root element of the XML data is modeled by an element node. The XPath root node contains the document's root element as well as other information relating to the document.

XSLT/XPath Data Model

Like the Document Object Model (DOM), the XSLT/XPath data model consists of a tree containing a variety of nodes. Under any given element node, there are text nodes, attribute nodes, element nodes, comment nodes, and processing instruction nodes.

In this abstract model, syntactic distinctions disappear, and you are left with a normalized view of the data. In a text node, for example, it makes no difference whether the text was defined in a CDATA section or whether it included entity references. The text node will consist of normalized data, as it exists after all parsing is complete. So the text will contain a < character, whether or not an entity reference such as < or a CDATA section was used to include it. (Similarly, the text will contain an & character, whether it was delivered using & or it was in a CDATA section).

In this section, we will deal mostly with element nodes and text nodes. For the other addressing mechanisms, see the XPath specification.

Templates and Contexts

An XSLT template is a set of formatting instructions that apply to the nodes selected by an XPath expression. In a stylesheet, an XSLT template would look something like this:

Wild card	Meaning
`*`	Matches any element node (not attributes or text).
`node()`	Matches any node of any kind: element node, text node, attribute node, processing instruction node, namespace node, or comment node.
`@*`	Matches any attribute node.

Operator	Meaning
`\|`	Alternative. For example, `PARA\|LIST` selects all `PARA` and `LIST` elements.
`or`, `and`	Returns the or/and of two Boolean values.
`=`, `!=`	Equal or not equal, for Booleans, strings, and numbers.
`<`, `>`, `<=`, `>=`	Less than, greater than, less than or equal to, greater than or equal to, for numbers.
`+`, `-`, `*`, `div`, `mod`	Add, subtract, multiply, floating-point divide, and modulus (remainder) operations (e.g., 6 mod 4 = 2).

Java API for XML Processing (JAXP) Tutorial

Chapter 4 Extensible Stylesheet Language Transformations

Introducing XSL, XSLT, and XPath

JAXP Transformation Packages

XSLT Sample Programs

How XPath Works

XPath Expressions

XSLT/XPath Data Model

Templates and Contexts

Basic XPath Addressing

Basic XPath Expressions

Combining Index Addresses

Wild Cards

Extended-Path Addressing

XPath Data Types and Operators

String-Value of an Element

XPath Functions

Node-Set Functions

Positional Functions

String Functions

Boolean Functions

Numeric Functions

Conversion Functions

Namespace Functions

Summary

Writing Out a DOM as an XML File

Reading the XML

Creating a Transformer

Running the TransformationApp01 Sample

Writing Out a Subtree of the DOM

Running the TranformationApp02 Sample

Generating XML from an Arbitrary Data Structure

Creating a Simple File

Creating a Simple Parser

Running the AddressBookReader01 Sample

Creating a Parser that Generates SAX Events

Using the Parser as a SAXSource

Running the TransformationApp03 Sample

Transforming XML Data with XSLT

Defining a Simple Document Type

Creating a Test Document

Writing an XSLT Transform

Processing the Basic Structure Elements

Process the <TITLE> Element

Process Headings

Generate a Runtime Message

Writing the Basic Program

Running the Stylizer Sample

Trimming the Whitespace

Running the Stylizer Sample with Trimmed Whitespace

Removing the Last Whitespace

Running the Stylizer Sample with All Whitespace Trimmed

Processing the Remaining Structure Elements

Modify <PARA> Handling

Process <LIST> and <ITEM> Elements

Ordering Templates in a Stylesheet

Process <NOTE> Elements

Running the Stylizer Sample With LIST and NOTE Elements Defined

Process Inline (Content) Elements

Running the Stylizer Sample With Inline Elements Defined

Printing the HTML

What Else Can XSLT Do?

The Trouble with Variables

Chapter 4
Extensible Stylesheet Language Transformations

Running the `TransformationApp01` Sample

Running the `TranformationApp02` Sample

Running the `AddressBookReader01` Sample

Using the Parser as a `SAXSource`

Running the `TransformationApp03` Sample

Process the `<TITLE>` Element

Running the `Stylizer` Sample

Running the `Stylizer` Sample with Trimmed Whitespace

Running the `Stylizer` Sample with All Whitespace Trimmed

Modify `<PARA>` Handling

Process `<LIST>` and `<ITEM>` Elements

Process `<NOTE>` Elements

Running the `Stylizer` Sample With `LIST` and `NOTE` Elements Defined

Running the `Stylizer` Sample With Inline Elements Defined