Copyright ©2001 W3C® (MIT, INRIA, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply.
This specification defines the XML Pointer Language (XPointer), the language
to be used as the basis for a fragment identifier for any URI reference that
locates a resource whose Internet media type is one of text/xml
, application/xml
, text/xml-external-parsed-entity
,
or application/xml-external-parsed-entity
.
XPointer, which is based on the XML Path Language (XPath), supports addressing into the internal structures of XML documents. It allows for examination of a hierarchical document structure and choice of its internal parts based on various properties, such as element types, attribute values, character content, and relative position.
This is a W3C Last Call Working Draft for review by W3C members and other interested parties.
The XML Linking Working Group invites comment on this specification, especially on the new xmlns XPointer scheme which was added to the previous Candidate Recommentation version. The Last Call period begins 8 January 2001 and ends 29 January 2001.
It is a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress." Comments on this document should be sent to the public mailing list www-xml-linking-comments@w3.org (archive).
This document is intended to be taken in conjunction with the IETF XML
Media Types Internet Draft [IETF I-D XMT], in order for
that document to officially designate XPointer as the fragment identifier
language for resources whose type is one of text/xml
, application/xml
, text/xml-external-parsed-entity
,
or application/xml-external-parsed-entity
. However, because of the
timing problems associated with publishing two related documents on separate
tracks, currently that document refers to this one only non-normatively. It
is anticipated that the Internet Draft will be updated to make a normative
reference when XPointer reaches the Recommendation stage.
XPointer is affected by a technology patent held by Sun Microsystems. The legal terms and conditions offered by Sun to XPointer implementors can be found in the archives of the public comments list.
For background on this work, please see the XML Activity Statement. For information about the XLink language with which XPointer may be used, see http://www.w3.org/TR/xlink. For information about the requirements that informed development of this specification, see http://www.w3.org/TR/NOTE-xptr-req. A list of current W3C working drafts can be found at http://www.w3.org/TR.
1 Introduction
1.1 Origin and Goals
1.2 Notation and Document Conventions
2 XPointer Terms and Concepts
3 XPointer Processing and Conformance
3.1 Processing Dependencies
3.2 String Conformance
3.3 Application Conformance
3.4 Classes of XPointer Errors
4 XPointer Model and Language
4.1 Character Escaping
4.1.1 Escaping Contexts
4.1.2 URI Reference Encoding and Escaping
4.1.3 Examples of Escaping
4.2 Forms of XPointer
4.2.1 Full XPointers
4.2.2 Bare Names
4.2.3 Child Sequences
4.3 Schemes
5 XPointer Extensions to XPath
5.1 XPointer Additions to XPath Terms and Concepts
5.2 Evaluation Context Initialization
5.2.1 Namespace Initialization
5.3 The point and range Location Types
5.3.1 Definition of Point Location
5.3.2 Definition of Range Location
5.3.3 Covering Ranges for All Location Types
5.3.4 Tests for point and range Locations
5.3.5 Document Order
5.4 XPointer Functions
5.4.1 range-to Function
5.4.2 string-range() Function
5.4.3 Additional Range-Related Functions
5.4.3.1 range Function
5.4.3.2 range-inside Function
5.4.3.3 start-point Function
5.4.3.4 end-point Function
5.4.4 here Function
5.4.5 origin Function
5.5 Root Node Children
A References
A.1 Normative References
A.2 Non-Normative References
B Working Group Members (Non-Normative)
This specification defines the XML Pointer Language (XPointer), the language
to be used as the basis for a fragment identifier for any URI reference that
locates a resource whose Internet media type is one of text/xml
, application/xml
, text/xml-external-parsed-entity
,
or application/xml-external-parsed-entity
[IETF RFC 2376].
This specification does not constrain the syntax or semantics of URI references
to resources of other media types, although it provides extension facilities
that may be used with other types.
XPointer supports addressing into the internal structures of XML documents. It allows for examination of a document's hierarchical structure and choice of its internal parts based on various properties, such as element types, attribute values, character content, and relative position. In particular, it provides for specific reference to elements, character strings, and other parts of XML documents, whether or not they bear an explicit ID attribute.
The structures located with XPointer can be used as link targets or for any other application-specific purpose. This specification does not constrain what uses an application may make of locations identified by XPointers. In particular, implementation of traversal to a resource is not constrained by this specification, and whether user "traversal" is the purpose of an XPointer at all is application-dependent. A formatted-text browser traversal might scroll to and highlight the designated location; a structure-oriented graphical viewer or a document-relationship display might do traversal in quite a different way; and a search application, parser, archival system, or expert agent might use XPointers for other purposes entirely. (The construction of linking elements in XML documents that associate arbitrary resources, including XML documents and portions thereof, is defined in a related specification, [XLink].)
XPointer is built on top of the XML Path Language [XPath], which is an expression language underlying the XSL Transformations (XSLT) language. XPointer's extensions to XPath allow it to:
Address points and ranges as well as whole nodes
Locate information by string matching
Use addressing expressions in URI references as fragment identifiers (after suitable escaping)
Addressing into the internal structures of DTDs and the XML declaration is outside the scope of this specification.
Note:
XPointer is intended to be the basis of fragment identifiers only for resources
whose media type is one of text/xml
, application/xml
, text/xml-external-parsed-entity
,
or application/xml-external-parsed-entity
. It is expected that many applications of
XML will define their own media types in order to indicate the particular
usage of XML they define. (Note that a recent Internet Draft [IETF I-D XMT]
suggests the use of a naming convention for specialized media types based
on XML.) XPointer is expected to be useful as a fragment identifier language
for the generic XML aspects of those media types. Schemes (discussed in 4.3 Schemes) can be used to provide specialized addressing into the content
of resources of those media types.
In addition to XPath, the following standards have been especially influential in the development of this specification:
HTML [HTML]: This system popularized an important location specifier type, the URL (now URI).
HyTime [ISO/IEC 10744]: This ISO standard defines location specifier types for all kinds of data.
Text Encoding Initiative Guidelines [TEI]: This application provides a formal syntax for "extended pointers," locators for structured markup that underlie the initial design for XPointer.
The addressing components of many other hypertext systems have also informed the design of XPointer, especially [Dexter], [OHS], [FRESS], [MicroCosm], and [Intermedia].
See the XPointer Requirements Document [XPREQ] for a thorough explanation of requirements for the design of XPointer.
The formal grammar for XPointer is given using a simple Extended Backus-Naur
Form (EBNF) notation, as described in the XML Recommendation [XML].
The circumflex (^
) metacharacter used in this notation (to denote
the complement of a set of characters) is not to be confused with the circumflex
used to escape parenthesis characters in an XPointer.
The prototypes for XPointer functions are given using the same notation used in the [XPath] Recommendation.
This specification contains some explanatory text on the XPath language; however, such text is non-normative. In the case of conflicts, [XPath] is the normative version.
Some special terms are defined here in order to clarify their relationship to similar terms used in the technologies on which XPointer is based. Additional XPointer-specific terms are defined in the flow of the text. Refer to [XPath], [DOM2], [XIS], and [IETF RFC 2396] for definitions of other technical terms used in this specification.
A position in XML information. This notion comes from the DOM Level 2 [DOM2] specification's notion of positions; this specification refers to DOM positions by the term "point" to avoid confusion with XPath positions.
An identification of all the XML information between a pair of end points. This notion comes from the DOM Level 2 [DOM2] specification.
A generalization of XPath's node that includes points and range in addition to nodes.
An ordered list of locations, such as produced by an XPointer expression. This corresponds to the node-set that is produced by XPath expressions, except for the generalization to include points and ranges.
A location-set that consists of a single location. A point is always a
singleton. A range is always a singleton as well, even if a different XPointer
could have produced multiple individual components encompassed by the range.
For example, for a list
element with ID "list37" that
contains three item
element children, the following XPointer produces
a location-set consisting of three locations (the item
children in
that list
):
xpointer(id('list37')/item) |
By contrast, the following XPointer returns a singleton location-set:
xpointer(id('list37')/item[1]/range-to(following-sibling::item[2])) |
The portion of an XML resource that is identified by an XPointer. For example, the whole resource being referred to is an XML document, but a sub-resource might be a particular element inside the document. Following a link to such a sub-resource might result, for example, in highlighting that element or scrolling to that point in the document.
This section details processing and conformance requirements on XPointers, the fragment identifiers that are based on them, and the applications that create and process them.
[Definition: The key words must, must not, required, shall, shall not, should, should not, recommended, may, and optional in this specification are to be interpreted as described in [IETF RFC 2119].]
XPointer processing depends on [IETF RFC 2396] (as updated by [IETF RFC 2732]) processing. Fully conformant XPointer processing depends on [XPath] processing.
A string conforms to the XPointer specification if it adheres to the syntactic requirements and validity constraints imposed by this specification (including [XPath], whose requirements are imposed by reference). This specification does not require that the string, in association with a URI, actually point to a resource or sub-resource that exists at any given moment.
This specification defines two levels of conformance:
[Definition: Minimal conformance entails handling series of XPointer parts, routing XPointer parts with particular schemes to an application that can evaluate those parts, and skipping over schemes that are not understood], as defined in 4.3 Schemes, as well as interpretation of bare-name XPointers, as prescribed in 4.2.2 Bare Names.
[Definition: Full conformance
entails (in addition to the items comprising minimal conformance) interpretation
of child-sequence XPointers and full-form XPointer parts that use the xpointer
and xmlns
schemes, as prescribed by this specification.]
For applications that recognize and interpret XPointers in URI references
that locate resources whose Internet media type is one of text/xml
, application/xml
, text/xml-external-parsed-entity
,
or application/xml-external-parsed-entity
, full conformance
is required. The minimal conformance level is defined for the convenience
of XML-based Internet media types [IETF I-D XMT] that
define fragment identifier languages based on XPointer.
The use of XPointers can give rise to several kinds of errors:
[Definition: A string that does not match the syntax specified in this document has a syntax error, and applications must not attempt to evaluate it as an XPointer.]
[Definition: If a syntactically correct XPointer, suitably escaped, is appended to a URI that identifies no resource, or a resource that is not well-formed XML, the XPointer has a resource error.]
[Definition: If a syntactically correct XPointer, suitably escaped, fails as discussed in 4.3 Schemes, the XPointer has a sub-resource error.] Note that XPath allows expressions that return empty node-sets as their results and does not regard this situation as an error. Because the XPointer language is intended as a specification of document locations rather than a broader query language, an empty result is an error.
This specification does not constrain how applications deal with resource errors and sub-resource errors.
XPath expressions work with a data set that is derived from the elements and other markup constructs of an XML document. The XPointer model augments this data set. Both XPointers and XPath expressions operate by selecting portions of such data sets, often by their structural relationship to other parts (for example, the parent of a node with a certain ID value). XPointers use iterative selections, each operating on what is found by the prior one.
Selection of portions of the information hierarchy is done through axes, predicates, and functions. An axis defines a sequence of candidates that might be located; predicates then test for various criteria relative to such portions; and functions generate new candidates or perform various other tasks. For example, one can select certain elements from among the siblings of some previously located element, based on whether those sibling elements have an attribute with a certain value, or are of a certain type such as "footnote"; or select the point location immediately preceding a certain "para".
The XPointer language is designed to be used in the context of URI references, which require encoding and escaping of certain characters. XPointers (in URI references) also often appear in XML documents, which impose some escaping requirements of their own when the document encoding limits the repertoire that can be used directly. Finally, because some characters are significant to XPointer processing, escaping is needed to use these characters in their ordinary sense. The following sections describe the escaping necessary for these contexts.
The following contexts require various types of escaping to be applied to XPointers.
When an XPointer is created that addresses into an XML resource, it might
contain characters that are significant in the XPointer syntax itself. Parentheses
are significant to the XPointer language, so unbalanced parentheses must be escaped. Because the circumflex (^
)
is used to escape such parentheses, normal circumflexes must
be escaped. Thus, occurrences of the circumflex are escaped with a second
circumflex (^^
), and any occurrences of unbalanced parentheses
are escaped with a circumflex, as described in 4.2 Forms of XPointer.
[Definition: An internationalized
URI reference, or IURI [IURI], is a URI reference that
directly uses Unicode characters [Unicode].] IURI references
allow a superset of the characters of fully escaped URI references, but both must have normal occurrences of the percent sign (%
)
escaped because it is the character used for escaping in URIs and IURIs. Thus,
when an XPointer is inserted into a URI reference or IURI reference, any occurrences
of percent signs (%
) are replaced with %25
.
If an XPointer appears in a URI reference or IURI reference in an XML document, any characters not expressible in the encoding used for the XML document must be escaped as character references, and any characters that are significant to XML processing must be escaped as character references or as predefined entity references. This escaping is removed when the XML document is parsed.
When an IURI reference containing an XPointer (perhaps originating in an XML document) is ultimately converted to a fully escaped URI reference that is suitable for resolution, any remaining characters not allowed in URI references must be escaped using the process described in 4.1.2 URI Reference Encoding and Escaping. This process can safely be applied repeatedly to fully escaped URI references in XML documents.
Square brackets ([
and ]
) should
be replaced with %5B
and %5D
respectively, for compatibility
with applications that have not implemented the [IETF RFC 2732] update
to [IETF RFC 2396].
When resolving an XPointer, the encodings and escapings described in these sections must be undone in the reverse order. If the result does not conform to the syntactic rules for XPointers in this specification, a syntax error results.
The set of characters for XPointers is necessarily the same as for XML, namely [Unicode]. However, some Unicode characters are disallowed from URI references. Thus, the disallowed characters in XPointer-containing URI references must ultimately be encoded and escaped in order for URI resolver software to be able to handle them.
The disallowed characters include all non-ASCII characters, plus the excluded characters listed in Section 2.4 of [IETF RFC 2396], except for the number sign (#) and percent sign (%) and the square bracket characters re-allowed in [IETF RFC 2732]. Disallowed characters are escaped as follows:
Each disallowed character is converted to UTF-8 [IETF RFC 2279] as one or more bytes.
Any bytes corresponding to a disallowed character are escaped with
the URI escaping mechanism (that is, converted to %
HH,
where HH is the hexadecimal notation of the byte value).
The original character is replaced by the resulting character sequence.
The following table shows the progressive escaping of an XPointer containing circumflexes, double quotation marks, and spaces that is intended to appear in an XML document.
Context | Notation | |
---|---|---|
Initial | The desired XPointer as it was
initially created:
|
|
A. XPointer | With the circumflex escaped, as required
by this specification:
|
|
B. IURI reference | Same as A (no percent sign found that
needs escaping):
|
|
C. XML document | Double quotation marks escaped using
XML's predefined " entity reference (assuming that the
XPointer appears in an attribute value delimited by double quotation marks):
| |
D. URI reference | With occurrences of the double
quotatation marks (%22 ), spaces (%20 ), and circumflexes
(%5E ) escaped:
|
The following table shows the progressive escaping of an XPointer containing
accented characters. The XPointer is intended to appear in an XML document
encoded in US-ASCII
, which does not allow the letter "é"
to appear directly.
Context | Notation | |
---|---|---|
Initial | The desired XPointer as it was
initially created:
| |
A. XPointer | Same (no circumflex or unbalanced parenthesis found that needs escaping):
|
|
B. IURI reference | Same as A (no percent sign found that
needs escaping):
| |
C. XML document | Represented in the US-ASCII encoding; accented
letters are escaped with XML character references:
|
|
D. URI reference | With occurrences of the letter "é"
(%C3%A9 ) escaped:
|
This specification defines one full form of XPointer addressing and two shorthand forms. The shorthand forms are defined as abbreviations of the full form, although applications need not convert an abbreviated form into the full form before evaluation. All error conditions apply as if the full form of the XPointer were specified.
The full form is described in 4.2.1 Full XPointers. The two short forms are bare names (Name, described in 4.2.2 Bare Names) and child sequences (ChildSeq, described in 4.2.3 Child Sequences). Thus, XPointer offers the following overall options.
[1] | XPointer | ::= | Name |
| ChildSeq | |||
| FullXPtr |
The internal structure of an XPointer is as follows. Any XPointer whose evaluation returns anything other than a non-empty location-set must signal a sub-resource error.
[2] | ChildSeq | ::= | Name? ('/' [1-9] [0-9]*
)+ | |
[3] | FullXPtr | ::= | XPtrPart (S? XPtrPart)* | |
[4] | XPtrPart | ::= | 'xpointer' '(' XPtrExpr
')' | |
| 'xmlns' '(' XPtrNsDecl? ')' | ||||
| Scheme '(' SchemeSpecificExpr
')' | ||||
[5] | Scheme | ::= | NCName | |
[6] | SchemeSpecificExpr | ::= | StringWithBalancedParens | |
[7] | StringWithBalancedParens | ::= | [^()]* ('(' StringWithBalancedParens
')' [^()]*)* | [VC: Parenthesis escaping] |
[8] | XPtrExpr | ::= | Expr | [VC: Parenthesis escaping] |
[9] | XPtrNsDecl | ::= | NCName S? '=' S? XPtrNsURI | |
[10] | XPtrNsURI | ::= | Char* | [VC: Parenthesis escaping] |
[VC: Namespace Name] |
Validity constraint: Parenthesis escaping
The end of an
XPointer part is signaled by the right parenthesis ")
"
character that is balanced with the left parenthesis "(
"
character that began the part. Although the production disallows unbalanced
parenthesis characters, they may occur. However,
if one does occur, even within literals, it must
be escaped with a circumflex (^
) character preceding it. If the
expression contains any literal occurrences of the circumflex, each must be escaped with an additional circumflex (that
is, ^^
). Any other use of a circumflex results in a syntax error.
Validity constraint: Namespace Name
The value of an XPtrNsURI must be a URI reference suitable for a namespace name.
Name, S, and Char are as defined in the XML Recommendation [XML]; NCName is as defined in the Namespaces in XML Recommendation [XML-Names]; and Expr is as defined in the XPath Recommendation [XPath], with the XPointer extensions defined in this specification.
The full form of addressing consists of one or more [Definition: XPointer parts; each starts with a scheme
name and is followed by a parenthesized expression, and multiple parts are
optionally separated by whitespace.] When the scheme is xpointer
(see 4.3 Schemes for more information), the associated expression
provides access to nodes in an XML document's data set (except for nodes representing
CDATA sections and entities) and access to arbitrary non-node locations.
A bare name stands for the same name provided as the argument of a location
step using the id()
function. For example, the first XPointer
below is the bare-name equivalent of the second, which uses the full XPointer
form:
intro xpointer(id("intro")) |
The bare-name shorthand is provided for two reasons:
To encourage use of IDs, which are the form of addressing most likely to survive document change. (See [XPREQ] and [RLocs] for more information on creating robust addresses into resources.)
To provide an analog of the HTML fragment identifier behavior for resources with XML media types that use a markup language similar to that of HTML. (Note that the XHTML specification [XHTML] defines some guidelines for anchors that help the bare-name shorthand to work in concert with the HTML fragment identifier in cases of content negotiation.)
A child sequence locates an element by stepwise navigation using a sequence
of integers separated by slashes (/). Each integer n locates the nth
child element of the previously located element, equivalent to an XPath location
step of the form *[
n]
. The first integer
in the sequence refers to either the document element, in which case the integer
will be 1, or one of the top-level elements in an external parsed entity.
A child sequence can optionally begin with a name before the integers;
the name locates an element as described for bare names in 4.2.2 Bare Names.
For example, the following two XPointers would locate the same XML information,
assuming intro
is the ID attribute value of the element that
is the fifth child of the second child of the document element:
intro/14/3 /1/2/5/14/3 |
Each XPtrPart begins with a Scheme
that identifies the particular notation used for that XPtrPart.
This specification defines only two schemes, xpointer
and xmlns
,
and reserves all others when the media type of the referenced resource is one of text/xml
, application/xml
, text/xml-external-parsed-entity
,
or application/xml-external-parsed-entity
.
However, the scheme mechanism provides a general framework for extensibility
that can be used for future versions of XPointer, or for other media types
that wish to adopt all or part of this specification in defining their own
fragment identifier languages. It is recommended
that if XML-based media types governed by [IETF I-D XMT]
define any XPointer-style scheme names, the names beginning with "xpointer"
and "xmlns" be avoided until they can be formally reserved.
For example, an XML-based vocabulary that is registered to have the image/4Dgrafix+xml
media type might choose to adopt the XPointer scheme mechanism and define
its own 4Dgrafix-xml
scheme instead of, or in addition to, the xpointer
scheme. It is particularly useful for XML-based media types to incorporate
XPointer by reference so that, in the case of content negotiation, a client
that does not support the specific XML-based media type can fall back on the xpointer
scheme if one is provided. The provision for full
conformance is defined in 3.3 Application Conformance to this
end. Alternatively, XML-based media types can incorporate XPointer's notion
of minimal conformance so that the
scheme framework can be used, even without interpretation of the xpointer
or xmlns
scheme.
Minimal conformance requires the following handling. When multiple XPtrParts are provided, they must be evaluated in left-to-right order. If a part being evaluated fails for one of the following reasons, the next is evaluated:
An unknown scheme
A scheme that is not applicable to the media type of the resource
A scheme that does not locate any sub-resource present in the resource
Note that an XPointer part that uses xmlns
scheme never returns
a sub-resource and thus always fails. However, its evaluation has a potential
effect on XPointer parts to its right; see 5.2.1 Namespace Initialization for
more information.
If the scheme being interpreted is xpointer
:
The string argument in a string-range
function is not found in the string-value of the location, or the third or
fourth argument of the function indicates a string that is beyond the beginning
or end of the document
The point returned by the start-point
function
is of type attribute or namespace
The XPointer application must consume a failed XPointer part and attempt to evaluate the next one, if any. The result of the first XPointer part whose evaluation succeeds is taken to be the sub-resource located by the XPointer as a whole. If all the parts fail, the result for the XPointer as a whole has a sub-resource error. If a syntax error is detected in any part or in the construction of the XPointer as a whole, evaluation stops and additional parts are not consumed.
For example, the following XPointer locates the element with an ID attribute whose value is "chap1":
xpointer(id("chap1")) |
However, in the absence of the DTD, it is not typically possible to determine
if an attribute is of type ID. An XPointer like the following, with
two parts, will have its first part fail if no DTD is available, but will
have its second part succeed if the desired attribute's name
is id
; note that the intent of the second XPointer part is just
an approximation of the first.
xpointer(id("chap1"))xpointer(//*[@id="chap1"]) |
XPointer extends XPath by adding the following:
A generalization of the XPath concepts of nodes, node types, and node-sets to the XPointer concepts of locations, location types, and location-sets, which subsume nodes, points, and ranges.
Two new location types, point
and range
,
corresponding to DOM positions and ranges, that can appear in location-set
results; also tests (akin to node tests) for these location types.
Rules for establishing the XPath evaluation context.
The functions string-range
and range-to
,
which return the range location type for selections that are not single XML
nodes.
The functions here
and origin
,
to provide for addressing relative to the location of an XPointer expression
itself, and to the point of origin for hypertext traversal when XPointers
are used in that (very common) application domain.
The functions start-point
and end-point
,
to address the beginning and ending locations which bound another location
such as a node or range.
Like [XSLT], XPointer allows the root node to have multiple child elements, to allow XPointers to address into arbitrary external parsed entities as well as well-formed documents.
XPath provides for locating any subset of the nodes in an XML document. XPath functionality, such as filtering an axis output by predicate, is generally defined in terms of operations on nodes and node-sets.
XPointer has a requirement to identify document portions that are not nodes in this sense. One example of such a non-node region is an arbitrary user selection indicated by a drag between two points in a document. The two points might have different sets of ancestors in the hierarchy, or the region might form only a small part of a node. For example, a range could be a single letter or could extend from the middle of one paragraph to the middle of the next, thus containing only part of the relevant paragraphs and text nodes. Even though such locations are not nodes, XPointer needs to be able to apply XPath operations to them as well as to nodes.
To accomplish this, XPointer defines location as a generalization of XPath's node. Every location is either a point, a range, or an XPath node. Thus, XPointer also defines location-set as a generalization of XPath's node-set. All locations generated by XPath constructs are nodes; XPointer constructs can also generate points and ranges.
Note:
The order of characters displayed on a computer screen might not reflect their order in the underlying XML document, for example, when a portion of a right-to-left language such as Arabic is embedded in a left-to-right language such as French. For XPointers that identify ranges of strings, the document order is used, not the display order. Thus, an XPointer for a single range might be displayed non-contiguously, and conversely a user selection of an apparent single range might correspond to multiple non-contiguous XPointer ranges in the underlying document.
An XPointer is evaluated to yield an object of type location-set. This evaluation is carried out within a context identical to the XPath evaluation context, except for the generalization of nodes to locations. XPointer applications must initialize the evaluation context as described in this section before evaluating an XPtrExpr.
An XPointer evaluation context contains the following information:
A location (the context location), initialized to the root node of an XML document. When the XPointer is a fragment identifier of a URI reference, the document is the one identified by the URI portion. If an application uses XPointers in another context than as a URI reference's fragment identifier, this specification does not constrain how the applicable document is chosen.
A non-zero context position, initialized to 1.
A non-zero context size, initialized to 1. (At the start, the only location in the current location list is the context location.)
A set of variable bindings. No means for initializing these is defined for XPointer applications. Thus, the set of variable bindings used when evaluating an XPointer is empty, and use of a variable reference in an XPointer results in a syntax error.
A library of functions. Only functions defined in XPath or XPointer can be used in XPointers. An XPointer that uses other functions results in a syntax error.
A set of namespace declarations, described in 5.2.1 Namespace Initialization.
For any XPointer part that uses the xpointer
scheme, the evaluation
context of that part must be initialized
to a set of namespace declarations consisting of a declaration of the xml
prefix, bound to the URI http://www.w3.org/XML/1998/namespace
,
plus any namespace declarations specified by xmlns
XPointer parts
appearing to its left. Each xmlns
part defines a namespace declaration
as a prefix (NCName) and namespace URI
(XPtrNsURI). In the event that two or more xmlns
parts specify the same prefix, the rightmost one is used. Any xmlns
parts attempting to override the xml
prefix must
be ignored.
As an example, assume the following target XML resource:
<doc> <x:a xmlns:x="http://example.com/foo"> <x:a xmlns:x="http://example.org/bar">This element and its parent are in different namespaces.</x:a> </x:a> </doc> |
The evaluation of the following XPointer appearing in a non-XML document
(or in an XML document with no declaration of the namespace prefix x
)
will result in a sub-resource error; it cannot be known which a
element
is desired because no namespace declarations are in scope:
xpointer(//x:a) |
The following avoids this evaluation error and ensures that the outer a
element is addressed:
xmlns(x=http://example.com/foo) xpointer(//x:a) |
The following (printed on two lines for convenience) allows for accurate
addressing of the inner a
element; note that the prefix used inside
the XPointer need not correspond to the prefix (if any) actually used in the
target resource:
xmlns(x=http://example.com/foo) xmlns(y=http://example.org/bar) xpointer(//x:a/y:a) |
point
and range
Location TypesTo address non-node locations, XPointer defines two new location types, point
and range
, that can appear in location-sets and can be operated
on by XPath node tests and predicates. Locations of the point
and range
type represent positions and ranges as in DOM Level
2 [DOM2]. This section defines the point
and range
types and their characteristics required for XPath interoperability.
Note:
Unlike DOM Level 2, which is based on UTF-16 units, XPath and XPointer are based on UCS characters. So while the concepts of points and ranges are based on the DOM 2 notions of positions and ranges, there are differences in detail. For example, a sequence which in DOM counts as two characters might count in XPointer as one character.
Points and ranges can be used as XPointer context locations. This allows
the XPointer []
operator to be used to select from sets of ranges.
Also, a point as a context location, when followed by a range-to
function, selects a range.
[Definition: A location of type point is defined by a node, called the container node, and a non-negative integer, called the index.] It can represent the location preceding any individual character, or preceding or following any node in the data set constructed from an XML document. The self axis of a point is the point itself. The parent axis of a point is a location set containing a single location, the container node. Two points are identical if they have the same container node and index.
Note:
This specification does not constrain the implementation of points; applications need not actually represent points using data structures consisting of a node and an index.
Also note that, while some nodes containing points have explicit boundaries (such as element start-tags and end-tags), the boundaries of text nodes are implicit. Applications that present a graphical user interface for the selection or rendering of points and ranges need to take into consideration the fact that some seemingly identical points, such as the points just inside and just outside the closing boundary of a text node inside an element, are in fact distinguished.
[Definition: When the container node of a point is of a node type that can have child nodes (that is, when the container node is an element node or a root node), then the index is an index into the child nodes; such a point is called a node-point.] The index of a node-point must be greater than or equal to zero and less than or equal to the number of child nodes of the container. An index of zero indicates the point before any child nodes, and a non-zero index n indicates the point immediately after the nth child node.
Note:
The zero-based counting of node-points differs from the one-based counting
of string-range
and other XPointer and XPath functions.
[Definition: When the container node of a point is of a node type that cannot have child nodes (such as text nodes, comments, and processing instructions), then the index is an index into the characters of the string-value of the node; such a point is called a character-point.] The index of a character-point must be greater than or equal to zero and less than or equal to the length of the string-value of the node. An index of zero indicates a point immediately before the first character of the string-value, and a non-zero index n indicates the point immediately after the nth character of the string-value.
A point location does not have an expanded-name.
The string-value of a point location is empty.
The axes of a point location are defined as follows:
The child, descendant, preceding-sibling, following-sibling, attribute, and namespace axes are empty.
The self axis contains the point itself.
The parent axis contains the node-point's container node.
The ancestor axis contains the node-point's container node and its ancestors.
A node-point's siblings are the children of the container node that are before or after the node-point.
A location of type range is defined by two points, a start point and an end point. A range represents all of the XML structure and content between the start point and end point. This is distinct from any list of nodes and/or characters, in part because some nodes might be only partly included. The start and end points of a range must be in the same document. The start point must not appear after the end point in document order (see 5.3.5 Document Order).
[Definition: A range whose start and end points are equal is a collapsed range.]
If the container node of one point of a range is a node of a type other than element, text, or root, the container node of the other point of the range must be the same node. For example, it is allowed to specify a range from the start of a processing instruction to the end of an element (encompassing both those nodes, but still a singleton), but not to specify a range from text inside a processing instruction to text outside it.
A range location does not have an expanded-name.
The string-value of a range location is defined as follows. If the points are both character-points and the container nodes of the points are the same, then the string-value consists of the characters between the two points. Otherwise, the string-value consists of the characters that are in text nodes and that are between the two points.
The axes of a range location are identical to the axes of its start point. For example, the parent axis of a range returns the parent of the start point of the range.
Note:
The start-point
and end-point
functions can be used to navigate with respect to the respective boundaries
of a range location.
[Definition: A covering range is a range that wholly encompasses a location. For every location, a covering range is defined as follows:]
For a range location, the covering range is identical to the range.
For an attribute or namespace location, the container node of the start point and end point of the covering range is the attribute or namespace location; the index of the start point of the covering range is 0; and the index of the end point of the covering range is the length of the string-value of the attribute or namespace location.
For the rootFor the root location, the container node of the start point and end point of the covering range is the root node; the index of the start point of the covering range is 0; and the index of the end point of the covering range is the number of children of the root location.
For a point location, the start and end points of the covering range are the point itself.
For any other kind of location, the container node of the start point and end point of the covering range is the parent of the location; the index of the start point of the covering range is the number of preceding sibling nodes of the location; and the index of the end point is one greater than the index of the start point.
point
and range
LocationsXPointer extends the XPath production for NodeType
by adding items for the point
and range
location
types. That production becomes as follows:
[11] | NodeType | ::= | 'comment' |
| 'text' | |||
| 'processing-instruction' | |||
| 'node' | |||
| 'point' | |||
| 'range' |
This definition allows NodeTests
to select locations of type point
and range
from
a location-set that might include locations of all three types.
XPointer extends XPath's concept of document order to cover point and range locations.
For any point, there is an immediately preceding node defined as follows (except that there is no point defined preceding or following the root):
For a node-point with a non-zero index, the immediately preceding node is the node immediately preceding the point (that is, the node that is the nth child of the container node, where n is the index).
For a node-point with a zero index, the container node is also the immediately preceding node unless it has any attribute or namespace nodes. If the container node does have attribute or namespace nodes, then the immediately preceding node is the last of those nodes.
For a character-point the immediately preceding node is the container node of the character-point.
For any point, the immediately following node is the node that is immediately after the immediately preceding node (as just defined).
A node is before a point if the node is before or equal in document order to the immediately preceding node of the point; otherwise, the node is after the point.
The relative order of a node and a range is determined by the relative order of the node and the start point of the range.
Document order for a point is defined as follows:
If the immediately preceding nodes of the two points are the same, then either the points are the same or they are both character-points with the same container node; in the latter case, the order of the indices determines the document order.
Otherwise, the document order of the immediately preceding nodes of the points determines the document order of the points.
One range location is before another range location if and only if
the start point of the one range is before the start point of the other range, or
the start point of the one range is the same as the start point of the other range, and the end point of the one range is before the end point of the other range.
XPointer adds the following functions to those in XPath; these functions must be provided by XPointer applications.
range-to
Functionlocation-set range-to(location-set )
For each location in the context, range-to
returns
a range. The start of the range is the start-point of the context location,
and the end of the range is the end-point of the location found by evaluating
the expression argument with respect to
that context location.
The change made to the XPath syntax to support the range-to
construct corresponds to a single addition to the Step
production of the [XPath] specification. The original
production is as follows:
[4] Step ::= AxisSpecifier NodeTest Predicate* | AbbreviatedStep |
The XPointer version is as follows:
[4xptr] Step ::= AxisSpecifier NodeTest Predicate* | AbbreviatedStep | 'range-to' '(' Expr ')' Predicate* |
This change is a single exception for the range-to
function. It is not a generic change and is not extensible to other functions.
The modified production expresses that a range computation must be made for
each of the nodes in the current node list.
As an example of using the range-to
function, the
following XPointer locates the range from the start-point for the element
with ID "chap1" to the end-point for the element with ID "chap2".
xpointer(id("chap1")/range-to(id("chap2"))) |
As another example, imagine a document that uses empty elements (such as <REVST/>
for revision start and <REVEND/>
for revision end) to mark
the boundaries of edits. The following XPointer would select, for each revision,
a range starting at the beginning of the REVST
element and ending
at the end of the next REVEND
element:
xpointer(descendant::REVST/range-to(following::REVEND[1])) |
string-range
() Functionlocation-set string-range(location-set , string, number?, number? )
For each location in the location-set argument, string-range
returns a set of [Definition: string
ranges, substrings in a string.] Specifically, the string-value of the location is
searched for substrings that match the string argument, and the
resulting location-set will contain a range location for each non-overlapping
match. An empty string is considered to match before each character of the
string-value and after the final character. Whitespace in a string is matched
literally, with no normalization except that provided by XML for line ends
and attribute values. The third argument gives the position of the first character
to be in the resulting range, relative to the start of the match. The default
value is 1, which makes the range start immediately before the first character
of the matched string. The fourth argument gives the number of characters
in the range; the default is that the range extends to the end of the matched
string.
Element boundaries, as well as entire embedded nodes such as processing instructions and comments, are ignored as specified by the definition of string-value in [XPath].
If the string argument is not found in the string-value of the location, or if the third or fourth argument indicates a position that is beyond the beginning or end of the document, the XPointer part in which the function appears fails.
The points of the range-locations in the returned location-set will all be character points.
For example, the following expression returns a range that selects the
17th occurrence of the string "Thomas Pynchon" occurring in a title
element:
string-range(//title,"Thomas Pynchon")[17] |
As another example, the following expression returns a collapsed range
whose points immediately precede the letter "P" (8 from the start
of the string) in the third occurrence of the string "Thomas Pynchon"
in a P
element:
string-range(//P,"Thomas Pynchon",8,0)[3] |
Alternatively this could be specified as follows:
string-range(string-range(//P,"Thomas Pynchon")[3],"P",1,0) |
String-values are "views" into only the string content of
a document; they do not retain the structural context of any non-text nodes
interspersed with the text. Because the string-range
function operates on a string-value, markup that intervenes in the middle
of a string does not prevent a match. (Note that for this reason, a string-range
match is a range describing the relevant substring of the string-value, not
necessarily a contiguous string in a single text node in the document.) For
example, if the 17th occurrence of "Thomas Pynchon" had some
inline markup in it as follows, it would not change the string returned by
the XPointer application:
Thomas <em>Pyn</em>chon |
The following expression selects the fifth exclamation mark in any text node in the document and the character immediately following it:
string-range(/,"!",1,2)[5] |
Although these examples locate ranges in elements, string-range
is useful for locating ranges that are wholly enclosed in other node types
as well, such as attributes, processing instructions, and comments.
The following functions are related to ranges.
range
Functionlocation-set range(location-set )
The range
function returns ranges covering the locations
in the argument location-set. For each location x in the argument
location-set, a range location representing the covering range of x
is added to the result location-set.
range-inside
Functionlocation-set range-inside(location-set )
The range-inside
function returns ranges covering
the contents of the locations in the argument location-set. For each location x
in the argument location-set, a range location is added to the result location-set.
If x is a range location or a point, then x is added
to the result location-set. If x is not a range location, then x
is used as the container node of the start and end points of the range location
to be added; the index of the start point of the range is zero; if the end
point is a character point then its index is the length of the string-value
of x, and otherwise is the number of children of x.
start-point
Functionlocation-set start-point(location-set )
For each location x in the argument location-set, start-point
adds a location of type point to the result location-set. That point represents
the start point of location x and is determined by the following
rules:
If x is of type point, the start point is x.
If x is of type range, the start point is the start point of x.
If x is of type root, element, text, comment, or processing instruction, the container node of the start point is x and the index is 0.
If x is of type attribute or namespace, the XPointer part in which the function appears fails.
end-point
Functionlocation-set end-point(location-set )
For each location x in the argument location-set, end-point
adds a location of type point to the result location-set. That point represents
the end point of location x and is determined by the following
rules:
If x is of type point, the resulting point is x.
If x is of type range, the resulting point is the end point of x.
If x is of type root or element, the container node of the resulting point is x and the index is the number of children of x.
If x is of type text, comment, or processing instruction, the container node of the resulting point is x and the index is the length of the string-value of x.
If x is of type attribute or namespace, the XPointer part in which the function appears fails.
here
Functionlocation-set here()
The here
function returns a location-set with a single
member. There are two possibilties for the location returned:
If the XPointer being evaluated appears in a text node inside an element node, the location returned is the element node.
Otherwise, the location returned is the node that directly contains the XPointer being evaluated.
In the following example, the here
function appears
inside an XPointer that is in an attribute node. The XPointer as a whole,
then, returns the slide
element just preceding the slide
element that most directly contains the attribute node in question.
<button xlink:type="simple" xlink:href="#xpointer(here()/ancestor::slide[1]/preceding::slide[1])"> Previous </button> |
Note:
The type of the node in which the here
function appears
is likely to be text
, attribute
, or processing-instruction
.
The returned location for an XPointer appearing in element content does not
have a node type of element
because the XPointer is in a text
node that is itself inside an element.
If the resource in which the XPointer appears is not XML, the XPointer
part in which the here
function appears fails.
origin
Functionlocation-set origin()
The origin
function enables addressing relative to
third-party and inbound links such as defined in XLink.
This allows XPointers to be used in applications to express relative locations
when links do not reside directly at one of their endpoints. The function
returns a location-set with a single member, which locates the element from
which a user or program initiated traversal of the link. The origin()
function thus provides a meaningful location-set only if the XPointer is being
processed by application software in response to a traversal instruction from
an XML document. (See [XLink] for information about traversal.)
It is a resource error to use origin
in the fragment identifier portion of a URI reference where a URI is also
provided and identifies a containing resource different from the resource
from which traversal was initiated, or in a situation where traversal is not
occurring.
[XML] requires well-formed documents to contain a single element at the top level. Thus, the XPath data model of a well-formed document will have a root node with a single child node of type element. In order to allow XPointer to be used to address locations in arbitrary external parsed entities, along with well-formed documents, XPointer extends the XPath data model to allow the root node to have any sequence of nodes as children that would be possible of an element node. This extension is identical to the one made by [XSLT]. Thus, the root node may contain child nodes of type text, and any number of child nodes of type element.
The first working drafts of this specification were developed in the XML Working Group, whose members are listed in [XML]. The work was completed in the XML Linking Working Group, with the following members active at the completion of this specification:
The editors wish to acknowledge substantial contributions from Tim Bray, who previously served as co-editor and co-chair. We would also like to acknowledge substantial contributions from James Clark, especially on the integration with XPath. We would like to thank Gavin Nicol and Martin Dürst for help with passages related to internationalization. Finally, we would like to thank the XML Linking Interest Group and Working Group for their support and input.