Notes:
All code on this page presupposes that the following has been executed:
require 'rexml/document'
For convenience, examples on this page use
REXML::Document.new
, not REXML::Element.new
. This
is completely valid, because REXML::Document is a subclass of
REXML::Element.
The context for an element is a hash of processing directives that influence the way XML is read, stored, and written. The context entries are:
:respect_whitespace
: controls treatment of whitespace.
:compress_whitespace
: determines whether whitespace is
compressed.
:ignore_whitespace_nodes
: determines whether whitespace-only
nodes are to be ignored.
:raw
: controls treatment of special characters and entities.
The default context for a new element is {}
. You can set the
context at element-creation time:
d = REXML::Document.new('', {compress_whitespace: :all, raw: :all}) d.context # => {:compress_whitespace=>:all, :raw=>:all}
You can reset the entire context by assigning a new hash:
d.context = {ignore_whitespace_nodes: :all} d.context # => {:ignore_whitespace_nodes=>:all}
Or you can create or modify an individual entry:
d.context[:raw] = :all d.context # => {:ignore_whitespace_nodes=>:all, :raw=>:all}
:respect_whitespace
¶ ↑Affects: REXML::Element.new
,
REXML::Element.text=
.
By default, all parsed whitespace is respected (that is, stored whitespace not compressed):
xml_string = '<root><foo>a b</foo> <bar>c d</bar> <baz>e f</baz></root>' d = REXML::Document.new(xml_string) d.to_s # => "<root><foo>a b</foo> <bar>c d</bar> <baz>e f</baz></root>"
Use :respect_whitespace
with an array of element names to
specify the elements that are to have their whitespace respected;
other elements’ whitespace, and whitespace between elements, will be
compressed.
In this example: foo
and baz
will have their
whitespace respected; bar
and the space between elements will
have their whitespace compressed:
d = REXML::Document.new(xml_string, {respect_whitespace: ['foo', 'baz']}) d.to_s # => "<root><foo>a b</foo> <bar>c d</bar> <baz>e f</baz></root>" bar = d.root[2] # => <bar> ... </> bar.text = 'X Y' d.to_s # => "<root><foo>a b</foo> <bar>X Y</bar> <baz>e f</baz></root>"
:compress_whitespace
¶ ↑Affects: REXML::Element.new
,
REXML::Element.text=
.
Use compress_whitespace: :all
to compress whitespace both
within and between elements:
xml_string = '<root><foo>a b</foo> <bar>c d</bar> <baz>e f</baz></root>' d = REXML::Document.new(xml_string, {compress_whitespace: :all}) d.to_s # => "<root><foo>a b</foo> <bar>c d</bar> <baz>e f</baz></root>"
Use :compress_whitespace
with an array of element names to
compress whitespace in those elements, but not in other elements nor
between elements.
In this example, foo
and baz
will have their
whitespace compressed; bar
and the space between elements will
not:
d = REXML::Document.new(xml_string, {compress_whitespace: ['foo', 'baz']}) d.to_s # => "<root><foo>a b</foo> <bar>c d</bar> <baz>e f</baz></root>" foo = d.root[0] # => <foo> ... </> foo.text= 'X Y' d.to_s # => "<root><foo>X Y</foo> <bar>c d</bar> <baz>e f</baz></root>"
:ignore_whitespace_nodes
¶ ↑Affects: REXML::Element.new
.
Use ignore_whitespace_nodes: :all
to omit all whitespace-only
elements.
In this example, bar
has a text node, while nodes
foo
and baz
do not:
xml_string = '<root><foo> </foo><bar> BAR </bar><baz> </baz></root>' d = REXML::Document.new(xml_string, {ignore_whitespace_nodes: :all}) d.to_s # => "<root><foo> FOO </foo><bar/><baz> BAZ </baz></root>" root = d.root # => <root> ... </> foo = root[0] # => <foo/> bar = root[1] # => <bar> ... </> baz = root[2] # => <baz/> foo.first.class # => NilClass bar.first.class # => REXML::Text baz.first.class # => NilClass
Use :ignore_whitespace_nodes
with an array of element names to
specify the elements that are to have whitespace nodes ignored.
In this example, bar
and baz
have text nodes,
while node foo
does not.
xml_string = '<root><foo> </foo><bar> BAR </bar><baz> </baz></root>' d = REXML::Document.new(xml_string, {ignore_whitespace_nodes: ['foo']}) d.to_s # => "<root><foo/><bar> BAR </bar><baz> </baz></root>" root = d.root # => <root> ... </> foo = root[0] # => <foo/> bar = root[1] # => <bar> ... </> baz = root[2] # => <baz> ... </> foo.first.class # => NilClass bar.first.class # => REXML::Text baz.first.class # => REXML::Text
:raw
¶ ↑Affects: Element.text=
, Element.add_text
,
Text.to_s
.
Parsing of a
elements is not affected by raw
:
xml_string = '<root><a>0 < 1</a><b>1 > 0</b></root>' d = REXML::Document.new(xml_string, {:raw => ['a']}) d.root.to_s # => "<root><a>0 < 1</a><b>1 > 0</b></root>" a, b = *d.root.elements a.to_s # => "<a>0 < 1</a>" b.to_s # => "<b>1 > 0</b>"
But Element#text= is affected:
a.text = '0 < 1' b.text = '1 > 0' a.to_s # => "<a>0 < 1</a>" b.to_s # => "<b>1 &gt; 0</b>"
As is Element.add_text:
a.add_text(' so 1 > 0') b.add_text(' so 0 < 1') a.to_s # => "<a>0 < 1 so 1 > 0</a>" b.to_s # => "<b>1 &gt; 0 so 0 &lt; 1</b>"