Archive of UserLand's first discussion group, started October 5, 1998.

XML != binary data

Author:Stephen Butler
Posted:6/28/1999; 12:48:04 PM
Topic:www.xml-rpc.com
Msg #:7869 (In response to 947)
Prev/Next:7868 / 7870

The XML-RPC spec says that a "string" element can contain any characters, while the XML spec says that the only non-printable characters allowed are tab, carriage return, linefeed, and certain 16-bit Unicode characters.

The XML spec defines "Character Data" as

[^<&]* - ([^<&]* ']]>' [^<&]*)

but I believe this production assumes the global restriction on characters. Whaddaya think?

Steve Butler

From the XML-RPC spec, Q&A section: (http://frontier.userland.com/tree$2.8.2.1#update1) "Any characters are allowed in a string except < and &, which are encoded as < and &. A string can be used to encode binary data."

From the XML spec, section 2.2: (http://www.w3.org/TR/1998/REC-xml-19980210.html#charsets) "A parsed entity contains text, a sequence of characters, which may represent markup or character data. A character is an atomic unit of text as specified by ISO/IEC 10646 [ISO/IEC 10646]. Legal characters are tab, carriage return, line feed, and the legal graphic characters of Unicode and ISO/IEC 10646"




This page was archived on 6/13/2001; 4:51:05 PM.

© Copyright 1998-2001 UserLand Software, Inc.