index.html 108 KB
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448
<!DOCTYPE html
  PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en-US" lang="en-US"><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8" /><title>Internationalization and Localization Markup Requirements</title><style type="text/css">

</style><link rel="stylesheet" href="local.css" type="text/css" /><link rel="stylesheet" type="text/css" href="http://www.w3.org/StyleSheets/TR/W3C-WD.css" /></head><body><div style="text-align:center;"><p>[ <a href="#contents">contents</a> ]</p></div><div class="head"><p><a href="http://www.w3.org/"><img src="http://www.w3.org/Icons/w3c_home" alt="W3C" height="48" width="72" /></a></p>
<h1><a name="title" id="title"></a>Internationalization and Localization Markup Requirements</h1>
<h2><a name="w3c-doctype" id="w3c-doctype"></a>W3C Working Draft 18 May 2006</h2><dl><dt>This version:</dt><dd>
			<a href="http://www.w3.org/TR/2006/WD-itsreq-20060518/">http://www.w3.org/TR/2006/WD-itsreq-20060518/</a>
		</dd><dt>Latest version:</dt><dd>
			<a href="http://www.w3.org/TR/itsreq/">http://www.w3.org/TR/itsreq/</a>
		</dd><dt>Previous version:</dt><dd><a href="http://www.w3.org/TR/2005/WD-itsreq-20051122/">http://www.w3.org/TR/2005/WD-itsreq-20051122/</a></dd><dt>Editor:</dt><dd>Yves Savourel, ENLASO Corporation</dd></dl><p class="copyright"><a href="http://www.w3.org/Consortium/Legal/ipr-notice#Copyright">Copyright</a> © 2006 <a href="http://www.w3.org/"><acronym title="World Wide Web Consortium">W3C</acronym></a><sup>®</sup> (<a href="http://www.csail.mit.edu/"><acronym title="Massachusetts Institute of Technology">MIT</acronym></a>, <a href="http://www.ercim.org/"><acronym title="European Research Consortium for Informatics and Mathematics">ERCIM</acronym></a>, <a href="http://www.keio.ac.jp/">Keio</a>), All Rights Reserved. W3C <a href="http://www.w3.org/Consortium/Legal/ipr-notice#Legal_Disclaimer">liability</a>, <a href="http://www.w3.org/Consortium/Legal/ipr-notice#W3C_Trademarks">trademark</a> and <a href="http://www.w3.org/Consortium/Legal/copyright-documents">document use</a> rules apply.</p></div><hr /><div>
<h2><a name="abstract" id="abstract"></a>Abstract</h2><p>When creating schemas (XML Schema, DTD, etc.), it is important to include constructs that meet the needs of content authors dealing with international audiences, and address the needs of the localization community. This document provides a list of key requirements to achieve such a goal. It will be used to provide a framework and direction for a detailed solution proposal (or set of proposals) to be developed later.</p></div><div>
<h2><a name="status" id="status"></a>Status of this Document</h2><p>
				<em>This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the <a href="http://www.w3.org/TR/">W3C technical reports index</a> at http://www.w3.org/TR/.</em>
			</p><p>This document defines requirements for a set of solutions that would address the main challenges and issues of internationalizing and localizing XML documents. The solutions are expected to include several aspects: a specialized vocabulary that XML users can include in their own documents, a set of guidelines to apply when using existing XML technologies, and a range of possible mechanisms for applying those. </p><p>This document was developed by the
<a href="http://www.w3.org/International/its/">Internationalization Tag Set (ITS) Working Group</a>, part of the <a href="http://www.w3.org/International/Activity">W3C Internationalization Activity</a>. A complete <a href="#revisionlog">list of changes</a> to this document is available.</p><p>Feedback about the content of this document is encouraged. Send your comments to
<a href="mailto:www-i18n-comments@w3.org?subject=[Comment on itsreq WD]">www-i18n-comments@w3.org</a>. Use "[Comment on itsreq WD]" in the subject line of your email. The <a href="http://lists.w3.org/Archives/Public/www-i18n-comments/">archives</a> for this list are publicly
available.</p><p>Publication as a Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.</p><p>This document was produced by a group operating under the <a href="http://www.w3.org/Consortium/Patent-Policy-20040205/">5 February 2004 W3C Patent Policy</a>. The group does not expect this document to become 
a W3C Recommendation. This document is informative only. W3C maintains a <a href="http://www.w3.org/2004/01/pp-impl/37139/status">public list of any patent disclosures</a> made in connection with the
    deliverables of the group; that page also includes 
    instructions for disclosing a patent.
  An individual who has actual knowledge of a patent which the
  individual believes contains <a href="http://www.w3.org/Consortium/Patent-Policy-20040205/#def-essential">Essential Claim(s)</a> must disclose the
  information in accordance with <a href="http://www.w3.org/Consortium/Patent-Policy-20040205/#sec-Disclosure">section 6 of the W3C Patent Policy</a>.</p></div><div class="toc">
<h2><a name="contents" id="contents"></a>Table of Contents</h2><div class="toc"><div class="toc1">1 <a href="#intro">Introduction</a><div class="toc2">1.1 <a href="#intro_background">Background</a></div>
<div class="toc2">1.2 <a href="#intro_who">Who Should Read This</a></div>
<div class="toc2">1.3 <a href="#intro_overview">Overview</a></div>
<div class="toc2">1.4 <a href="#intro_definitions">Key Definitions</a></div>
</div>
<div class="toc1">2 <a href="#scenarios">Usage Scenarios</a><div class="toc2">2.1 <a href="#scenarios_content">Content Authoring</a></div>
<div class="toc2">2.2 <a href="#scenarios_terminology">Terminology Creation and Translation</a></div>
<div class="toc2">2.3 <a href="#scenarios_software">Software Development</a></div>
</div>
<div class="toc1">3 <a href="#req">Requirements</a><div class="toc2">3.1 <a href="#constraints">Indicator of Constraints</a></div>
<div class="toc2">3.2 <a href="#span">Span-Like Element</a></div>
<div class="toc2">3.3 <a href="#cdata">CDATA Section</a></div>
<div class="toc2">3.4 <a href="#uid">Unique Identifier</a></div>
<div class="toc2">3.5 <a href="#entities">Handling of Entities</a></div>
<div class="toc2">3.6 <a href="#langlocale">Identifying Language/Locale</a></div>
<div class="toc2">3.7 <a href="#termid">Identifying Terms</a></div>
<div class="toc2">3.8 <a href="#mapping">Purpose Specification/Mapping</a></div>
<div class="toc2">3.9 <a href="#contstyle">Content Style</a></div>
<div class="toc2">3.10 <a href="#linkedtext">Link to Internal/External Text</a></div>
<div class="toc2">3.11 <a href="#bidi">Bidirectional Text Support</a></div>
<div class="toc2">3.12 <a href="#transinfo">Indicator of Translatability</a></div>
<div class="toc2">3.13 <a href="#metrics">Metrics Count</a></div>
<div class="toc2">3.14 <a href="#impact">Limited Impact</a></div>
<div class="toc2">3.15 <a href="#transattr">Attributes and Translatable Text</a></div>
<div class="toc2">3.16 <a href="#naming">Naming Scheme</a></div>
<div class="toc2">3.17 <a href="#locnotes">Localization Notes</a></div>
<div class="toc2">3.18 <a href="#whitespaces">Handling of White-Spaces</a></div>
<div class="toc2">3.19 <a href="#multilang">Multilingual Documents</a></div>
<div class="toc2">3.20 <a href="#annomark">Annotation Markup</a></div>
<div class="toc2">3.21 <a href="#datetime">Identifying Date and Time</a></div>
<div class="toc2">3.22 <a href="#nestedelems">Nested Elements</a></div>
<div class="toc2">3.23 <a href="#lingml">Linguistic Markup</a></div>
<div class="toc2">3.24 <a href="#variables">Variables</a></div>
<div class="toc2">3.25 <a href="#elemseg">Elements and Segmentation</a></div>
<div class="toc2">3.26 <a href="#objects">Associated Objects</a></div>
</div>
</div>
<h3><a name="appendices" id="appendices"></a>Appendices</h3><div class="toc1">A <a href="#sec-bibliography">References</a> (Non-Normative)</div>
<div class="toc1">B <a href="#revisionlog">Revision Log</a> (Non-Normative)</div>
<div class="toc1">C <a href="#acknowledgements">Acknowledgements</a> (Non-Normative)</div>
</div><hr /><div class="body"><div class="div1">
<h2><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="intro" id="intro"></a>1 Introduction</h2><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="intro_background" id="intro_background"></a>1.1 Background</h3><p>Content or software that is authored in one language (i.e. source language) is often made available in additional languages. This is done through a process called localization, where the original material is translated and adapted to the target audience.</p><p>From the viewpoints of feasibility, cost, and efficiency, it is important that the original material should be suitable for localization. This is achieved by proper design and development, and the corresponding process is referred to as internationalization.</p><p>The increasing usage of XML as a medium for documentation-related content (e.g. <a title="The&#xA;DocBook Document Type" href="#docbookspec">[DocBook]</a>, being a format for writing structured documentation, well suited to computer hardware and software manuals) and software-related content (e.g. the eXtensible User Interface Language (XUL)) provides growing challenges and opportunities in the domain of XML internationalization and localization.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="intro_who" id="intro_who"></a>1.2 Who Should Read This</h3><p>The target audience of this document includes the following categories: </p><ul><li><p>Designers of content-related formats</p></li><li><p>Developers of schemas in various formats</p></li><li><p>Developers of XML authoring tools</p></li><li><p>Authors of XML content</p></li><li><p>Developers of localization tools</p></li><li><p>Localizers involved with XML</p></li><li><p>Developers of Internet specifications at the World Wide Web Consortium and related bodies</p></li></ul></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="intro_overview" id="intro_overview"></a>1.3 Overview</h3><p>This document describes requirements for a list of guidelines and a set of recommended approaches to developing schemas which address issues related to international use of document formats and localization of XML content.</p><p>Regardless of the final form and syntax such approaches ultimately take, it is possible to envision their usage at different levels:</p><ol class="depth1"><li><p>In a document instance, grouped in a single location, to associate information with multiple parts of the document using some kind of linking or addressing mechanism. Such usage would be similar to the <code>style</code> element in an HTML document.</p></li><li><p>In a document instance, within the content, at the location where the information applies. This usage would be similar to the <code>style</code> attribute in an HTML document.</p></li><li><p>In schemas, along with the definition of an element or an attribute, to provide data categories for internationalization and localization.</p></li></ol><p>Such approaches are not meant to describe the configuration settings of localization tools for XML content. However, it is expected that the tools will be able to infer such properties from the information provided by the ITS implementations. For example, the tools should be able to build a list of all nodes that are to be translated in a given document using the ITS information in the document itself and in its corresponding schema(s) or DTD.</p><p>Most of the requirements listed here are addressed in "Internationalization Tag Set (ITS)" <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or in "Best Practices for XML Internationalization" <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>. Some may be addressed in later versions of these documents.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="intro_definitions" id="intro_definitions"></a>1.4 Key Definitions</h3><p>When used in this document, the following terms have the meaning described here:</p><dl><dt class="label">Internationalization</dt><dd><p>
								[<a name="t001" id="t001" title="i18n">Definition</a>: 
									<span class="new-term">Internationalization</span> is the design and development of a product, application or document content that enables easy localization for target audiences that vary in culture, region, or language.] (Definition based on W3C Internationalization Activity FAQ <a title="Localization&#xA;vs. Internationalization" href="#geo-i18n-l10n">[i18n l10n]</a>)</p></dd><dt class="label">Localization</dt><dd><p>
								[<a name="t002" id="t002" title="l10n">Definition</a>: 
									<span class="new-term">Localization</span> refers to the adaptation of a product, application or document content to meet the language, cultural and other requirements of a specific target market (a "locale").] (Definition based on W3C Internationalization Activity FAQ <a title="Localization&#xA;vs. Internationalization" href="#geo-i18n-l10n">[i18n l10n]</a>)</p></dd><dt class="label">Schema</dt><dd><p>
								[<a name="t003" id="t003" title="schema">Definition</a>: The term <span class="new-term">schema(s)</span> refers to any schema language (e.g. DTD, XML Schema, etc). The term "XML Schema" is used when referring to XML Schema.]
							</p></dd></dl></div></div><div class="div1">
<h2><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios" id="scenarios"></a>2 Usage Scenarios</h2><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_content" id="scenarios_content"></a>2.1 Content Authoring</h3><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_content_desc" id="scenarios_content_desc"></a>2.1.1 Description</h4><p>As an author develops content that is meant to be localized, he or she may need to label specific parts of the text for various purposes, such as:</p><ul><li><p>terms that either should not be translated or translated using a pre-existing terminology list</p></li><li><p>sections of the document that should remain in the source language</p></li><li><p>acronyms or specific terminology that requires an explanation note for the translator</p></li><li><p>identification of reusable text</p></li></ul><p>In other cases, the original text itself may need to be labeled for specific information required for correct rendering, such as ruby text in Japanese <a title="What is&#xA;Ruby?" href="#whatisruby">[Ruby]</a>, or bidirectional overrides in Arabic <a title="What you&#xA;need to know about the bidi algorithm and inline markup" href="#bidiinfo">[Bidi]</a>.</p><p>The use of a standardized set of tags allows authoring systems to provide a common solution for these special markers across all XML documents. This, in turn, increases the feasibility of a simple interface for performing the labeling task.</p><p>For example, the author selects a portion of text not to translate and clicks a button to mark it up as "do not translate" with standardized markup. The availability of such interface allows the author to provide to the translators a better context of work, with minimal effort.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_content_stake" id="scenarios_content_stake"></a>2.1.2 Stakeholders</h4><p>This scenario is relevant to:</p><ul><li><p>The technical writers developing content (especially content to be localized)</p></li><li><p>The developers of authoring systems</p></li><li><p>The localizers and the translators</p></li></ul></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_terminology" id="scenarios_terminology"></a>2.2 Terminology Creation and Translation</h3><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_terminology_desc" id="scenarios_terminology_desc"></a>2.2.1 Description</h4><p>During the development of documentation material, it is common practice to scan the content of the documents to create a list of frequently used terms. </p><p>This list is used, for example, to provide a consistent terminology across the different parts of the documentation. It is also used as the base for translation glossaries.</p><p>During the terminology creation phase the insertion of special markers to delimit terms within the source material helps the user to identify the proposed entries and view them within their context.</p><p>The same markup can be used at later stages in the translation process, to help the translators match the source terms with their agreed-upon translations.</p><p>The use of a common set of markers allows for better re-usability of the information across the different steps of the localization process and across the various tools used to facilitate it.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_terminology_stake" id="scenarios_terminology_stake"></a>2.2.2 Stakeholders</h4><p>This scenario is relevant to:</p><ul><li><p>The technical writers developing content (especially content to be localized)</p></li><li><p>The authors or the terminologists that create the glossaries</p></li><li><p>The people working on quality management/assurance</p></li><li><p>The translators</p></li></ul></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_software" id="scenarios_software"></a>2.3 Software Development</h3><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_software_desc" id="scenarios_software_desc"></a>2.3.1 Description</h4><p>Software-related material is now often stored in XML repositories. Examples of this would be UI resources and message files, comments in the source code to generate documentation, or even temporary XML storage generated from proprietary formats for the time of the localization.</p><p>A software developer often needs to provide localization-related information along with the resources that will be translated. For instance, he or she may need to indicate that a string has a maximum length because the program processes it using a fixed-length buffer.</p><p>Using a common set of tags in the XML documents to carry such information across the different tools used during the localization process offers better control to the developer. He or she can affect how the resources will be modified, and ultimately prevent some bug or incorrect translation to be introduced.</p><p>Localizers also often need to add their own information in the resource material. They do this to complete what has been already set by the developer, or to add their own instructions.</p><p>In all these cases, a common set of tags allows the localization providers to develop reusable verification tools to ensure that the translated material follows the requirements requested by the developers. It also helps the communication, in context, of some information between the different parties.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="scenarios_software_stake" id="scenarios_software_stake"></a>2.3.2 Stakeholders</h4><p>This scenario is relevant to:</p><ul><li><p>The software developers that create the resources</p></li><li><p>The localization engineers that prepare the resources for translation</p></li><li><p>The translators modifying the data</p></li></ul></div></div></div><div class="div1">
<h2><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="req" id="req"></a>3 Requirements</h2><div class="note"><p><span class="note-head">Note: </span>Several of the following requirements are illustrated with XML code samples using yet-to-be-defined ITS elements and attributes. Their names are completely arbitrary and are not intended to represent the appearance of the actual solution. The solution also may or may not be implemented as a namespace. These elements and attributes are represented with a <strong>strong emphasis</strong> in the examples.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="constraints" id="constraints"></a>3.1 Indicator of Constraints</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>[R001] <em>It should be possible to associate one or more constraints to specific content.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="constraints_desc" id="constraints_desc"></a>3.1.1 Challenges</h4><p>Translatable data may come with various constraints in the way they can be modified. For example, the content of the following <code>string</code> element must accommodate the length restriction imposed by the small display panel where it is used:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e354" id="d2e354"></a>Example 1: Length restriction</div><div class="exampleInner"><pre>&lt;!-- LED display has only 16 characters --&gt;
&lt;string id="s123"&gt;Printing...&lt;/string&gt;</pre></div></div><p>In this case a standard method should be used for indicating the dimensions of the container so that localization tools can automatically recognize them and, when possible, enforce the constraint during translation.</p><p>Examples of constraints are:</p><ul><li><p>Container size (e.g. maximum length, etc.)</p></li><li><p>Text allowed in a limited set of characters (e.g. translatable paths or filenames)</p></li></ul><p>These constraints may need to be defined at the schema level or they may need to be defined for specific instances of an element.</p><p>In some cases, the constraint may be applicable only for a given context or a given tool.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="constraints_notes" id="constraints_notes"></a>3.1.2 Notes</h4><p>XSD (XML Schema Part 2: Datatypes Second Edition) provides a mechanism to define "Constraining Facets" (<a title="XML Schema&#xA;Part 2: Datatypes Second Edition" href="#xsd">[XSD]</a>, section 4.3) that may provide some solution for this requirement at the schema level. At the instance level, Schematron <a title="Schematron - A Language for Making Assertions about Patterns Found in XML Documents." href="#schematron">[Schematron]</a> could be used for the same purpose.</p><p>Sometimes the constraint may need to be expressed using units different from the unit used in the document. For example, the maximum length of a string may need to be expressed in byte or pixels, or display cells instead of characters. This may lead to the need for quite a few parameters with the constraint (e.g. the encoding to use, or the font and point-size information, etc.)</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="span" id="span"></a>3.2 Span-Like Element</h3><p>[R002] <em>span-like element is required to allow authors to mark sections text that may have special properties, from a localization and internationalization point of view.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="span_desc" id="span_desc"></a>3.2.1 Challenges</h4><p>Given a section of XML text, there's often insufficient information in the original markup in order to determine how exactly the contents should be dealt with from a localization and internationalization point of view. Adding various span-like elements to the markup at the authoring stage, would allow this information to be passed on to localization processes (either human or machine assisted processes).</p><p>For example, span-like elements could be used to mark sections of text that need to be translated by a domain-expert (as with source code fragments) or mark those that need special terminology in order to be properly translated. In particular, a span-like element can be useful to help translation tools determine where to apply sentence-breaks and also to assist metrics-calculating algorithms.</p><p>A span-like element is also extremely useful for marking language information in source files that translation tools can use to determine which translation process to use for each given section of text (e.g. a Latin quotation in a section of English text is often intended to be left in Latin for the translated version of the English text.) Other uses are foreseen, within the scope of the ITS.</p><p>One example would be the following sentence, which contains some source code that we would like to treat specially during translation:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e404" id="d2e404"></a>Example 2: Text with portion of source code</div><p>
							<code>The Java statement System.out.println("Hello World!"); prints the text "Hello World!" to standard output.</code>
						</p></div><p>Here, we would like to put a span-like element around the source code fragment to indicate that it is not standard text for translation and should be translated by someone familiar with the Java programming language. Also, translation tools should treat the exclamation points in this sample text carefully with respect to sentence-segmentation if they perform that function.</p><p>While the <code>code</code> tag in XHTML could be used to markup this text (in an XHTML document), it is often not specific enough for translators: it does not tell the translator what sort of source code is contained inside the tag, nor does it mark which portions of the code contents are translatable.</p><p>A suggestion of the sort of usage we could foresee for a span-like element could be the following:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e421" id="d2e421"></a>Example 3: Text with marked-up source code</div><p>
							<code>The Java statement &lt;code&gt;</code>
							<strong>
								<code>&lt;span trans="no"&gt;</code>
							</strong>
							<code>System.out.println("</code>
							<strong>
								<code>&lt;/span&gt;</code>
							</strong>
							<code>Hello World</code>
							<strong>
								<code>&lt;span trans="no"&gt;</code>
							</strong>
							<code>");</code>
							<strong>
								<code>&lt;/span&gt;</code>
							</strong>
							<code>&lt;/code&gt; prints the text "Hello World!" to standard output.</code>
						</p></div><p>An alternative to this sort of construction would be to put the translatable text in a separate document, and then refer to that using  some form of linking mechanism, for example:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e467" id="d2e467"></a>Example 4: Source code with entity reference</div><p>
							<code>&lt;code&gt;System.out.println("&amp;java.code.example.text;");&lt;/code&gt;</code>
						</p></div><p>Another example is shown below, where we have a piece of text that contains a file name which should also not be translated:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e477" id="d2e477"></a>Example 5: Text with non-translatable file name</div><p>
							<code>The file /etc/passwd is a local source of information about users' accounts.</code>
						</p></div><p>In this case, the filename <code>/etc/passwd</code> should not be translated, and we would like to add markup to indicate this.</p><p>In these examples, we show that we are aiming to shift some of the responsibility of identifying translatable versus non-translatable content off the translation tools author, on to the content author, or at the very least, make recommendations to content authors to separate out the translatable versus non-translatable portions of text more clearly.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="span_notes" id="span_notes"></a>3.2.2 Notes</h4><p>This requirement is related to some other requirements, namely:</p><ul><li><p>
								<a class="section-ref" href="#constraints">Section 3.1: Indicator of Constraints</a>
							</p></li><li><p>
								<a class="section-ref" href="#termid">Section 3.7: Identifying Terms</a>
							</p></li><li><p>
								<a class="section-ref" href="#mapping">Section 3.8: Purpose Specification/Mapping</a>
							</p></li><li><p>
								<a class="section-ref" href="#linkedtext">Section 3.10: Link to Internal/External Text</a>
							</p></li><li><p>
								<a class="section-ref" href="#bidi">Section 3.11: Bidirectional Text Support</a>
							</p></li><li><p>
								<a class="section-ref" href="#transinfo">Section 3.12: Indicator of Translatability</a>
							</p></li><li><p>
								<a class="section-ref" href="#metrics">Section 3.13: Metrics Count</a>
							</p></li><li><p>
								<a class="section-ref" href="#naming">Section 3.16: Naming Scheme</a>
							</p></li><li><p>
								<a class="section-ref" href="#locnotes">Section 3.17: Localization Notes</a>
							</p></li><li><p>
								<a class="section-ref" href="#annomark">Section 3.20: Annotation Markup</a>
							</p></li></ul><p>For the requirement <a class="section-ref" href="#mapping">Section 3.8: Purpose Specification/Mapping</a>, we need to ensure that any related semantics in the target schema are also sufficient for translation: that is for example, saying that a <code>programlisting</code> element in <a title="The&#xA;DocBook Document Type" href="#docbookspec">[DocBook]</a> is related to a <code>code</code> element in XHTML is interesting, but neither will help the translator determine which contents of <code>code</code> or <code>programlisting</code> are actually translatable.</p><p>A span-like element could be used in cases like these where specific text properties are identified.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="cdata" id="cdata"></a>3.3 CDATA Section</h3><p>[R003] <em>Provisions must be taken to ensure that CDATA sections do not impair the localization process.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="cdata_desc" id="cdata_desc"></a>3.3.1 Challenges</h4><p>For translators, and other document consumers, given any section of CDATA, it's difficult to know the intended use of the contents of a CDATA section.</p><p>The use of CDATA sections in translatable XML files is discouraged, as they prevent any elements in a proposed XML internationalization tag set from being used to mark up the localizable components of that section of text, although the entire CDATA section could be wrapped in additional tags.</p><p>In addition, numeric character references and entity references are not supported within CDATA sections, which could lead to a possible loss of data if the document is converted from one encoding to another where some characters in the CDATA sections are not supported.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="cdata_notes" id="cdata_notes"></a>3.3.2 Notes</h4><p>There is a temptation to use CDATA sections in XML files to escape sections of text that contain characters which would otherwise be interpreted as XML characters.</p><p>A commonly employed example of this has been seen where document authors attempt to easily produce an "XML version" of an input file by inserting CDATA sections around text which contains HTML markup.</p><p>Since the contents of these escaped sections cannot be marked up, they must be examined manually to determine which parts of the content contain translatable text, non-translatable text, etc. For tools authors, there is often no way to determine the original format of the text inside the CDATA section (e.g. was it HTML, RTF, a base64-encoded OpenOffice.org document etc.)</p><p>These considerations can result in bottle-necks in translation processes while these manual steps are performed.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="uid" id="uid"></a>3.4 Unique Identifier</h3><p>[R004] <em>It should be possible to attach a unique identifier to any localizable item. This identifier should be unique within a document set, but should be identical across all translations of the same item.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="uid_desc" id="uid_desc"></a>3.4.1 Challenges</h4><p>In order to most effectively reuse translated text where content is reused (either across update versions or across deliverables) it is necessary to have a unique and persistent identifier associated with the element.</p><p>This identifier allows the translation tools to correctly track an item from one version or location to the next. After one is sure that this is the same item, the content can be examined for changes, and if no change has taken place the potential for reuse of the previous translation is very high.</p><p>Change analysis constitutes an extremely powerful productivity tool for translation when compared to the typical source matching (a.k.a. translation memory) techniques, which simply look for similar source text in the database without, most of the time, being able to tell whether the context of its use is the same.</p><p>This change analysis technique has been possible with user-interface messages in the past, but the introduction of structured XML (and SGML) documents will allow for its use in documents also.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="uid_notes" id="uid_notes"></a>3.4.2 Notes</h4><p>The <code>xml:id</code> attribute <a title="xml:id Version&#xA;1.0" href="#xmlid">[XML ID]</a> may be a means to carry the unique identifier. Note however, that <code>xml:id</code> is unique within a document, not necessarily within a set of documents.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="entities" id="entities"></a>3.5 Handling of Entities</h3><p>[R005] <em>XML applications which combine contents from various modules/entities need to adhere to certain guidelines in order to ensure that the XML application itself and the contents can be localized easily.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="entities_desc" id="entities_desc"></a>3.5.1 Challenges</h4><p>XML applications (i.e. a combination of DTD/XSD, style-sheets, XML instances) often make use of so-called general entities (<a title="Extensible Markup&#xA;Language (XML) 1.0 (Third Edition)" href="#xml10spec">[XML 1.0]</a>, section 4). Various types of entities exist, for example:</p><ol class="depth1"><li><p>Character entity. The entity defines a single Unicode character. Example:
 <code>&lt;!ENTITY aacute "á"&gt;</code>
							</p></li><li><p>A short element-free text. The entity defines a short text that contains only text (no element or other XML constructs). This is for instance an entity for a product name. Example: <code>&lt;!ENTITY productName "pictoMagic for Windows"&gt;</code>
							</p></li><li><p>A longer text with one or more elements. The entity defines a piece of boiler-plate text such as a copyright paragraph. Example: <code>&lt;!ENTITY copyrightInfo "&lt;a href='copyright.htm'&gt;Copyright&lt;/a&gt; 2005 W3C."&gt;</code>
							</p></li></ol><p>Two aspects of entities are of particular importance with regard to internationalization and localization: entities are defined, and entities are used. For example, the snippet:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e661" id="d2e661"></a>Example 6: Entity declaration</div><p>
							<code>&lt;!ENTITY productName "pictoMagic for Windows"&gt;</code>
						</p></div><p>defines an entity called "productName", and the snippet</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e671" id="d2e671"></a>Example 7: Entity reference</div><p>
							<code>The latest version of &amp;productName; features many enhancements.</code>
						</p></div><p>references/uses the entity.</p><p>If internationalization and localization are not addressed for entity-related work several issues may arise:</p><ol class="depth1"><li><p>Entity reference cannot be resolved. Example: the definition is not available to the XML processor.</p></li><li><p>Entity definition does not fit with the surrounding context language-wise. Example: The context in "<code>Das Produkt &amp;productName; ist mit vielen Erweiterungen ausgestattet worden</code>" is German whereas the definition of the entity may be in English.</p></li><li><p>Entity definition does not fit with the surrounding context grammar-wise. Example: The syntax in "<code>The &amp;objectName; has been disabled.</code>" will work, in English, only if the value for &amp;objectName; is singular. If it is plural, "<code>has</code>" must be changed. In other languages "<code>The</code>" and "<code>disabled</code>" may also have to be adjusted.</p></li><li><p>In addition, even if the entity itself is translated there may be significant grammatical problems for inflected languages for nouns. The translation will inevitably follow the case of the original. For example, if the original is genitive, the translation is genitive as well (of course this requires that the original language and the translation language have a concept for "genitive").</p></li></ol><p>Since entities affect the content of the document, and XSLT processors and other kinds of XML processors act on the content, various processing-related issues may arise. An XSLT style sheet for example, which is sensitive to content contributed by an entity, may fail to work as expected (e.g. may not be able to generate the <code>alt</code> attribute for HTML pages).</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="entities_notes" id="entities_notes"></a>3.5.2 Notes</h4><p>Ideally, the solution which the WG will produce will be applicable not only with regard to entities but also in the realm of XInclude <a title="XML&#xA;Inclusions (XInclude) Version 1.0" href="#xinclude">[XInclude]</a> or even fragments (<a title="XML&#xA;Fragment Interchange" href="#xmlfrag">[XFI]</a>, appendix B).</p><p>Note that character entity references (e.g. <code>&amp;aacute;</code>) and numeric character references (NCRs, e.g. <code>&amp;#x00E1;</code>) are different things. This requirement addresses character entity references, as well as all user defined entities.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="langlocale" id="langlocale"></a>3.6 Identifying Language/Locale</h3><p>[R006] <em>Any document at its beginning should declare a language/locale that is applied to both main content and external content stored separately. While the language/locale may be declared for the whole document, when an element or a text span is in a different language/locale from the document-level language, it should be labeled appropriately. Therefore, DTD/Schema should allow any elements to have a language/locale specifying attribute. The language/locale declaration should use industry standard approaches.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="langlocale_desc" id="langlocale_desc"></a>3.6.1 Challenges</h4><p>Identifying languages (such as French and Spanish) and locales (such as Canadian French and Ecuadorian Spanish) is very important in rendering and processing document text and content properly since they provide specifications of language-dependent properties, such as hyphenation, text wrapping rules, color usage, fonts, spell checking quotation marks and other punctuation, etc.</p><p>In order to simplify the parsing process by documentation and localization tools, there should be a declaration of a language/locale that is applied to the whole document as well as externalized content. This should be done as a document-level property. Meanwhile, as a document may contain content with multiple languages/locales, subsets of the document needs a language/locale attribute. Such a local language/locale specification should be declared against an element or a span of text.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="langlocale_notes" id="langlocale_notes"></a>3.6.2 Notes</h4><p>Currently there are several different standards for language/locale 
specifications, such as <a title="Tags for the&#xA;Identification of Languages" href="#rfc1766">[RFC 1766]</a> and <a title="Tags for the&#xA;Identification of Languages" href="#rfc3066">[RFC 3066]</a>.
XML 1.0 prescribes a language identification attribute <code>xml:lang</code>
      (<a title="Extensible Markup&#xA;Language (XML) 1.0 (Third Edition)" href="#xml10spec">[XML 1.0]</a>, section 2.12, and <a title="XML 1.0 Third Edition&#xA;Specification Errata" href="#xml10spec_errata">[XML 1.0 Errata]</a>, E01).
There is also a technical standard from Unicode regarding the locale data markup language <a title="Locale Data Markup&#xA;Language (LDML)" href="#ldml">[LDML]</a>. 
ITS should carefully review these existing industry standards and clearly define 
what is a language/locale and its purpose in order to successfully meet this 
requirement.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="termid" id="termid"></a>3.7 Identifying Terms</h3><p>[R007] <em>It should be possible to identify terms inside an element or a span and to provide data for terminology management and index generation. Terms should be either associated with attributes for related term information or linked to external terminology data.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="termid_desc" id="termid_desc"></a>3.7.1 Challenges</h4><p>The capability of specifying terms within the source content is important for terminology management that is beneficial to translation/localization quality. Terms to be identified include any domain-specific words and abbreviations for which translators need additional information in order to find appropriate concepts in their target languages. Term identification also facilitates the creation of glossaries and allows validation of terminology usage in the source and target documents.</p><p>Meanwhile, identified terms could be used for indexing that may require some language specific information. For example, Japanese words are sorted not by script characters, but by phonetic characters. Therefore when a Japanese index item is created, it should be accompanied with a phonetic string, called Yomigana.</p><p>As a result, terms may require various attributes, such as part of speech, gender, number, term types, definitions, notes on usage, etc. To avoid such a large attribute data is repeated within a document, it should be possible for identified terms to link to externalized attribute data, such as glossary documents and terminology database.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="termid_notes" id="termid_notes"></a>3.7.2 Notes</h4><p>For more details, please see <a href="http://lists.w3.org/Archives/Public/public-i18n-its/2005JanMar/0069.html">discussions on term links at OASIS/XLIFF</a>.</p><p>The OSCAR/TBX working group is currently working on drafting the 
TBX-Link specification <a title="TermBase eXchange&#xA;Link (TBX Link) 1.0 Specification" href="#tbxlink">[TBX-Link]</a>.</p><p>This requirement is related to <a class="section-ref" href="#scenarios_terminology">Section 2.2: Terminology Creation and Translation</a>.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="mapping" id="mapping"></a>3.8 Purpose Specification/Mapping</h3><p>[R008] <em>Currently, it does not appear to be realistic that all XML vocabularies tag localization-relevant information identical (e.g. all use the "term" tag for terms). One way to take care of diverse localization-relevant markup in localization environments is a mapping mechanism which maps localization-relevant markup onto a canonical representation (such as the
Internationalization Tag Set).</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="mapping_desc" id="mapping_desc"></a>3.8.1 Challenges</h4><p>From a localization point of view, many XML vocabularies include markup which requires special attention, since the markup is associated with a specific type of content. Examples:</p><ul><li><p>elements which are associated with embedded/binary graphics</p></li><li><p>elements which are associated with specific text styles (e.g. underline and bold)</p></li><li><p>elements which are associated with linking (e.g. <code>a</code> in HTML)</p></li><li><p>elements which are associated with lists</p></li><li><p>elements which are associated with tables</p></li><li><p>elements which are associated with with generated content (e.g. an element that fires a query to a database in order to pull in the data for a product catalogue)</p></li></ul><p>Here are some reasons why this type of markup may require special attention:</p><ul><li><p>the localization tool may be able to render specific text styles in a standard way (e.g. increased font weight for bold)</p></li><li><p>embedded binary images may have to follow a specific workflow</p></li><li><p>content generation queries may have to be adapted</p></li></ul><p>Since it is hardly imaginable that all content developers will be able to work with the same elements and attributes for this specific type of content, ITS should include markup which allows people to specify the purpose of specific elements.</p><p>Challenges arise for example from the fact that the 'source/original' vocabularies may vary widely with regards to the representation they choose for a specific data category (e.g. their markup related to graphics; see the <a href="http://lists.w3.org/Archives/Public/public-i18n-its/2005AprJun/0096.html">longer discussion of this</a>).</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="mapping_notes" id="mapping_notes"></a>3.8.2 Notes</h4><p>This requirement is related to the requirement <a class="section-ref" href="#impact">Section 3.14: Limited Impact</a>.</p><p>For the specific case of linking something to look at already exists: <a title="Link recognition&#xA;for the XHTML Family" href="#hlink">[HLink]</a>.</p><p>The approach may be used to support term identification. Suppose that an original document has the following:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e865" id="d2e865"></a>Example 8: Markup to map</div><p>
							<code>You can define multiple computation IDs for one company in the &lt;index sortstr="currency restatement"&gt;Currency Restatement&lt;/index&gt; program.</code>
						</p></div><p>When you wish that the <code>index</code> element serves as an ITS "term", you could use the following mapping:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e878" id="d2e878"></a>Example 9: Mapping</div><div class="exampleInner"><pre>&lt;purposeSpec&gt;
 &lt;servesPurpose origVoc="index" its="term"/&gt;
&lt;/purposeSpec&gt;</pre></div></div><p>One question to answer is: How can existing attributes (e.g. <code>sortstr</code> in the sample above) be carried over, or how can new attributes (like <code>partOfSpeech</code>, <code>termType</code>) be introduced?</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="contstyle" id="contstyle"></a>3.9 Content Style</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>[R009] <em>It must be possible to specify content styles in a document in order to better qualify the contents for different linguistic purposes, such as localization.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="contstyle_desc" id="contstyle_desc"></a>3.9.1 Challenges</h4><p>Depending on target languages, source content could be translated with several different styles. A few examples are as follows:</p><ul><li><p>Italian uses an informal style for software help content and a formal style for user guides.</p></li><li><p>Japanese uses a polite style (<span xml:lang="ja" lang="ja">です・ます調</span> [Desu/masu] tone) for user guides and a formal style (<span xml:lang="ja" lang="ja">だ・である調</span> [Da/dearu] tone) for academic and legal content.</p></li><li><p/></li><li><p/></li><li><p/></li><li><p/></li></ul></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="contstyle_notes" id="contstyle_notes"></a>3.9.2 Notes</h4><p/><p>Content styles and tones in a target language vary mostly depending on target audience (general users, academic experts, etc) and content’s domain (IT, legal, medical, etc). While a source language does not get affected by such aspects, target content may need to use a specific content style.</p><p>Information about content styles is critical in reusability of translation. For example, certain content from a user’s guide in Italian may not be appropriate to be reused in online help content, while corresponding English content has no such issue.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="linkedtext" id="linkedtext"></a>3.10 Link to Internal/External Text</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>The latest drafted text for this requirement can be found on <a href="http://esw.w3.org/topic/its0504ReqLinkedText">here</a>.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="bidi" id="bidi"></a>3.11 Bidirectional Text Support</h3><p>[R011] <em>Markup should be available to support the needs of bidirectional scripts.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="bidi_desc" id="bidi_desc"></a>3.11.1 Challenges</h4><p>Generally the Unicode bidirectional algorithm will cause text in scripts such as Arabic and Hebrew to appropriately order mixed script text. Sometimes, however, additional help is needed. For example, in the following phrase the 'W3C' and the comma should appear to the left side of the quotation. This cannot be achieved using the bidirectional algorithm alone.</p><p>The title says "פעילות הבינאום, W3C" in Hebrew.</p><p>The desired effect can be achieved using Unicode control characters, but this is not recommended (see the W3C Note and Unicode Technical Report Unicode in XML &amp; Other Markup Languages). Markup is needed to establish the default directionality of a document, and to change that where appropriate by creating nested embedding levels.</p><p>Markup is also applicable to disable the effects of the bidirectional algorithm for a specified range of text.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="bidi_notes" id="bidi_notes"></a>3.11.2 Notes</h4><p>See the following GEO documents for background:</p><ul><li><p><cite>Authoring Techniques for XHTML &amp; HTML Internationalization: Handling Bidirectional Text 1.0</cite> <a title="Authoring&#xA;Techniques for XHTML &amp; HTML Internationalization: Handling Bidirectional&#xA;Text 1.0" href="#biditech">[Bidi Technique]</a></p></li><li><p><cite>What you need to know about the bidi algorithm and inline markup.</cite> <a title="What you&#xA;need to know about the bidi algorithm and inline markup" href="#bidiinfo">[Bidi]</a></p></li></ul><p>It may be sensible, when considering implementation approaches, to follow the lead of the XHTML 2.0 specification <a title="XHTML™ 2.0" href="#bidixhtml2">[Bidi XHTML2]</a></p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="transinfo" id="transinfo"></a>3.12 Indicator of Translatability</h3><p>[R012] <em>Methods must exist to allow to specify the parts of a document that are to be 
translated or not.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="transinfo_desc" id="transinfo_desc"></a>3.12.1 Challenges</h4><p>The content of XML documents can usually be seen as either generally translatable (e.g. an XHTML file), or generally not translatable (e.g. an <a title="Scalable Vector&#xA;Graphics (SVG) 1.1 Specification" href="#svgspec">[SVG]</a> file). A mechanism should exist to identify the parts of the document that are exceptions to the rule.</p><p>The mechanism should also allow for the specification of exceptions within exceptions. For example, within the elements of an <a title="Scalable Vector&#xA;Graphics (SVG) 1.1 Specification" href="#svgspec">[SVG]</a> document, which are generally not translatable, it should allow one to specify that <code>text</code> is to be translated, but also that some occurrences of the <code>text</code> element (e.g. with an attribute <code>translate="no"</code>) are not to be translated.</p><p>The mechanism should be able to map existing elements that already carry implicitly or explicitly the translatability information. Here are some examples of this:</p><ul><li><p>The <code>trademark</code> element in <a title="The&#xA;DocBook Document Type" href="#docbookspec">[DocBook]</a> may be an indicator of non-translatable content.</p></li><li><p>The <code>text</code> element in <a title="Scalable Vector&#xA;Graphics (SVG) 1.1 Specification" href="#svgspec">[SVG]</a> indicates translatable content.</p></li><li><p>The <code>translate</code> attribute in <a title="OASIS&#xA;Darwin Information Typing Architecture (DITA) Language Specification&#xA;v1.0" href="#ditaspec">[DITA]</a> is used to flag translatability.</p></li></ul><p>The mechanism should provide a way to delimit a portion of the content if such a mechanism does not exist in the original vocabulary (so parts of he content could be marked as translatable or not).</p><p>The methods used to identify the translatable parts of a document should be useable by localization tools for both:</p><ul><li><p>Processing the document directly.</p></li><li><p>Generating localization properties settings files that can be used on all documents of the same document type.</p></li></ul></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="transinfo_notes" id="transinfo_notes"></a>3.12.2 Notes</h4><p>Part of this requirement is related to the "<a class="section-ref" href="#span">Section 3.2: Span-Like Element</a>" requirement.</p><p>Another part is related to the "<a class="section-ref" href="#mapping">Section 3.8: Purpose Specification/Mapping</a>" requirement.</p><p>There is a relationship between indicating the parts of a content that are to be translated and the parts of a content that are to be included in "<a class="section-ref" href="#metrics">Section 3.13: Metrics Count</a>".</p><p>Indicators of translatability may be used for helping translation tools in the creation of localization properties files (i.e. tools settings describing how to handle a given type of document from the viewpoint of localization). They can also be used to complement the localization properties by adding information in document instances.</p><p>The information about the parts of a document that are translatable is not limited to localization. Such information can be used in other contexts. For instance when implementing Accessibility features, it can be used to identify content that need to be process differently from the rest of the document.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="metrics" id="metrics"></a>3.13 Metrics Count</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>The latest drafted text for this requirement can be found on <a href="http://esw.w3.org/topic/its0505WordCount">here</a>.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="impact" id="impact"></a>3.14 Limited Impact</h3><p>[R014] <em>All solutions proposed should be designed to have as less impact as possible on the tree structure of the original document and on the content models in the original schema.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="impact_desc" id="impact_desc"></a>3.14.1 Challenges</h4><p>Inserting elements or attributes of a different namespace in an XML document can have side effects on various processing aspects. For example, the inserted nodes may:</p><ul><li><p>Break the XPath expressions already in use to access part of the document.</p></li><li><p>Interfere with <code>xsl:value-of</code> for extracting information.</p></li><li><p>Interfere with numbering and other aspects of styling the original document.</p></li></ul><p>Solutions for any of the ITS requirements must take in account these potential drawbacks and offer implementations that have limited impact on the original document and on the content models in the original schema.</p><p>For instance:</p><ul><li><p>Use attributes whenever possible (they have a lesser impact than elements). For example:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1127" id="d2e1127"></a>Example 10: Using an extra attribute</div><div class="exampleInner"><pre>&lt;table <strong>translate="no"&gt;</strong>
 &lt;tr&gt;...
&lt;/table&gt;</pre></div></div><p>is better than:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1137" id="d2e1137"></a>Example 11: Using an extra element</div><div class="exampleInner"><pre><strong>&lt;notrans&gt;</strong>
 &lt;table&gt;
  &lt;tr&gt;...
 &lt;/table&gt;
<strong>&lt;/notrans&gt;</strong></pre></div></div></li><li><p>Use data categories that already exist in the original markup by either mapping them to ITS concepts (see "<a class="section-ref" href="#mapping">Section 3.8: Purpose Specification/Mapping</a>") or by using them to carry ITS attributes. For example:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1151" id="d2e1151"></a>Example 12: Mapping concepts</div><div class="exampleInner"><pre><strong>&lt;info&gt;
 &lt;mapping target='quote' its='notrans'/&gt;
&lt;info&gt;</strong>
...
&lt;para&gt;The motto of Québec is:
 &lt;quote&gt;"je me souviens"&lt;/quote&gt;.&lt;/para&gt;</pre></div></div></li><li><p>Group general ITS information in branches that are placed in locations where they have a minimal impact:</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1161" id="d2e1161"></a>Example 13: Information placement</div><div class="exampleInner"><pre>&lt;doc&gt;
 <strong>&lt;info&gt;
 ...
 &lt;/info&gt;</strong>
 &lt;header&gt;...
 &lt;body&gt;...</pre></div></div></li></ul></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="impact_notes" id="impact_notes"></a>3.14.2 Notes</h4><p>One possible solution which has to be discussed is whether ITS should encompass not only a tag set, but also a specification of processing steps for documents. One step then could be the separation of the document in namespace specific sections. This would limit the side effects mentioned above.</p><p>The Namespace Routing Language <a title="Namespace Routing Language&#xA;(NRL)" href="#nrl">[NRL]</a> could be used for this purpose. The "Part 4: Namespace-based Validation Dispatching Language — NVDL" <a title="Document Schema Definition&#xA;Languages (DSDL) — Part 4: Namespace-based Validation Dispatching Language —&#xA;NVDL" href="#nvdl">[NVDL]</a> of the ISO/IEC 19757 proposal "Document Schema Definition Languages (DSDL)" <a title="ISO/IEC&#xA;19757 - DSDL, Document Schema Definition Languages" href="#dsdl">[DSDL]</a> relies mainly on NRL. The following example NRL document can be applied to XML documents with markup from the xhtml namespace and a fictive ITS namespace. With the NRL document, the XML document are validated only against the XHTML scheme "<code>xhtml.rng</code>":</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1185" id="d2e1185"></a>Example 14: Using NRL with XHTML and ITS</div><div class="exampleInner"><pre>&lt;rules startMode="root"
 xmlns="http://www.thaiopensource.com/validate/nrl"&gt;
 &lt;mode name="root"&gt;
  &lt;namespace ns="http://www.w3.org/1999/xhtml"&gt;
   &lt;validate schema="xhtml.rng" useMode="xhtml"/&gt;
  &lt;/namespace&gt;
 &lt;/mode&gt;
 &lt;mode name="xhtml"&gt;
  &lt;namespace ns="http://www.example.org/its"&gt;
   &lt;unwrap/&gt;
  &lt;/namespace&gt;
  &lt;namespace ns="http://www.w3.org/1999/xhtml"&gt;
   &lt;attach/&gt;
  &lt;/namespace&gt;
 &lt;/mode&gt;
&lt;/rules&gt;</pre></div></div></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="transattr" id="transattr"></a>3.15 Attributes and Translatable Text</h3><p>[R015] <em>Provisions must be taken to ensure that attributes with translatable values do not impair the localization process.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="transattr_desc" id="transattr_desc"></a>3.15.1 Challenges</h4><p>If translatable text is provided as an attribute value rather than element content, the following problems may arise:</p><ul><li><p>It is difficult to apply to the text of the attribute value meta-information such as no-translate flags, designer's notes, etc. (Except when using mechanisms such as XPath or XPointer).</p></li><li><p>The difficulty to attach unique identifiers to translatable attribute text makes it more complicated to use ID-based leveraging tools.</p></li><li><p>Translatable attributes can create problems when they are prepared for localization because they can occur within the content of a translatable element, breaking it into different parts, and possibly altering the sentence structure.</p></li><li><p>The language identification mechanism (i.e. xml:lang) applies to the content of the element where it is declared, including its attribute values. If the text of an attribute is in a different language than the text of the element content, one cannot set the language for both correctly.</p></li><li><p>In some languages, bidirectional markers may be needed to provide a correct display. Tags cannot be used within an attribute value. One can use Unicode control characters instead, but this is not recommended (see the W3C Note and Unicode Technical Report Unicode in XML &amp; Other Markup Languages).</p></li></ul><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1219" id="d2e1219"></a>Example 15: </div><p>In this example the no-translate flag applies to the content of the element, but not to the title text. The title text may benefit from id-based leveraging, but has no ID. The xml:lang tag, after translation, will only be relevant for the element content, not the title text.</p><div class="exampleInner"><pre>&lt;extract id="0517.1447" translate="no" xml:lang="en" 
 title="Ambiguous linguistic construct."&gt;The man hit the boy 
with the stick in the bathroom.&lt;/extract&gt;</pre></div></div><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1224" id="d2e1224"></a>Example 16: </div><p>In this example part of the alt-text value should be left untranslated (the name of the picture), but it is difficult to see how that would be expressed so that a machine translation tool would exhibit the correct behavior.</p><div class="exampleInner"><pre>&lt;image id="0517.1716" 
 alt-text="Catalog number 123: The Fish Wife"
 source="fishwife.png" /&gt;</pre></div></div><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1229" id="d2e1229"></a>Example 17: </div><p>In this example many translation tools would see the value of the alt attribute as embedded inside the sentence where the image is inserted, making the translation difficult.</p><div class="exampleInner"><pre>&lt;para&gt;Click the button  
&lt;image source="startnow.png" alt="Start Now!" /&gt; to register
now.&lt;/para&gt;</pre></div><p>Segmentation:</p><div class="exampleInner"><pre>"Click the button [code]Start Now![code] to register now."</pre></div></div></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="transattr_notes" id="transattr_notes"></a>3.15.2 Notes</h4><p>Whenever possible, a schema should ensure that translatable text is stored in elements rather than attributes.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="naming" id="naming"></a>3.16 Naming Scheme</h3><p>[R016] <em>It should be possible for translation tools to rely on a predictable list of element and attribute names for a given type of documents.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="naming_desc" id="naming_desc"></a>3.16.1 Challenges</h4><p>Using documents where elements or attributes do not follow a predictable naming pattern may cause problem for translation tools. This is especially true if not all parts of the document are not to be translated. In that case the rules to distinguish the translatable nodes from the non-translatable ones would be difficult to specify.</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1256" id="d2e1256"></a>Example 18: </div><p>The XML excerpt illustrates a naming scheme that is not conducive to localization because the list of elements cannot be easily codified through translation rules.</p><div class="exampleInner"><pre>&lt;Ok&gt;OK&lt;/Ok&gt; 
&lt;Cancel&gt;Cancel&lt;/Cancel&gt;
&lt;Message001&gt;Cannot open the file.&lt;/Message001&gt;</pre></div></div></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="locnotes" id="locnotes"></a>3.17 Localization Notes</h3><p>[R017] <em>A method must exist for authors to communicate information to localizers about a particular item of content.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="locnotes_desc" id="locnotes_desc"></a>3.17.1 Challenges</h4><p>To assist the translator to achieve a correct translation, authors may need to provide information about the text that they have written. For example, the author may want to:</p><ul><li><p>tell the translator how to translate part of the content</p></li><li><p>expand on the meaning or contextual usage of a particular element, such as what a variable refers to or how a string will be used on the UI</p></li><li><p>clarify ambiguity and show relationships between items sufficiently to allow correct translation (e.g. in many languages it is impossible to translate the word 'enabled' in isolation without knowing the gender, number and case of the thing it refers to.)</p></li><li><p>explain why text is not translated, point to text reuse, or describe the use of conditional text</p></li><li><p>indicate why a piece of text is emphasized (important, sarcastic, etc.)</p></li></ul><p>This can help translators avoid mistakes or avoid spending time searching for information. Two types of informative notes are needed:</p><ol class="depth1"><li><p>An alert contains information that the translator MUST read before translating a piece of text. Example: an instruction to the translator to leave parts of the text in the source language.</p></li><li><p>A description provides useful background information that the translator will refer to only if they wish. Example: a clarification of ambiguity in the source text.</p></li></ol><p>The relationship between a note and the data, to which the note pertains, should be unambiguous.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="whitespaces" id="whitespaces"></a>3.18 Handling of White-Spaces</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>The latest drafted text for this requirement can be found on <a href="http://esw.w3.org/topic/its0505ReqWhiteSpaces">here</a>.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="multilang" id="multilang"></a>3.19 Multilingual Documents</h3><p>[R019] <em>Careful considerations must be taken when designing XML documents that include the same content in multiple languages.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="multilang_desc" id="multilang_desc"></a>3.19.1 Challenges</h4><p>Using document with content in multiple languages is quite easily done with XML, as shown below.</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1328" id="d2e1328"></a>Example 19: </div><p>Same text stored in multiple languages in the same document:</p><div class="exampleInner"><pre>&lt;para&gt;  
 &lt;text xml:lang="en"&gt;My hovercraft is full of eels.&lt;/text&gt;
 &lt;text xml:lang="fr"&gt;Mon aéroglisseur est plein d'anguilles.&lt;/text&gt;
 &lt;text xml:lang="hu"&gt;Légpárnás hajóm tele van angolnákkal.&lt;/text&gt;
 &lt;text xml:lang="ja"&gt;私のホバークラフトは鰻で一杯です。&lt;/text&gt;
 &lt;text xml:lang="pl"&gt;Mój poduszkowiec jest pełen węgorzy.&lt;/text&gt;
 &lt;text xml:lang="es"&gt;Mi aerodeslizador está lleno de anguilas.&lt;/text&gt;
 &lt;text xml:lang="zh-CH"&gt;我隻氣墊船裝滿晒鱔.&lt;/text&gt;
 &lt;text xml:lang="zh-TW"&gt;我的氣墊船充滿了鱔魚 [我的气垫船充满了鳝鱼]&lt;/text&gt;
&lt;/para&gt;</pre></div></div><p>However, one must be careful when such kind of documents as, from the view point of localization, they may be difficult to handle for translation.</p><p>For example, when source material is provided in a multilingual document and the different translations should go in the same document, it will be difficult to do concurrent translation in all languages as the translators are likely to be different for each language. This means the document will have to be broken down into separate pairs of languages (the source and one target) and re-constructed later on, adding time, cost and an opportunity for possible errors</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="multilang_notes" id="multilang_notes"></a>3.19.2 Notes</h4><p>Obviously, some XML documents are designed for multilingual functions, and can be used as it without problem. Examples of such formats are XLIFF or TMX.</p><p>Note that multilingual documents where the different languages are for different content (e.g. a citation in German within a document in Spanish) do not have the challenges described here.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="annomark" id="annomark"></a>3.20 Annotation Markup</h3><p>[R020] <em>There must be a way to support markup up of text annotations of the 'ruby' type.</em>
				</p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="annomark_desc" id="annomark_desc"></a>3.20.1 Challenges</h4><p>XHTML 1.1 contains a [Ruby Annotation] module that provides markup for phonetic or semantic annotation of text such as is common in Far Eastern scripts for Japanese and Chinese. (Ruby is known as <em>furigana</em> in Japan.) As standard mechanism should be proposed to support such annotations.</p><p>This annotation mechanism should not be limited to use for Japanese and Chinese.</p><p>To support Far Eastern text usage a single annotation text for a given base text is most common. Occasionally, however, two annotations per base text are appropriate.</p><p>The Ruby Annotation specification also divides its markup into simple and complex forms, allowing a choice for implementation support. We should probably also allow for this, although we should investigate whether the division is drawn appropriately by the Ruby Annotation specification. For example, we could envisage a simple ruby model, a model that allows two annotations, and another than allows for table like groupings of elements in a single or double annotation approach.</p><p>As per the Ruby Annotation specification, a fallback mechanism (i.e. the equivalent of &lt;ruby-parenthesis&gt;) should also be specified.</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="annomark_notes" id="annomark_notes"></a>3.20.2 Notes</h4><p>We should probably start a proposal by looking closely at the Ruby Annotation specification, however, criticism has been leveled at this spec from some quarters because it is very presentation oriented. We may therefore need to address this.</p><p>The Ruby module of CSS3 will provide styling to indicate the expected behaviour of the base and annotation text.</p><p>When we come to investigating solutions, the following article by Masayasu Ishikawa will be worth consulting: <a title="Implementing&#xA;the Ruby Module" href="#implruby">[Ruby Impl]</a></p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="datetime" id="datetime"></a>3.21 Identifying Date and Time</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>The latest drafted text for this requirement can be found on <a href="http://esw.w3.org/topic/its0506ReqDateTime">here</a>.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="nestedelems" id="nestedelems"></a>3.22 Nested Elements</h3><p>[R022] <em>Great care must be taken when defining or using nested translatable elements.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="nestedelems_desc" id="nestedelems_desc"></a>3.22.1 Challenges</h4><p>An XML format can allow the recursive nesting of the same elements. In some cases, such structure may cause some difficulties for the translation tools to segment or extract the translatable text.</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1404" id="d2e1404"></a>Example 20: The <code>note</code> element in <a title="Open&#xA;Document Format for Office Applications (OpenDocument) v1.0" href="#odfspec">[OpenDocument]</a>:</div><p>A <code>text:p</code> element can contain a <code>text:note</code> element. The <code>text:note</code> includes a <code>text:note-body</code> element, which in turn, can contain one or more <code>text:p</code> elements. Other constructs, such as <code>office:annotation</code> elements can also be found in paragraphs, allowing for possibly complex nesting combinations.</p><div class="exampleInner"><pre>text:p text:style-name="P1"&gt;
  Palouse horse
  &lt;text:note text:id="ftn0" text:note-class="footnote"&gt;
   &lt;text:note-citation&gt;2&lt;/text:note-citation&gt; 
   &lt;text:note-body&gt;
    &lt;text:p text:style-name="Footnote"&gt;A Palouse horse is the same as an Appaloosa.&lt;/text:p&gt; 
    &lt;text:p text:style-name="Footnote"&gt;The Nez Perce 
     &lt;office:annotation&gt;
      &lt;dc:date&gt;2006-04-26T00:00:00&lt;/dc:date&gt; 
      &lt;text:p&gt;The native's name "Ni-Mii-Puu" means "the People".&lt;/text:p&gt; 
     &lt;/office:annotation&gt;Indians of the inland Northwest deserve much of the credit for 
the Appaloosa horses we have today.&lt;/text:p&gt;
    &lt;/text:note-body&gt;
   &lt;/text:note&gt; have spotted coats. 
  &lt;/text:p&gt;</pre></div></div><p>Such nesting combinations may be difficult to handle during localization.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="lingml" id="lingml"></a>3.23 Linguistic Markup</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>The latest drafted text for this requirement can be found on <a href="http://esw.w3.org/topic/its0908LinguisticMarkup">here</a>.</p></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="variables" id="variables"></a>3.24 Variables</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>[R024] <em>Software text often includes placeholders for variables that are inserted at runtime and may have effect on how the text around them should be translated. It should be possible to identify such elements and label them with appropriate information so they can be translated correctly.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="variables_desc" id="variables_desc"></a>3.24.1 Challenges</h4><p>A number of challenges come along with variables. Some of them are:</p><ul><li><p>The text of the variable has gender- or number-specific information that need to be known for a correct translation of the text around the reference.</p></li><li><p>The size of the text where the variable is may be limited, making it necessary to know the maximum size of the text of the variable to verify the length final text.</p></li></ul><p>Additional information on the issues regarding can be found in the article <a title="Working&#xA;with Composite Messages" href="#compmsg">[Composite Messages]</a>.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="elemseg" id="elemseg"></a>3.25 Elements and Segmentation</h3><p>[R025] <em>Methods, independent of the semantic, of the elements must exist to provide hints on how to break down document content into meaningful runs of text.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="elemseg_desc" id="elemseg_desc"></a>3.25.1 Challenges</h4><p>Many applications that process content for linguistic-related tasks need to be able to perform a basic segmentation. They need to be able to do this without knowing about the semantic of the elements. The elements marking up the document content should provide generic clues to help such process.</p><p>Several types of information are needed:</p><ol class="depth1"><li><p>A way to distinguish:</p><ol class="depth2"><li><p>elements that may hold text content</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1501" id="d2e1501"></a>Example 21: The element <code>p</code> may hold text:</div><div class="exampleInner"><pre>&lt;p&gt; 
 &lt;b&gt;This is bold.&lt;/b&gt;
 &lt;i&gt;This is italic.&lt;/i&gt;
&lt;/p&gt;</pre></div></div></li><li><p>from elements that never have text content</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1512" id="d2e1512"></a>Example 22: The element <code>ul</code> should not hold text:</div><div class="exampleInner"><pre>&lt;ul&gt; 
 &lt;li&gt;This is the first item.&lt;/li&gt;
 &lt;li&gt;This is the second item.&lt;/li&gt;
&lt;/ul&gt;</pre></div></div></li><li><p>from elements that breaks the content</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1523" id="d2e1523"></a>Example 23: The <code>text:line-break</code> element in <a title="Open&#xA;Document Format for Office Applications (OpenDocument) v1.0" href="#odfspec">[OpenDocument]</a> may break a paragraph:</div><div class="exampleInner"><pre>&lt;text:p text:style-name="Standard"&gt;
 Palouse horses have spotted coats.&lt;text:line-break/&gt;
 (A Palouse horse is the same as an Appaloosa)&lt;/text:p&gt;</pre></div></div></li></ol></li><li><p>A way to distinguish:</p><ol class="depth2"><li><p>independent text content that is nested within another content</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1540" id="d2e1540"></a>Example 24: A footnote in <a title="OASIS&#xA;Darwin Information Typing Architecture (DITA) Language Specification&#xA;v1.0" href="#ditaspec">[DITA]</a> The text in <code>fn</code> is distinct from the text of <code>p</code></div><div class="exampleInner"><pre>&lt;p&gt;Palouse horses&lt;fn callout="#"&gt;A Palouse horse is  
the same as an Appaloosa.&lt;/fn&gt; have spotted coats.&lt;/p&gt;</pre></div></div><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1552" id="d2e1552"></a>Example 25: A note in <a title="Open&#xA;Document Format for Office Applications (OpenDocument) v1.0" href="#odfspec">[OpenDocument]</a>, more complex:</div><div class="exampleInner"><pre>&lt;text:p text:style-name="Standard"&gt;
 Palouse horses
 &lt;text:note text:id="ftn1" text:note-class="footnote"&gt;
  &lt;text:note-citation&gt;1&lt;/text:note-citation&gt;
  &lt;text:note-body&gt;
   &lt;text:p text:style-name="Footnote"&gt;
A Palouse horse is the same as an Appaloosa.&lt;/text:p&gt;
  &lt;/text:note-body&gt;
 &lt;/text:note&gt;
 have spotted coats.&lt;/text:p&gt;</pre></div></div><p>Both examples above correspond to two distinct text runs:</p><ul><li><p>"Palouse horses have spotted coats."</p></li><li><p>"A Palouse horse is the same as an Appaloosa."</p></li></ul></li><li><p>from text content that is part of its parent element's content</p><div class="exampleOuter"><div class="exampleHeader"><a name="d2e1571" id="d2e1571"></a>Example 26: The text in <code>term</code> is part of the text of <code>p</code></div><div class="exampleInner"><pre>&lt;p&gt;&lt;term&gt;Palouse horses&lt;/term&gt; 
have spotted coats.&lt;/p&gt;</pre></div></div><p>This corresponds to a single text run:</p><ul><li><p>"Palouse horses have spotted coats."</p></li></ul></li></ol></li></ol><p>A processor should be able to know from a method or infer from the context such information.</p></div></div><div class="div2">
<h3><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="objects" id="objects"></a>3.26 Associated Objects</h3><p>This requirement might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>.</p><p>[R026] <em>A mechanism should exist to attach information to the object associated to XML attribute or element nodes rather than the text in the nodes.</em></p><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="objects_desc" id="objects_desc"></a>3.26.1 Challenges</h4><p>In certain cases, it may be necessary to attach information to objects associated to element or attribute nodes rather than their text. An information architect may for example have to express that all images (which in his XML vocabulary are referenced via a <code>src</code> attribute on an <code>img</code> element) have to be localized (since they for example are only valid for a certain culture).</p></div><div class="div3">
<h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="objects_notes" id="objects_notes"></a>3.26.2 Notes</h4><p>The mechanism used to select an object associated to a node should be different from the mechanism to select the text of the same node. Furthermore, this requires specific thoughts related to precedence and inheritance.</p></div></div></div></div><div class="back"><div class="div1">
<h2><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="sec-bibliography" id="sec-bibliography"></a>A References (Non-Normative)</h2><dl><dt class="label"><a name="bidiinfo" id="bidiinfo" />Bidi</dt><dd>Richard Ishida.
<a href="http://www.w3.org/International/articles/inline-bidi-markup/"><cite>What you
need to know about the bidi algorithm and inline markup</cite></a>, W3C
Internationalization FAQ. Available at
<a href="http://www.w3.org/International/articles/inline-bidi-markup/">http://www.w3.org/International/articles/inline-bidi-markup/</a>.
</dd><dt class="label"><a name="biditech" id="biditech" />Bidi Technique</dt><dd>Richard Ishida.
<a href="http://www.w3.org/TR/i18n-html-tech-bidi/#ri20030728.094313871"><cite>Authoring
Techniques for XHTML &amp; HTML Internationalization: Handling Bidirectional
Text 1.0</cite></a>, W3C Working Draft 9 May 2004. Available at
<a href="http://www.w3.org/TR/i18n-html-tech-bidi/#ri20030728.094313871">http://www.w3.org/TR/i18n-html-tech-bidi/#ri20030728.094313871</a>.
The latest version of
<a href="http://www.w3.org/TR/i18n-html-tech-bidi/">Bidi
Technique</a> is available at
http://www.w3.org/TR/i18n-html-tech-bidi/.</dd><dt class="label"><a name="bidixhtml2" id="bidixhtml2" />Bidi XHTML2</dt><dd>Jonny Axelsson, Mark Birbeck, et
al. editors. <a href="http://www.w3.org/TR/2005/WD-xhtml2-20050527/"><cite>XHTML™ 2.0</cite></a>, W3C
Working Draft 27 may 2005. Available at
<a href="http://www.w3.org/TR/2005/WD-xhtml2-20050527/">http://www.w3.org/TR/2005/WD-xhtml2-20050527/</a>.
See
<a href="http://www.w3.org/TR/2005/WD-xhtml2-20050527/mod-bidi.html#s_bidimodule">Section
15: XHTML Bi-directional Text Attribute Module</a>. The latest version of <a href="http://www.w3.org/TR/xhtml2/">XHTML2</a> is available at http://www.w3.org/TR/xhtml2/.</dd><dt class="label"><a name="compmsg" id="compmsg" />Composite Messages</dt><dd>Richard Ishida,
<a href="http://www.w3.org/International/articles/composite-messages/"><cite>Working
with Composite Messages</cite></a>, Article of the
<a href="http://www.w3.org/International/">W3C Internationalization
Activity</a>, March 2006.</dd><dt class="label"><a name="ditaspec" id="ditaspec" />DITA</dt><dd>Michael Priestley, JoAnn Hackos, et. al., editors.
<a href="http://www.oasis-open.org/committees/download.php/15316/dita10.zip"><cite>OASIS
Darwin Information Typing Architecture (DITA) Language Specification
v1.0</cite></a>, OASIS Standard 9 May 2005. Available at
<a href="http://www.oasis-open.org/committees/download.php/15316/dita10.zip">http://www.oasis-open.org/committees/download.php/15316/dita10.zip</a>.</dd><dt class="label"><a name="docbookspec" id="docbookspec" />DocBook</dt><dd>Norman Walsh, editor.
<a href="http://www.docbook.org/specs/cs-docbook-docbook-4.2.pdf"><cite>The
DocBook Document Type</cite></a>, OASIS Committee Specification, 16 July 2002.
Available at
<a href="http://www.docbook.org/specs/cs-docbook-docbook-4.2.pdf">http://www.docbook.org/specs/cs-docbook-docbook-4.2.pdf</a>.</dd><dt class="label"><a name="dsdl" id="dsdl" />DSDL</dt><dd>ISO/IEC. <a href="http://dsdl.org/"><cite>ISO/IEC
19757 - DSDL, Document Schema Definition Languages</cite></a>. Available at
<a href="http://dsdl.org/">http://dsdl.org/</a>. </dd><dt class="label"><a name="hlink" id="hlink" />HLink</dt><dd>Steven Pemberton, Masayasu Ishikawa, editors.
<a href="http://www.w3.org/TR/2002/WD-hlink-20020913/"><cite>Link recognition
for the XHTML Family</cite></a>, W3C Working Draft 13 September 2002. Available
at <a href="http://www.w3.org/TR/2002/WD-hlink-20020913/">http://www.w3.org/TR/2002/WD-hlink-20020913/</a>.
The latest version of <a href="http://www.w3.org/TR/hlink/">HLink</a> is available at
http://www.w3.org/TR/hlink/.</dd><dt class="label"><a name="geo-i18n-l10n" id="geo-i18n-l10n" />i18n l10n</dt><dd>Richard Ishida, Susan Miller.
<a href="http://www.w3.org/International/questions/qa-i18n"><cite>Localization
vs. Internationalization</cite></a> Article of the
<a href="http://www.w3.org/International/">W3C Internationalization
Activity</a>, January 2006.</dd><dt class="label"><a name="itsspec" id="itsspec" />ITS</dt><dd>Christian Lieske, Felix
Sasaki, editors. <a href="http://www.w3.org/TR/2006/WD-its-20060518/"><cite>Internationalization Tag Set (ITS)</cite></a>
W3C Working Draft 18 May 2006. Available at
<a href="http://www.w3.org/TR/2006/WD-its-20060518/">http://www.w3.org/TR/2006/WD-its-20060518/</a>. The latest version of <a href="http://www.w3.org/TR/its/">ITS</a> is available at http://www.w3.org/TR/its/.
</dd><dt class="label"><a name="ldml" id="ldml" />LDML</dt><dd>Mark Davis,
<a href="http://unicode.org/reports/tr35/tr35-5.html"><cite>Locale Data Markup
Language (LDML)</cite></a>, Unicode Technical Standard #35. Available at
<a href="http://unicode.org/reports/tr35/tr35-5.html">http://unicode.org/reports/tr35/tr35-5.html</a>.
The latest version of <a href="http://unicode.org/reports/tr35/">LDML</a> is available at
http://unicode.org/reports/tr35/. </dd><dt class="label"><a name="nrl" id="nrl" />NRL</dt><dd>James Clark,
<a href="http://www.lisa.org/info/faqs.html"><cite>Namespace Routing Language
(NRL)</cite></a>, Thai Open Source Software Center Ltd 2003-06-13. Available at
<a href="http://www.thaiopensource.com/relaxng/nrl.html">http://www.thaiopensource.com/relaxng/nrl.html</a>.
</dd><dt class="label"><a name="nvdl" id="nvdl" />NVDL</dt><dd>ISO/IEC JTC 1/SC 34.
<a href="http://www.lisa.org/info/faqs.html"><cite>Document Schema Definition
Languages (DSDL) — Part 4: Namespace-based Validation Dispatching Language —
NVDL</cite></a>, 2004-05-31. Available at
<a href="http://dsdl.org/0525.pdf">http://dsdl.org/0525.pdf</a>.
</dd><dt class="label"><a name="odfspec" id="odfspec" />OpenDocument</dt><dd>Michael Brauer, Patrick
Durusau, et. al., editors.
<a href="http://www.oasis-open.org/committees/download.php/12572/OpenDocument-v1.0-os.pdf"><cite>Open
Document Format for Office Applications (OpenDocument) v1.0</cite></a>, OASIS
Standard 1 May 2005. Available at
<a href="http://www.oasis-open.org/committees/download.php/12572/OpenDocument-v1.0-os.pdf">http://www.oasis-open.org/committees/download.php/12572/OpenDocument-v1.0-os.pdf</a>.</dd><dt class="label"><a name="rfc1766" id="rfc1766" />RFC 1766</dt><dd>H. Alvestrand, editor.
<a href="http://www.ietf.org/rfc/rfc1766.txt"><cite>Tags for the
Identification of Languages</cite></a>, IETF March 1995. Available at
<a href="http://www.ietf.org/rfc/rfc1766.txt">http://www.ietf.org/rfc/rfc1766.txt</a>.
</dd><dt class="label"><a name="rfc3066" id="rfc3066" />RFC 3066</dt><dd>H. Alvestrand, editor.
<a href="http://www.ietf.org/rfc/rfc1766.txt"><cite>Tags for the
Identification of Languages</cite></a>, IETF January 2001. Available at
<a href="http://www.ietf.org/rfc/rfc3066.txt">http://www.ietf.org/rfc/rfc3066.txt</a>.
</dd><dt class="label"><a name="rfc3066bis" id="rfc3066bis" />RFC 3066bis</dt><dd>Addison Phillips, Mark Davis,
editors.
<a href="http://www.ietf.org/internet-drafts/draft-ietf-ltru-registry-14.txt"><cite>Tags
for Identifying Languages</cite></a>, draft-ietf-ltru-registry-14.txt.
Available at
<a href="http://www.ietf.org/internet-drafts/draft-ietf-ltru-registry-14.txt">http://www.ietf.org/internet-drafts/draft-ietf-ltru-registry-14.txt</a>.
</dd><dt class="label"><a name="whatisruby" id="whatisruby" />Ruby</dt><dd>Richard Ishida.
<a href="http://www.w3.org/International/questions/qa-ruby"><cite>What is
Ruby?</cite></a>, W3C Internationalization FAQ. Available at
<a href="http://www.w3.org/International/questions/qa-ruby">http://www.w3.org/International/questions/qa-ruby</a>.
</dd><dt class="label"><a name="implruby" id="implruby" />Ruby Impl</dt><dd>Masayasu Ishikawa,
<a href="http://www.w3.org/People/mimasa/test/schemas/NOTE-ruby-implementation"><cite>Implementing
the Ruby Module</cite></a> Personal Note, 14 July 2005. Available at
<a href="http://www.w3.org/People/mimasa/test/schemas/NOTE-ruby-implementation">http://www.w3.org/People/mimasa/test/schemas/NOTE-ruby-implementation</a>.</dd><dt class="label"><a name="schematron" id="schematron" />Schematron</dt><dd><a href="http://www.schematron.com/"><cite>Schematron - A Language for Making Assertions about Patterns Found in XML Documents.</cite></a>.
Available at <a href="http://www.schematron.com/">http://www.schematron.com/</a>.
</dd><dt class="label"><a name="svgspec" id="svgspec" />SVG</dt><dd>Jon Ferraiolo, 藤沢
淳 (Fujisawa Jun), Dean Jackson, editors.
<a href="http://www.w3.org/TR/2003/REC-SVG11-20030114/"><cite>Scalable Vector
Graphics (SVG) 1.1 Specification</cite></a>, W3C Recommendation 14 january
2003. Available at
<a href="http://www.w3.org/TR/2003/REC-SVG11-20030114/">http://www.w3.org/TR/2003/REC-SVG11-20030114/</a>.
The latest version is available at <a href="http://www.w3.org/TR/SVG11/">http://www.w3.org/TR/SVG11/</a>.</dd><dt class="label"><a name="tbxlink" id="tbxlink" />TBX-Link</dt><dd>Alan K. Melby, Andrzej Zydroń,
editors. <a href="http://www.lisa.org/standards/tbxlink/tbxlink.html"><cite>TermBase eXchange
Link (TBX Link) 1.0 Specification</cite></a>, Initial Draft 0.1. Available at
<a href="http://www.lisa.org/standards/tbxlink/tbxlink.html">http://www.lisa.org/standards/tbxlink/tbxlink.html</a>.
</dd><dt class="label"><a name="xmlfrag" id="xmlfrag" />XFI</dt><dd>Paul Grosso, Daniel Veillard, editors.
<a href="http://www.w3.org/TR/2001/CR-xml-fragment-20010212"><cite>XML
Fragment Interchange</cite></a>, W3C Candidate Recommendation 12 February 2001.
Available at
<a href="http://www.w3.org/TR/2001/CR-xml-fragment-20010212">http://www.w3.org/TR/2001/CR-xml-fragment-20010212</a>.
The latest version of <a href="http://www.w3.org/TR/xml-fragment">XFI</a> is available at
http://www.w3.org/TR/xml-fragment. </dd><dt class="label"><a name="xinclude" id="xinclude" />XInclude</dt><dd>Jonathan Marsh, David Orchard, editors.
<a href="http://www.w3.org/TR/2004/REC-xinclude-20041220/"><cite>XML
Inclusions (XInclude) Version 1.0</cite></a>, W3C Recommendation 20 December
2004. Available at
<a href="http://www.w3.org/TR/2004/REC-xinclude-20041220/">http://www.w3.org/TR/2004/REC-xinclude-20041220/</a>.
The latest version of <a href="http://www.w3.org/TR/xinclude/">XInclude</a> is available at
http://www.w3.org/TR/xinclude/. </dd><dt class="label"><a name="xml10spec" id="xml10spec" />XML 1.0</dt><dd>Tim
Bray, Jean Paoli, C.M. Sperberg-McQueen, et. al., editors.
<a href="http://www.w3.org/TR/2004/REC-xml-20040204/"><cite>Extensible Markup
Language (XML) 1.0 (Third Edition)</cite></a>, W3C Recommendation 04 February
2004. Available at <a href="http://www.w3.org/TR/2004/REC-xml-20040204/">http://www.w3.org/TR/2004/REC-xml-20040204/</a>.
The latest version of <a href="http://www.w3.org/TR/REC-xml/">XML
1.0</a> is available at http://www.w3.org/TR/REC-xml/&gt;. </dd><dt class="label"><a name="xml10spec_errata" id="xml10spec_errata" />XML 1.0 Errata</dt><dd>W3C.
<a href="http://www.w3.org/XML/xml-V10-3e-errata"><cite>XML 1.0 Third Edition
Specification Errata</cite></a>. Available at
<a href="http://www.w3.org/XML/xml-V10-3e-errata">http://www.w3.org/XML/xml-V10-3e-errata</a>.
</dd><dt class="label"><a name="xml-i18n-bp" id="xml-i18n-bp" />XML i18n BP</dt><dd>Yves Savourel, Diane Stoick, editors.
<a href="http://www.w3.org/TR/2006/WD-xml-i18n-bp-20060518/"><cite>Best
Practices for XML Internationalization</cite></a> W3C Working Draft 18 May
2006. Available at
<a href="http://www.w3.org/TR/2006/WD-xml-i18n-bp-20060518/">http://www.w3.org/TR/2006/WD-xml-i18n-bp-20060518/</a>
. The latest version of <a href="http://www.w3.org/TR/xml-i18n-bp/">xml-i18n-bp</a> is available at
http://www.w3.org/TR/xml-i18n-bp/.</dd><dt class="label"><a name="xmlid" id="xmlid" />XML ID</dt><dd>Jonathan Marsh, Daniel Veillard,
Norman Walsh, editors.
<a href="http://www.w3.org/TR/2005/REC-xml-id-20050909/"><cite>xml:id Version
1.0</cite></a>, W3C Recommendation 9 September 2005. Available at
<a href="http://www.w3.org/TR/2005/REC-xml-id-20050909/">http://www.w3.org/TR/2005/REC-xml-id-20050909/</a>.
The latest version of <a href="http://www.w3.org/TR/xml-id/">XML
ID</a> is available at http://www.w3.org/TR/xml-id/.</dd><dt class="label"><a name="xsd" id="xsd" />XSD</dt><dd>Paul V. Biron, Ashok Malhotra, editors.
<a href="http://www.w3.org/TR/2004/REC-xmlschema-2-20041028/"><cite>XML Schema
Part 2: Datatypes Second Edition</cite></a>. Available at
<a href="http://www.w3.org/TR/2004/REC-xmlschema-2-20041028/">http://www.w3.org/TR/2004/REC-xmlschema-2-20041028/</a>.
The latest version of <a href="http://www.w3.org/TR/xmlschema-2/">XSD</a> is available at
http://www.w3.org/TR/xmlschema-2/. </dd></dl></div><div class="div1">
<h2><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="revisionlog" id="revisionlog"></a>B Revision Log (Non-Normative)</h2><p>The following log records changes that have been made to this document since the <a href="http://www.w3.org/TR/2005/WD-itsreq-20051122/">publication in November 2005</a>.</p><ol class="depth1"><li><p>References to the working drafts <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> and <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>, which implement requirements of this document, have been added.</p></li><li><p>The following requirements have been added to this document: <a href="#nestedelems">nested elements</a>, <a href="#lingml">linguistic markup</a>, <a href="#variables">variables</a>, <a href="#elemseg">elements and segmentation</a>, <a href="#objects">associated objects</a>.</p></li><li><p>The requirement about <a href="http://www.w3.org/TR/2005/WD-itsreq-20051122/#culture">cultural aspects of the content</a> has been generalized to a requirement about <a href="#contstyle">content style</a>.</p></li><li><p>It has been clarified for the  following requirements that they might be completed and addressed in future versions of <a title="Internationalization Tag Set (ITS)" href="#itsspec">[ITS]</a> or <a title="Best&#xA;Practices for XML Internationalization" href="#xml-i18n-bp">[XML i18n BP]</a>: <a href="#constraints">indicator of constraints</a>, <a href="#contstyle">content style</a>, <a href="#linkedtext">link to internal / external text</a>, <a href="#metrics">metrics count</a>, <a href="#whitespaces">handling of white-spaces</a>, <a href="#datetime">identifying date and time</a>, <a href="#lingml">linguistic markup</a>, <a href="#variables">variables</a>, <a href="#objects">associated objects</a>.</p></li><li><p>The following requirements, which had been mentioned in the previous version of this document, have been rewritten and / or expanded: <a href="#cdata">CDATA sections</a>, <a href="#bidi">bidirectional text support</a>, <a href="#transattr">attributes and translatable text</a>, <a href="#naming">naming scheme</a>, <a href="#locnotes">localization notes</a> <a href="#multilang">multilingual documents</a>, <a href="#annomark">annotation markup</a></p></li></ol></div><div class="div1">
<h2><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents." /></a><a name="acknowledgements" id="acknowledgements"></a>C Acknowledgements (Non-Normative)</h2><p>The initial requirements in this document have been developed and edited on a wiki system driven by several past and present members of the ITS Working Group: Tim Foster (Sun Microsystems), Richard Ishida (W3C), Masaki Itagaki (Invited Expert), Christian Lieske (SAP), Naoyuki Nomura (Ricoh), Yves Savourel (ENLASO), Felix Sasaki (W3C), and Andrzej Zydroń (Invited Expert).</p><p>The other past and present members of the ITS Working Group have also contributed their valuable time and comments to the creation of these requirements: Karunesh Arora (CDAC), Martin Dürst (Invited Expert), Sebastian Rahtz (invited Expert), François Richard (HP), Goutam Saha (CDAC), Diane Stoick (Boeing), and Najib Tounsi (<span xml:lang="fr" lang="fr">Ecole Mohammadia d’Ingénieurs</span>).</p></div></div></body></html>