Extensible.html
33.5 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="generator" content=
"HTML Tidy for Mac OS X (vers 31 October 2006 - Apple Inc. build 13), see www.w3.org" />
<!-- so that tech reports can be copied -->
<title>
Web Architecture: Extensible languages
</title>
<base href="http://www.w3.org/DesignIssues/Extensible.html" />
<link href="di.css" rel="stylesheet" type="text/css" />
<meta http-equiv="Content-Type" content="text/html" />
</head>
<body>
<!-- in Design Issues space only -->
<div class="header">
<p>
<a href="/"><img align="left" alt="W3C" src=
"/Icons/WWW/w3c_home" /></a>
</p>
<h1 align="center">
Web Architecture: Extensible languages
</h1>
<h3 align="center">
10 Feb 1998
</h3>
<dl>
<dt>
Similar to: W3C Note:
</dt>
<dd>
<a href=
"http://www.w3.org/TR/NOTE-webarch-extlang">http://www.w3.org/TR/1998/NOTE-webarch-extlang</a>
<!-- @@ .html? .pdf?-->
</dd>
<dt>
Editors:
</dt>
<dd>
<a href="http://www.w3.org/People/Berners-Lee/">Tim
Berners-Lee</a> <tt><a href=
"mailto:timbl@w3.org"><timbl@w3.org></a></tt>
W3C<br />
<a href="http://www.w3.org/People/Connolly/">Dan
Connolly</a> <tt><a href=
"mailto:connolly@w3.org"><connolly@w3.org></a></tt>
W3C
</dd>
</dl>
<h2>
Status of This Document
</h2>
<p>
<i>This document is a similar to a NOTE made available by
the W3 Consortium for discussion only. This indicates no
endorsement of its content, nor that the Consortium has,
is, or will be allocating any resources to the issues
addressed by the NOTE.</i>
</p>
<p>
This work is related to the Architecture domain of the W3C,
and particularly to the <a href=
"../XML/Overview.html">XML</a> activity, but is related to
<a href="../MarkUp/Overview.html">HTML</a>, <a href=
"../Protocols/HTTP/Overview.html">HTTP</a> and <a href=
"../Metadata/Overview.html">Metadata</a> activities.
</p>
<p>
Comments should be sent to the authors and <a href=
"mailto:www-talk@w3.org">www-talk@w3.org</a>.
</p>
<p>
This document is meant to be a fairly explanatory synthesis
of the requirements for namespace extension in languages on
the web, and in particular for the general language planned
to be the common basis of many future applications, XML.
It was originally written as part of the "<a href=
"../DesignIssues/Overview.html">Design Issues</a>" series
of notes. Whilst technically the personal opinion of the
authors, it their best attempt as technical coordinators
at outlining common architectural principles for W3C
development.
</p>
<p>
At the time of writing [1998/02], various drafts in the XML
and RDF community address these requirements in various
ways. The document may evolve if further clarity is
sen to be needed, or futher requirements added. Some open
issue are noted.
</p>
<h2>
Abstract
</h2>
<p>
Experience with the task of coordinating developments by
independent groups has allows us to define properties of
languages which will allow the unfettered growth of the Web
technology in a chaotic but still well defined way. These
take the form of constraints on the language features for
making reference to multiple different vocabularies, and on
langauhes for "schema" documents which define those
vocabularies.
</p>
<hr />
<p>
<a href="Overview.html">Up to Design Issues</a>
</p>
<h2>
Contents
</h2>
<p>
Extensible languages
</p>
<ul>
<li>
<a href="#Introduction">Introduction</a>
</li>
<li>
<a href="#Requirements">Requirements</a>
<ul>
<li>
<a href="#Glossary">Glossary</a>
</li>
<li>
<a href="#Mixing">Mixing vocabularies</a>
</li>
<li>
<a href="#Scenario">Scenario</a>
</li>
<li>
<a href="#Local">Local scope</a>
</li>
<li>
<a href="#Ambiguity">Lack of ambiguity</a>
</li>
<li>
<a href="#Evolving">Evolving new scheme languages</a>
</li>
<li>
<a href="#Correctness">Correctness of documents with
multiple vocabularies</a>
</li>
<li>
<a href="#Granularity">Granularity</a>
</li>
<li>
<a href="#Incorporation">Incorporation into the
language</a>
</li>
</ul>
</li>
<li>
<a href="#Related">Related resources</a>
</li>
</ul>
</div><!-- end header division -->
<hr />
<h1>
<i><a name="Extensible" id="Extensible">Extensible
languages</a><br /></i>
</h1>
<h2>
<a name="Introduction" id="Introduction">Introduction</a>
</h2>
<p>
When the World Wide Web Consortium was first put together,
high on the list of goals of the Consortium was making the
web "evolvable". At that time, it was a philosophical
goal and it wasn't clear what it would mean technically.
Since then, W3C has had plenty of experience in the
deployment of new technology, particularly in an
environment of thousands of independent groups developing in
closely related or identical fields.
</p>
<p>
The HTTP and HTML specifications have both grown rapidly in
this environment. The existence of an open and freely usable
standard allows anyone in the world to experiment with
extensions. Deployment of experimental features was enabled
by one simple rule, inherited with care from the Internet
email community:
</p>
<h4>
Rule used to date:
</h4>
<table border="1" cellpadding="4">
<tbody>
<tr>
<td>
<p align="left">
<i>Old rule:</i> If you find a language element you
don't understand, ignore it.
</p>
</td>
</tr>
</tbody>
</table>
<p>
(The exact definition of "Ignore" varies - HTTP headers are
actually ignored and HTML elements are replaced with their
contents (ie unknown tags are ignored) - but the principle
has been the same.)
</p>
<p>
This rule has covered web development from 1989 to the
present. The result has been a very high speed of
growth. However, a state of ambiguity and lack of
interoperability always exists from the introduction of an
experimental feature until the later agreement on a common
standard . This weakened the reliability and credibility of
the Web. Furthermore, there has always been a threat that
lack of consensus on new features would lead to a permanent
fragmentation of the evolutionary paths.
</p>
<p>
The problem was that neither the specification of new
elements nor the effect of ignoring them was ever clearly
defined. Contrast this to the situation in most distributed
object systems. In these cases, objects and support
classes generally have well defined interfaces. Whilst
ensuring interoperability, the rigidity of this system, in
which new interfaces had to be explicitly agreed between
parties, has been one of the factors inhibiting such systems
from spreading in web-like or virus-like manner.
</p>
<p>
Can we have the best of both worlds, and have clearly defined
interfaces, but also allow systems from different communities
to communicate when having only a partial understanding of
each other's specifications? This need has surfaced
from many areas:
</p>
<ul>
<li>HTTP extension -- See the <a href=
"../Protocols/Activity.html#PEPspec">PEP requirements</a>]
</li>
<li>HTML and XML -- See <a href=
"http://www.cs.caltech.edu/~adam/papers/xml/ascent-of-xml.html">
<cite>The Evolution of Web Documents: The Ascent of
XML</cite></a> by Dan Connolly, Rohit Khare, and Adam
Rifkin, W3J special Issue on XML, Vol 2, Number 4, Fall
1997, Pages 119-128
</li>
<li>Metadata: see the <a href="Metadata.html">Metadata
architecture</a>.design issues page and ...
</li>
</ul>
<h2>
<a name="Requirements" id="Requirements">Requirements</a>
</h2>
<p>
The need is for two systems to be able to communicate when
they have a common vocabulary but not complete understanding
of all the features they each use. As these
requirements are derived from experience across many
different systems, we will have to chose which words to use
in this document.
</p>
<h4>
<a name="Glossary" id="Glossary">Glossary</a>
</h4>
<p>
For the purposes of this document, words are used as follows:
</p>
<dl>
<dt>
<b>element</b>
</dt>
<dd>
A range text within of a document, identified by a local
identifier.
</dd>
<dt>
<b>vocabulary</b>
</dt>
<dd>
a set of local identifiers in a document, (which identify
parts of the document), and whose meaning (at some level)
is defined by generic resource. The namespace resource
conceptually represents the vacabulary in general, which
may be represented by one or more schemata.
</dd>
<dt>
<b>schema</b>
</dt>
<dd>
A specific document which defines a vocabulary (at some
level)
</dd>
</dl>
<p>
Although this is a general document, it is hoped that these
terms are not used inconsistently with their use in XML
(element) and RDF (schema).
</p>
<p>
There is some rough correspondence in the soup of terms as
follows.<br />
</p>
<center>
<table border="1" cellpadding="2" align="center">
<tbody>
<tr>
<th>
This document
</th>
<th>
SGML
</th>
<th>
HTTP
</th>
<th>
Programming languages
</th>
<th>
RDF
</th>
</tr>
<tr>
<td>
Element
</td>
<td>
Element
</td>
<td>
Header
</td>
<td>
Function/Procedure/Method call
</td>
<td>
Element
</td>
</tr>
<tr>
<td>
Binding
</td>
<td>
-
</td>
<td>
(PEP header)
</td>
<td>
"Import", external declaration
</td>
<td></td>
</tr>
<tr>
<td>
-
</td>
<td>
Entity declaration
</td>
<td>
-
</td>
<td>
#Include
</td>
<td></td>
</tr>
<tr>
<td>
Declaration
</td>
<td>
Element declaration
</td>
<td>
(http spec)
</td>
<td>
Function declaration
</td>
<td></td>
</tr>
<tr>
<td>
Schema
</td>
<td>
DTD
</td>
<td>
(none!)
</td>
<td>
Module interface definition
</td>
<td>
Schema
</td>
</tr>
<tr>
<td></td>
<td>
Content model
</td>
<td></td>
<td>
Parameter type
</td>
<td></td>
</tr>
<tr>
<td></td>
<td>
Attributes
</td>
<td></td>
<td>
Parameter type
</td>
<td></td>
</tr>
</tbody>
</table>
</center>
<h3>
<a name="Mixing" id="Mixing">Mixing vocabularies</a>
</h3>
<p>
When a message is sent across the Internet as part of a Web
communications protocol, it is tempting as above to compare
the message with a remote procedure call, and to adopt the
characteristics of a procedure/method call from distributed
OO systems. A procedure call identifies the target
object, one of a finite number of methods from the exported
interface, and a set of typed parameters.
</p>
<p>
However, this analogy is not powerful enough. A message
should be considered an expression and, if one takes an
analogy with programming languages, the analogy should be
with an expression or program rather than with a function
call. [Or, if considered a function call, strictly, the
parameters have to be extended to allow other nested function
calls]. In this case, there may be many functions
identified, in many interfaces. In other words, don't
think of an HTTP message or an HTML document as an RPC call,
but rather as the transmission of a expression in some
language.
</p>
<p>
In the case of an XML document, this corresponds to a
document which contains elements whose declarations occur in
many different specifications (SGML: many different DTDs).
This is the requirement brought out under "Metadata
architecture", of <a href="Metadata.html#Mixing">mixing
vocabularies</a>:
</p>
<table border="1" cellpadding="4">
<tbody>
<tr>
<td>
It must be possible at one point in a document for more
than one vocabulary to be in scope.
</td>
</tr>
</tbody>
</table>
<h3>
<a name="Scenario" id="Scenario">Scenario</a>
</h3>
<p>
Imagine that I send you an invoice for an aeroplane part I am
shipping to you. The invoice is mostly in common
business language, and the vocabulary such as item, cost,
quantity, authorising signature, total cost and due date are
well known to both of us. However, the item is
specified in an expression which details exactly which engine
lower inspection hatch door mount bracket lock nut is
involved. Neither you nor I actually have to understand
this vocabulary and references to part numbers and the like.
Only the person or machine loading the part onto the truck,
and the person or machine installing the part in the aircraft
need to know it. It is true that we need to agree about
the cost and the significance of the signing authority, as
that is part of the protocol between us.
</p>
<p>
This sort of thing happens all the time in real life.
Documents mix vacabularies defined in different places. We
are always making decisions about which of the myriad of
things we don't understand are important to us. We are
constantly handling information with partial understanding.
Imagine if an old version of a word processor could
read a file written by a new version with partial
understanding, rather than panicing that it had met a being
from the future. It also happens all the time on the
web, as people bury private elements such as index tags and
editing information inside HTML files.
</p>
<p>
The requirement is for the new vocabularies to be well
defined, like the basic vocabulary.
</p>
<p>
By analogy with a programming language, a Web document or
protocol message should be able to include expressions
combining calls to functions from many modules. This is
so fundamental to programming languages that it has gone
without saying, but it has not been possible in SGML.
</p>
<h4>
Same scope
</h4>
<p>
What does "within the same scope" mean? It means that just
nesting one sort of document inside another is not good
enough. It means that I must be able to write an
expression or compound element which combines elements from
two vocabularies. (In fact, strictly, wherever there is
an expression tree which combines identifiers from more than
one vocabulary, one can in theory break it down to a set of
nested subtrees each of which only uses one vacabulary and
could be considered a "subdocument", but in practice this is
impracticably cumbersome.) For example, if I can extend HTML
to include Math, in this way one is able to use HTML bold
tags still within a Math expression.
</p>
<h3>
<a name="Local" id="Local">Local scope</a>
</h3>
<p>
There is a practical requirement that it must be possible to
introduce a new vocabulary in part of a document in a way
that requires changes only locally within the document.
This means that for example it must be possible to
introduce a new vocabulary within a local block. Here is an
example in an arbitrary syntax, where "NS:using" is the
<b><i>binding</i></b> of local identifiers starting with
"<code>f</code>" to a schema
<code>http://blah/currency</code>
</p>
<pre>
<a:details>
<NS:using href="http://blah/currency" as="f">
<a:price>
<f:chf>4.00</f:chf>
</a:price>
</NS:using>
</a:details>
</pre>
<p>
The binding between the local identifier and the schema is
texually local. There is no need to a binding in the
document's head. In general this makes document management
much easier. It makes checking a document easier, as you can
in some cases verify an embedded piece without having to
check the whole document.
</p>
<h4>
Why?
</h4>
<p>
A specific need for local scoping comes from the fact that
many documents are generated (for example by CGI scripts)
by calling programs to output parts in context, and the
program which generates the parts has no access to the rest
of the document.
</p>
<p>
In theory it would always be possible to take such a document
with nested bindings of namespaces, and find all those
bindings, and generate new local prefixes for each so that
they are unique, and then move all the bindings to the top of
the document. Therefore, a document using local scope can be
converted into one which only uses global scoping. However,
this requires buffering of all the document, and so cannot be
done in pipelined systems, and pipelined systems are often a
necessity in the Web in order to achieve acceptable response
times.
</p>
<p>
Another case involves very long documents using many
namespaces. Typically web applications have to be able to
cope with documents of arbitrary length. Imagine a document
which, every paragraph, refers to a new name space. (A proof
by example would be a document documenting many namespaces..
but image also a list of suppliers each of which has its own
catalog schema.). As processing of the document continues, if
the bindings of namespaces are local, then each is made and
discarded. The working set needed for processing the document
is finite. In the case in which the bindings are global in
scope, then the working set size increases linearly with the
length of the document, and the product of resource
utilization and processing time then rises as the square of
the document size.
</p>
<p>
A third example of a need for local scoping is that for many
uses of XML (take SMIL for example) concatenation of two
documents to make one document should be a simple process.
Indeed, a worthy design goal would be to require that the
concatenation of any two XML documents be an XML document. If
local scoping is not available, the concatenation function
requires the rewriting of one dcoument from begining to end
changing local identifiers where they clash.
</p>
<p>
In general, one can call on all the design experience of the
computer science community which, over the years, has seen
the need for block structured langauges with local scoping.
There have been many factors influencing this, but one
unmentioned to date has been the maintainability of
programs/documents. When the binding of a name and its use
can be close together, for human-maintained documents,
mistakes are much less likely.
</p>
<table border="1" cellpadding="2">
<tbody>
<tr>
<td>
It must be possible to introduce a new vocabulary in
part of a document in a way that requires changes only
locally within the document.
</td>
</tr>
</tbody>
</table>
<h3>
<a name="Ambiguity" id="Ambiguity">Lack of ambiguity</a>
</h3>
<p>
Some programming languages allow one to introduce identifiers
from new name spaces in such a way that it is not possible to
know which namespace a local identifier belongs to without
accessing both the module interface specifications and
checking which one has with the highest priority, or
most recently in the document, redefined a given local
identifier.
</p>
<p>
This may have some uses in a programming language such as
<a href="#java">Java</a>, but it has a serious flaw in that
when one module changes (without the knowledge of the
designers of the other module), it can unwittingly redefine a
local identifier used by the second module, completely
changing the meaning of a previously written document.
Clearly, in the Web world in which modules evolve but
documents must have clearly defined meanings, this is
unacceptable. (Compare with the <a href=
"#mod3">modula-3</a> language which does this unambiguously).
</p>
<table border="1" cellpadding="2">
<tbody>
<tr>
<td>
The syntax must unambiguously associate an identifier
in a document with the related schema without requiring
inspection of that or another schema.
</td>
</tr>
</tbody>
</table>
<p>
This is the reason for the use of a prefix in the XML
namespace proposal to tie the use of an identifier directly
to the specification of the name space. Notice that in the
example above, the fact that the binding element actually
creates a new level of nesting removing all ambiguity.
</p>
<h3>
<a name="Evolving" id="Evolving">Evolving new schema
languages</a>
</h3>
<p>
In SGML the "DTD" defines, for an SGML element, what possible
other elements may be nested inside it. For example, on
an invoice, it may specify that the signing authority must be
either Tom or Joe. It may specify that an item can be any
part number or any accessory number or any book number.
Checking the SGML validity of a document is a process which
can be done automatically from the DTD. This is a check
at a certain low level in that it does not verify semantic
correctness, only structural correctness. But the
structural constraints alone are useful in many ways. For
example, a user interface for constructing a document
can be generated automatically from the structural
constraints.
</p>
<p>
We plan to introduce more powerful languages for describing
not only the structure of a document, but the semantics to an
extent that not only can checking be automated to a higher
level, but also so can the processing of a document and
reasoning about its contents be automated. Therefore it is
essential that when a document is written to refer to a
namespace, the name space definition should be a generic
resource whose instances may include schemas in various
languages at various levels of sophistication. This is
an essential growth point for the web.
</p>
<table border="1" cellpadding="2">
<tbody>
<tr>
<td>
The resource defining a namespace may be generic and
allow definitions of the namespace in varying present
or future languages.
</td>
</tr>
</tbody>
</table>
<h3>
<a name="Correctness" id="Correctness">Correctness of
documents with multiple vocabularies</a>
</h3>
<p>
How does one check the validity/correctness of a document
with multiple namespaces? Clearly one must be able to find
definitions of the namespaces at the appropriate level, and
combine them. Looking at the example above of the invoice, we
notice a difference.
</p>
<p>
In the case of the "content model" for an authorizing person,
the designer of the invoice intended that in fact the schema
should be extensible so that any new object could be included
as an item. For example, one could use a part number
system from any new supplier, just by incorporating the
namespace. However, when it came to the "content model" for
an authorizing person, only Tom or Joe should be able to
sign. No namespace extension should be allowed to redefine
the permissible content model
</p>
<table border="1" cellpadding="2">
<tbody>
<tr>
<td>
There must be a way of indicating when a given content
model may be extended by new schemas.
</td>
</tr>
</tbody>
</table>
<table border="1" cellpadding="2">
<tbody>
<tr>
<td>
There must be a way, in a new schema, of
specifying that a given new content model is
designed an extension to the existing content model of
an existing schema.
</td>
</tr>
</tbody>
</table>
<p>
These are constraints on the schema language. (They are
<a href=
"http://www.w3.org/TR/1998/NOTE-XML-data-0105/Overview.html#OpenClosed">
addressed</a> by the XML-DATA discussion NOTE.)
</p>
<h3>
<a name="Granularity" id="Granularity">Granularity</a>
</h3>
<p>
With what granularity should one be able to define new
vocabularies in XML? The analogy with programming
languages suggest that we can understand how to add new
elements (functions) but that adding new attributes to
existing elements (parameters to existing functions) is
difficult to define when one gets above the structural level.
</p>
<p>
Although scheme languages do not yet exist to define semantic
relations and typing, clearly there will be need for
extension of concepts to type. Perhaps the need for content
model extension will in fact represent the same need.
</p>
<h3>
<a name="Incorporation" id="Incorporation">Incorporation into
the language</a>
</h3>
<p>
The namespace functionality is a very fundamental part
of the language. A language processor which does not
understand it can check what in XML is called
"well-formedness", ie basic syntactic correctness, of a
document, but can do no more.
</p>
<p>
A fundamental processing need outlined above is "partial
understanding". I envisage three ways in which partial
understanding can be accomplished, when a document in an
"original" schema's vocabulary includes some of a "new"
schema's vocabulary:
</p>
<ol>
<li>It may be possible to mathematically deduce what
information can be ignored from properties of the original
schema;
</li>
<li>At a simple level this could be built into the language
itself so that it can be expressed in the document itself;
(analogy with PEP extensions to HTTP).
</li>
<li>The "new" schema may allow one to deduce what can be
ignored. It may even give mappings which allow expressions in
the new schema's vocabulary to be replaced with simpler
expressions in better known vocabularies.
</li>
</ol>
<p>
Notice that the first two ways do not require one to be able
to access or understand the "new" schema in order to decide
whether to ignore it. This is a powerful and important
feature. Taking against the invoice example above, it
is essential to be able to process the invoice at some level
without even looking up on the Web any definition of the part
numbers. It is sufficient for the invoice itself declare that
the item specifications don't matter as far as the validity
of the invoice as an invoice.
</p>
<table border="1" cellpadding="2">
<tbody>
<tr>
<td>
It should be possible to create an original document
schema such that one can determine, without access to
the extension schema, which uses of extensions to
that document can be ignored.
</td>
</tr>
</tbody>
</table>
<p>
The difference between the first two ways above is
whether some functionality is regarded as basic to the
language or part of a very commonly understood namespace of
elements for document construction. This design decision is
not currently clear.
</p>
<h3>
Revision and evolution of namespaces
</h3>
<p>
This document does not define the requirements of schema
langauges, nor of languages with which to assert the
equivalence of assertions made using different vocabularies.
However it is worth noting that the architecture expects
machine-readable documents to describe the relationship
between different schemas, including between a schema and
later evolved versions of the schema. The namespace
functionality itself is not required to address that issue
directly.
</p>
<p>
See: <a href="Evolution.html">Evolvability</a>
</p>
<hr />
<h2>
<a name="Related" id="Related">Related resources</a>
</h2>
<h4>
HTML and XML evolution
</h4>
<p>
"And yet the ability to combine resources that were developed
independently is an essential survival property of technology
in a distributed information system. "
</p>
<p>
<a href=
"http://www.cs.caltech.edu/~adam/papers/xml/ascent-of-xml.html">
<cite>The Evolution of Web Documents: The Ascent of
XML</cite></a> by Dan Connolly, Rohit Khare, and Adam Rifkin,
W3J special Issue on XML, Vol 2, Number 4, Fall 1997, Pages
119-128
</p>
<h4>
<a name="java" id="java">Java names:</a>
</h4>
<p>
Section 6.5 <a href=
"http://java.sun.com/docs/books/jls/html/6.doc.html#20569"><cite>
Determining the Meaning of a Name</cite></a>, in <a href=
"http://java.sun.com/docs/books/jls/html/index.html">The Java
Language Specification</a>, James Gosling, Bill Joy, Guy
Steele, Edition 1.0, (Converted from the printed book, August
1996, first printing)
</p>
<h4>
<a name="mod3" id="mod3">Equivalent for Modula 3:</a>
</h4>
<p>
<a href=
"http://www.research.digital.com/SRC/m3defn/html/units.html">2.5
Modules and interfaces</a> from <a href=
"http://www.research.digital.com/SRC/m3defn/html/m3.html">Modula-3:
Language definition (multi-page)</a>
</p>
<p>
"The original definition of Modula-3 was given in SRC
Research Report 31, August 1988. It was revised in report 52,
November 1989. And finally published in Systems Programming
with Modula-3, November 1989. "
</p>
<h4>
More ...
</h4>
<p>
Umberto Eco, "??" TBD
</p>
<p>
E. Akpotsui, V. Quint and C. Roisin. <a href=
"http://opera.inrialpes.fr/OPERA/BibOpera.html#[Akpotsui97]">Type
Modelling for Document Transformation in Structured Editing
Systems</a>, Mathematical and Computer Modelling, Volume 25,
Number 4, Pages 1-19, 1997.
</p>
<p>
A complete list of references to the issues involved here
would be huge, so the reader is directed to the litterature.
</p>
<hr />
<p>
Related notes: <a href=
"../Architecture/note-webarch-names.html">webarch-names</a>....
</p>
<address>
Tim BL, with much input from DWC, started Febrary 1998.
<p>
Last edit $Id: Extensible.html,v 1.19 2000/09/08 19:01:30
timbl Exp $
</p>
</address>
</body>
</html>