NOTE-ioh-arch 39.5 KB
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987
<!DOCTYPE report public "-//W3C//DTD Report 1997-05-18//EN"
                        "../XML/9705/report.dtd">
<report>
  <head>
    <title>An Evaluation of the World Wide Web with respect to
    Engelbart's Requirements</title>
    <!-- link rel=stylesheet href="test.css" -->
  </head>
  
  <titlepage>
  <H1>An Evaluation of the World Wide Web with respect to Engelbart's
  Requirements</H1>

  <ADDRESS>
    <author><A HREF="../People/Connolly/">Daniel W. Connolly</A>
    <affiliation><A HREF="../../"><br>W3C</A><A
    HREF="http://www.lcs.mit.edu/">MIT LCS</A></></>

      <date><h3>$Date: 2002/08/13 22:52:26 $</h3></>
  </ADDRESS>

  <abstract>
      <status><h2>Status of This Document</h2>

	<p>This document is informational only. It does not specify
	any new specifications or protocols. The editor is working
	towards makin this document a <formal-term><a
	href="../TR/#NOTE">W3C Note</a></>.

	<p>This is based on <a href="../Collaboration/ECommerceEval">an Oct '95 conference paper</a>.</p>

      </status>
      <H3>Abstract</H3>

    <P>The architecture and implementation of the World Wide Web are
    evaluated with respect to Douglas Engelbart's requirements for an
    open hyperdocument system.  Engelbart's requirements are derived
    from experience in using computer-supported collaborative work
    (CSCW) to support large scale electronic commerce.

    <keywords><H3>Keywords</H3><A
    HREF="http://www.cs.indiana.edu/cstr/search?hypertext+system">Hypertext
    Systems</A> <A HREF="../DesignIssues/">Web Design</A>
    </keywords>
  </abstract>
  </titlepage>
  <contents></>
  <sect><H2>Introduction</H2>

    <P>The World Wide Web is targeted at collaborative work: research,
    development, marketing, advertising, negotiation, sales, and
    support, if not more. in this way. It borrows many design
    principles from the research in computer supported collaborative
    work (CSCW), which showed how effective hypermedia systems can be
    for knowledge capture, knowledge exchange, and collaboration in
    general.

    <P>In comparison to even the early hypertext research systems, the
    Web is fairly simple and somewhat limited. And yet it is extremely
    widely deployed and exploited. Is the Web technology ultimately
    limited? Is it a passing phenomenon, or is it a viable long-term
    platform for collaborative work and electronic commerce?

    <P>In <bibref><a href="#Eng90">[Eng90]</a></>, Douglas Engelbart
    condensed the results of many years of research in computer
    supported collaborative work into twelve requirements<bibref><A
    HREF="#ENG90">[ENG90]</A></>. This report evaluates the Web with
    respect to each of the requirements: has it been met? If not, can
    it? What are the obstacles, and what are the most promising
    developments in each area?
  </sect>

  <sect><H2>Essential Elements of an Open Hyperdocument System</H2>

    <P>Engelbart's research was directed at large scale knowledge
    work; for example, in the aircraft industry, the interactions
    between a major manufacturer and its contractors, subcontractors,
    and so on. Research and experimentation led to the following
    requirements:
    <OL>
      <LI><xref><A HREF="#mixdoc">Mixed Object Documents</A></>
      <LI><xref><A HREF="#structdoc">Explicitly Structured
      Documents</A></> <LI><xref><A HREF="#viewcontrol">View Control
      of Object's Form, Sequence, and Content</A></> <LI><xref><A
      HREF="#hyperdoc">The Basic
      <q><c>"</>Hyperdocument<c>"</></q></A></> <LI><xref><A
      HREF="#backlink">Hyperdocument <q><c>"</>Back-Link<c>"</></q>
      Capability</A></> <LI><xref><A HREF="#library-system">The
      Hyperdocument <q><c>"</>Library System<c>"</></q></A></>
      <LI><xref><A HREF="#hypermail">Hyperdocument Mail</A></>
      <LI><xref><A HREF="#digsig">Personal Signature Encryption</A></>
      <LI><xref><A HREF="#access-control">Access Control</A></>
      <LI><xref><A HREF="#human-addr">Link Addresses That Are Readable
      and Interpretable by Humans</A></> <LI><xref><A
      HREF="#alladdr">Every Object Addressable</A></> <LI><xref><A
      HREF="#printlink">Hard-Copy Print Options to Show Address of
      Objects and Address Specification of Links</A></>
    </OL>
  </sect>
  <sect>
    <H2>Overview of Web Architecture</H2>

    <P>In order to evaluate the Web with respect to Engelbart's
    requirements, some background on the architecture is essential.

    <P>The Web is a hypermedia distributed information system. A
    distributed information system allows users to access (read,
    search, navigate) documents from a wide variety of computers
    connected via a network. A hypertext system is one where the
    documents have links in and between each other, so that readers
    are not constrained to a sequential reading order. Finally, a
    hypermedia system is one that integrates images, sounds, video,
    and other media with text in documents.

    <P>The Web incorporates a wide variety of information sources on
    the Internet into a coherent information system. The user
    experience on the Web is:

    <OL>
      <LI>An initial document is presented to the user, consisting of
      text, graphics, sounds, animation, etc.

      <LI>The user can choose one of the links in the document and
      request to visit the document on the other end of the link. The
      link specifies the address of another document.

      <LI>The system retrieves the document at that address and
      presents it to the user, and the process repeats.
    </OL>

    <subsects>
      <subsect><H3>Protocols and Addressing Schemes</H3>

	<P>A URI, or Uniform Resource Identifier, is a name or address
	for an object in the Web<bibref><A
	HREF="#URI">[URI]</A></>. For example:

	<PRE>
http://www.w3.org/ ftp://ds.internic.net/rfc/rfc822.txt
	</PRE>

	<P>Each URI begins with <TT><VAR>scheme</VAR>:</TT>, where
	<VAR>scheme</VAR> refers to an addressing or naming
	scheme. For example, URIs beginning with <TT>http: </TT>refer
	to objects accessible via the HTTP protocol; <TT>ftp:</TT>
	refers to FTP objects, and so on. New schemes can be added
	over time.

	<P>There are a number of information retrieval protocols on
	the Internet: FTP, gopher, WAIS, etc. In the Web user
	experience, they behave similarly: a server makes a collection
	of documents available. A client contacts a server and makes a
	request to access one or more of the documents by giving a
	command telling what sort of access (read, write, etc.) and
	some parameters that indicate which is the relevant document
	(s). Finally, the server fulfills the request by, for example,
	transmitting the relevant document to the client.

	<P>HTTP is a protocol designed specifically for the
	Web<bibref><A HREF="#HTTP">[HTTP]</A></>. It is an extensible
	protocol with support for the widely used features of existing
	protocols, such as file transfer and index searching, plus
	support for some novel features such as redirection and format
	negotiation.

      </subsect>

      <subsect><H3>Data Formats</H3>

	<P>There is no one data format for all Web documents<emdash>
	-- </>each document may have its own format. In fact, it may
	be available in many formats.

	<P>Each data format has a name, or an Internet Media
	Type. Internet Media Types, aka content types, are part of
	MIME<bibref><A HREF="#MIME">[MIME]</A></>, a collection of
	enhancements to Internet mail to accomodate rich media. Most
	Internet Media Types are just standardized names for existing
	data formats; for example, <TT>text/plain</TT> for normal
	ASCII text; <TT>image/gif</TT> for images in GIF format, etc.

	<P>The MIME standard also introduces some new data
	formats. The <TT>multipart/mixed</TT> is a <DFN>compound</DFN>
	data format<emdash> -- </>it allows several pieces of media to
	be combined into one, for transmission via email, storage in a
	file, etc.

	<P>
	  Hypertext Markup Language (HTML) is a data format designed
	  specifically for the Web. It combines the features of a
	  typical structured markup language (paragraphs, titles,
	  lists) with hypertext linking features.
	<P>
	  HTML is an application of SGML, the Standard Generalized
	  Markup Language<bibref><A HREF="#SGML">[SGML]</A></>. SGML
	  is a technology for specifying structured document
	  types. HTML is one such document type, but there are many
	  others<emdash> -- </>as many as anyone cares to dream
	  up. TEI, DocBook, and Pinnacles are just a few of the types
	  of SGML documents used in the web<bibref><A
	  HREF="#NCSASGML">[NCSASGML]</A></>.
      </subsect>
    </subsects>
  </sect>
  <sect>
    <H2>Evaluating the World Wide Web</H2>
    <P>
      This evaluation measures the World Wide Web against each of
      Engelbart's requirements, discussing strengths, weaknesses, and
      promising developments.</>
    <subsects>
      <subsect>
	<H3><A NAME="mixdoc"><num>1. </>Mixed Object
	Documents</A></H3>
	<BLOCKQUOTE>
	  <p>to provide for an arbitrary mix of text, diagrams,
	  equations, tables, raster-scan images (single frames, or
	  even live video), spread sheets, recorded sound,
	  etc.<emdash> -- </>all bundled within a common
	  <q><c>"</>envelope<c>"</></q> to be stored, transmitted,
	  read (played) and printed as a coherent entity called a
	  <q><c>"</>document.<c>"</></q>
	</BLOCKQUOTE>
	<P>
	  The first Web documents contained only text, but support for
	  icons and images was added to NCSA Mosaic in 1993. Since
	  then, Web pages with mixed text and graphics have been the
	  rule, and sound and video are not uncommon. As a recent
	  development, tables are widely deployed. Support for
	  equations is still essentially in development, and diagrams
	  are generally limited to rasterized snapshots.
	<P>
	  Exchange of spreadsheets and other rich data sets associated
	  with proprietary software packages is supported to some
	  extent, but its use is generally limited to small
	  communities of interest who agree to use the packages.
	<P>
	  This demonstrates that at least to some extent, the
	  requirement for mixed object documents is met. But it is not
	  met completely: the tools for composing mixed object
	  documents are primitive, and many features of a
	  comprehensive compound document architecture are lacking.</>
	<subsubsects>
	  <subsubsect>
	    <H4>
	      Authoring Tools
	    </H4>
	    <P>
	      The intent of the original design of the web was that
	      documents would be composed in direct-manipulation
	      fashion from a rich set of media objects.  The initial
	      prototype was done on the NeXTStep platform, which
	      allows drag-and-drop editing of rich text documents
	      including raster images, encapsulated postscript images,
	      sounds, diagrams, etc. The NeXTStep platform includes an
	      architecture and a set of development tools for adding
	      new object types to the mix available on the desktop.

	    <P>System such as <bibref><a
	    href="#hyperg">[Hyper-G]</A></> provide many of these
	    features, and recently products such as <bibref><A
	    HREF="#navisoft">[NaviPress]</A></> and <bibref><A
	    HREF="#frontpage">[FrontPage]</A></> beginning to provide
	    these features to the web user base at large.

	  </subsubsect>
	  <subsubsect>
	    <H4>
	      Compound Document Architecture
	    </H4>
	    <P>
	      The two technologies used to create compound documents
	      on the web are URIs and MIME. Objects can be linked
	      together by using their addresses. For example, the HTML
	      A, IMG, FORM, and LINK elements specify URIs of linked
	      objects.  Links are typed to indicate various
	      relationships between objects such as parent/child,
	      precedes/succeeds, supports/refutes, etc. The MIME
	      multipart facility allows several pieces of content to
	      be combined into one entity.  Support for typed links
	      and multipart data is deployed only to a limited extent.
	    <P>
	      The combination of URIs and MIME supports the entire
	      existing web information space. Still, compared with
	      compound document architectures such as OpenDoc, OLE,
	      Fresco, LINCKS<bibref><A HREF="#LINCKS">[LINCKS]</A></>,
	      or HyTime, many facilities are lacking.
	    <P>
	      Facilities for diagrams, equations, screen real-estate
	      arbitration, event management, versioning, transactions,
	      link indirection, and aggregation have been developed
	      for various web applications, but there are no standard
	      facilities. Standards for such facilities might result
	      in a critical mass of shared of technology that would
	      make feasible classes of applications that were
	      previously too costly.
	  </subsubsect>
	</subsubsects>
      </subsect>
      <subsect>
	<H3><A NAME="structdoc"><num>2. </>Explicitly Structured
	Documents</A></H3>

	<BLOCKQUOTE>
	  <p>where the objects comprising a document are arranged in
	  an explicit hierarchical structure, and compound-object
	  substructures may be explicitly addressed for access or
	  manipulation of the structural relationships.
	</BLOCKQUOTE>
	<P>
	  The Hypertext Markup Language used to represent most web
	  documents is a structured document format. Elements of HTML
	  documents are explicitly tagged, and structure can be
	  inferred from the tags. Not all substructures may be
	  explicitly addressed<emdash> -- </>only anchor elements and
	  embedded objects.
	<P>
	  Fragment identifiers in URIs allow compound-object
	  substructures to be explicitly addressed.
	<P>
	  Other hierarchical structures are possible, but not yet
	  supported. One facility that is notably lacking from Web
	  implementations is transclusion<emdash> -- </>the ability to
	  include one text object inside another by reference. For
	  example, to include an excerpt of one document inside
	  another, or to build a document out of section and
	  subsection objects.
	<P>
	  Even if transclusion were supported, compound text documents
	  would probably be prohibitively expensive: the overhead for
	  a transaction in the current version of the HTTP protocol is
	  very high; hence the protocol is inefficient for retrieving
	  a number of small objects in succession. This inefficiency
	  is being addressed in efforts to revise the protocol.
	<P>
	  Because HTML is essential to interoperability, it is
	  restricted to document structures that are universally
	  applicable. In many situations, a custom document type would
	  support more expressive collaboration. Widespread support
	  for custom SGML document types would enable such
	  collaboration.
	<P>
	  SGML is not the only structured document technology
	  available. The multipart media types support explicit
	  hierarchical structure, and support for them is being
	  deployed.
	<P>
	  The Web architecture clearly supports the requirement for
	  structured documents.  But the deployed software provides
	  limited support. The Web is predominately used in a
	  <q><c>"</>publish and browse<c>"</></q> fashion; data
	  transmitted across the Web is largely throw-away data that
	  looks good but has little structure. In order to use the Web
	  as a rich collaboration platform, much more support for
	  structured documents will be needed.
      </subsect>
      <subsect>

	<H3><A NAME="viewcontrol"><num>3. </>View Control of Object's
	Form, Sequence, and Content</A></H3>

	<BLOCKQUOTE>
	  <p>where a structured, mixed-object document may be
	  displayed in a window according to a flexible choice of
	  viewing options<emdash> -- </>especially by selective level
	  clipping (outline for viewing), but also by filtering on
	  content, by truncation or some algorithmic view that
	  provides a more useful view of structure and/or object
	  content (including new sequences or groupings of objects
	  that actually reside in other documents). Editing on
	  structure or object content from such special views would be
	  allowed whenever appropriate.
	</BLOCKQUOTE>
	<P>View control can be achieved on the Web by custom
	processing by the information provider. A number of views can
	be provided, and the consumer can express their choice via
	links or HTML forms. For example, gateways to database query
	systems and fulltext search systems are commonplace. Another
	technique is to provide multiple views of an SGML document
	repository<bibref><A HREF="#dynatext">[dynatext, </A></>
	<bibref><A HREF="#I4I">I4I]</A></>.

	<P>Another approach to view control is client-side processing:
	after the document is transmitted, the reader's software could
	filter, sort, or truncate the data. About the only such
	control in wide deployment is the ability to turn off embedded
	images. Outline views with folding have been proposed
	<bibref><A HREF="#FOLD">[FOLD]</A></>, but the cost of
	transmitting text that isn't displayed has presented a
	barrier.
	<P>
	  Stylesheets are a mechanism for presenting the same data in
	  different forms, using fonts, colors, and space to give
	  visual structure. Support for stylesheets is an important
	  ongoing development.
	<P>
	  But in some systems<bibref><A HREF="#LINCKS">[LINCKS, </A>
	  <A HREF="#dynatext">dynatext]</A></>, stylesheets are much
	  more than that: they control form, sequence, and content as
	  Engelbart described. For example, in LINCKS, the same
	  document can be presented and edited in abstract form,
	  outline form, or full form depending on which stylesheet
	  (generic presentation descriptor, in their jargon) is in
	  effect.
	<P>
	  At the extreme end of the spectrum, stylesheets give way to
	  arbitrary programs that display data and interact with the
	  reader. In a distributed system, running arbitrary programs
	  can have dangerous consequences. But advances such as
	  Safe-Tcl and Java make this technique of <q><c>"</>active
	  objects<c>"</></q> feasible.
	<P>
	  It's clear that the Web architecture supports custom view
	  control. It remains to be seen whether some view control
	  mechanisms are sufficiently valuable to be standardized.
      </subsect>
      <subsect>
	<H3><A NAME="hyperdoc"><num>4. </>The Basic
	<q><c>"</>Hyperdocument<c>"</></q></A>
	</H3>
	<BLOCKQUOTE>
	  <p>where embedded objects called <q><c>"</>links<c>"</></q>
	  can point to any arbitrary object within the document, or
	  within another document in a specified domain of
	  documents<emdash> -- </>and the link can be actuated by a
	  user or an automatic process to <q><c>"</>go see what is at
	  the other end,<c>"</></q> or <q><c>"</>bring the other-end
	  object to this location,<c>"</></q> or <q><c>"</>execute the
	  process identified a the other end.<c>"</></q> (These
	  executable processes may control peripheral devices such as
	  CD ROM, video-disk players, etc.)
	</BLOCKQUOTE>
	<P>
	  This requirement is clearly met. The hyperdocument as
	  described above is the epitome of the Web page.
	<P>
	  The only exception is that Engelbart refers to links as
	  <q><c>"</>special objects,<c>"</></q> whereas in the Web,
	  links are not addressable <q><c>"</>first-class<c>"</></q>
	  objects<emdash> -- </>not in implementations, and not in the
	  architecture.
      </subsect>
      <subsect>
	<H3><A NAME="backlink"><num>5. </>Hyperdocument
	<q><c>"</>Back-Link<c>"</></q> Capability</A>
	</H3>
	<BLOCKQUOTE>
	  <p>when reading a hyperdocument online, a worker can utilize
	  information about links from other objects within this or
	  other hyperdocuments that point to this
	  hyperdocument<emdash> -- </>or to designated objects or
	  passages of interest in this hyperdocument.
	</BLOCKQUOTE>
	<P>
	  The design of the Web trades link consistency guarantees for
	  global scalability.  Links are one way , and the reverse
	  link is not guaranteed to exist.
	<P>
	  Aside from the intractability of maintaining global link
	  consistency, another barrier to a distributed back-link
	  service is privacy. Some links are sensitive, and their
	  owners do not want them easily discovered.
	<P>
	  These barriers do not, however, prevent Web users from
	  utilizing back-link information. Some Web server tools
	  <bibref><A HREF="#frontpage">[FrontPage]</A></>) maintain
	  back-link information for the local site. And the HTTP
	  protocol includes a mechanism<emdash> -- </>the
	  <TT>Referer:</TT> field <emdash> -- </>that allows
	  information providers to gather back-link information for
	  their site.
	<P>
	  Finally, there are search services that traverse the web
	  building full-text search indexes. Some <bibref><A
	  HREF="#altavista">[Altavista]</A></> take advantage of the
	  links they encounter to offer back-link services.
      </subsect>
      <subsect>
	<H3><A name="library-system"><num>6. </>The Hyperdocument
	<q><c>"</>Library System<c>"</></q></A></H3>

	<BLOCKQUOTE>
	  <p>where hyperdocuments can be submitted to a library-like
	  service that catalogs them and guarantees access when
	  referenced by its catalog number, or <q><c>"</>jumped
	  to<c>"</></q> with an appropriate link. Links within newly
	  submitted hyperdocuments can cite any passages within any of
	  the prior documents, and the back-link service lets the
	  online reader of a document detect and <q><c>"</>go
	  examine<c>"</></q> any passage of a subsequent document that
	  has a link citing that passage.
	</BLOCKQUOTE>

	<P>There are no guarantees of access on the web today. A few
	commercial web service providers <bibref><A
	HREF="#navipress">[NaviService]</A></> guarantee server
	availability on a 24x7 basis, but this doesn't guarantee
	access to the entire global user base<emdash> -- </>any
	network interruption between the reader and the provider can
	prevent access.

	<P>
	  In a distributed system, absolute guarantees are
	  impossible. But reliability can be made arbitrarily good by
	  investing in redundancy. A number of strategies for caching
	  and replication (including <bibref><A
	  HREF="#harvest">Harvest</A></>) are being explored,
	  standardized, and deployed.
	<P>
	  Catalog numbering systems have not matured either. This is
	  known as <q><c>"</>the URN problem.<c>"</></q> A number of
	  promising proposals have been made <bibref><A
	  HREF="#PATH">[PATH, </A><A HREF="#STANF">STANF]</A></>, but
	  none is widely deployed.

	<P>Digital Libraries is an active field of research. ARPA is
	funding research, and the Online Computer Library Center is
	conducting experiments.

      </subsect>
      <subsect>
	<H3><A NAME="hypermail"><num>7. </>Hyperdocument Mail</A></H3>
	<BLOCKQUOTE>
	  <p>where an integrated, general-purpose mail service enables
	  a hyperdocument of any size to be mailed. Any embedded links
	  are also faithfully transmitted<emdash> -- </>and any
	  recipient can then follow those links to their designated
	  targets in other mail items, in common-access files, or in
	  <q><c>"</>library<c>"</></q> items.
	</BLOCKQUOTE>
	<P>
	  Internet Mail is possibly the world's most widely deployed
	  information system.  MIME, the Multipurpose Internet Mail
	  Extensions, standardizes facilities for attachments and
	  compound documents, among other things.
	<P>
	  Though nearly every new mail system supports MIME, the
	  installed base of pre-MIME mail systems is still significant
	  <emdash> -- </>a majority by many estimates.
	<P>
	  User interfaces that integrate email (and USENET news) into
	  the Web user experience are anticipated, but not deployed.
      </subsect>
      <subsect>
	<H3><A NAME="digsig"><num>8. </>Personal Signature
	Encryption</A></H3>
	<BLOCKQUOTE>
	  <p>where a user can affix his personal signature to a
	  document, or a specified segment within the document, using
	  the private signature key. Users can verify that the
	  signature is authentic and that no bit of the signed
	  document or document segment has been altered since it was
	  signed.
	</BLOCKQUOTE>
	<P>
	  There are a number of digital signature standards, but none
	  has been globally adopted and deployed on the web.
	<P>
	  One barrier is patent licensing. A critical feature of web
	  technology that led to its rapid deployment was its
	  royalty-free copyright status. Patents on public-key
	  cryptography prevent digital signature technology from being
	  deployed without license negotiations.
	<P>
	  Another barrier is export control
	  legislation. Implementations of cryptographic techniques
	  such as encryption are considered munitions by many
	  governments, and there are strict controls on the export of
	  such technologies.
	<P>
	  But the largest barrier is the social and educational
	  one. Digital signature techniques will have to be tested in
	  production use, and users will have to be educated about the
	  related issues before commerce can depend on this
	  technology.
      </subsect>
      <subsect>
	<H3><A NAME="access-control"><num>9. </>Access
	Control</A></H3>
	<BLOCKQUOTE>
	  <p>Hyperdocuments in personal, group, and library files can
	  have access restrictions down to the object level.
	</BLOCKQUOTE>
	<P>
	  The distributed nature of the Web allows information
	  providers to implement any access control policy they
	  choose, down to the object level.
	<P>
	  Minimal support for username/password authentication is
	  widely deployed.  This allows information providers to
	  implement access control based on users and groups of
	  users. But this basic facility is not robust in the face of
	  concerted attack.
	<P>
	  A number of mechanisms for strong authentication and
	  confidentiality, as well as billing and payment are being
	  standardized. A complete discussion of these mechanisms is
	  beyond the scope of this document.
      </subsect>
      <subsect>
	<H3><A NAME="human-addr"><num>10. </>Link Addresses That Are
	Readable and Interpretable by Humans</A></H3>
	<BLOCKQUOTE>
	  <p>one of the <q><c>"</>viewing options<c>"</></q> for
	  displaying/printing a link object should provide a
	  human-readable description of the <q><c>"</>address
	  path<c>"</></q> leading to the cited object; AND, that the
	  human must be able to read the path description, interpret
	  it, and follow it (find the destination <q><c>"</>by
	  hand<c>"</></q> so to speak).
	</BLOCKQUOTE>
	<P>
	  Document addresses in the web are designed so that they can
	  be transcribed<emdash> -- </>written on envelopes, recited
	  over the phone, etc. Each URI scheme has an associated
	  public specification of how to interpret and follow its path
	  description.
	<P>
	  By and large, URIs are sensible to those familiar with the
	  conventions<emdash> -- </><TT>http://www.ford.com</TT> is
	  the address of Ford Motor Company, for example.

	<P>But portions of URIs are allowed to be opaque by
	design<emdash> -- </>they may be pointers into an index,
	checksums, dates, etc.

      </subsect>
      <subsect>
	<H3><A NAME="alladdr"><num>11. </>Every Object
	Addressable</A></H3>
	<BLOCKQUOTE>
	  <p>in principal, every object that someone might validly
	  want/need to cite should have an unambiguous address
	  (capable of being portrayed in a manner as to be human
	  readable and interpretable). (E.g. not acceptable to be
	  unable to link to an object within a
	  <q><c>"</>frame<c>"</></q> or <q><c>"</>card.<c>"</></q>)
	</BLOCKQUOTE>
	<P>
	  Every object on the web is addressable. But not every
	  substructure within objects that someone might need to cite
	  has a standard addressing mechanism.  For example,
	  individual pages in a postscript document, lines in a text
	  file, pixels in an image.
	<P>
	  These structures are, in principle, addressable. Only a
	  standard syntax for URI fragment identifiers to address them
	  is lacking.
	<P>
	  In HTML documents, elements can be named and addressed. But
	  there is no mechanism to address unnamed elements. For
	  parties that do not have write access to a document, this
	  presents a problem.
	<P>
	  One solution would be to allow elements to be addressed by
	  their structural position. There are a number of standard
	  technologies for addressing elements of SGML documents (and
	  hence HTML documents): TEI pointers, HyTime location
	  ladders, and DSSSL queries. Any of these could be
	  incorporated into web software.
	<P>
	  Another possibility is to address strings within a document
	  by pattern matching.  One annotation system<bibref><A
	  HREF="#BRIO">[BRIO]</A></> uses patricia trees for stable
	  pointers into documents.
      </subsect>
      <subsect>
	<H3><A NAME="printlink"><num>12. </>Hard-Copy Print Options to
	Show Address of Objects and Address Specification of Links</A>
	</H3>
	<BLOCKQUOTE>
	  <p>so that, besides online workers being able to follow a
	  link-citation path (manually, or via an automatic link
	  jump), people working with associated hard copy can read and
	  interpret the link-citation, and follow the indicated path
	  to the cited object in the designated hard-copy document.
	  <P>
	    Also, suppose that a hard-copy worker wants to have a link
	    to a given object established in the online file. By
	    visual inspection of the hard copy, he should be able to
	    determine a valid address path to that object and for
	    instance hand-write an appropriate link specification for
	    later online entry, or dictate it over a phone to a
	    colleague
	</BLOCKQUOTE>
	<P>
	  Most of the installed base of web client software allows
	  users to view link address. But ironically, that option is
	  not available for printing in many cases. It would be a
	  straightforward enhancement, well within the bounds of the
	  existing architecture.
      </subsect>
    </subsects>
  </sect>
  <sect>
    <H2>Summary</H2>
    <P>
      In the following table, each row represents one of Engelbart's
      requirements.  The columns are as follows:
    <DL>
      <DT>
	Architecture Support
	<DD>
	  Is the requirement explicitly addressed in the Web
	  architecture? Does the architecture passively allow the
	  requirement to be met by applications? Or does the
	  architecture conflict with the requirement?
      <DT>
	Standard Facilities
	<DD>
	  Are there standardized facilities that meet the requirement?
	  Partially meet the requirement?
      <DT>
	Ubiquitous Facilities
	<DD>
	  Some facilities are standardized, but not required of all
	  systems, or not supported by all installations. What
	  facilities are assumed to be supported by all participants
	  in the web?
      <DT>
	Local/Proprietary Facilities
	<DD>
	  Has the requirement been met within the architecture by
	  local applications or with the use of proprietary
	  facilities?
    </DL>
    <P>
      Each cell contains an evaluation of whether the requirement is
      met (YES, NO, PASSive, or PARTial) followed by a list of
      relevant facilities. Missing facilities are marked with *.

    <TABLE>
      <caption>Evaluation</caption>
      <TR>
	<TH>Requirement</TH> <TH>Architecture Support</TH>
	<TH>Standard Facilities</TH> <TH>Ubiquitous Facilities</TH>
	<TH>Local/Proprietary Facilities</TH>
      </TR>
      <TR>
	<TD>1. Mixed Object Documents</TD> <TD>YES: format
	  negotiation, typed links</TD> <TD>PART: URI, HTML, IMG,
	  INSERT, MIME<BR> link types*</TD> <TD>PART: GIF in HTML</TD>
	<TD>YES: JPEG in HTML, Java/Safe-Tcl in HTML, OLE, OpenDoc,
	  Fresco<BR>
	  <BR>
	</TD>
      </TR>
      <TR>
	<TD>2. Explicitly Structured Documents</TD> <TD>YES:
	  fragment identifiers</TD> <TD>YES: HTML, SGML, MIME</TD>
	<TD>PART: HTML</TD> <TD>YES: Panorama, OLE, LINCKS</TD>
      </TR>
      <TR>
	<TD>3. View Control of Object's Form, Sequence, and
	  Content</TD> <TD>PASS</TD> <TD>PART: HTTP, CGI</TD>
	<TD>NO</TD> <TD>YES: DynaWeb, Java/Safe-Tcl, Style
	  Sheets</TD>
      </TR>
      <TR>
	<TD>4. The Basic <q><c>"</>Hyperdocument<c>"</></q></TD>
	<TD>YES</TD> <TD>YES: URI, HTML</TD> <TD>YES: URI, HTML</TD>
	<TD></TD>
      </TR>
      <TR>
	<TD>5. Hyperdocument <q><c>"</>Back-Link<c>"</></q>
	  Capability</TD> <TD>PASS</TD> <TD>PART: Referer</TD>
	<TD>NO</TD> <TD>YES: local link map, back-link service</TD>
      </TR>
      <TR>
	<TD>6. The Hyperdocument <q><c>"</>Library
	    System<c>"</></q></TD> <TD>PASS</TD> <TD>NO</TD> <TD>NO</TD>
	<TD>NO</TD>
      </TR>
      <TR>
	<TD>7. Hyperdocument Mail</TD> <TD>PASS</TD> <TD>YES:
	  MIME</TD> <TD>YES: MIME</TD> <TD></TD>
      </TR>
      <TR>
	<TD>8. Personal Signature Encryption</TD> <TD>PASS</TD>
	<TD>NO</TD> <TD>NO</TD> <TD>YES: S-HTTP, PEM, S/MIME, PGP,
	  PKCS-7</TD>
      </TR>
      <TR>
	<TD>9. Access Control</TD> <TD>YES</TD> <TD>PART: Basic
	  auth</TD> <TD>PART: Basic Auth</TD> <TD>YES: MD5, SSL,<BR>
	  S-HTTP, PGP, smart cards, etc.</TD>
      </TR>
      <TR>
	<TD>10. Link Addresses That Are Readable and Interpretable
	  by Humans</TD> <TD>PART: URI</TD> <TD>PART: URI</TD>
	<TD>PART: URI</TD> <TD></TD>
      </TR>
      <TR>
	<TD>11. Every Object Addressable</TD> <TD>objects: YES
	  substructures: PASS</TD> <TD>PART: URI</TD> <TD>PART:
	  URI</TD> <TD></TD>
      </TR>
      <TR>
	<TD>12. Hard-Copy Print Options to Show Address of Objects
	  and Address Specification of Links</TD> <TD>PASS</TD>
	<TD>NO</TD> <TD>NO</TD> <TD>YES: HTML2LaTeX</TD>
      </TR>
    </TABLE>
    <P>
  </sect>
  <sect><H2>Conclusions</H2>

    <P>Support for Engelbart's requirements is far from
    ubiquitous. But the architecture in no way prevents them from
    being realized, and the quantity of resources integrated into the
    system provides ample motivation for research and development.
    <P>
      In each area where facilities to meet the requirement are not
      ubiquitous, a demonstration of sufficient facilities has taken
      place.
    <P>
      This gives confidence that the requirements will eventually be
      met and become infrastructure.
    <P>
      If in fact Engelbart's requirements are an effective way to
      measure the viability of a platform for electronic commerce, the
      Web is very likely to be a viable platform for some time to
      come.
  </sect>
  <sect>
    <H2>
      Acknowledgements
    </H2>
    <P>
      This paper has been in development since I met Douglas Engelbart
      and Tim Berners-Lee on the same day at Hypertext '91.
    <P>
      Since then, a number of collaborators have influenced my
      perspective on web design: Roy Fielding, Larry Masinter, Terry
      Allen, and Stu Wiebel to name a few.
    <P>
      Thanks to Jim Miller for his encouragement and review comments.

    <P>I would also like to thank Ravi Kalakota, who invited me to
    write this paper and present it at <bibref><a
    href="#austin-ecommerce">[UT95]</a>, and Keith Instone and the
    participants of the <a href="http://instone.org/keith/hrweb/ht96-call.html">HRWeb workshop</a> at Hypertext '96 in Washington D.C. </>.

  </sect>
  <references><H2>References</H2>
    <DL>
      <DT><A name=BRIO>[BRIO]</A>

	<DD><ref.inproceedings><ref.title><A
	HREF="http://www-diglib.stanford.edu/diglib/pub/reports/brio_www95.html">Beyond
	Browsing: Shared Comments, SOAPs, Trails and On-line
	Communities</A></> <ref.author>by M.  R&ouml;scheisen, C. Mogensen
	and T. Winograd</> <ref.intitle>appeared in <A
	HREF="http://www.igd.fhg.de/www95.html">Fifth International
	World Wide Web Conference</A></> <ref.pub.addr>Darmstadt</>,
	<ref.date num=1995>1995</>. See also <A
	HREF="http://www-diglib.stanford.edu/diglib/pub/reports/COMMENTOR">ComMentor
	Documentation</A>.[html document, 88k] added Feb. 21, 1995</>

      <DT><A name=ENG90>[ENG90]</A>

	<DD><ref.inproceedings><ref.title><A
	HREF="http://www.bootstrap.org/augment/AUGMENT/132082.html"> Knowledge-Domain
	Interoperability and an Open Hyperdocument System</A></>
	<ref.author>Douglas C.  Engelbart</> <ref.intitle>Proceedings
	of the Conference on Computer-Supported Collaborative Work</>,
	<ref.pub.addr>Los Angeles, CA</> <ref.date num="19901007">Oct
	7-10,</> <ref.pages>pp. 143-156.</> (AUGMENT, 132082).</>

      <DT><A name=FOLD>[FOLD]</A>

	<DD><ref.email><ref.title>Proposal<emdash> -- </>Foldable
	Anchor Inclusion</> ,<ref.author><a
	href="mailto:torrance@ai.mit.edu">Mark Torrance</a></> message
	to <ref.pub.addr>www-talk@w3.org</>, <ref.date
	num="19940727">Tue, 26 Jul 94 13:40:42 EDT</>. See: <A
	HREF="http://www.eit.com/cgi-bin/wwwwais?keywords=Outline+Fold&amp;selection=WWW-HTML%2C+1994-5&amp;maxhits=40">archives</A>.</>
	
	  
	  
	  
      <DT><A name=HTML>[HTML]</A>

	<DD><ref.report><ref.title>HyperText Markup Language
	Specification<emdash> -- </>2.0</>, <ref.institution><A
	HREF="http://www.ietf.org/">IETF</A></> <ref.num><A
	HREF="ftp://ds.internic.net/rfc/rfc1866.txt">RFC 1866</A></>
	<ref.author>T. Berners-Lee and D. Connolly</>, <ref.date
	num="199511">November 1995</>.</>

      <DT><A name=HTTP>[HTTP]</A>

	<DD><ref.report><A
	HREF="ftp://ds.internic.net/internet-drafts/draft-ietf-http-v10-spec-00.ps">Hypertext
	Transfer Protocol - HTTP/1.0</A>. T. Berners-Lee,
	R. T. Fielding, and H. Frystyk Nielsen. Work in Progress, MIT,
	UC Irvine, CERN, March 1995.</>

      <DT><A name=LINCKS>[LINCKS]</A>

	<DD><ref.report>LINCKS<emdash> -- </>a platform for
	cooperative information systems, P. Lambrix, M.  Sj&ouml;lin, and
	L. Pagdham, unpublished manuscript, Department of Computer and
	Information Science, LiTH, Sweden, 1993. See: <A
	HREF="http://www.ida.liu.se/~lincks/">LINCKS Home Page</A></>

      <DT><A name=MIME>[MIME]</A>

	<DD><ref.report>MIME (Multipurpose Internet Mail Extensions)
	Part One: Mechanisms for Specifying and Describing the Format
	of Internet Message Bodies. N.  Borenstein &amp;
	N. Freed. September 1993. (Format: TXT=187424, <A
	HREF="http://info.internet.isi.edu:80/in-notes/rfc/files/rfc1521.ps">PS</A>=393670
	bytes) (Obsoletes RFC1341) (Updated by RFC1590)</>

      <DT><A name=NCSASGML>[NCSASGML]</A>

	<DD><ref.report><A
	HREF="http://www.ncsa.uiuc.edu/SDG/Software/Mosaic/WebSGML.html">Welcome
	to SGML on the Web</A>, Lucy Ventresca, NCSA and
	SoftQuad. Work in Progress.</>

      <DT><A name=PATH>[PATH]</A>

	<DD><ref.report><A
	HREF="http://union.ncsa.uiuc.edu/~liberte/www/path.html">The
	Path URN Specification </A>, D. LaLiberte, M. Shapiro,
	Internet Draft 07/26/1995</>

      <DT><A name=STANF>[STANF]</A>

	<DD><ref.report><A
	HREF="http://www.saintjoe.edu/URI/stanf.html">table Network
	File URLs as a Mechanism for Uniform Naming </A> Terry
	Winograd. Early draft version 12/2/93</>

      <DT><A name=SGML>[SGML]</A>

	<DD><ref.report><A
	HREF="http://www.iso.ch/cate/d16387.html">ISO
	8879</A>. Information Processing<emdash> -- </>Text and Office
	Systems - Standard Generalized Markup Language (SGML),
	1986.</>

      <DT><A name=URI>[URI]</A>
	
	<DD><ref.report><A
	href="http://info.internet.isi.edu:80/in-notes/rfc/files/rfc1630.txt">rfc1630.txt</A>
	<emdash> -- </>Universal Resource Identifiers in WWW: A
	Unifying Syntax for the Expression of Names and Addresses of
	Objects on the Network as used in the World-Wide
	Web. T. Berners-Lee. June 1994. (Format: TXT=57601 bytes)</>

      <dt><a name=hyperg>[HyperG]</a> <dd><ref.email><a
      HREF="http://hyperg.iicm.tu-graz.ac.at/Chyperg">Hyper-G</A>@@</>
	  
	  
      <dt><a name=navipress>[Navipress]</a> <dd><ref.email><A
      HREF="http://www.navisoft.com/">NaviPress</A>, <a
      HREF="http://www.naviservice.com/">NaviService</A>@@</>
	  
      <dt><a name=frontpage>[FrontPage]</a> <dd><ref.email><A
      HREF="http://www.vermeer.com/">FrontPage</A>@@</>
	  
      <dt><a name=dynatext>[Dynatext]</a> <dd><ref.email><A
      HREF="http://www.ebt.com/">dynatext</A>@@</>
	  
      <dt><a name="I4I">[I4I]</a> <dd><ref.email><A
      HREF="http://www.i4i.org/vision.html">I4I</A>@@</>
	  
      <dt><a name="altavista">[AltaVista]</a> <dd><ref.email><a
      HREF="http://www.altavista.digital.com/">Altavista</A>@@</>
	  
      <dt><a name="harvest">[Harvest]</a> <dd><ref.email><A
      HREF="http://harvest.cs.colorado.edu/">Harvest</A>@@</>
	  
      <dt><a name="oclc">[OCLC]</a> <dd><ref.email><A
      HREF="http://www.oclc.org/">OCLC</A> research@@</>
	  
      <dt><A name="austin-ecommerce">[UT95]</a> <dd><ref.email><a
      href="http://cism.bus.utexas.edu/ravi/ecomm.html">The First
      International Conference on Electronic Commerce</A> at the
      University of Texas at Austin, Austin, October 1995 @@</>
	  
    </DL>
  </references>
</report>