WD-swbp-thesaurus-pubguide-20050517
21.3 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
<link rel="stylesheet" type="text/css" href="../WD-swbp-skos-core-guide-20050510/guideextras.css" />
<link rel="stylesheet" type="text/css" href="http://www.w3.org/StyleSheets/TR/W3C-WD" />
<title>Quick Guide to Publishing a Thesaurus on the Semantic Web</title>
</head>
<body>
<!-- HEADER SECTION -->
<div class="head">
<a href="http://www.w3.org/"><img src="http://www.w3.org/Icons/w3c_home"
alt="W3C" height="48" width="72" /></a>
<h1 id="title">Quick Guide to Publishing a Thesaurus on the Semantic Web</h1>
<h2>W3C Working Draft 17 May 2005</h2>
<dl>
<dt>This version:</dt>
<dd><a
href="http://www.w3.org/TR/2005/WD-swbp-thesaurus-pubguide-20050517">http://www.w3.org/TR/2005/WD-swbp-thesaurus-pubguide-20050517</a></dd>
<dt>Latest version:</dt>
<dd><a
href="http://www.w3.org/TR/swbp-thesaurus-pubguide">http://www.w3.org/TR/swbp-thesaurus-pubguide</a></dd>
<dt>Previous version:</dt>
<dd><a
href="http://www.w3.org/TR/2005/WD-swbp-thesaurus-pubguide-20050510">http://www.w3.org/TR/2005/WD-swbp-thesaurus-pubguide-20050510</a></dd>
<dt>Editors:</dt>
<dd><a href="http://www.bitd.clrc.ac.uk/Person/A.J.Miles">Alistair
Miles</a>, CCLRC</dd>
</dl>
<p class="copyright"><a
href="http://www.w3.org/Consortium/Legal/ipr-notice#Copyright">Copyright</a>
©2005 <a href="http://www.w3.org/"><acronym
title="World Wide Web Consortium">W3C</acronym></a> <sup>®</sup> (<a
href="http://www.csail.mit.edu/"><acronym
title="Massachusetts Institute of Technology">MIT</acronym></a>, <a
href="http://www.ercim.org/"><acronym
title="European Research Consortium for Informatics and Mathematics">ERCIM</acronym></a>,
<a href="http://www.keio.ac.jp/">Keio</a>), All Rights Reserved. W3C <a
href="http://www.w3.org/Consortium/Legal/ipr-notice#Legal_Disclaimer">liability</a>,
<a
href="http://www.w3.org/Consortium/Legal/ipr-notice#W3C_Trademarks">trademark</a>
and <a href="http://www.w3.org/Consortium/Legal/copyright-documents">document
use</a> rules apply.</p>
</div>
<hr/>
<h2><a id="abstract">Abstract</a></h2>
<p>This document describes in brief how to express the content and structure of a thesaurus, and metadata about a thesaurus, in RDF. Using RDF allows data to be linked to and/or merged with other RDF data by semantic web applications. The Semantic Web, which is based on the Resource Description Framework (RDF), provides a common framework that allows data to be shared and reused across application, enterprise, and community boundaries. </p>
<hr/>
<h2 id="status">Status of this Document</h2>
<p><em>This section describes the status of this document at the time
of its publication. Other documents may supersede this document. A
list of current W3C publications and the latest revision of this
technical report can be found in the <a href="http://www.w3.org/TR/"
>W3C technical reports index</a> at http://www.w3.org/TR/.</em></p>
<p>This document is an updated First Public Working Draft published by the <a href="http://www.w3.org/2001/sw/BestPractices/">Semantic Web Best Practices and Deployment Working Group</a>, part of the <a href="http://www.w3.org/2001/sw/">W3C Semantic Web Activity</a>. The Working Group intends the Quick Guide to Publishing a Thesaurus on the Semantic Web to become a W3C Working Group Note.</p>
<p>This Quick Guide accompanies the <a
href="http://www.w3.org/TR/2005/WD-swbp-skos-core-spec-20050510">SKOS
Core Vocabulary Specification</a> and <a
href="http://www.w3.org/TR/2005/WD-swbp-skos-core-guide-20050510">SKOS
Core Guide</a>.</p>
<p>We encourage public comments. Please send comments to <a href="mailto:public-esw-thes@w3.org?subject=comment:">public-esw-thes@w3.org</a> [<a href="http://lists.w3.org/Archives/Public/public-esw-thes/">archive</a>] and start the subject line of the message with "comment:".</p>
<p>Publication as a Working Draft does not imply endorsement by the
W3C Membership. This is a draft document and may be updated,
replaced or obsoleted by other documents at any time. It is
inappropriate to cite this document as other than work in
progress.</p>
<p><strong>Change History</strong>: The initial (previous) version of
this document was published with incorrect URIs in the UKAT examples.
The URIs used in the previous version resolved to textual descriptions
of UKAT concepts and were not intended to be the names of the concepts
themselves. This last-minute publication change is corrected in this
version.</p>
<hr/>
<h2 id="contents">Contents</h2>
<ul>
<li><a href="#secIntro">Introduction</a></li>
<li><a href="#secExpressingThesaurus">Expressing a Thesaurus in RDF</a></li>
<li><a href="#secCreatingRDF">Creating and Maintaining an RDF Representation of a Thesaurus</a></li>
<li><a href="#secExpressingMetadata">Expressing Thesaurus Metadata in RDF</a></li>
<li><a href="#secPublishing">Publishing RDF Data</a></li>
<li><a href="#secReading">Further Reading</a></li>
<li><a href="#secReferences">References</a></li>
</ul>
<hr/>
<h2 id="secIntro">Introduction</h2>
<p> The Semantic Web provides a common framework that allows data to be shared and reused across application, enterprise, and community boundaries [<a href="#refSemanticWeb">Semantic Web Activity</a>]. It is based on the Resource Description Framework (RDF) [<a href="#refRDF">RDF</a>], which provides a simple data formalism for talking about things, their properties, inter-relationships, and categories (classes). For an overview of RDF, see [<a href="#refRDFConcepts">RDF Concepts</a>]</p>
<p>This document describes in brief how to express the content and structure of a thesaurus, and metadata about a thesaurus, in RDF. Using RDF allows your data to be linked to and/or merged with other RDF data by semantic web applications. In practice, this means that data sources can be distributed across the web in a decentralised way, but still be meaningfully composed and integrated by applications, often in novel and unanticipated ways.</p>
<p>The examples in this guide are given as a visualisation of the RDF graph, e.g.</p>
<p class="caption">example</p>
<p><img src="http://www.w3.org/2004/02/skos/core/guide/img/ex-triple.png" alt="Graph of an RDF triple"/></p>
<p>An RDF graph can be serialised (i.e. encoded as a series of characters) according to any of three currently defined RDF syntaxes: RDF/XML [<a href="#refRDFSyntax">RDF Syntax</a>], N3/Turtle [<a href="#refTurtle">Turtle</a>][<a href="#refN3">N3</a>], N-Triple [<a href="#refNTriples">N-Triple</a>]. All examples in this guide use the RDF/XML syntax for consistency. </p>
<p>The examples in this document use the SKOS Core Vocabulary, which is a set of properties and classes that can be used to express the conceptual content of a thesaurus as an RDF graph. SKOS Core is designed to be used with not only thesauri, but also other types of 'concept scheme', such as classification schemes, subject heading systems, controlled vocabularies, glossaries, taxonomies etc. For a complete description of SKOS Core, see [<a href="#refSKOSCoreGuide">SKOS Core Guide</a>].</p>
<p>The examples in this document also use the DCMI Metadata Terms, which are properties and classes for describing resource metadata. For more about DCMI Terms, see [<a href="#refDCMITerms">DCMI Terms</a>].</p>
<p>Note that the prefix <code>skos:</code> in this document stands for <code>http://www.w3.org/2004/02/skos/core#</code> - so for example <code>skos:prefLabel</code> is an abbreviation of <code>http://www.w3.org/2004/02/skos/core#prefLabel</code>.</p>
<hr/>
<h2 id="secExpressingThesaurus">Expressing a Thesaurus in RDF</h2>
<p>Below is an extract from the UK Archival Thesaurus (UKAT) [<a href="#refUKAT">UKAT</a>]:</p>
<pre class="code">
Term: Economic cooperation
Used For:
Economic co-operation
Broader terms:
Economic policy
Narrower terms:
Economic integration
European economic cooperation
European industrial cooperation
Industrial cooperation
Related terms:
Interdependence
Scope Note:
Includes cooperative measures in banking, trade, industry etc., between and among countries.
</pre>
<p>This example, expressed as an RDF graph using the SKOS Core Vocabulary, looks like:</p>
<p><img src="http://www.w3.org/2004/02/skos/core/guide/img/ex-ukat-1.png" alt="Graph of extract from UKAT"/></p>
<p>Each of the blue circles in the image above represents a concept from the UKAT. The red circle represents the UKAT itself.</p>
<p>Each concept from the UKAT has an allocated URI. URIs are globally unique identifiers that may be used to refer to a resource unambiguously from any context. Anything can be a 'resource', not just web documents, therefore URIs can be used as identifiers for anything. For example, the URI:</p>
<p><code>http://www.ukat.org.uk/thesaurus/concept/1750</code></p>
<p>... denotes the concept from the UKAT whose preferred term is 'Economic cooperation'. (The URIs are not shown in the visualisation of the graph above for the sake of readability.)</p>
<p><strong>Allocating URIs to the concepts in a thesaurus allows anybody to refer to them unambiguously from any context.</strong></p>
<p>For a complete description of considerations relevant to allocating URIs, see [<a href="#refWEBARCH">WEBARCH</a>]. See also the section 'HTTP URIs for Concepts' in [<a href="#refSKOSCoreGuide">SKOS Core Guide</a>].</p>
<p>An RDF/XML serialisation of the RDF description of the 'Economic cooperation' concept from the UKAT is below:</p>
<pre id="rdfxml_example" class="code">
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:skos="http://www.w3.org/2004/02/skos/core#">
<skos:Concept rdf:about="http://www.ukat.org.uk/thesaurus/concept/1750">
<skos:prefLabel>Economic cooperation</skos:prefLabel>
<skos:altLabel>Economic co-operation</skos:altLabel>
<skos:scopeNote>Includes cooperative measures in banking, trade, industry etc.,
between and among countries.</skos:scopeNote>
<skos:broader rdf:resource="http://www.ukat.org.uk/thesaurus/concept/4382"/>
<skos:narrower rdf:resource="http://www.ukat.org.uk/thesaurus/concept/2108"/>
<skos:narrower rdf:resource="http://www.ukat.org.uk/thesaurus/concept/9505"/>
<skos:narrower rdf:resource="http://www.ukat.org.uk/thesaurus/concept/15053"/>
<skos:narrower rdf:resource="http://www.ukat.org.uk/thesaurus/concept/18987"/>
<skos:related rdf:resource="http://www.ukat.org.uk/thesaurus/concept/3250"/>
<skos:inScheme rdf:resource="http://www.ukat.org.uk/thesaurus"/>
</skos:Concept>
</rdf:RDF>
</pre>
<p>Note that, in expressing the content of a thesaurus such as the UKAT in RDF using SKOS Core, each descriptor (preferred term) becomes a <em>preferred label for a concept</em>, and each non-descriptor (non-preferred term) becomes an <em>alternative label for a concept</em>.</p>
<p>Note also the <code>skos:inScheme</code> property - this property may be used to assert a link between a concept and the concept scheme(s) in which it participates. SKOS Core places no restriction on the number of schemes in which a concept may participate.</p>
<hr/>
<h2 id="secCreatingRDF">Creating and Maintaining an RDF Representation of a Thesaurus</h2>
<p>Most thesauri are managed via a thesaurus management system. Where the thesaurus management system stores its data in a relational database, or in an XML or structured text file format, or where the standard output of the thesaurus management system is an XML or structured text format, it is usually possible to create an RDF representation of the thesaurus via an automated procedure (e.g. database report, text parsing program, XSLT transformation).</p>
<p>A full discussion of conversion techniques and best practice is beyond the scope of this document. However, note that when using an automated procedure, care must be taken to ensure that the output generated is sensible and conforms to the recommended usage of the SKOS Core Vocabulary. For example, if an XML format contains an XML element whose name is 'scopenote' it should not be automatically assumed that the textual content of these elements should be converted to the value of a <code>skos:scopeNote</code> property. Perhaps these 'scopenote' elements actually contain definitions, in which case the <code>skos:definition</code> property should be used; or perhaps the 'scopenote' elements have been used very loosely and contain all kinds of note types, in which case the more general <code>skos:publicNote</code> property would be more appropriate.</p>
<p>Also note that maintaining an RDF representation of a thesaurus requires clear policies for versioning and change management. For example, users need to know if the meaning associated with a URI is stable, and if it isn't they need to know how and when it may change. A fuller discussion of management best practice is currently being undertaken by the Vocabulary Management Task Force of the Semantic Web Best Practices and Deployment Working Group.</p>
<hr/>
<h2 id="secExpressingMetadata">Expressing Thesaurus Metadata in RDF</h2>
<p>RDF can also be used to express metaproperties of a thesaurus, such as it's title, description, date of modification and so on. The DCMI Metadata Terms [<a href="#refDCMITerms">DCMI Terms</a>] include a number of useful properties for this purpose. For example, below is an RDF/XML serialisation of the UKAT metadata:</p>
<pre class="code">
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:skos="http://www.w3.org/2004/02/skos/core#"
xmlns:dc="http://purl.org/dc/elements/1.1/">
<skos:ConceptScheme rdf:about="http://www.ukat.org.uk/thesaurus">
<dc:title>The UK Archival Thesaurus</dc:title>
<dc:description>A subject thesaurus produced to support indexing in the UK
archive sector.</dc:description>
<dc:creator>UK Archival Thesaurus project</dc:creator>
<dc:date>2004-08-22</dc:date>
<dc:language>en</dc:language>
<dc:rights>All rights reserved. Data in the UK Archival Thesaurus may be freely
used and copied, without prior permission, for educational and other non-commercial
purposes. These purposes include (but are not limited to) the incorporation of UKAT
data into indexes, thesauri and finding aids created by organisations and projects
in the archive sector and the wider heritage sector, in the UK and elsewhere. Under
no circumstances may copies of UKAT data be sold without prior written permission
from the UKAT Project (support@ukat.org.uk).</dc:rights>
<skos:hasTopConcept rdf:resource="http://www.ukat.org.uk/thesaurus/field/1"/>
<skos:hasTopConcept rdf:resource="http://www.ukat.org.uk/thesaurus/field/2"/>
<skos:hasTopConcept rdf:resource="http://www.ukat.org.uk/thesaurus/field/3"/>
<skos:hasTopConcept rdf:resource="http://www.ukat.org.uk/thesaurus/field/4"/>
<skos:hasTopConcept rdf:resource="http://www.ukat.org.uk/thesaurus/field/5"/>
<skos:hasTopConcept rdf:resource="http://www.ukat.org.uk/thesaurus/field/6"/>
<skos:hasTopConcept rdf:resource="http://www.ukat.org.uk/thesaurus/field/8"/>
</skos:ConceptScheme>
</rdf:RDF>
</pre>
<p>See [<a href="#refDCMITerms">DCMI Terms</a>] for a description of the recommended usage of these properties.</p>
<p>To make statements about a thesaurus in RDF, the thesaurus must have an allocated URI. For example, the URI:</p>
<p><code>http://www.ukat.org.uk/thesaurus</code></p>
<p>... denotes the UKAT itself.</p>
<p>Note that SKOS Core models a thesaurus as a 'concept scheme'. For more about this, see the section 'Concept Schemes' in [<a href="#refSKOSCoreGuide">SKOS Core Guide</a>].</p>
<p>Note also that, in the example above, a link has been asserted between the UKAT thesaurus and the top concepts in the UKAT thesaurus (in the UKAT they are known as 'fields') using the <code>skos:hasTopConcept</code> property. Using this property is recommended, as it gives applications an efficient way of locating the top concepts for a given scheme.</p>
<hr/>
<h2 id="secPublishing">Publishing RDF Data</h2>
<p>The simplest way to publish RDF data is to create one or more RDF documents containing your data, and publish them on the web via a normal HTTP server.</p>
<p>Note that, although the examples above all use the RDF/XML serialisation syntax (i.e. file format), there are two other alternative syntaxes for RDF: N3/Turtle [<a href="#refN3">N3</a>][<a href="#refTurtle">Turtle</a>] and N-Triples [<a href="#refNTriples">N-Triples</a>]. For documents containg RDF data in the RDF/XML format, the 'content-type' field in the HTTP header for that document should be 'application/rdf+xml'. </p>
<p>You can also publish your RDF data on the web via a dedicated RDF server such as Joseki [<a href="#refJoseki">Joseki</a>] or Sesame [<a href="#refSesame">Sesame</a>]. Publishing via an RDF server allows anyone to query the thesaurus over the web via an RDF query language such as SPARQL [<a href="#refSPARQL">SPARQL</a>].</p>
<p>Current practice suggests that if you use HTTP URIs as identifiers for your thesaurus and its concepts, then those URIs should dereference to an RDF description of the thing they identify.</p>
<hr/>
<h2 id="secReading">Further Reading</h2>
<ul>
<li><a href="http://www.w3.org/2004/02/skos/core/guide/">SKOS Core Guide</a></li>
<li><a href="http://www.w3.org/TR/rdf-primer/">RDF Primer</a></li>
<li><a href="http://www.w3.org/TR/webarch/">Architecture of the World Wide Web, Volume One</a></li>
</ul>
<hr/>
<h2 id="secReferences">References</h2>
<dl>
<dt id="refDCMITerms">[DCMI Terms]</dt>
<dd><span class="title">DCMI Metadata Terms</span>. Dublin Core Metadata Initiative, 2004. (See <a href="http://dublincore.org/documents/dcmi-terms/">http://dublincore.org/documents/dcmi-terms/</a>) </dd>
<dt id="refJoseki">[Joseki]</dt>
<dd><span class="title">Joseki Jena RDF Server</span>. Sourceforge. (See <a href="http://www.joseki.org/">http://www.joseki.org/</a>)</dd>
<dt id="refN3">[N3]</dt>
<dd>Tim Berners-Lee. <span class="title">Primer: Getting into RDF & Semantic Web using N3</span>. World Wide Web Consortium, 2004. (See <a href="http://www.w3.org/2000/10/swap/Primer">http://www.w3.org/2000/10/swap/Primer</a>)</dd>
<dt id="refNTriples">[N-Triples]</dt>
<dd>Jan Grant, Dave Beckett, editors. <span class="title">RDF Test Cases (Section 3. N-Triples)</span>. World Wide Web Consortium, 2004. (See <a href="http://www.w3.org/TR/rdf-testcases/#ntriples">http://www.w3.org/TR/rdf-testcases/#ntriples</a>) </dd>
<dt id="refRDF">[RDF]</dt>
<dd><span class="title">Resource Description Framework (RDF)</span>. (See <a href="http://www.w3.org/RDF/">http://www.w3.org/RDF/</a>)</dd>
<dt id="refRDFConcepts">[RDF Concepts]</dt>
<dd>Graham Klyne, Jeremy Carroll, editors.<span class="title">Resource Description Framework (RDF): Concepts and Abstract Syntax</span>. World Wide Web Consortium, 2004. (See <a href="http://www.w3.org/TR/rdf-concepts/">http://www.w3.org/TR/rdf-concepts/</a>)</dd>
<dt id="refRDFSyntax">[RDF Syntax]</dt>
<dd>Dave Beckett, editor. <span class="title">RDF/XML Syntax Specification (Revised)</span>. World Wide Web Consortium, 2004. (See <a href="http://www.w3.org/TR/rdf-syntax-grammar/">http://www.w3.org/TR/rdf-syntax-grammar/</a>) </dd>
<dt id="refSemanticWeb">[Semantic Web Activity]</dt>
<dd><span class="title">Semantic Web Activity Statement</span>. World Wide Web Consortium, 2001. (See <a href="http://www.w3.org/2001/sw/Activity">http://www.w3.org/2001/sw/Activity</a>)</dd>
<dt id="refSesame">[Sesame]</dt>
<dd><span class="title">Sesame RDF Database</span>. (See <a href="http://www.openrdf.org/">http://www.openrdf.org/</a>)</dd>
<dt id="refSKOSCoreGuide">[SKOS Core Guide]</dt>
<dd>Alistair Miles, Dan Brickley, editors. <span class="title">SKOS Core Guide</span>. World Wide Web Consortium, 2004. (See <a href="http://www.w3.org/2004/02/skos/core/guide/">http://www.w3.org/2004/02/skos/core/guide/</a>) </dd>
<dt id="refSPARQL">[SPARQL]</dt>
<dd>Eric Prud'hommeaux, Andy Seaborne, editors. <span class="title">SPARQL Query Language for RDF</span>. World Wide Web Consortium, 2004. (See <a href="http://www.w3.org/TR/rdf-sparql-query/">http://www.w3.org/TR/rdf-sparql-query/</a>)</dd>
<dt id="refTurtle">[Turtle]</dt>
<dd>Dave Beckett. <span class="title">Turtle - Terse RDF Triple Language</span>. ILRT, University of Bristol, 2004. (See <a href="http://www.ilrt.bris.ac.uk/discovery/2004/01/turtle/">http://www.ilrt.bris.ac.uk/discovery/2004/01/turtle/</a>) </dd>
<dt id="refUKAT">[UKAT]</dt>
<dd><span class="title">The UK Archival Thesaurus</span>. See (<a href="http://www.ukat.org.uk/">http://www.ukat.org.uk/</a>)</dd>
<dt id="refWEBARCH">[WEBARCH]</dt>
<dd>Ian Jacobs, Norman Walsh, editors. <span class="title">Architecture of the World Wide Web, Volume One</span>. World Wide Web Consortium, 2004. (See <a href="http://www.w3.org/TR/webarch/">http://www.w3.org/TR/webarch/</a>) </dd>
</dl>
</body>
</html>