   MPEG-7 and the Semantic Web
   W3C Incubator Group Editor's Draft 14 August 2007 
    14 August 2007

   <a name="introduction">
    1. Introduction

MPEG-7 was developed to provide standardized tools for describing different
aspects of multimedia at different levels of abstraction. Its XML-based
syntax enables smooth interchange across applications and over the web,
but the lack of precise semantics hinders metadata interoperability.
Two representative examples include:
    <li>Semantically identical metadata can be represented in multiple ways. For example, an
image depicting a player scoring a goal can be annotated using the free
text tag ("Zinedine Zidane scoring against England"), the keyword tag
(Zidane, goal, France, England), the label tag etc.

  <div class="exampleInner" style="clear: both">
&lt;FreeTextAnnotation xml:lang="en"&gt;Zinedine Zidane scoring against England.&lt;/FreeTextAnnotation&gt;

<h4 id="free-text-annotation">Using the free text annotation</h4>

  <div class="exampleInner" style="clear: both">
&lt;KeywordAnnotation xml:lang="en"&gt; 

<h4 id="keyword-annotation">Using the keyword annotation</h4>

  <div class="exampleInner" style="clear: both">
    &lt;Name xml:lang="en"&gt;Zinedine Zidane&lt;/Name&gt;
    &lt;Name xml:lang="en"&gt;Zinedine Zidane scoring against England.&lt;/Name&gt;

<h4 id="structured-annotation">Using a structured annotation with labels</h4>

  <div class="exampleInner" style="clear: both">
&lt;Semantic id="FormalAbstractionDescription"&gt; 
  &lt;SemanticBase xsi:type="AgentObjectType" id="Zidane"&gt;
    &lt;Label&gt;&lt;Name&gt;Zidane &lt;/Name&gt;&lt;/Label&gt;
    &lt;Agent xsi: ="PersonType"&gt;
  &lt;SemanticBase xsi:type="EventType" id="scoring"&gt;
      &lt;Name&gt;Zinedine Zidane scoring against England.&lt;/Name&gt;

<h4 id="mpeg7-annotation">Using MPEG-7-built-in (non-formal) semantic descriptor</h4>
  <li>The intended semantics underlying the structure of descriptions defined
within MPEG-7, for example the decomposition relation between an image
and its constituent segments, are not formal and as such cannot be
deployed (an image annotated as depicting Zidane and an image, a
segment of which is annotated as depicting Zidane won&#xe2;&#x80;&#x99;t be both
retrieved in a corresponding &#xe2;&#x80;&#x98;semantic&#xe2;&#x80;&#x99; query unless customized query
expansion is performed to cover both cases.

   <a name="mpeg7-ontologies">
    2. Existing MPEG-7 ontologies

To alleviate the resulting interoperability issues, efforts have been
undertaken to translate MPEG-7 into an ontology and through appropriate
frameworks to enable its integration with other ontologies, thus
enhancing interoperability. Two main such methodologies include the
proposals by Hunter et. al. and Tsinaraki et. al. Both approaches aim
to provide a framework for interoperable MPEG-7 compliant multimedia
metadata. However, given the continuously growing research interest in
formalizing multimedia related semantics and building a common metadata
framework, the question of how interoperable these proposals are
becomes particularly important.

   <a name="hunter-ontology">
    2.1. Using the MPEG-7/ABC Ontology

In the approach proposed by Hunter, the ABC ontology is used as the core
one to provide attachment points for integrating mpeg7 and domain
specific ontologies. More specifically, the <tt>mpeg7:MultimediaContent</tt>
class (and the subsequent multimedia and segment hierarchy) is defined
as a subclass of the <tt>abc:Manifestation</tt> class, while the corresponding
domain ontologies are assumed to be appropriately attached to
corresponding ABC classes. 
A first observation at this point would be that MPEG-7 includes apart
from the structure related description schemes, descriptions on other
aspects as well (e.g., the semantic part ones), for which it is not
clear how the mapping to ABC should be and how they relate to possibly
relevant domain specific definitions. For example, <tt>mpeg7:Agent</tt> could
be mapped to <tt>abc:Agent</tt>. Assuming a domain specific class <tt>o:Person</tt> it
should be again linked to <tt>abc:Agent</tt> as equivalent class, subclass or
through some property, thus raising issues about the semantics of the
<tt>mpeg7:Agent</tt> and <tt>o:Person</tt> relation, which in turn reduces
interoperability among possible pre-existing MPEG-7 based annotation
metadata and newly created ones under the ABC core ontology framework.
Let assume that someone follows the approach by Hunter, using the
Multimedia Description Scheme (MDS) part of the MPEG-7 ontology to
address the structural aspects, in order to annotate an image depicting
Zidane scoring. Assuming a soccer ontology s, the involved classes
would be <tt>s:goal</tt>, <tt>s:player</tt>, <tt>s:scoring</tt> and <tt>mpeg7:image</tt> (at least in a
simple case where spatiotemporal decomposition is not taken into
account). One possible way to represent this annotation would be using
the following statements:

  <div class="exampleInner" style="clear: both">
:image01 rdf:type mpeg7:Image 
:goal01 rdf:type s:Goal
:scoring01 rdf:type s:Scoring

:image01 mpeg7:depicts :goal01
:goal01 abc:hasAction :scoring01
:scoring01 abc:hasAgent  s:_b1
:_b1 :hasName 'Zinedine Zidane'

where additionally the following hold:

  <div class="exampleInner" style="clear: both">
mpeg7:Image rdfs:subclass mpeg7:MultimediaContent 
mpeg7:MultimediaContent rdfs:subclass abc:Manifestation
s:Scoring rdfs:subclass abc:Action
s:Goal rdfs:subclass abc:Event

Notice that under this framework, having attached this annotation to a specific image region rather than the whole image, i.e.

  <div class="exampleInner" style="clear: both">
:region01 rdf:type mpeg7:StillRegion 
:region01 mpeg7:depicts :goal01

we would be able to retrieve the corresponding image if querying for
images depicting Zinedine Zidane scoring, due to the subclass relation<br/>
<tt>mpeg7:StillRegion rdfs:subclass mpeg7:Image</tt>,<br/>
something that is not inherently possible by MPEG-7 itself.
Leaving out individual issues regarding the taken modeling decisions (e.g.,
should still regions be modeled as a subclass of image or related to
the latter through partonomic decomposition relations only), the one
sees evidence for the value of using an upper ontology, adequately
generic to allow the consistent integration between an MPEG-7 ontology
and domain specific ones.

   <a name="tsinaraki-ontology">
    2.2. Using the MPEG-7/Tsinaraki Ontology

In Tsinaraki on the other hand, the semantic part of MPEG-7 is translated
into an ontology that serves as the core one for the attachment of
domain specific ontologies, in order to achieve MPEG-7 compliant domain
specific annotations. A first observation is that under this approach
the initial conceptualization of the domain specific ontologies needs
to be "mapped" to the MPEG-7 modeling rationale. Consequently,
annotation metadata produced following this approach would not be
interoperable with approaches coupling domain specific ontologies with
an MPEG-7-like one, following a procedure similar to the one proposed
by Hunter.

   <a name="mpeg7-interoperability">
    3. Solving the interoperability problems

In this section we will present the possible solutions for the interoperability
problems that arise from the different translations/formalisations of
the MPEG-7 standard. The specific interoperability problems have been
illustrated in the motivating example. There are three approaches in
the literature that try to overcome such interoperability problems.
These approaches are:

  <p>TODO: Michael to describe syntactic (XML, XML-Schema) and semantic (RDF/OWL/rules) aspects.
  <ol type="1">
Create syntactic mappings between terms of two or more standards (e.g.
Cidoc-Crm Vs Dublin-core). The proposed solution exploits the
expressive power and reasoning support of OWL and SWRL (or other rules
on-top-of ontologies language) in order to created syntactic as well as
semantic mappings.
Align the domain ontologies in a multimedia core ontology (or framework) that
ensures interoperability. This approach covers the work that is in
progress in the K-Space project.
Using MPEG-7 profiles. This approach will be mainly covered by Michael.
The aim of this section is not to present the analytical solutions but
rather the mechanism to ensure interoperability in MPEG-7 based MM
applications. In addition, we will present the interoperability
problems that are solved and the new ones that are introduced.

   <a name="references" id="references">

    <a id="DublinCore" name="DublinCore">[Dublin Core]</a>
    The Dublin Core Metadata Initiative,
    <a href="http://dublincore.org/documents/dces/">Dublin Core Metadata Element Set, Version 1.1: Reference Description</a>

    <a id="Hunter01" name="Hunter01">[Hunter, 2001]</a>
    J. Hunter. 
    <!-- official link broken? a href="http://www.semanticweb.org/SWWS/program/full/paper59.pdf" -->
    <a href="http://www.itee.uq.edu.au/~jane/jane-hunter/swws.pdf">Adding
    Multimedia to the Semantic Web &#8212; Building an MPEG-7
    Ontology</a>. In <i><a
    href="http://www.semanticweb.org/SWWS/">International Semantic Web
    Working Symposium (SWWS 2001)</a></i>, Stanford University,
    California, USA, July 30 - August 1, 2001

    <a id="MPEG-7" name="MPEG-7">[MPEG-7]</a>
    Information Technology - Multimedia Content Description Interface (MPEG-7).
    Standard No. ISO/IEC 15938:2001, International Organization for Standardization(ISO), 2001

    <a id="Ossenbruggen04" name="Ossenbruggen04">[Ossenbruggen, 2004]</a>
    J. van Ossenbruggen, F. Nack, and L. Hardman. That Obscure Object of Desire: Multimedia Metadata on the Web (Part I). In:
    IEEE Multimedia 11(4), pp. 38-48 October-December 2004

    <a id="Ossenbruggen05" name="Ossenbruggen05">[Ossenbruggen, 2005]</a>
    F. Nack, J. van Ossenbruggen, and L. Hardman. That Obscure Object of Desire: Multimedia Metadata on the Web (Part II). In:
    IEEE Multimedia 12(1), pp. 54-63 January-March 2005

    <a name="OWL-Guide" id="OWL-Guide">[OWL Guide]</a>
       <a href="http://www.w3.org/TR/2004/REC-owl-guide-20040210/">
        OWL Web Ontology Language Guide</a></cite>, Michael K.
        Smith, Chris Welty, and Deborah L. McGuinness, Editors, W3C
        Recommendation, 10 February 2004,
        http://www.w3.org/TR/2004/REC-owl-guide-20040210/ .
        <a href="http://www.w3.org/TR/owl-guide/">Latest
        version</a> available at http://www.w3.org/TR/owl-guide/

    <a name="OWL" id="OWL">[OWL Semantics and Abstract Syntax]</a></dt>
     <a href=
      "http://www.w3.org/TR/2004/REC-owl-semantics-20040210/">OWL Web
     Ontology Language Semantics and Abstract Syntax</a></cite>, Peter
     F. Patel-Schneider, Patrick Hayes, and Ian Horrocks, Editors, W3C
     Recommendation 10 February 2004,
     http://www.w3.org/TR/2004/REC-owl-semantics-20040210/ . <a href=
     "http://www.w3.org/TR/owl-semantics/">Latest version</a>
     available at http://www.w3.org/TR/owl-semantics/</dd>

   <dt><a id="RDF-Primer" name="RDF-Primer">[RDF Primer]</a></dt>
     <cite><a href="http://www.w3.org/TR/2004/REC-rdf-primer-20040210/">RDF
   Primer</a></cite>, F. Manola, E. Miller, Editors, W3C Recommendation, 10 February 2004.  
   <a href="http://www.w3.org/TR/2004/REC-rdf-primer-20040210/">This
   version</a> is
   http://www.w3.org/TR/2004/REC-rdf-primer-20040210/. The <a href="http://www.w3.org/TR/rdf-primer/">latest version</a> is at

   <dt><a id="RDF" name="RDF"></a>[RDF Syntax]</dt>
     <a href="http://www.w3.org/TR/2004/REC-rdf-syntax-grammar-20040210/">
      RDF/XML Syntax Specification (Revised)</a>
    </cite>, Dave Beckett,
      Editor, W3C Recommendation, 10 February 2004,
      http://www.w3.org/TR/2004/REC-rdf-syntax-grammar-20040210/ .  <a
      version</a> available at

    <a id="Stamou05" name="Stamou05">[Stamou, 2005]</a>
    G. Stamou and S. Kollias (eds). Multimedia Content and the
    Semantic Web: Methods, Standards and Tools.  John Wiley &amp; Sons
    Ltd, 2005

    <a id="Troncy03" name="Troncy03">[Troncy, 2003]</a>
    R. Troncy. <a
    Integrating Structure and Semantics into Audio-visual
    Documents</a>. In <i><a
    href="http://iswc2003.semanticweb.org/">Second International
    Semantic Web Conference (ISWC 2003)</a></i>, pages 566 &#8211;
    581, Sanibel Island, Florida, USA, October 20-23,
    2003. Springer-Verlag Heidelberg

    <a id="Tsinaraki" name="Tsinaraki">
    Tsinaraki, C.: OWL soccer ontology available at
    <a href="http://elikonas.ced.tuc.gr/ontologies/soccer.zip">http://elikonas.ced.tuc.gr/ontologies/soccer.zip</a>

    <a id="VDO" name="VDO">[VDO]</a>
    aceMedia Visual Descriptor Ontology, available from <a
     <a id="XML-NS" name="XML-NS">[XML NS]</a>
     <a href="http://www.w3.org/TR/1999/REC-xml-names-19990114/">Namespaces
     in XML</a></cite>, Bray T., Hollander D., Layman A.
     (Editors), World Wide Web Consortium, 14 January 1999. <a href="http://www.w3.org/TR/1999/REC-xml-names-19990114/">This
     version</a> is http://www.w3.org/TR/1999/REC-xml-names-19990114/. The <a href="http://www.w3.org/TR/REC-xml-names/">latest version</a>
     is http://www.w3.org/TR/REC-xml-names/


   <a id="acknowledgments" name="acknowledgments">Acknowledgments</a>

