From multimedia to the semantic Web using MPEG-7 and computational intelligence

Tummarello, G.; Morbidoni, Christian; Puliti, P.; Dragoni, Aldo Franco; Piazza, Francesco

doi:10.1109/WDM.2004.1358100

We present an architecture that provides semantic Web annotations of sound clips described by MPEG-7 audio descriptions. The great flexibility of the MPEG-7 standard makes especially difficult to compare descriptions coming from heterogeneous sources. To cope with this, the architecture would first obtain "normalized" versions of the audio descriptions using different adaptation techniques. Once in a "normalized" format, descriptions can be then projected into uniform and semantically relevant vector spaces, ready to be processed by a variety of well known computational intelligence techniques. As higher semantic results are then available, these can be exported as interoperable (RDF) annotations about the resource that was originally fed into the system. As novel aspect, through the use and interchange of MPEG-7 descriptions, the framework allows building applications (e.g. classificators) which can provide annotations on distributed audio resource sets.

From multimedia to the semantic Web using MPEG-7 and computational intelligence / G., T., Morbidoni, C., P., P., Dragoni, A.F., Piazza, F.. - STAMPA. - (2004), pp. 52-59. (WEDELMUSIC 2004 Conference Barcelona; Spain 13-14 Sept. 2004) [10.1109/WDM.2004.1358100].