Extracting semantic knowledge from Web context for multimedia IR: A taxonomy, survey and challenges

T. Bracamonte; B. Bustos; B. Poblete; T. Schreck

doi:10.1007/s11042-017-4997-y

Extracting semantic knowledge from Web context for multimedia IR: A taxonomy, survey and challenges

T. Bracamonte, B. Bustos, B. Poblete, T. Schreck

Institute of Computer Graphics and Knowledge Visualisation (7110)

Research output: Contribution to journal › Article › peer-review

Abstract

Since its invention, the Web has evolved into the largest multimedia repository that has ever existed. This evolution is a direct result of the explosion of user-generated content, explained by the wide adoption of social network platforms. The vast amount of multimedia content requires effective management and retrieval techniques. Nevertheless, Web multimedia retrieval is a complex task because users commonly express their information needs in semantic terms, but expect multimedia content in return. This dissociation between semantics and content of multimedia is known as the semantic gap. To solve this, researchers are looking beyond content-based or text-based approaches, integrating novel data sources. New data sources can consist of any type of data extracted from the context of multimedia documents, defined as the data that is not part of the raw content of a multimedia file. The Web is an extraordinary source of context data, which can be found in explicit or implicit relation to multimedia objects, such as surrounding text, tags, hyperlinks, and even in relevance-feedback. Recent advances in Web multimedia retrieval have shown that context data has great potential to bridge the semantic gap. In this article, we present the first comprehensive survey of context-based approaches for multimedia information retrieval on the Web. We introduce a data-driven taxonomy, which we then use in our literature review of the most emblematic and important approaches that use context-based data. In addition, we identify important challenges and opportunities, which had not been previously addressed in this area.

Original language	English
Pages (from-to)	13853 - 13889
Number of pages	37
Journal	Multimedia Tools and Applications
Volume	77
Issue number	11
Early online date	2017
DOIs	https://doi.org/10.1007/s11042-017-4997-y
Publication status	Published - Jun 2018

Fields of Expertise

Information, Communication & Computing

Access to Document

10.1007/s11042-017-4997-y

Cite this

@article{12ffecc53fc04f7ba918ce447d26f3e8,

title = "Extracting semantic knowledge from Web context for multimedia IR: A taxonomy, survey and challenges",

abstract = "Since its invention, the Web has evolved into the largest multimedia repository that has ever existed. This evolution is a direct result of the explosion of user-generated content, explained by the wide adoption of social network platforms. The vast amount of multimedia content requires effective management and retrieval techniques. Nevertheless, Web multimedia retrieval is a complex task because users commonly express their information needs in semantic terms, but expect multimedia content in return. This dissociation between semantics and content of multimedia is known as the semantic gap. To solve this, researchers are looking beyond content-based or text-based approaches, integrating novel data sources. New data sources can consist of any type of data extracted from the context of multimedia documents, defined as the data that is not part of the raw content of a multimedia file. The Web is an extraordinary source of context data, which can be found in explicit or implicit relation to multimedia objects, such as surrounding text, tags, hyperlinks, and even in relevance-feedback. Recent advances in Web multimedia retrieval have shown that context data has great potential to bridge the semantic gap. In this article, we present the first comprehensive survey of context-based approaches for multimedia information retrieval on the Web. We introduce a data-driven taxonomy, which we then use in our literature review of the most emblematic and important approaches that use context-based data. In addition, we identify important challenges and opportunities, which had not been previously addressed in this area.",

author = "T. Bracamonte and B. Bustos and B. Poblete and T. Schreck",

year = "2018",

month = jun,

doi = "10.1007/s11042-017-4997-y",

language = "English",

volume = "77",

pages = "13853 -- 13889",

journal = "Multimedia Tools and Applications",

publisher = "Springer Netherlands",

number = "11",

}

TY - JOUR

T1 - Extracting semantic knowledge from Web context for multimedia IR: A taxonomy, survey and challenges

AU - Bracamonte, T.

AU - Bustos, B.

AU - Poblete, B.

AU - Schreck, T.

PY - 2018/6

Y1 - 2018/6

N2 - Since its invention, the Web has evolved into the largest multimedia repository that has ever existed. This evolution is a direct result of the explosion of user-generated content, explained by the wide adoption of social network platforms. The vast amount of multimedia content requires effective management and retrieval techniques. Nevertheless, Web multimedia retrieval is a complex task because users commonly express their information needs in semantic terms, but expect multimedia content in return. This dissociation between semantics and content of multimedia is known as the semantic gap. To solve this, researchers are looking beyond content-based or text-based approaches, integrating novel data sources. New data sources can consist of any type of data extracted from the context of multimedia documents, defined as the data that is not part of the raw content of a multimedia file. The Web is an extraordinary source of context data, which can be found in explicit or implicit relation to multimedia objects, such as surrounding text, tags, hyperlinks, and even in relevance-feedback. Recent advances in Web multimedia retrieval have shown that context data has great potential to bridge the semantic gap. In this article, we present the first comprehensive survey of context-based approaches for multimedia information retrieval on the Web. We introduce a data-driven taxonomy, which we then use in our literature review of the most emblematic and important approaches that use context-based data. In addition, we identify important challenges and opportunities, which had not been previously addressed in this area.

AB - Since its invention, the Web has evolved into the largest multimedia repository that has ever existed. This evolution is a direct result of the explosion of user-generated content, explained by the wide adoption of social network platforms. The vast amount of multimedia content requires effective management and retrieval techniques. Nevertheless, Web multimedia retrieval is a complex task because users commonly express their information needs in semantic terms, but expect multimedia content in return. This dissociation between semantics and content of multimedia is known as the semantic gap. To solve this, researchers are looking beyond content-based or text-based approaches, integrating novel data sources. New data sources can consist of any type of data extracted from the context of multimedia documents, defined as the data that is not part of the raw content of a multimedia file. The Web is an extraordinary source of context data, which can be found in explicit or implicit relation to multimedia objects, such as surrounding text, tags, hyperlinks, and even in relevance-feedback. Recent advances in Web multimedia retrieval have shown that context data has great potential to bridge the semantic gap. In this article, we present the first comprehensive survey of context-based approaches for multimedia information retrieval on the Web. We introduce a data-driven taxonomy, which we then use in our literature review of the most emblematic and important approaches that use context-based data. In addition, we identify important challenges and opportunities, which had not been previously addressed in this area.

U2 - 10.1007/s11042-017-4997-y

DO - 10.1007/s11042-017-4997-y

M3 - Article

VL - 77

SP - 13853

EP - 13889

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

IS - 11

ER -

Extracting semantic knowledge from Web context for multimedia IR: A taxonomy, survey and challenges

Abstract

Fields of Expertise

Access to Document

Fingerprint

Cite this