Learning to Recognize Faces from Videos and Weakly Related Information Cues

Martin Köstinger; Paul Wohlhart; Peter Roth; Horst Bischof

doi:978-145770845-9

Learning to Recognize Faces from Videos and Weakly Related Information Cues

Martin Köstinger, Paul Wohlhart, Peter Roth, Horst Bischof

Institute of Computer Graphics and Vision (7100)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

Videos are often associated with additional information that could be valuable for interpretation of their content. This especially applies for the recognition of faces within video streams, where often cues such as transcripts and subtitles are available. However, this data is not completely reliable and might be ambiguously labeled. To overcome these limitations, we take advantage of semi-supervised (SSL) and multiple instance learning (MIL) and propose a new semi-supervised multiple instance learning (SSMIL) algorithm. Thus, during training we can weaken the prerequisite of knowing the label for each instance and can integrate unlabeled data, given only probabilistic information in form of priors. The benefits of the approach are demonstrated for face recognition in videos on a publicly available benchmark dataset. In fact, we show exploring new information sources can considerably improve the classification results.

Original language	English
Title of host publication	2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2011
Pages	23 - 28
ISBN (Electronic)	978-145770845-9
DOIs	https://doi.org/978-145770845-9
Publication status	Published - 2011
Event	IEEE International Conference on Advanced Video and Signal Based Surveillance: AVSS 2011 - Klagenfurt, Austria Duration: 30 Aug 2011 → 2 Sept 2011

Conference

Conference	IEEE International Conference on Advanced Video and Signal Based Surveillance
Country/Territory	Austria
City	Klagenfurt
Period	30/08/11 → 2/09/11

Fields of Expertise

Information, Communication & Computing

Access to Document

978-145770845-9

MDL - Multimedia Documentation Lab: Wissensrepäsentation und -Organisation bei multimedialen Inhalten als Voraussetzung für sicherheitsrelevante Analysen
Wohlhart, P., Köstinger, M. & Bischof, H.
1/02/09 → 31/01/12
Project: Research project

Cite this

Köstinger, M, Wohlhart, P, Roth, P & Bischof, H 2011, Learning to Recognize Faces from Videos and Weakly Related Information Cues. in 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2011., 6027287, pp. 23 - 28, IEEE International Conference on Advanced Video and Signal Based Surveillance, Klagenfurt, Austria, 30/08/11. https://doi.org/978-145770845-9

@inproceedings{711c851912234ca4908437d55211e4bf,

title = "Learning to Recognize Faces from Videos and Weakly Related Information Cues",

abstract = "Videos are often associated with additional information that could be valuable for interpretation of their content. This especially applies for the recognition of faces within video streams, where often cues such as transcripts and subtitles are available. However, this data is not completely reliable and might be ambiguously labeled. To overcome these limitations, we take advantage of semi-supervised (SSL) and multiple instance learning (MIL) and propose a new semi-supervised multiple instance learning (SSMIL) algorithm. Thus, during training we can weaken the prerequisite of knowing the label for each instance and can integrate unlabeled data, given only probabilistic information in form of priors. The benefits of the approach are demonstrated for face recognition in videos on a publicly available benchmark dataset. In fact, we show exploring new information sources can considerably improve the classification results.",

author = "Martin K{\"o}stinger and Paul Wohlhart and Peter Roth and Horst Bischof",

year = "2011",

doi = "978-145770845-9",

language = "English",

pages = "23 -- 28",

booktitle = "2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2011",

note = "IEEE International Conference on Advanced Video and Signal Based Surveillance : AVSS 2011 ; Conference date: 30-08-2011 Through 02-09-2011",

}

TY - GEN

T1 - Learning to Recognize Faces from Videos and Weakly Related Information Cues

AU - Köstinger, Martin

AU - Wohlhart, Paul

AU - Roth, Peter

AU - Bischof, Horst

PY - 2011

Y1 - 2011

N2 - Videos are often associated with additional information that could be valuable for interpretation of their content. This especially applies for the recognition of faces within video streams, where often cues such as transcripts and subtitles are available. However, this data is not completely reliable and might be ambiguously labeled. To overcome these limitations, we take advantage of semi-supervised (SSL) and multiple instance learning (MIL) and propose a new semi-supervised multiple instance learning (SSMIL) algorithm. Thus, during training we can weaken the prerequisite of knowing the label for each instance and can integrate unlabeled data, given only probabilistic information in form of priors. The benefits of the approach are demonstrated for face recognition in videos on a publicly available benchmark dataset. In fact, we show exploring new information sources can considerably improve the classification results.

AB - Videos are often associated with additional information that could be valuable for interpretation of their content. This especially applies for the recognition of faces within video streams, where often cues such as transcripts and subtitles are available. However, this data is not completely reliable and might be ambiguously labeled. To overcome these limitations, we take advantage of semi-supervised (SSL) and multiple instance learning (MIL) and propose a new semi-supervised multiple instance learning (SSMIL) algorithm. Thus, during training we can weaken the prerequisite of knowing the label for each instance and can integrate unlabeled data, given only probabilistic information in form of priors. The benefits of the approach are demonstrated for face recognition in videos on a publicly available benchmark dataset. In fact, we show exploring new information sources can considerably improve the classification results.

U2 - 978-145770845-9

DO - 978-145770845-9

M3 - Conference paper

SP - 23

EP - 28

BT - 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2011

T2 - IEEE International Conference on Advanced Video and Signal Based Surveillance

Y2 - 30 August 2011 through 2 September 2011

ER -

Learning to Recognize Faces from Videos and Weakly Related Information Cues

Abstract

Conference

Fields of Expertise

Access to Document

Fingerprint

Projects

MDL - Multimedia Documentation Lab: Wissensrepäsentation und -Organisation bei multimedialen Inhalten als Voraussetzung für sicherheitsrelevante Analysen

Cite this