UPC System for the 2016 MediaEval Multimodal Person Discovery in Broadcast TV task

India M, Martí G, Cotillas C, Bouritsas G, Sayrol E, Morros JR, et al.. UPC System for the 2016 MediaEval Multimodal Person Discovery in Broadcast TV task. In MediaEval 2016 Workshop. Hilversum, The Netherlands; 2016.

Google Scholar
BibTex

(174.87 KB)

Abstract

The UPC system works by extracting monomodal signal segments (face tracks, speech segments) that overlap with the person names overlaid in the video signal. These segments are assigned directly with the name of the person and used as a reference to compare against the non-overlapping (unassigned) signal segments. This process is performed independently both on the speech and video signals. A simple fusion scheme is used to combine both monomodal annotations into a single one.

Projects

	Camomile - Collaborative Annotation of multi-MOdal, MultI-Lingual and multi-mEdia documents
	BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces.

Image Processing Group

Search form

User login

Abstract

Projects