Search Publication
Transcription-Enriched Joint Embeddings or Spoken Descriptions of Images and Videos. In CVPR 2020 Workshop on Egocentric Perception, Interaction and Computing. Seattle, WA, USA: arXiv; 2020. (96.79 KB)
. Multimodal Person Identification in a Smart Room. In IV Jornadas en Tecnología del Habla. 2006. pp. 327–331.
. Multimodal identification and localization of users in a smart environment. Journal on Multimodal user interfaces. 2008;2:75–91.
. Audio, Video and Multimodal Person Identification in a Smart Room. In Lecture notes in computer science - Multimodal Technologies for Perception of Humans. 2006. pp. 258–269. (321.95 KB)
. Audio, Video and Multimodal Person Identification in a Smart Room. In CLEAR'06 Evaluation Campaign and Workshop - Classification of Events, Activities and Relationships. 2007. pp. 258–269. (294.95 KB)
.