Dublin City University and Partners' Participation in the INS and VTT Tracks at TRECVid 2016

Marsden M, Mohedano E, McGuinness K, Calafell A, Giró-i-Nieto X, O'Connor N, et al.. Dublin City University and Partners' Participation in the INS and VTT Tracks at TRECVid 2016. In TRECVID Workshop 2016. Gaithersburg, MD, USA; 2016.

Abstract

DCU participated with a consortium of colleagues from NUIG and UPC in two tasks, INS and VTT. For the INS task we developed a framework consisting of face detection and representation and place detection and representation, with a user annotation of top-ranked videos. For the VTT task we ran 1,000 concept detectors from the VGG-16 deep CNN on 10 keyframes per video and submitted 4 runs for caption re-ranking, based on BM25, Fusion, Word2Vec and a fusion of baseline BM25 and Word2Vec. With the same pre-processing for caption generation we used an open source image-to-caption CNN-RNN toolkit NeuralTalk2 to generate a caption for each keyframe and combine them.

Projects

	BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces.
	Multimedia Retrieval

Image Processing Group

Search form

User login

Abstract

Projects