Cultural Event Recognition with Computer Vision (software)

Resource Type Date
Software 2015-03-18

Description

Authors:

Amaia Salvador Daniel Manchon-Vizuete Andrea Calafell Matthias Zeppelzauer Xavier Giró-i-Nieto

Amaia Salvador

(UPC)

Daniel Manchon-Vizuete

(UPC)

Andrea Calafell

(UPC)

Matthias Zeppelzauer 

(FH St Pölten)

Xavier Giró-i-Nieto

(UPC)

 

This package contains the software and extracted features for our submission to the challenge in the CVPR Workshop 2015ChaLearn Looking at People 2015 - Track 3/4: Cultural Event Recognition.

Awarded with the second prize !

Second prize

 

Overview

We fine tuned the a pretrained Convolutional Network (CaffeNet) using Caffe, a deep learning framework, using at first only the training data (partitioning it as 80% for traininig and 20% for validation).  Once the validation labels were provided, we fine tuned our network with the remaining 20% of training images using the real validation data. 

The last layer of our fine tuned network gives us the confidence score for an image for each of the classes. Results using those scores improved the baseline, but still we tried some late fusion strategies training an SVM on the neural codes generated on each of the last three layers of the network (FC6, FC7 and FC8). We combined the descriptors extracted from both our fine tuned network and the pretrained one, achieving our maximum result by adding a final temporal refinement. The temporal refinement was applied only to images with time stamps in their EXIF metadata, where high classification scores based on visual features were penalized when their time stamp did not match well an event-specific temporal distribution learned from the training and validation data.

 

Download source code for ICCV 2015 workshop

Source code (v2.1, 12/09/2015): Fine-tunning of convolutional networks and feature extraction was run with the Python interface to Caffe, while SVM classifiers where created in Matlab.

 

Download source code and models for CVPR 2015 workshop

Problems with the download or software ? Drop us an e-mail at amaia.salvador@upc.edu, Matthias.Zeppelzauer@fhstp.ac.at and xavier.giro@upc.edu.

 

Acknowledgements

 

We would like to especially thank Albert Gil Moreno and Josep Pujal from our technical support team at the Image Processing Group at the UPC. Albert Gil Moreno Josep Pujal
  Albert Gil Moreno Josep Pujal

 

We gratefully acknowledge the support of NVIDIA Corporation with the donation of the GeoForce GTX Titan Z used in this work. NVidia logo
 The Image ProcessingGroup at the UPC is a SGR14 Consolidated Research Group recognized and sponsored by the Catalan Government (Generalitat de Catalunya) through its  AGAUR office. Departament d'Economica i Coneixement 
This work has been developed in the framework of the project BigGraph TEC2013-43935-R, funded by the Spanish Ministerio de Economía y Competitividad and the European Regional Development Fund (ERDF).  Ministerio de Economia y Competitividad

 

  

 

UPC event viewer for Chalearn 2015 Cultural Event Recognition

People involved

Xavier Giró Associate Professor
Amaia Salvador PhD Candidate