Xavier Giró

Positionsort descending e-mail
Associate Professor xavier.giro@upc.edu
Office Phone
D5-117 (Barcelona - Campus Nord)
TR2-104 (Terrassa - ESEIAAT)
+34 934 015 769


Xavier Giro-i-Nieto is an associate professor at the Universitat Politecnica de Catalunya (UPC). He graduated in Telecommuncations Engineering studies at ETSETB (UPC) in 2000, after completing his master thesis on image compression at the Vrije Universiteit in Brussels (VUB) under the direction of Professor Peter Schelkens. In 2001 he worked in the digital television group of Sony Brussels, before returning to Barcelona and joining the Image Processing Group at the UPC. Since 2003, he has created and taught graduate and undergraduate courses for Electrical Engineering degress at the ESEIAAT and ETSETB schools from UPC. In 2013 he participated in the design of the Master in Computer Vision of Barcelona by UPC, UAB, UPF and UOC universities, where he lectures on deep learning, image retrieval and video processingl. He has taught several international courses in the framework of the European Erasmus program. He obtained his Phd on image retrieval in 2012, under the supervision by Professor Ferran Marqués from UPC and Professor Shih-Fu Chang from Columbia University. He was a visiting scholar during Summers 2008 to 2014 at the Digital Video and MultiMedia laboratory at Columbia University, in New York. His relation with industry includes collaborations with Mediapro, Catalan Broadcast Corporation (TV3), PixableCatchoomNarrative and Vilynx.



  • Teaching

Logo of the Master in Computer Vision BarcelonaLogo for ETSETB Telecom BCNLogo of EET schoolErasmus+ logo

Social media

Facebook logo Twitter logo Blogger logo LinkedIn logo  

Current students and research topics

Student Co-advisor Program Vision Language Speech Audio Knowledge Emotions
Amaia Salvador Ferran Marqués Phd'18            
Míriam Bellver (BSC) Jordi Torres (BSC), Ferran Marqués and Jordi Pont-Tuset (ETHZ) Phd'20            
Víctor Campos (BSC) Jordi Torres (BSC), Brendan Jou (Google) and Shih-Fu Chang (Columbia University) Phd'20            
Dèlia Fernandez (Vilynx) Elisenda Bou (Vilynx) Phd'20            
Eduard Ramon (Crisalix) Jaime Garcia (Crisalix) Phd'20            
Amanda Duarte Jordi Torres (BSC), Amaia Salvador & Dídac Surís (Telefónica) Phd'20            
Ceren Güzel (Gazi University) Hasan Sakir Bilge (Gazi University) Phd'20            
Carlos Arenas Víctor Campos (BSC) and Damian Borth (DFKI) MSc'18            
Daniel Moreno Marta R. Costa-Jussà MSc'18            
Xunyu Lin Jordi Torres (BSC), Víctor Campos (BSC) & Cristian Canton (Facebook) MSc'19            
Miquel Oliver Amaia Salvador BSc'18            
Dani Fojo Víctor Campos (BSC) BSc'18            
Fran Roldan Kevin McGuinness MSc'18            
Janna Escur Marta R. Costa-Jussà BSc'18            
Marta Coll Kevin McGuinness BSc'18            
Al-lodí Jutglà Mathias Lux (University of Klagenfurt) BSc'18            



Awards: Best scanpath prediction in Salient360 ICME Challenge 2017, Best poster award at LSCVS NIPS workshop 2016, Best poster award at ICMR 2016, Among Top 10% papers in ICIP 2015, Winner of the LSUN Saliency prediction challenge in CVPRW 2015, 2nd place in ChaLearn Cultural Event Recognition Challenge in CVPRW 2015, 2nd place in MediaEval Social Event Detection 2014, 3rd place in MediaEval Social Event Detection 2013, Winner of the Videobrowser Showdown in MMM 2012.

Scientific IDs:  Google ScholarWoK Researcher ID: M-5834-2013, ORCID: 0000-0002-9935-5332, Scopus Author ID35098596700, UPC Futur 

Selected Service: Associate Editor of IEEE Transactions on Multimedia (2017-2019), Associate editor of ACM SIGMM records, Area Chair of ACM Multimedia 2016, Organizer of Lifelogging Tools and Applications (LTA) workshop at ACM Multimedia 2016 & 2017.

Conference Committees: NIPS 2017, ICCV 2017, ICMR 2017, ACM Multimedia (2017, 2016, 2014), ICIP (2014, 2003), EUSIPCO 2011.

Workshop Committees: MUSA 2017@ACMMMVSM 2016@ECCV & 2017@ICCVEPIC 2016@ECCV & 2017@ICCVISM 2016, CBMI (20162015,2014), CrowdMM 2015, MMSys 2015 Dataset TrackSMAP 2015MediaEval 2014SMAP 2014SEWM 2014, MMSys Dataset 2014, SMAP 2013, ICIP 2003.

Journal reviewer: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)Multimedia Tools and Applications (MTAP), EURASIP Journal on Image and Video Processing, Multimedia Systems (MMSJ), Image and Vision Computing (IMAVIS).

Deprecated links: [Advised Thesis] [Detailed list of courses] [Miscellanious Slides] [In the news]

Book Chapters and Bookstop

C. Ventura, Martos, M., Giró-i-Nieto, X., Vilaplana, V., and Marqués, F., Hierarchical Navigation and Visual Search for Video Keyframe Retrieval, in Advances in Multimedia Modeling, vol. 7131, Springer Berlin / Heidelberg, 2012, pp. 652-654.
E. Carcel, Martos, M., Giró-i-Nieto, X., and Marqués, F., Rich Internet Application for Semi-automatic Annotation of Semantic Shots on Keyframes, in Computational Intelligence for Multimedia Understanding, vol. 7242, Pisa, Italy: Springer-Verlag, 2012, pp. 172-182. (6.93 MB)
C. Ferran, Giró-i-Nieto, X., Marqués, F., and Casas, J., BPT Enhancement based on Syntactic and Semantic criteria, in Semantic Multimedia, vol. 4306, Berlin / Heidelberg: Springer, 2006, pp. 184–198.
X. Giró-i-Nieto and Marqués, F., From partition trees to semantic trees, in Multimedia Content Representation, Classification and Security, vol. 4105/2006, 2006, pp. 306–313. (966.49 KB)
X. Giró-i-Nieto, Vilaplana, V., Marqués, F., and Salembier, P., Automatic extraction and analysis of visual objects information, in Multimedia content and the semantic web, Wiley, 2005, pp. 203–221.

Conference Papers top

V. Campos, Jou, B., Giró-i-Nieto, X., Torres, J., and Chang, S. - F., Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks, presented at the 08/2017, Submitted. (427.72 KB)
In Press
D. Fernàndez, Woodward, A., Campos, V., Jou, B., Giró-i-Nieto, X., and Chang, S. - F., More cat than cute? Interpretable Prediction of Adjective-Noun Pairs, in ACM Multimedia 2017 Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes, Mountain View, CA (USA), In Press. (9.62 MB)
C. Gurrin, Giró-i-Nieto, X., Radeva, P., Dimiccoli, M., Dang-Nguyen, D. - T., and Joho, H., LTA 2017: The Second Workshop on Lifelogging Tools and Applications, in ACM Multimedia, Mountain View, California USA, In Press. (309.94 KB)
A. Lidon, Bolaños, M., Dimiccoli, M., Radeva, P., Garolera, M., and Giró-i-Nieto, X., Semantic Summarization of Egocentric Photo Stream Events, in ACM Multimedia 2017 Workshop on Lifelogging Tools and Applications, In Press. (3.08 MB)
D. Fernàndez, Varas, D., Espadaler, J., Ferreira, J., Woodward, A., Rodríguez, D., Giró-i-Nieto, X., Riveiro, J. Carlos, and Bou, E., ViTS: Video Tagging System from Massive Web Multimedia Collections, in ICCV 2017 Workshop on Web-scale Vision and Social Media , Venice, Italy, In Press.

Theses top

Research Areas top

Region-based image and video processing Internal Jan
Lifelogging Internal Feb
Affective Computing Internal Jan
Deep learning Internal Jun
Multimedia Retrieval Internal Sep

Teaching top

Acronym Title Level College
PAE Advanced Project in Science and Telecommunication Technologies (CDIO) 3rd year Telecom BCN - ETSETB
BIOM Biometric Technologies Master in Telecommunications Engineering (MET) ETSETB - Telecom BCN
READCV Computer Vision Reading Group Master in Telecommunications Engineering (MET) ETSETB-Telecom BCN
DLCV Deep Learning for Computer Vision Master in Telecommunications Engineering (MET) ETSETB Telecom BCN
DLMM Deep Learning for Multimedia Master MET ETSETB TelecomBCN
DLSL Deep Learning for Speech and Language BSc, MSc & Phd ETSETB TelecomBCN
CA562 Information access Master on E-Commerce Dublin City University
GDSA Multimedia Content Management and Delivery Degree in Audiovisual Systems (3rd year) Escola Superior d'Enginyeries Industrials, Aeroespacial i Audiovisual de Terrassa (ESEIAAT)
VA Video Analysis Master in Computer Vision (MCV) UAB, UOC, UPC & UPF
VR Visual Recognition Master in Computer Vision (MCV) UAB, UOC, UPC & UPF