Xavier Giró

Positionsort descending e-mail
Associate Professor xavier.giro@upc.edu
Office Phone
D5-117 (Barcelona - Campus Nord)
TR2-102 (Terrassa - ESEIAAT)
+34 934 015 769


Xavier Giro-i-Nieto is an associate professor at the Universitat Politecnica de Catalunya (UPC) in Barcelona, member of the Image Processing Group (GPI), Intelligent Data Science and Artificial Intelligence Research Center (IDEAI-UPC), Institute of Industrial Robotics (IRI), and also a visiting researcher at Barcelona Supercomputing Center (BSC). He graduated in Telecommunications Engineering at ETSETB (UPC) in 2000, after completing his master thesis on image compression at the Vrije Universiteit in Brussels (VUB) with Prof. Peter Schelkens. After working one year in Sony Brussels, he returned to UPC to obtain a PhD on computer vision, supervised by Prof. Ferran Marqués and Prof. Shih-Fu Chang from the Digital Video and MultiMedia laboratory at Columbia University, that he repeateadly visited between 2008-2014. Dr. Giró is the director of the Postgraduate on Artificial Intelligence with Deep Learning at UPC School, and also teaches undergradute and graduate course on deep learning at ESEIAAT and ETSETB schools at UPC, as well as the Master in Computer Vision of BarcelonaHe regularly collaborates with the Insight Center of Data Analytics at Dublin City Universityand is a member of the Governance Committee of the Science Foundation Ireland Centre for Research Training in Machine Learning. From a transfer technology perspective, he is a member of the scientific advisory committee of Vilynx, and collaborates with Telefónica R&DMediapro, BBC R&D and Crisalix. He serves as associate editor at IEEE Transactions in Multimedia and reviews for top tier conferences in machine learning (NeurIPS, ICML), computer vision (CVPR, ECCV, ICCV) and multimedia (ACMMM, ICMR). 




Latest News


External activities

Scientific IDs:  Google ScholarWoK Researcher ID: M-5834-2013, ORCID: 0000-0002-9935-5332, Scopus Author ID35098596700, UPC Futur 


Book Chapters and Bookstop

V. Campos, Giró-i-Nieto, X., Jou, B., Torres, J., and Chang, S. - F., Sentiment concept embedding for visual affect recognition, in Multimodal Behavior Analysis in theWild, 1st ed., Elsevier, 2018.
E. Mohedano, Salvador, A., McGuinness, K., Giró-i-Nieto, X., O'Connor, N., and Marqués, F., Object Retrieval with Deep Convolutional Features, in Deep Learning for Image Processing Applications, vol. 31, Amsterdam, The Netherlands: IOS Press, 2017.
M. Bellver, Giró-i-Nieto, X., Marqués, F., and Torres, J., Hierarchical Object Detection with Deep Reinforcement Learning, in Deep Learning for Image Processing Applications, vol. 31, Amsterdam, The Netherlands: IOS Press, 2017.
C. Ventura, Martos, M., Giró-i-Nieto, X., Vilaplana, V., and Marqués, F., Hierarchical Navigation and Visual Search for Video Keyframe Retrieval, in Advances in Multimedia Modeling, vol. 7131, Springer Berlin / Heidelberg, 2012, pp. 652-654.
E. Carcel, Martos, M., Giró-i-Nieto, X., and Marqués, F., Rich Internet Application for Semi-automatic Annotation of Semantic Shots on Keyframes, in Computational Intelligence for Multimedia Understanding, vol. 7242, Pisa, Italy: Springer-Verlag, 2012, pp. 172-182. (6.93 MB)

Conference Papers top

In Press
A. Duarte, Palaskar, S., Ventura, L., Ghadiyaram, D., DeHaan, K., Metze, F., Torres, J., and Giró-i-Nieto, X., How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language, in CVPR 2021, In Press. (744.94 KB)
L. Ventura, Duarte, A., and Giró-i-Nieto, X., Can Everybody Sign Now? Exploring Sign Language Video Generation from 2D Poses, in ECCV 2020 Workshop on Sign Language recognition, Production and Translation (SLRTP), 2020. (3.85 MB)
M. Caros, Garolera, M., Radeva, P., and Giró-i-Nieto, X., Automatic Reminiscence Therapy for Dementia, in ACM International Conference on Multimedia Retrieval (ICMR), Dublin, Ireland, 2020. (4.37 MB)
V. Campos, Trott, A., Xiong, C., Socher, R., Giró-i-Nieto, X., and Torres, J., Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills, in International Conference on Machine Learning (ICML) 2020, 2020. (6.89 MB)
X. Giró-i-Nieto, One Perceptron to Rule Them All: Language, Vision, Audio and Speech (tutorial), in ACM International Conference on Multimedia Retrieval (ICMR) 2020, Dublin, Ireland, 2020. (313.96 KB)

Theses top

V. Campos, Deep Learning that Scales: Leveraging Compute and Data, Universitat Politècnica de Catalunya, Barcelona, Catalonia, 2020.
A. Salvador, Computer Vision beyond the visible: Image understanding through language, Universitat Politecnica de Catalunya, Barcelona, 2019.
C. Ventura, Visual Object Analysis using Regions and Local Features, 2016. (2.5 MB)
X. Giró-i-Nieto, Part-Based Object Retrieval With Binary Partition Trees, Universitat Politècnica de Catalunya (UPC), Barcelona, Catalonia, 2012. (16.34 MB)

Research Areas top

Region-based image and video processing Internal Jan
Lifelogging Internal Feb
Affective Computing Internal Jan
Deep learning Internal Jun
Saliency prediction Internal Feb

Teaching top

Acronym Title Level College
BIOM Biometric Technologies Master in Telecommunications Engineering (MET) ETSETB - Telecom BCN
DLAI Deep Learning for Artificial Intelligence Master MET ETSETB TelecomBCN
DLCV Deep Learning for Computer Vision Master in Telecommunications Engineering (MET) ETSETB Telecom BCN
DLMM Deep Learning for Multimedia Master & PhD Dublin City University
DLSL Deep Learning for Speech and Language BSc, MSc & Phd ETSETB TelecomBCN
IDL Introduction to Deep Learning BSc ETSETB TelecomBCN
GDSA Multimedia Content Management and Delivery Degree in Audiovisual Systems (3rd year) Escola Superior d'Enginyeries Industrials, Aeroespacial i Audiovisual de Terrassa (ESEIAAT)
VA Video Analysis Master in Computer Vision (MCV) UAB, UOC, UPC & UPF