Xavier Giro-i-Nieto is an associate professor at the Universitat Politecnica de Catalunya (UPC). He graduated in Electrical Engineering studies at ETSETB (UPC) in 2000, after completing his master thesis on image compression at the Vrije Universiteit in Brussels (VUB) under the direction of Professor Peter Schelkens. In 2001 he worked in the digital television group of Sony Brussels, before returning to Barcelona and joining the Image Processing Group at the UPC. Since 2003, he has created and taught graduate and undergraduate courses for Electrical Engineering degress at the ESEIAAT and ETSETB schools from UPC. In 2013 he participated in the design of the Master in Computer Vision of Barcelona by UPC, UAB, UPF and UOC universities, where he lectures on deep learning, image retrieval and video processingl. He has taught several international courses in the framework of the European Erasmus program. He obtained his Phd on image retrieval in 2012, under the supervision by Professor Ferran Marqués from UPC and Professor Shih-Fu Chang from Columbia University. He was a visiting scholar during Summers 2008 to 2014 at the Digital Video and MultiMedia laboratory at Columbia University, in New York. His relation with industry includes collaborations with Mediapro, Catalan Broadcast Corporation (TV3), Pixable, Catchoom, Narrative and Vylinx
- (22/03/2017): Appointed as Associated Editor of IEEE Transactions in Multimedia.
- (15/03/2017): Guest Editor for Special Issue on Egocentric Vision and Lifelogging Tools in Journal of Visual Communication and Image Representation.
- (25/01/2017): Member of the Technical Program Committee of the ACM Multimedia 2017.
- (24/01/2017): Chair of the Deep Learning for Speech and Language Winter School at UPC ETSETB TelecomBCN (slides & videos available).
- (12/01/2017): Member of the Technical Program Committee of the ACM International Conference on Multimedia Retrieval (ICMR) 2017.
- (27/12/2016): Member of the Technical Program Committee of the DeLIMMA wokrshop @ ICME 2017.
- (10/12/2016): Best Poster Award at the 1st NIPS Workshop on Large Scale Computer Vision Systems 2016.
- (28/10/2016): Paper accepted at Deep Reinforcement Learning Workshop, NIPS 2016.
- (16/09/2016): Our student Dèlia Fernàndez has received the best thesis award of the Master in Computer Vision Barcelona.
- (03/08/2016): Chair of the Deep Learning for Computer Vision Summer School 2016 at UPC ETSETB TelecomBCN (slides & videos available).
- (08/06/2016): Best Poster Award at ACM ICMR 2016 with this paper .
Current students and research topics
|Student||Co-advisor||Program||Saliency prediction||Wearables||Affective Computing||Multimedia Retrieval||Scene Understanding|
|Amaia Salvador||Ferran Marqués||Phd'18|
|Míriam Bellver (BSC)||Jordi Torres (BSC), Jordi Pont-Tuset (ETHZ) and Ferran Marqués||Phd'20|
|Víctor Campos (BSC)||Jordi Torres (BSC) and Shih-Fu Chang (Columbia University)||Phd'20|
|Junting Pan||Shih-Fu Chang (Columbia University)||MSc'17|
|Albert Jiménez||Jose M Alvarez (Data61, Australia)||MSc'17|
|Manel Baradad||Amaia Salvador||MSc'17|
|Dídac Surís||Víctor Campos (BSC)||MSc'18|
|Alex Woodward||Víctor Campos (BSC) and Dèlia Fernandez (Vylinx)||MSc'18|
|Xunyu Lin||Jordi Torres (BSC) and Víctor Campos (BSC)||BSc'17|
Petia Radeva (UB)
|Marc Assens||Kevin McGuinness (DCU, Ireland) & Eva Mohedano (DCU, Ireland)||BSc'17|
|Fran Roldan||Santi Pascual and Issey Masuda||BSc'17|
|Francesc LLuís||Deniz Erdogmus & Dana Brooks (Northeastern University, USA)||BSc'17|
|Marc Gorriz||Emmanuel Faure and Axel Carlier (ENSEEIHT, France)||BSc'17|
|LLuc Cardoner||Maia Zaharieva (TU Vienna, Austria)||BSc'17|
|Oriol Bernal||Maia Zaharieva (TU Vienna, Austria)||BSc'17|
- Eva Mohedano, Phd candidate at Insight-Dublin City University.
- Bruna Girvent, Phd candidate at Northeastern University Boston
- Ana García del Molino, Phd candidate at Nanyang Technological University, Singapore. Awarded with "La Caixa" scholarship (2015).
- Marcel Tella-Amo, Phd candidate at University College London
- Gabriel de Oliveira, Phd candidate at University of Barcelona
- Dèlia Fernàndez: Best Thesis in Master of Computer Vision 2016. Engineer at Vilynx.
Awards: Best poster award at LSCVS NIPS workshop 2016, Best poster award at ICMR 2016, Among Top 10% papers in ICIP 2015, Winner of the LSUN Saliency prediction challenge in CVPRW 2015, 2nd place in ChaLearn Cultural Event Recognition Challenge in CVPRW 2015, 2nd place in MediaEval Social Event Detection 2014, 3rd place in MediaEval Social Event Detection 2013, Winner of the Videobrowser Showdown in MMM 2012.
Program Commitee: Area Chair of ACM Multimedia 2016 "Multimedia and Vision", Organizer of Lifelogging Tools and Applications (LTAA) workshop at ACM Multimedia 2016, VSM 2016, EPIC 2016, ISM 2016, CBMI 2016, CrowdMM 2015, MMSys 2015 Dataset Track, SMAP 2015, CBMI 2015, MediaEval 2014, ICIP 2014, SMAP 2014, ACM MultiMedia Doctoral Symposium 2014, CBMI 2014, SEWM 2014, MMSys Dataset 2014, SMAP 2013, EUSIPCO 2011, ICIP 2003.
Journal reviewer: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Multimedia Tools and Applications (MTAP), EURASIP Journal on Image and Video Processing, Multimedia Systems (MMSJ), Image and Vision Computing (IMAVIS).
Journal Articles top
|“From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction”, Image and Vision Computing, 2017.,|
|“Assessment of Crowdsourcing and Gamification Loss in User-Assisted Object Segmentation”, Multimedia Tools and Applications, vol. 23, no. 75, 2016.,|
|“Improving Object Segmentation by using EEG signals and Rapid Serial Visual Presentation”, Multimedia Tools and Applications, 2015.,|
|“From Global Image Annotation to Interactive Object Segmentation”, Multimedia Tools and Applications, vol. 70, 2014.,|
|“Improving retrieval accuracy of Hierarchical Cellular Trees for generic metric spaces”, Multimedia Tools and Applications, 2013.,|
Book Chapters and Bookstop
|“Hierarchical Navigation and Visual Search for Video Keyframe Retrieval”, in Advances in Multimedia Modeling, vol. 7131, Springer Berlin / Heidelberg, 2012, pp. 652-654.,|
|“Rich Internet Application for Semi-automatic Annotation of Semantic Shots on Keyframes”, in Computational Intelligence for Multimedia Understanding, vol. 7242, Pisa, Italy: Springer-Verlag, 2012, pp. 172-182.,|
|“BPT Enhancement based on Syntactic and Semantic criteria”, in Semantic Multimedia, vol. 4306, Berlin / Heidelberg: Springer, 2006, pp. 184–198.,|
|“From partition trees to semantic trees”, in Multimedia Content Representation, Classification and Security, vol. 4105/2006, 2006, pp. 306–313.,|
|“Automatic extraction and analysis of visual objects information”, in Multimedia content and the semantic web, Wiley, 2005, pp. 203–221.,|
Conference Papers top
|“Scaling a Convolutional Neural Network for classification of Adjective Noun Pairs with TensorFlow on GPU Clusters”, in 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Madrid, Spain, In Press.,|
|“The Impact of Segmentation on the Accuracy and Sensitivity of a Melanoma Classifier based on Skin Lesion Images”, in Annual Meeting of the Society of Imaging Informatics in Medicine (SIIM), Pittsburgh, PA, USA, In Press.,|
|“Distributed training strategies for a computer vision deep learning algorithm on a distributed GPU cluster”, in International Conference on Computational Science (ICCS), Zurich, Switzerland, In Press.,|
|“Skin Lesion Classification from Dermoscopic Images using Deep Learning”, in The 13th IASTED International Conference on Biomedical Engineering (BioMed 2017), Innsbruck Austria, 2017.,|
|“Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks”, in 1st NIPS Workshop on Large Scale Computer Vision Systems 2016, 2016.,|
|“Visual Object Analysis using Regions and Local Features”, 2016.,|
|“Part-Based Object Retrieval With Binary Partition Trees”, Universitat Politècnica de Catalunya (UPC), Barcelona, Catalonia, 2012.,|
|“SalGAN: Visual Saliency Prediction with Generative Adversarial Networks”. Submitted.,||Unpublished|
|“Semantic Summarization of Egocentric Photo Stream Events”. Submitted.,||Unpublished|
|“The impact of visual saliency prediction in image classification”. 2017.,||Ms Thesis|
|“Skin Lesion Detection from Dermoscopi Images using Convolutional Neural Networks”. 2017.,||Ms Thesis|
|“Multi-label Remote Sensing Image Retrieval based on Deep Features”. 2017.,||Ms Thesis|
|MALEGRA - Multimodal Signal Processing and Machine Learning on Graphs||National||Jan 2017||Dec 2020|
|BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces.||National||Jan 2014||Dec 2017|
|SGR14 - Image and Video Processing Group||National||Jan 2014||Apr 2017|
|MuViPro - Multicamera Video Processing||National||Jan 2011||Aug 2014|
|SGR09 - Processament de Video Multicamera||National||Oct 2009||Dec 2013|
Research Areas top
Demos and Resources top
|EgoMon Gaze & Video Dataset||Dataset||Jul 2016|
|UPC at CVPRW Visual Question Answering Challenge 2016||Software||Jun 2016|
|UPC at CVPRW ActivityNet Challenge 2016||Software||Jun 2016|
|Faster R-CNN Features for Instance Search (software)||Software||May 2016|
|Sentiment maps generator||Software||Apr 2016|
|PAE||Advanced Project in Science and Telecommunication Technologies (CDIO)||3rd year||Telecom BCN - ETSETB|
|BIOM||Biometric Technologies||Master in Telecommunications Engineering (MET)||ETSETB - Telecom BCN|
|READCV||Computer Vision Reading Group||Master in Telecommunications Engineering (MET)||ETSETB-Telecom BCN|
|DLCV||Deep Learning for Computer Vision||Master in Telecommunications Engineering (MET)||ETSETB Telecom BCN|
|DLMM||Deep Learning for Multimedia||Master MET||ETSETB TelecomBCN|
|DLSL||Deep Learning for Speech and Language||BSc, MSc & Phd||ETSETB TelecomBCN|
|CA562||Information access||Master on E-Commerce||Dublin City University|
|GDSA||Multimedia Content Management and Delivery||Degree in Audiovisual Systems (3rd year)||Escola Superior d'Enginyeries Industrials, Aeroespacial i Audiovisual de Terrassa (ESEIAAT)|
|SiS||Signals and Systems (Terrassa)||Degree in Audiovisual Systems (2nd year)||Escola Superior d'Enginyeries Industrials, Aeroespacial i Audiovisual de Terrassa (ESEIAAT)|
|VA||Video Analysis||Master in Computer Vision (MCV)||UAB, UOC, UPC & UPF|