Other Publications by Xavier Giró back

2022
M. A. Mohamed, “Exploring Visual Representations for Sign Language Translation”. 2022. Google Scholar BibTex (700.12 KB)	Ms Thesis
P. Cabot, “Sign Language Translation based on Transformers for the How2Sign Dataset”, 2022. Google Scholar BibTex (1.56 MB)	Report
Á. Budria, “Multimodal 3D Hand Pose Enhancement for Sign Language”, 2022. Google Scholar BibTex (1.2 MB)	Report
M. Barrabés, “Machine Learning for Genomic Sequence Processing”, 2022. Google Scholar BibTex	Report
P. Cabot, Tarrés, L., and Giró-i-Nieto, X., “Sign-Language Translation with Pseudo-Glosses”. 2022. Google Scholar BibTex (7.51 MB)	Ms Thesis
M. Perera, “Ancestry-conditioned Generative Models for Genotyping”, 2022. Google Scholar BibTex	Report
Á. Budria, “Topic Detection from Sign Language Videos”. 2022. Google Scholar BibTex	Ms Thesis
T. Domenech, “Hiding Images in their Spoken Narratives”. 2022. Google Scholar BibTex (15.23 MB)	Ms Thesis
X. Giró-i-Nieto and Duarte, A., “Towards Sign Language Translation and Production”. 2022. Google Scholar BibTex	Presentation

2021
X. Giró-i-Nieto, “Learning Representations for Sign Language Videos”. 2021. Google Scholar BibTex	Presentation
M. Escobar, “Object Model Adaptation for Multiple Object Tracking”, 2021. Google Scholar BibTex (785.3 KB)	Report
X. Giró-i-Nieto, “Deep Learning Representations for All (a.ka. the AI hype)”. 2021. Google Scholar BibTex (10.95 MB)	Presentation
M. Geleta, “Unsupervised learning with applications in genomic”, vol. BSc Data Science Engineering. 2021. Google Scholar BibTex	Ms Thesis
B. Oriol, “Species-agnostic Local Ancestry Inference on Genomic Data with Convolutions”. 2021. Google Scholar BibTex	Ms Thesis
R. Creus, “Unsupervised skill learning from pixels”. 2021. Google Scholar BibTex (19.61 MB)	Ms Thesis
A. Iturralde, “Towards video alignment across cameras with sign language 2D poses”. 2021. Google Scholar BibTex (3.45 MB)	Ms Thesis
J. Aguilar, “2D-to-3D Lifting of Sign Language Body Poses with Recurrent Neural Networks”, UPC ETSETB TelecomBCN, Barcelona, 2021. Google Scholar BibTex (340.44 KB)	Report
L. Tarrés, “GAN-based Image Colourisation with Feature Reconstruction Loss”. 2021. Google Scholar BibTex (12.4 MB)	Ms Thesis
J. José Nieto, “Discovery and Learning of Navigation Goals from Pixels in Minecraft”. 2021. Google Scholar BibTex (16.15 MB)	Ms Thesis
X. Giró-i-Nieto and Ventura, C., “Object Detection with Deep Learning”. 2021. Google Scholar BibTex	Presentation
X. Giró-i-Nieto, “Sign Language Translation and Production Multimedia and Multimodal Challenges for All”. 2021. Google Scholar BibTex	Presentation
P. Caselles, “Disentangling neural network structure from the weights space”. 2021. Google Scholar BibTex (17.1 MB)	Ms Thesis

2020
P. Pérez-Granero, “2D to 3D body pose estimation for sign language with Deep Learning”. 2020. Google Scholar BibTex (2.97 MB)	Ms Thesis
P. Muschik, “Learn2Sign : sign language recognition and translation using human keypoint estimation and transformer model”. 2020. DOI Google Scholar BibTex (5.32 MB)	Ms Thesis
J. Escur, “Attention-based multi-view 3D reconstruction models”. 2020. Google Scholar BibTex	Ms Thesis
C. Puntí, “PixInPix: Hidding Pixels in Pixels”. 2020. Google Scholar BibTex (2.13 MB)	Ms Thesis
I. Kazakos, “Generation of Synthetic Referring Expressions for Object Segmentation in Videos”. 2020. Google Scholar BibTex (4.14 MB)	Ms Thesis
O. Mañas, “Self-Supervised Visual Representation Learning for Remote Sensing”. 2020. Google Scholar BibTex	Ms Thesis
M. Gonzalez-i-Calabuig, “Curriculum Learning for Recurrent Video Object Segmentation”. 2020. Google Scholar BibTex	Ms Thesis
X. Giró-i-Nieto and Ventura, C., “Image Segmentation with Deep Learning”. 2020. Google Scholar BibTex	Presentation
X. Giró-i-Nieto, “Image and Video Object Segmentation with Low Supervision”. 2020. Google Scholar BibTex	Presentation
X. Giró-i-Nieto, “Deep Self-Supervised Learning for All”. 2020. Google Scholar BibTex	Presentation

2019
E. Ramon, Villar, J., Ruiz, G., Batard, T., and Giró-i-Nieto, X., “Plug-and-Train Loss for Model-Based Single View 3D Reconstruction”, BMVA Technical Meeting: 3D vision with Deep Learning. UPC, London, UK, 2019. Google Scholar BibTex (3.97 MB)	Unpublished
J. José Nieto, “Video Saliency Prediction with Deep Neural Networks”. 2019. Google Scholar BibTex (2.17 MB)	Ms Thesis
M. Caros, “A Generative Dialogue System for Reminiscence Therapy”. 2019. Google Scholar BibTex (4.89 MB)	Ms Thesis
M. Tubau, “Wav2Pix: Enhancement and Evaluation of a Speech-conditioned Image Generator”. 2019. Google Scholar BibTex (9.65 MB)	Ms Thesis
A. Comas, “Exploring Methods for Enhancing Linear Prediction of Video Sequences”. 2019. Google Scholar BibTex	Ms Thesis
P. Caselles, “Integrating low-level motion cues in deep video saliency”. 2019. Google Scholar BibTex (10.04 MB)	Ms Thesis
J. LLadós, Bou, E., Bressan, M., Sala, O., and Giró-i-Nieto, X., “The pillars of the Computer Vision Catalan Alliance”. 2019. Google Scholar BibTex	Presentation
M. Granero, “A Video Database for Analyzing Affective Physiological Responses”. 2019. Google Scholar BibTex (23.66 MB)	Ms Thesis
A. Herrera-Palacio, “Recurrent Instance Segmentation with Linguistic Referring Expressions”. 2019. Google Scholar BibTex (3.6 MB)	Ms Thesis
B. Oriol, “Multimodal Hate Speech Detection in Memes”. 2019. Google Scholar BibTex (1.66 MB)	Ms Thesis
X. Giró-i-Nieto, “One Perceptron to Rule Them All: Language and Vision”. 2019. Google Scholar BibTex (15.61 MB)	Presentation

2018
J. Escur, “Exploring Automatic Speech Recognition with TensorFlow”. 2018. Google Scholar BibTex (829.82 KB)	Ms Thesis
E. Ramon, “Deep Learning algorithms for 3D Reconstruction and Simulation of Aesthetic Procedures”. 2018. Google Scholar BibTex	Unpublished
A. Alsina, “An interactive Lifelog Search Engine for LSC2018”. 2018. Google Scholar BibTex (2.75 MB)	Ms Thesis
M. Coll-Pol, “The Importance of Time in Visual Attention Models”. 2018. Google Scholar BibTex (5.46 MB)	Ms Thesis
X. Giró-i-Nieto, “One Perceptron to Rule them All”. 2018. Google Scholar BibTex (8.44 MB)	Presentation
C. Arenas, “Video Understanding through the Disentanglement of Appearance and Motion”. 2018. Google Scholar BibTex (1.06 MB)	Ms Thesis
X. Giró-i-Nieto, “Learning Where and When to Look”. 2018. Google Scholar BibTex	Presentation
S. Roca, “Block-based Speech-to-Speech Translation”. 2018. Google Scholar BibTex (505.01 KB)	Ms Thesis
D. Moreno, Costa-jussà, M. R., and Giró-i-Nieto, X., “English to ASL Translator for Speech2Signs”. 2018. Google Scholar BibTex (1.54 MB)	Unpublished
F. Roldán, “Speech-conditioned Face Generation with Deep Adversarial Networks”. 2018. Google Scholar BibTex (1.79 MB)	Ms Thesis
D. Fojo, “Reproducing and Analyzing Adaptive Computation Time in PyTorch and TensorFlow”. 2018. Google Scholar BibTex (1.41 MB)	Ms Thesis
D. Fernàndez, Bou-Balust, E., and Giró-i-Nieto, X., “Multimodal Knowledge Base Population from News Streams for Media Applications”. 2018. Google Scholar BibTex	Unpublished

2017
O. Bernal, “Predicting emotion in movies: Recurrent and convolutional models applied to videos”. 2017. Google Scholar BibTex (3.05 MB)	Ms Thesis
V. Campos, “Learning to Skip State Updates in Recurrent Neural Networks”. 2017. Google Scholar BibTex (961.49 KB)	Ms Thesis
M. Górriz, “Active Deep Learning for Medical Imaging Segmentation”. 2017. Google Scholar BibTex (2.84 MB)	Ms Thesis
M. Bellver, “Detection-aided medical image segmentation using deep learning”. 2017. Google Scholar BibTex (7.07 MB)	Ms Thesis
M. Assens, “The Temporal Dimension of Visual Attention Models”. 2017. Google Scholar BibTex (6.98 MB)	Ms Thesis
A. Jiménez, “Class Weighted Convolutional Features for Image Retrieval”. 2017. Google Scholar BibTex	Ms Thesis
M. Compri, “Multi-label Remote Sensing Image Retrieval based on Deep Features”. 2017. Google Scholar BibTex (1.99 MB)	Ms Thesis
F. Roldán, “Visual Question Answering 2.0”. 2017. Google Scholar BibTex (2.59 MB)	Ms Thesis
E. Arazo, “The impact of visual saliency prediction in image classification”. 2017. Google Scholar BibTex (828.66 KB)	Ms Thesis
A. Bozal, “Personalized Image Classication from EEG Signals using Deep Learning”. 2017. Google Scholar BibTex (4.51 MB)	Ms Thesis
E. Mohedano, McGuinness, K., Giró-i-Nieto, X., and O'Connor, N., “Fine-tuning of CNN models for Instance Search with Pseudo-Relevance Feedback”. NIPS 2017 Women in Machine Learning Workshop, Long Beach, CA, USA, 2017. Google Scholar BibTex (341.96 KB)	Unpublished
A. Romero-Lopez, “Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Networks”. 2017. Google Scholar BibTex	Ms Thesis
L. L. Cardoner, “Predicting Media Interestingness”. 2017. Google Scholar BibTex (1.78 MB)	Ms Thesis
X. Giró-i-Nieto, Pascual-deLaPuente, S., Miró, V., and Esteve, O., “La meitat de les notícies que consumirem el 2022 seran falses”, 2017. . Google Scholar BibTex	Web Article

2016
C. Reyes, Mohedano, E., McGuinness, K., O'Connor, N., and Giró-i-Nieto, X., “Where did I leave my phone ?”, 4th Workshop on Egocentric (First-Person) Vision, CVPR 2016, Las Vegas, NV, USA, 2016. Google Scholar BibTex (312.27 KB)	Report
M. Chertó, “EgoMon Gaze and Video Dataset for Visual Saliency Prediction”. 2016. Google Scholar BibTex (1.48 MB)	Ms Thesis
D. Fernàndez, Campos, V., Jou, B., Giró-i-Nieto, X., and Chang, S. - F., “Is a “happy dog” more “happy” than “dog”? - Adjective and Noun Contributions for Adjective-Noun Pair prediction”, NIPS Women in Machine Learning Workshop. Barcelona, 2016. Google Scholar BibTex (3.11 MB)	Unpublished
M. Carné-Herrera, “Detect Snap Points in Egocentric Images with Physiological Signals”. 2016. Google Scholar BibTex (4.63 MB)	Ms Thesis
C. Reyes, “Time-sensitive Egocentric Image Retrieval for Fidings Objects in Lifelogs”. 2016. Google Scholar BibTex (10.23 MB)	Ms Thesis
M. Bellver, Giró-i-Nieto, X., and Marqués, F., “Efficient search of objects in images using deep reinforcement learning”, NIPS Women in Machine Learning Workshop. Barcelona., 2016. Google Scholar BibTex	Unpublished
I. Masuda-Mora, “Open-Ended Visual Question-Answering”. 2016. Google Scholar BibTex (7.03 MB)	Ms Thesis
A. Nespereira, “Siamese Convolutional Neural Network for Learning Object Similarities in RGB-D Images”. 2016. Google Scholar BibTex	Ms Thesis
A. Montes, “Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks”. 2016. Google Scholar BibTex (27.84 MB)	Ms Thesis
A. Calafell, “Video Retrieval of Specific Persons in Specific Locations”. 2016. Google Scholar BibTex (17.14 MB)	Ms Thesis
I. Masuda-Mora, Pascual-deLaPuente, S., and Giró-i-Nieto, X., “Towards Automatic Generation of Question Answer Pairs from Images”, Visual Question Answering Challenge Workshop, CVPR 2016, Las Vegas, NV, USA, 2016. Google Scholar BibTex (206.92 KB)	Report
D. Fernàndez, “Clustering and Prediction of Adjective-Noun Pairs for Affective Computing”. 2016. Google Scholar BibTex (10.38 MB)	Ms Thesis
M. Carné-Herrera, Giró-i-Nieto, X., and Gurrin, C., “EgoMemNet: Visual Memorability Adaptation to Egocentric Images”, 4th Workshop on Egocentric (First-Person) Vision, CVPR 2016, Las Vegas, NV, USA, 2016. Google Scholar BibTex (265.4 KB)	Report
M. Carné-Herrera, “Visual Memorability for Egocentric Cameras”. 2016. Google Scholar BibTex (98.86 MB)	Ms Thesis
A. Ferri, “Object Tracking in Video with TensorFlow”. 2016. Google Scholar BibTex (22.63 MB)	Ms Thesis

2015
R. Mestre, “Visual Summary of Egocentric Photostreams by Representative Keyframes”. 2015. Google Scholar BibTex (1.36 MB)	Ms Thesis
M. Bellver, “Efficient Exploration of Region Hierarchies for Semantic Segmentation”. 2015. Google Scholar BibTex (11.62 MB)	Ms Thesis
S. Porta, “Rapid Serial Visual Presentation for Relevance Feedback in Image Retrieval with EEG Signals”. 2015. Google Scholar BibTex (1.38 MB)	Ms Thesis
V. Campos, “Layer-wise CNN Surgery for Visual Sentiment Prediction”. 2015. Google Scholar BibTex (1.51 MB)	Ms Thesis
G. de Oliveira-Barra, “LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System”. 2015. Google Scholar BibTex (8.3 MB)	Ms Thesis
C. Ventura, Giró-i-Nieto, X., Vilaplana, V., McGuinness, K., Marqués, F., and O'Connor, N. E., “Improving Spatial Codification in Semantic Segmentation (Supplementary Material)”, 2015. Google Scholar BibTex (18.81 MB)	Report
J. Pan and Giró-i-Nieto, X., “End-to-end Convolutional Network for Saliency Prediction”, arXiv, Boston, MA (USA), 2015. Google Scholar BibTex (1.18 MB)	Report
A. Lidon, “Semantic and Diverse Summarization of Egocentric Photo Events”. 2015. Google Scholar BibTex (5.34 MB)	Ms Thesis
F. Cabezas, “Co-filtering human interaction and object segmentation”. 2015. Google Scholar BibTex (1.82 MB)	Ms Thesis
E. Fontdevila-Bosch, “Region-oriented Convolutional Networks for Object Retrieval”. 2015. Google Scholar BibTex (8.02 MB)	Ms Thesis
C. Ramos-Caballero, “Keyframe-based Video Summarization Designer”. 2015. Google Scholar BibTex (2.64 MB)	Ms Thesis
I. Gris-Sarabia, “Pyxel, una llibreria per a l’anotació automàtica de fotografies”. 2015. Google Scholar BibTex (1.12 MB)	Ms Thesis
J. Roldan-Carlos, “Visual Search for Musical Performances and Endoscopic Videos”. 2015. Google Scholar BibTex (12.35 MB)	Ms Thesis
A. Calafell, “Fine-tuning a Convolutional Network for Cultural Event Recognition”. 2015. Google Scholar BibTex (11.14 MB)	Ms Thesis
J. Pan, “Visual Saliency Prediction using Deep learning Techniques”. 2015. Google Scholar BibTex (1.57 MB)	Ms Thesis

2014
D. Almendros-Gutiérrez, “Visual instance mining of news videos using a graph-based approach”. 2014. Google Scholar BibTex (4.12 MB)	Ms Thesis
M. Tella, “Contextless Object Recognition with Shape-enriched SIFT and Bags of Features”. 2014. Google Scholar BibTex (4.82 MB)	Ms Thesis
S. Imedio-Pereira, “An investigation of eye gaze tracking utilities in image object recognition”. 2014. Google Scholar BibTex (1.63 MB)	Ms Thesis
D. Manchon-Vizuete, “Low computational cost algorithms for photo clustering and mail signature detection in the cloud”. 2014. Google Scholar BibTex	Ms Thesis
A. Salvador, “Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation”. 2014. Google Scholar BibTex (8.97 MB)	Ms Thesis
J. Sánchez-Escué, “Bundling interest points for object classification”. 2014. Google Scholar BibTex (2.15 MB)	Ms Thesis
M. Ferrarons-Betrian, “Mobile Visual Search at Catchoom”. 2014. Google Scholar BibTex	Ms Thesis

2013
C. Ventura, “Visual Object Analysis Using Regions and Interest Points”, ACM Multimedia. 2013. DOI Google Scholar BibTex (132.2 KB)	Ms Thesis
M. Martos, “Content-based Video Summarisation to Object Maps”. 2013. Google Scholar BibTex (3.73 MB)	Ms Thesis
A. Garcia-delMolino, “Extension of Instance Search Technique by Geometric Coding and Quantization Error Compensation”. 2013. Google Scholar BibTex	Ms Thesis
L. Tort, “Video Clustering Using Camera Motion”. 2013. Google Scholar BibTex (8.87 MB)	Ms Thesis
C. Ventura, “Visual Object Analysis Using Regions and Interest Points”. 2013. Google Scholar BibTex (4.62 MB)	Ms Thesis
A. Salvador, “Crowdsourced Object Segmentation with a Game”. 2013. Google Scholar BibTex (1.34 MB)	Ms Thesis
E. Mohedano, “Investigating EEG for Saliency and Segmentation Applications in Image Processing”. 2013. Google Scholar BibTex (332.54 KB)	Ms Thesis
J. Antoja-Sabin, “El telèfon mòbil com a eina d'aprenentatge informal”. 2013. Google Scholar BibTex (1.55 MB)	Ms Thesis

2011
A. Rubiano, “Búsqueda Visual con Retroacción de Relevancia Basada en Actualizacion de Pesos”. 2011. Google Scholar BibTex (1.07 MB)	Ms Thesis
C. Ventura, “Tools for Image Retrieval in Large Multimedia Databases”. 2011. Google Scholar BibTex (6.33 MB)	Ms Thesis
M. Alfaro, “Reordenació i agrupament d'imatges d'una cerca de vídeo”. 2011. Google Scholar BibTex (24.81 MB)	Ms Thesis
M. Tella, “Interactive Image Processing demonstrations for the web”. 2011. Google Scholar BibTex (1.77 MB)	Ms Thesis
E. Carcel, “Rich Internet Application for the Semi-Automatic Annotation of Semantic Shots on Keyframes”. 2011. Google Scholar BibTex (6.58 MB)	Ms Thesis

2010
S. Cortés, “GOS: búsqueda visual de imágenes”, 25, 2010. Google Scholar BibTex	Report
C. Ruiz-Sancho, “Tweet@TV: Televisió social en 140 caràcters”. 2010. Google Scholar BibTex (6.63 MB)	Ms Thesis
P. Muñoz-Trallero, “Extensió d'una interfície de cerca d'imatges a les consultes amb regions”. 2010. Google Scholar BibTex	Ms Thesis
B. Girvent, “Servei de vídeos a la carta per a l'iPhone”. 2010. Google Scholar BibTex (10.26 MB)	Ms Thesis
M. Gimeno, “Interfície gràfica d'usuari per a l'avaluació de classificadors d'imatges”. 2010. Google Scholar BibTex	Ms Thesis

2009
S. Cortés, “Interfaz gráfica de usuario para la búsqueda de imágenes basada en imágenes”. 2009. Google Scholar BibTex (2.84 MB)	Ms Thesis
R. Salla-Rovira, “Aplicació rica d'internet per a la consulta amb text i imatge a la Corporació Catalana de Mitjans Audiovisuals”. 2009. Google Scholar BibTex (6.01 MB)	Ms Thesis

2004
X. Giró-i-Nieto, “La convergència de la TV cap al PC”, Diari Avui, Barcelona, Catalonia, 2004. Google Scholar BibTex	Report

2000
X. Giró-i-Nieto, “Volumetric Data Compression based on Cube-Splitting and Embedded Block Coding by Optimized Truncation”. 2000. Google Scholar BibTex (1.36 MB)	Ms Thesis