Other Publications by Xavier Giró back

2022
M. A. Mohamed, Exploring Visual Representations for Sign Language Translation. 2022. (700.12 KB) Ms Thesis
P. Cabot, Sign Language Translation based on Transformers for the How2Sign Dataset, 2022. (1.56 MB) Report
Á. Budria, Multimodal 3D Hand Pose Enhancement for Sign Language, 2022. (1.2 MB) Report
M. Barrabés, Machine Learning for Genomic Sequence Processing, 2022. Report
P. Cabot, Tarrés, L., and Giró-i-Nieto, X., Sign-Language Translation with Pseudo-Glosses. 2022. (7.51 MB) Ms Thesis
M. Perera, Ancestry-conditioned Generative Models for Genotyping, 2022. Report
Á. Budria, Topic Detection from Sign Language Videos. 2022. Ms Thesis
T. Domenech, Hiding Images in their Spoken Narratives. 2022. (15.23 MB) Ms Thesis
X. Giró-i-Nieto and Duarte, A., Towards Sign Language Translation and Production. 2022. Presentation
2021
X. Giró-i-Nieto, Learning Representations for Sign Language Videos. 2021. Presentation
M. Escobar, Object Model Adaptation for Multiple Object Tracking, 2021. (785.3 KB) Report
X. Giró-i-Nieto, Deep Learning Representations for All (a.ka. the AI hype). 2021. (10.95 MB) Presentation
M. Geleta, Unsupervised learning with applications in genomic, vol. BSc Data Science Engineering. 2021. Ms Thesis
B. Oriol, Species-agnostic Local Ancestry Inference on Genomic Data with Convolutions. 2021. Ms Thesis
R. Creus, Unsupervised skill learning from pixels. 2021. (19.61 MB) Ms Thesis
A. Iturralde, Towards video alignment across cameras with sign language 2D poses. 2021. (3.45 MB) Ms Thesis
J. Aguilar, 2D-to-3D Lifting of Sign Language Body Poses with Recurrent Neural Networks, UPC ETSETB TelecomBCN, Barcelona, 2021. (340.44 KB) Report
L. Tarrés, GAN-based Image Colourisation with Feature Reconstruction Loss. 2021. (12.4 MB) Ms Thesis
J. José Nieto, Discovery and Learning of Navigation Goals from Pixels in Minecraft. 2021. (16.15 MB) Ms Thesis
X. Giró-i-Nieto and Ventura, C., Object Detection with Deep Learning. 2021. Presentation
X. Giró-i-Nieto, Sign Language Translation and Production Multimedia and Multimodal Challenges for All. 2021. Presentation
P. Caselles, Disentangling neural network structure from the weights space. 2021. (17.1 MB) Ms Thesis
2020
P. Pérez-Granero, 2D to 3D body pose estimation for sign language with Deep Learning. 2020. (2.97 MB) Ms Thesis
P. Muschik, Learn2Sign : sign language recognition and translation using human keypoint estimation and transformer model. 2020. (5.32 MB) Ms Thesis
J. Escur, Attention-based multi-view 3D reconstruction models. 2020. Ms Thesis
C. Puntí, PixInPix: Hidding Pixels in Pixels. 2020. (2.13 MB) Ms Thesis
I. Kazakos, Generation of Synthetic Referring Expressions for Object Segmentation in Videos. 2020. (4.14 MB) Ms Thesis
O. Mañas, Self-Supervised Visual Representation Learning for Remote Sensing. 2020. Ms Thesis
M. Gonzalez-i-Calabuig, Curriculum Learning for Recurrent Video Object Segmentation. 2020. Ms Thesis
X. Giró-i-Nieto and Ventura, C., Image Segmentation with Deep Learning. 2020. Presentation
X. Giró-i-Nieto, Image and Video Object Segmentation with Low Supervision. 2020. Presentation
X. Giró-i-Nieto, Deep Self-Supervised Learning for All. 2020. Presentation
2019
E. Ramon, Villar, J., Ruiz, G., Batard, T., and Giró-i-Nieto, X., Plug-and-Train Loss for Model-Based Single View 3D Reconstruction, BMVA Technical Meeting: 3D vision with Deep Learning. UPC, London, UK, 2019. (3.97 MB) Unpublished
J. José Nieto, Video Saliency Prediction with Deep Neural Networks. 2019. (2.17 MB) Ms Thesis
M. Caros, A Generative Dialogue System for Reminiscence Therapy. 2019. (4.89 MB) Ms Thesis
M. Tubau, Wav2Pix: Enhancement and Evaluation of a Speech-conditioned Image Generator. 2019. (9.65 MB) Ms Thesis
A. Comas, Exploring Methods for Enhancing Linear Prediction of Video Sequences. 2019. Ms Thesis
P. Caselles, Integrating low-level motion cues in deep video saliency. 2019. (10.04 MB) Ms Thesis
J. LLadós, Bou, E., Bressan, M., Sala, O., and Giró-i-Nieto, X., The pillars of the Computer Vision Catalan Alliance. 2019. Presentation
M. Granero, A Video Database for Analyzing Affective Physiological Responses. 2019. (23.66 MB) Ms Thesis
A. Herrera-Palacio, Recurrent Instance Segmentation with Linguistic Referring Expressions. 2019. (3.6 MB) Ms Thesis
B. Oriol, Multimodal Hate Speech Detection in Memes. 2019. (1.66 MB) Ms Thesis
X. Giró-i-Nieto, One Perceptron to Rule Them All: Language and Vision. 2019. (15.61 MB) Presentation
2018
J. Escur, Exploring Automatic Speech Recognition with TensorFlow. 2018. (829.82 KB) Ms Thesis
E. Ramon, Deep Learning algorithms for 3D Reconstruction and Simulation of Aesthetic Procedures. 2018. Unpublished
A. Alsina, An interactive Lifelog Search Engine for LSC2018. 2018. (2.75 MB) Ms Thesis
M. Coll-Pol, The Importance of Time in Visual Attention Models. 2018. (5.46 MB) Ms Thesis
X. Giró-i-Nieto, One Perceptron to Rule them All. 2018. (8.44 MB) Presentation
C. Arenas, Video Understanding through the Disentanglement of Appearance and Motion. 2018. (1.06 MB) Ms Thesis
X. Giró-i-Nieto, Learning Where and When to Look. 2018. Presentation
S. Roca, Block-based Speech-to-Speech Translation. 2018. (505.01 KB) Ms Thesis
D. Moreno, Costa-jussà, M. R., and Giró-i-Nieto, X., English to ASL Translator for Speech2Signs. 2018. (1.54 MB) Unpublished
F. Roldán, Speech-conditioned Face Generation with Deep Adversarial Networks. 2018. (1.79 MB) Ms Thesis
D. Fojo, Reproducing and Analyzing Adaptive Computation Time in PyTorch and TensorFlow. 2018. (1.41 MB) Ms Thesis
D. Fernàndez, Bou-Balust, E., and Giró-i-Nieto, X., Multimodal Knowledge Base Population from News Streams for Media Applications. 2018. Unpublished
2017
O. Bernal, Predicting emotion in movies: Recurrent and convolutional models applied to videos. 2017. (3.05 MB) Ms Thesis
V. Campos, Learning to Skip State Updates in Recurrent Neural Networks. 2017. (961.49 KB) Ms Thesis
M. Górriz, Active Deep Learning for Medical Imaging Segmentation. 2017. (2.84 MB) Ms Thesis
M. Bellver, Detection-aided medical image segmentation using deep learning. 2017. (7.07 MB) Ms Thesis
M. Assens, The Temporal Dimension of Visual Attention Models. 2017. (6.98 MB) Ms Thesis
A. Jiménez, Class Weighted Convolutional Features for Image Retrieval. 2017. Ms Thesis
M. Compri, Multi-label Remote Sensing Image Retrieval based on Deep Features. 2017. (1.99 MB) Ms Thesis
F. Roldán, Visual Question Answering 2.0. 2017. (2.59 MB) Ms Thesis
E. Arazo, The impact of visual saliency prediction in image classification. 2017. (828.66 KB) Ms Thesis
A. Bozal, Personalized Image Classi cation from EEG Signals using Deep Learning. 2017. (4.51 MB) Ms Thesis
E. Mohedano, McGuinness, K., Giró-i-Nieto, X., and O'Connor, N., Fine-tuning of CNN models for Instance Search with Pseudo-Relevance Feedback. NIPS 2017 Women in Machine Learning Workshop, Long Beach, CA, USA, 2017. (341.96 KB) Unpublished
A. Romero-Lopez, Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Networks. 2017. Ms Thesis
L. L. Cardoner, Predicting Media Interestingness. 2017. (1.78 MB) Ms Thesis
X. Giró-i-Nieto, Pascual-deLaPuente, S., Miró, V., and Esteve, O., La meitat de les notícies que consumirem el 2022 seran falses, 2017. . Web Article
2016
C. Reyes, Mohedano, E., McGuinness, K., O'Connor, N., and Giró-i-Nieto, X., Where did I leave my phone ?, 4th Workshop on Egocentric (First-Person) Vision, CVPR 2016, Las Vegas, NV, USA, 2016. (312.27 KB) Report
M. Chertó, EgoMon Gaze and Video Dataset for Visual Saliency Prediction. 2016. (1.48 MB) Ms Thesis
D. Fernàndez, Campos, V., Jou, B., Giró-i-Nieto, X., and Chang, S. - F., Is a “happy dog” more “happy” than “dog”? - Adjective and Noun Contributions for Adjective-Noun Pair prediction, NIPS Women in Machine Learning Workshop. Barcelona, 2016. (3.11 MB) Unpublished
M. Carné-Herrera, Detect Snap Points in Egocentric Images with Physiological Signals. 2016. (4.63 MB) Ms Thesis
C. Reyes, Time-sensitive Egocentric Image Retrieval for Fidings Objects in Lifelogs. 2016. (10.23 MB) Ms Thesis
M. Bellver, Giró-i-Nieto, X., and Marqués, F., Efficient search of objects in images using deep reinforcement learning, NIPS Women in Machine Learning Workshop. Barcelona., 2016. Unpublished
I. Masuda-Mora, Open-Ended Visual Question-Answering. 2016. (7.03 MB) Ms Thesis
A. Nespereira, Siamese Convolutional Neural Network for Learning Object Similarities in RGB-D Images. 2016. Ms Thesis
A. Montes, Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks. 2016. (27.84 MB) Ms Thesis
A. Calafell, Video Retrieval of Specific Persons in Specific Locations. 2016. (17.14 MB) Ms Thesis
I. Masuda-Mora, Pascual-deLaPuente, S., and Giró-i-Nieto, X., Towards Automatic Generation of Question Answer Pairs from Images, Visual Question Answering Challenge Workshop, CVPR 2016, Las Vegas, NV, USA, 2016. (206.92 KB) Report
D. Fernàndez, Clustering and Prediction of Adjective-Noun Pairs for Affective Computing. 2016. (10.38 MB) Ms Thesis
M. Carné-Herrera, Giró-i-Nieto, X., and Gurrin, C., EgoMemNet: Visual Memorability Adaptation to Egocentric Images, 4th Workshop on Egocentric (First-Person) Vision, CVPR 2016, Las Vegas, NV, USA, 2016. (265.4 KB) Report
M. Carné-Herrera, Visual Memorability for Egocentric Cameras. 2016. (98.86 MB) Ms Thesis
A. Ferri, Object Tracking in Video with TensorFlow. 2016. (22.63 MB) Ms Thesis
2015
R. Mestre, Visual Summary of Egocentric Photostreams by Representative Keyframes. 2015. (1.36 MB) Ms Thesis
M. Bellver, Efficient Exploration of Region Hierarchies for Semantic Segmentation. 2015. (11.62 MB) Ms Thesis
S. Porta, Rapid Serial Visual Presentation for Relevance Feedback in Image Retrieval with EEG Signals. 2015. (1.38 MB) Ms Thesis
V. Campos, Layer-wise CNN Surgery for Visual Sentiment Prediction. 2015. (1.51 MB) Ms Thesis
G. de Oliveira-Barra, LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System. 2015. (8.3 MB) Ms Thesis
C. Ventura, Giró-i-Nieto, X., Vilaplana, V., McGuinness, K., Marqués, F., and O'Connor, N. E., Improving Spatial Codification in Semantic Segmentation (Supplementary Material), 2015. (18.81 MB) Report
J. Pan and Giró-i-Nieto, X., End-to-end Convolutional Network for Saliency Prediction, arXiv, Boston, MA (USA), 2015. (1.18 MB) Report
A. Lidon, Semantic and Diverse Summarization of Egocentric Photo Events. 2015. (5.34 MB) Ms Thesis
F. Cabezas, Co-filtering human interaction and object segmentation. 2015. (1.82 MB) Ms Thesis
E. Fontdevila-Bosch, Region-oriented Convolutional Networks for Object Retrieval. 2015. (8.02 MB) Ms Thesis
C. Ramos-Caballero, Keyframe-based Video Summarization Designer. 2015. (2.64 MB) Ms Thesis
I. Gris-Sarabia, Pyxel, una llibreria per a l’anotació automàtica de fotografies. 2015. (1.12 MB) Ms Thesis
J. Roldan-Carlos, Visual Search for Musical Performances and Endoscopic Videos. 2015. (12.35 MB) Ms Thesis
A. Calafell, Fine-tuning a Convolutional Network for Cultural Event Recognition. 2015. (11.14 MB) Ms Thesis
J. Pan, Visual Saliency Prediction using Deep learning Techniques. 2015. (1.57 MB) Ms Thesis
2014
D. Almendros-Gutiérrez, Visual instance mining of news videos using a graph-based approach. 2014. (4.12 MB) Ms Thesis
M. Tella, Contextless Object Recognition with Shape-enriched SIFT and Bags of Features. 2014. (4.82 MB) Ms Thesis
S. Imedio-Pereira, An investigation of eye gaze tracking utilities in image object recognition. 2014. (1.63 MB) Ms Thesis
D. Manchon-Vizuete, Low computational cost algorithms for photo clustering and mail signature detection in the cloud. 2014. Ms Thesis
A. Salvador, Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation. 2014. (8.97 MB) Ms Thesis
J. Sánchez-Escué, Bundling interest points for object classification. 2014. (2.15 MB) Ms Thesis
M. Ferrarons-Betrian, Mobile Visual Search at Catchoom. 2014. Ms Thesis
2013
C. Ventura, Visual Object Analysis Using Regions and Interest Points, ACM Multimedia. 2013. (132.2 KB) Ms Thesis
M. Martos, Content-based Video Summarisation to Object Maps. 2013. (3.73 MB) Ms Thesis
A. Garcia-delMolino, Extension of Instance Search Technique by Geometric Coding and Quantization Error Compensation. 2013. Ms Thesis
L. Tort, Video Clustering Using Camera Motion. 2013. (8.87 MB) Ms Thesis
C. Ventura, Visual Object Analysis Using Regions and Interest Points. 2013. (4.62 MB) Ms Thesis
A. Salvador, Crowdsourced Object Segmentation with a Game. 2013. (1.34 MB) Ms Thesis
E. Mohedano, Investigating EEG for Saliency and Segmentation Applications in Image Processing. 2013. (332.54 KB) Ms Thesis
J. Antoja-Sabin, El telèfon mòbil com a eina d'aprenentatge informal. 2013. (1.55 MB) Ms Thesis
2011
A. Rubiano, Búsqueda Visual con Retroacción de Relevancia Basada en Actualizacion de Pesos. 2011. (1.07 MB) Ms Thesis
C. Ventura, Tools for Image Retrieval in Large Multimedia Databases. 2011. (6.33 MB) Ms Thesis
M. Alfaro, Reordenació i agrupament d'imatges d'una cerca de vídeo. 2011. (24.81 MB) Ms Thesis
M. Tella, Interactive Image Processing demonstrations for the web. 2011. (1.77 MB) Ms Thesis
E. Carcel, Rich Internet Application for the Semi-Automatic Annotation of Semantic Shots on Keyframes. 2011. (6.58 MB) Ms Thesis
2010
S. Cortés, GOS: búsqueda visual de imágenes, 25, 2010. Report
C. Ruiz-Sancho, Tweet@TV: Televisió social en 140 caràcters. 2010. (6.63 MB) Ms Thesis
P. Muñoz-Trallero, Extensió d'una interfície de cerca d'imatges a les consultes amb regions. 2010. Ms Thesis
B. Girvent, Servei de vídeos a la carta per a l'iPhone. 2010. (10.26 MB) Ms Thesis
M. Gimeno, Interfície gràfica d'usuari per a l'avaluació de classificadors d'imatges. 2010. Ms Thesis
2009
S. Cortés, Interfaz gráfica de usuario para la búsqueda de imágenes basada en imágenes. 2009. (2.84 MB) Ms Thesis
R. Salla-Rovira, Aplicació rica d'internet per a la consulta amb text i imatge a la Corporació Catalana de Mitjans Audiovisuals. 2009. (6.01 MB) Ms Thesis
2004
X. Giró-i-Nieto, La convergència de la TV cap al PC, Diari Avui, Barcelona, Catalonia, 2004. Report
2000
X. Giró-i-Nieto, Volumetric Data Compression based on Cube-Splitting and Embedded Block Coding by Optimized Truncation. 2000. (1.36 MB) Ms Thesis