Amaia Salvador
Contributions while at GPI
Journal Articles
| , “Mask-guided sample selection for Semi-Supervised Instance Segmentation”, Multimedia Tools and Applications, 2020.  (2.2 MB) | 
| , “Assessment of Crowdsourcing and Gamification Loss in User-Assisted Object Segmentation”, Multimedia Tools and Applications, vol. 23, no. 75, 2016.  (5.05 MB) | 
Book Chapters and Books
| , “Object Retrieval with Deep Convolutional Features”, in Deep Learning for Image Processing Applications, vol. 31, Amsterdam, The Netherlands: IOS Press, 2017. | 
Conference Papers
| , “Budget-aware Semi-Supervised Semantic and Instance Segmentation”, in CVPR 2019 DeepVision Workshop, Long Beach, CA, USA, 2019.  (6.59 MB) | 
| , “Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks”, in ICASSP, Brighton, UK, 2019.  (4.42 MB) | 
| , “RVOS: End-to-End Recurrent Network for Video Object Segmentation”, in CVPR, Long Beach, CA, USA, 2019.  (5.76 MB) | 
| , “Inverse Cooking: Recipe Generation from Food Images”, in CVPR, Long Beach, CA, USA, 2019. | 
| , “Recurrent Neural Networks for Semantic Instance Segmentation”, in ECCV 2018 Women in Computer Vision (WiCV) Workshop, 2018.  (2.55 MB) | 
Theses
| , “Computer Vision beyond the visible: Image understanding through language”, Universitat Politecnica de Catalunya, Barcelona, 2019. | 
Other
| , “Speech-conditioned Face Generation with Deep Adversarial Networks”. 2018.  (1.79 MB) | Ms Thesis | 
| , “MIT is building a system that can identify a recipe using pictures of food”, 2017. . | Web Article | 
| , “Snap a photo, get a recipe: pic2recipe uses AI to predict food ingredients”, 2017. . | Web Article | 
| , “Artificial intelligence suggests recipes based on food photos”, 2017. . | Web Article | 
| , “Object Tracking in Video with TensorFlow”. 2016.  (22.63 MB) | Ms Thesis | 
Projects
|   | Cross-modal Deep Learning between Vision, Language, Audio and Speech | European | Oct 2018 | Sep 2021 | 
|   | SGR17 - Image and Video Processing Group | National | Jan 2017 | Sep 2021 | 
|   | BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces. | National | Jan 2014 | Dec 2017 | 
|   | SGR14 - Image and Video Processing Group | National | Jan 2014 | Apr 2017 | 
Research Areas
|   | Language and Vision | Internal | Feb 2016 | Dec 2021 | 
|   | Region-based image and video processing | Internal | Jan 1992 | Dec 2020 | 
|   | Deep learning | Internal | Jun 2014 | Dec 2020 | 
|   | Multimedia Retrieval | Internal | Sep 2001 | Dec 2018 | 
|   | Crowdsourcing | Internal | Jan 2013 | Dec 2015 | 
Demos and Resources
|   | UPC at CVPRW ActivityNet Challenge 2016 | Software | Jun 2016 | 
|   | Faster R-CNN Features for Instance Search (software) | Software | May 2016 | 
|   | C3D Model for Keras trained over Sports 1M | Software | Apr 2016 | 
|   | Diving Deep into Sentiment: Understanding Fine-tuned CNNs for Visual Sentiment Prediction (software) | Software | Mar 2016 | 
|   | Terrassa Buildings 4126 | Dataset | Dec 2015 | 
|   | Cultural Event Recognition with Computer Vision (software) | Software | Mar 2015 | 
|   | Filters of users and clicks for noisy interactions in object segmentation | Software | Feb 2015 | 
|   | Pyxel, a Python library for the automatic annotation of web photos | Software | Jan 2015 | 
|   | Click'n'Cut: Online interactive segmentation | Demo | Oct 2014 | 
|   | Simulator of Clicks and Object Segmentation for Ask'n'Seek | Software | Jul 2013 | 
|   | User traces and additional simulated results from Ask'n'Seek | Results | Jul 2013 | 
