Amaia Salvador
Contributions while at GPI
Journal Articles
“Mask-guided sample selection for Semi-Supervised Instance Segmentation”, Multimedia Tools and Applications, 2020. (2.2 MB) | ,
“Assessment of Crowdsourcing and Gamification Loss in User-Assisted Object Segmentation”, Multimedia Tools and Applications, vol. 23, no. 75, 2016. (5.05 MB) | ,
Book Chapters and Books
“Object Retrieval with Deep Convolutional Features”, in Deep Learning for Image Processing Applications, vol. 31, Amsterdam, The Netherlands: IOS Press, 2017. | ,
Conference Papers
“Budget-aware Semi-Supervised Semantic and Instance Segmentation”, in CVPR 2019 DeepVision Workshop, Long Beach, CA, USA, 2019. (6.59 MB) | ,
“Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks”, in ICASSP, Brighton, UK, 2019. (4.42 MB) | ,
“RVOS: End-to-End Recurrent Network for Video Object Segmentation”, in CVPR, Long Beach, CA, USA, 2019. (5.76 MB) | ,
“Inverse Cooking: Recipe Generation from Food Images”, in CVPR, Long Beach, CA, USA, 2019. | ,
“Recurrent Neural Networks for Semantic Instance Segmentation”, in ECCV 2018 Women in Computer Vision (WiCV) Workshop, 2018. (2.55 MB) | ,
Theses
“Computer Vision beyond the visible: Image understanding through language”, Universitat Politecnica de Catalunya, Barcelona, 2019. | ,
Other
“Speech-conditioned Face Generation with Deep Adversarial Networks”. 2018. (1.79 MB) | ,Ms Thesis |
“MIT is building a system that can identify a recipe using pictures of food”, 2017. . | ,Web Article |
“Snap a photo, get a recipe: pic2recipe uses AI to predict food ingredients”, 2017. . | ,Web Article |
“Artificial intelligence suggests recipes based on food photos”, 2017. . | ,Web Article |
“Object Tracking in Video with TensorFlow”. 2016. (22.63 MB) | ,Ms Thesis |
Projects
Cross-modal Deep Learning between Vision, Language, Audio and Speech | European | Oct 2018 | Sep 2021 | |
SGR17 - Image and Video Processing Group | National | Jan 2017 | Sep 2021 | |
BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces. | National | Jan 2014 | Dec 2017 | |
SGR14 - Image and Video Processing Group | National | Jan 2014 | Apr 2017 |
Research Areas
Language and Vision | Internal | Feb 2016 | Dec 2021 | |
Region-based image and video processing | Internal | Jan 1992 | Dec 2020 | |
Deep learning | Internal | Jun 2014 | Dec 2020 | |
Multimedia Retrieval | Internal | Sep 2001 | Dec 2018 | |
Crowdsourcing | Internal | Jan 2013 | Dec 2015 |
Demos and Resources
UPC at CVPRW ActivityNet Challenge 2016 | Software | Jun 2016 | |
Faster R-CNN Features for Instance Search (software) | Software | May 2016 | |
C3D Model for Keras trained over Sports 1M | Software | Apr 2016 | |
Diving Deep into Sentiment: Understanding Fine-tuned CNNs for Visual Sentiment Prediction (software) | Software | Mar 2016 | |
Terrassa Buildings 4126 | Dataset | Dec 2015 | |
Cultural Event Recognition with Computer Vision (software) | Software | Mar 2015 | |
Filters of users and clicks for noisy interactions in object segmentation | Software | Feb 2015 | |
Pyxel, a Python library for the automatic annotation of web photos | Software | Jan 2015 | |
Click'n'Cut: Online interactive segmentation | Demo | Oct 2014 | |
Simulator of Clicks and Object Segmentation for Ask'n'Seek | Software | Jul 2013 | |
User traces and additional simulated results from Ask'n'Seek | Results | Jul 2013 |