Amaia Salvador
Contributions while at GPI
Journal Articles
|
, “Mask-guided sample selection for Semi-Supervised Instance Segmentation”, Multimedia Tools and Applications, 2020. |
|
, “Assessment of Crowdsourcing and Gamification Loss in User-Assisted Object Segmentation”, Multimedia Tools and Applications, vol. 23, no. 75, 2016. |
Book Chapters and Books
| , “Object Retrieval with Deep Convolutional Features”, in Deep Learning for Image Processing Applications, vol. 31, Amsterdam, The Netherlands: IOS Press, 2017. |
Conference Papers
|
, “Budget-aware Semi-Supervised Semantic and Instance Segmentation”, in CVPR 2019 DeepVision Workshop, Long Beach, CA, USA, 2019. |
|
, “Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks”, in ICASSP, Brighton, UK, 2019. |
|
, “RVOS: End-to-End Recurrent Network for Video Object Segmentation”, in CVPR, Long Beach, CA, USA, 2019. |
| , “Inverse Cooking: Recipe Generation from Food Images”, in CVPR, Long Beach, CA, USA, 2019. |
|
, “Recurrent Neural Networks for Semantic Instance Segmentation”, in ECCV 2018 Women in Computer Vision (WiCV) Workshop, 2018. |
Theses
| , “Computer Vision beyond the visible: Image understanding through language”, Universitat Politecnica de Catalunya, Barcelona, 2019. |
Other
|
, “Speech-conditioned Face Generation with Deep Adversarial Networks”. 2018. |
Ms Thesis |
| , “MIT is building a system that can identify a recipe using pictures of food”, 2017. . | Web Article |
| , “Snap a photo, get a recipe: pic2recipe uses AI to predict food ingredients”, 2017. . | Web Article |
| , “Artificial intelligence suggests recipes based on food photos”, 2017. . | Web Article |
|
, “Object Tracking in Video with TensorFlow”. 2016. |
Ms Thesis |
Projects
|
|
Cross-modal Deep Learning between Vision, Language, Audio and Speech | European | Oct 2018 | Sep 2021 |
|
|
SGR17 - Image and Video Processing Group | National | Jan 2017 | Sep 2021 |
|
|
BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces. | National | Jan 2014 | Dec 2017 |
|
|
SGR14 - Image and Video Processing Group | National | Jan 2014 | Apr 2017 |
Research Areas
|
|
Language and Vision | Internal | Feb 2016 | Dec 2021 |
|
|
Region-based image and video processing | Internal | Jan 1992 | Dec 2020 |
|
|
Deep learning | Internal | Jun 2014 | Dec 2020 |
|
|
Multimedia Retrieval | Internal | Sep 2001 | Dec 2018 |
|
|
Crowdsourcing | Internal | Jan 2013 | Dec 2015 |
Demos and Resources
|
|
UPC at CVPRW ActivityNet Challenge 2016 | Software | Jun 2016 |
|
|
Faster R-CNN Features for Instance Search (software) | Software | May 2016 |
|
|
C3D Model for Keras trained over Sports 1M | Software | Apr 2016 |
|
|
Diving Deep into Sentiment: Understanding Fine-tuned CNNs for Visual Sentiment Prediction (software) | Software | Mar 2016 |
|
|
Terrassa Buildings 4126 | Dataset | Dec 2015 |
|
|
Cultural Event Recognition with Computer Vision (software) | Software | Mar 2015 |
|
|
Filters of users and clicks for noisy interactions in object segmentation | Software | Feb 2015 |
|
|
Pyxel, a Python library for the automatic annotation of web photos | Software | Jan 2015 |
|
|
Click'n'Cut: Online interactive segmentation | Demo | Oct 2014 |
|
|
Simulator of Clicks and Object Segmentation for Ask'n'Seek | Software | Jul 2013 |
|
|
User traces and additional simulated results from Ask'n'Seek | Results | Jul 2013 |