Carles Ventura
Contributions while at GPI
Journal Articles
“RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation”, Multimedia Tools and Applications, 2022. (5.78 MB) | ,
“Multiresolution co-clustering for uncalibrated multiview segmentation”, Signal Processing: Image Communication, 2019. (4.35 MB) | ,
“Improving retrieval accuracy of Hierarchical Cellular Trees for generic metric spaces”, Multimedia Tools and Applications, 2013. (1.97 MB) | ,
Book Chapters and Books
“Hierarchical Navigation and Visual Search for Video Keyframe Retrieval”, in Advances in Multimedia Modeling, vol. 7131, Springer Berlin / Heidelberg, 2012, pp. 652-654. | ,
Conference Papers
“SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation”, in NAACL Visually Grounded Interaction and Language (ViGIL) Workshop, Virtual, 2021. (794.97 KB) | ,
“Curriculum Learning for Recurrent Video Object Segmentation”, in ECCV 2020 Women in Computer Vision Workshop, 2020. (1.76 MB) | ,
“RVOS: End-to-End Recurrent Network for Video Object Segmentation”, in CVPR, Long Beach, CA, USA, 2019. (5.76 MB) | ,
“Video Object Linguistic Grounding”, in ACM Multimedia Workshop on Multimodal Understanding and Learning for Embodied Applications (MULEA), Nice, France, 2019. (441.12 KB) | ,
“Recurrent Instance Segmentation using Sequences of Referring Expressions”, in NeurIPS workshop on Visually Grounded Interaction and Language (ViGIL), Vancouver, Canada, 2019. (1.13 MB) | ,
Theses
“Visual Object Analysis using Regions and Local Features”, 2016. (2.5 MB) | ,
Other
“Object Detection with Deep Learning”. 2021. | ,Presentation |
“Object Model Adaptation for Multiple Object Tracking”, 2021. (785.3 KB) | ,Report |
“Curriculum Learning for Recurrent Video Object Segmentation”. 2020. | ,Ms Thesis |
“Image Segmentation with Deep Learning”. 2020. | ,Presentation |
“Recurrent Instance Segmentation with Linguistic Referring Expressions”. 2019. (3.6 MB) | ,Ms Thesis |
Projects
BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces. | National | Jan 2014 | Dec 2017 | |
SGR14 - Image and Video Processing Group | National | Jan 2014 | Apr 2017 | |
MuViPro - Multicamera Video Processing | National | Jan 2011 | Aug 2014 | |
SGR09 - Processament de Video Multicamera | National | Oct 2009 | Dec 2013 | |
Buscamedia - Hacia una adaptación semántica de medios digitales multi-red multi-terminal | National | Jan 2010 | Dec 2012 |
Research Areas
Multimedia Retrieval | Internal | Sep 2001 | Dec 2018 |
Demos and Resources
Automatic Keyframe Selection over TVC database | Results | Jun 2013 | |
GOS, Graphical Object Searcher | Software | Jul 2012 |
Teaching activity
Acronym | Title | Level | College |
---|---|---|---|
AIDL | Artificial Intelligence with Deep Learning | Postgraduate | UPC School |