Search Publication
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks. In ICASSP. Brighton, UK: IEEE; 2019.
(4.42 MB)
. 
Deep Learning that Scales: Leveraging Compute and Data. . Computer Architecture. [Barcelona, Catalonia]: Universitat Politècnica de Catalunya; 2020.
(8.55 MB)
. 
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills. In International Conference on Machine Learning (ICML) 2020. 2020.
(6.89 MB)
. 
Mask-guided sample selection for Semi-Supervised Instance Segmentation. Multimedia Tools and Applications. 2020;.
(2.2 MB)
. 
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language. In CVPR 2021. 2021.
(5.94 MB)
. 
Image and Video Object Segmentation in Low Supervision Scenarios. . Computer Architectures. [Barcelona]: Universitat Politecnica de Catalunya; 2021.
. Data and methods for a visual understanding of sign languages. . Signal Theory and Communications. 2022.
. RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation. Multimedia Tools and Applications. 2022;.
(5.78 MB)
. 
Tackling Low-Resourced Sign Language Translation: UPC at WMT-SLT 22. In EMNLP 2022 Seventh Conference on Machine Translation (WMT22). 2022.
(295.68 KB)
. 
Topic Detection in Continuous Sign Language Videos. In Accessibility, Vision, and Autonomy Meet (AVA) CVPR Workshop. 2022.
(1.82 MB)
. 
Sign Language Translation from Instructional Videos. In CVPR 2023 Women in Computer Vision Workshop. Vancouver, Canada: Computer Vision Foundation / IEEE; 2023.
(4.64 MB)
. 