Search Publication
Video Object Linguistic Grounding. In ACM Multimedia Workshop on Multimodal Understanding and Learning for Embodied Applications (MULEA). Nice, France: ACM; 2019.
(441.12 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
Mode dependent vector quantization with a rate-distortion optimized codebook for residue coding in video compression. In IEEE Int. Conf. on Acoustics Speech and Signal Processing, ICASSP 2015. Brisbane, Australia: IEEE; 2015.
(287.4 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
Study of Manifold Geometry using Multiscale Non-Negative Kernel Graphs. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece; 2023.
(1.4 MB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
An investigation of eye gaze tracking utilities in image object recognition. . 2014.
(1.63 MB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
UPC System for the 2015 MediaEval Multimodal Person Discovery in Broadcast TV task. In MediaEval 2015 Workshop. Wurzen, Germany; 2015.
(163.11 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
UPC System for the 2016 MediaEval Multimodal Person Discovery in Broadcast TV task. In MediaEval 2016 Workshop. Hilversum, The Netherlands; 2016.
(174.87 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
UPC Multimodal Speaker Diarization System for the 2018 Albayzin Challenge. In IberSpeech 2018. Barcelona; 2018.
(379.14 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
Efficient strategies for navigation through very large JPEG2000 image. . Université Catholique de Louvain (UCL); 2008.
. PROMEDS: An adaptive robust fundamental matrix estimation approach. In 3DTV Conference. Zurich, Switzerland: IEEE; 2012.
(576.37 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
CNN-based bacilli detection in sputum samples for tuberculosis diagnosis. In International Symposium on Biomedical Imaging (ISBI 2019). 2019.
. Towards video alignment across cameras with sign language 2D poses. . 2021.
(3.45 MB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
.
Registration of Multi-Modal Neuroimaging Datasets by Considering the Non-Overlapping Field of View into the NMI Calculation. In IEEE International Symposium on Biomedical Imaging, ISBI 2012. Barcelona, Spain; 2012.
. Class Weighted Convolutional Features for Visual Instance Search. In 28th British Machine Vision Conference (BMVC). London, UK; 2017.
(3.56 MB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
Breast Cancer Molecular Subtyping from H&E Whole Slide Images using Foundation Models and Transformers. In Deep Breast Workshop on AI and Imaging for Diagnostic and Treatment Challenges in Breast Care, MICCAI 2024. In Press.
. . . Una Enginyeria per a la Societat del Coneixement. In II Congrés d'Enginyeria en Llengua Catalana. 2004.
. .
SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation. In NAACL Visually Grounded Interaction and Language (ViGIL) Workshop. Virtual; 2021.
(794.97 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
Gesture controlled interactive rendering in a panoramic scene. In European Interactive TV Conference, EuroITV. Como, Italy; 2013.
(466.78 KB)
. ![application/pdf](/web/modules/file/icons/application-pdf.png)
Standardized Assessment of Automatic Segmentation of White Matter Hyperintensities; Results of the WMH Segmentation Challenge. IEEE Transactions on Medical Imaging. 2019;.
. Cooperative background modelling using multiple cameras towards human detection in smart-room. In 14th European Signal Processing Conference. 2006. pp. 1–5.
. A Unified Framework for Consistent 2D/3D Foreground Object Detection. . Universitat Politècnica de Catalunya (UPC); 2008.
. Reconstruction of 3D shapes considering inconsistent 2D silhouettes. In International Conference on Image Processing. 2006. pp. 1–4.
.