Search Publication
Video Object Linguistic Grounding. In ACM Multimedia Workshop on Multimodal Understanding and Learning for Embodied Applications (MULEA). Nice, France: ACM; 2019. (441.12 KB)
. Mode dependent vector quantization with a rate-distortion optimized codebook for residue coding in video compression. In IEEE Int. Conf. on Acoustics Speech and Signal Processing, ICASSP 2015. Brisbane, Australia: IEEE; 2015. (287.4 KB)
. Study of Manifold Geometry using Multiscale Non-Negative Kernel Graphs. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece; 2023. (1.4 MB)
. An investigation of eye gaze tracking utilities in image object recognition. . 2014. (1.63 MB)
. UPC System for the 2015 MediaEval Multimodal Person Discovery in Broadcast TV task. In MediaEval 2015 Workshop. Wurzen, Germany; 2015. (163.11 KB)
. UPC System for the 2016 MediaEval Multimodal Person Discovery in Broadcast TV task. In MediaEval 2016 Workshop. Hilversum, The Netherlands; 2016. (174.87 KB)
. UPC Multimodal Speaker Diarization System for the 2018 Albayzin Challenge. In IberSpeech 2018. Barcelona; 2018. (379.14 KB)
. Efficient strategies for navigation through very large JPEG2000 image. . Université Catholique de Louvain (UCL); 2008.
. PROMEDS: An adaptive robust fundamental matrix estimation approach. In 3DTV Conference. Zurich, Switzerland: IEEE; 2012. (576.37 KB)
. CNN-based bacilli detection in sputum samples for tuberculosis diagnosis. In International Symposium on Biomedical Imaging (ISBI 2019). 2019.
. Towards video alignment across cameras with sign language 2D poses. . 2021. (3.45 MB)
. .
Registration of Multi-Modal Neuroimaging Datasets by Considering the Non-Overlapping Field of View into the NMI Calculation. In IEEE International Symposium on Biomedical Imaging, ISBI 2012. Barcelona, Spain; 2012.
. Class Weighted Convolutional Features for Visual Instance Search. In 28th British Machine Vision Conference (BMVC). London, UK; 2017. (3.56 MB)
. Breast Cancer Molecular Subtyping from H&E Whole Slide Images using Foundation Models and Transformers. In Deep Breast Workshop on AI and Imaging for Diagnostic and Treatment Challenges in Breast Care, MICCAI 2024. In Press.
. . . Una Enginyeria per a la Societat del Coneixement. In II Congrés d'Enginyeria en Llengua Catalana. 2004.
. .
SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation. In NAACL Visually Grounded Interaction and Language (ViGIL) Workshop. Virtual; 2021. (794.97 KB)
. Gesture controlled interactive rendering in a panoramic scene. In European Interactive TV Conference, EuroITV. Como, Italy; 2013. (466.78 KB)
. Standardized Assessment of Automatic Segmentation of White Matter Hyperintensities; Results of the WMH Segmentation Challenge. IEEE Transactions on Medical Imaging. 2019;.
. Cooperative background modelling using multiple cameras towards human detection in smart-room. In 14th European Signal Processing Conference. 2006. pp. 1–5.
. A Unified Framework for Consistent 2D/3D Foreground Object Detection. . Universitat Politècnica de Catalunya (UPC); 2008.
. A Unified Framework for Consistent 2D/3D Foreground Object Detection. IEEE transactions on circuits and systems for video technology. 2008;18:1040–1051.
.