Search Publication
Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network. Sensors. 2019;19(8). (4.75 MB)
. Disentangling Motion, Foreground and Background Features in Videos. In CVPR 2017 Workshop Brave New Motion Representations. 2017. (370.64 KB)
. Temporally Coherent 3D Point Cloud Video Segmentation in Generic Scenes. IEEE Transactions on Image Processing. 2018;27(6):3087 - 3099. (24.37 MB)
. 3D Point Cloud Video Segmentation Based on Interaction Analysis. In ECCV 2016: Computer Vision – ECCV 2016 Workshops. Amsterdam: Springer; 2016. pp. 821 - 835. (10 MB)
. Semantic and Generic Object Segmentation for Scene Analysis Using RGB-D Data. . Signal Theory and Communications (TSC). [download link]: Universitat Politècnica de Catalunya (UPC); 2018.
. 3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field. In The 25th European Signal Processing Conference (EUSIPCO 2017). Kos island, Greece: Eurasip/IEEE; 2017. (2.34 MB)
. 3D Point Cloud Segmentation Oriented to The Analysis of Interactions. In The 24th European Signal Processing Conference (EUSIPCO 2016). Budapest, Hungary: Eurasip; 2016. (10.54 MB)
. Graph based Dynamic Segmentation of Generic Objects in 3D. In CVPR SUNw: Scene Understanding Workshop. Las Vegas, US; 2016. (956.15 KB)
. One Shot Learning for Generic Instance Segmentation in RGBD Videos. In International Conference on Computer Vision, Theory and Applications. Prague: SciTePress; 2019. (1.64 MB)
. UPC-UB-STP @ MediaEval 2015 Diversity Task: Iterative Reranking of Relevant Images. In MediaEval 2015 Workshop. 2015. (158.17 KB)
. Semantic and Diverse Summarization of Egocentric Photo Events. . 2015. (5.34 MB)
. Semantic Summarization of Egocentric Photo Stream Events. In ACM Multimedia 2017 Workshop on Lifelogging Tools and Applications. Mountain View, CA, USA: ACM; 2017. (3.08 MB)
. Region-based caption text extraction. In Lecture Notes in Electrical Engineering (Analysis, Retrieval and Delivery of Multimedia Content). New York: Springer; 2013. pp. 21-36.
. Caption text extraction for indexing purposes using a hierarchical region-based image model. In 16th International Conference on Image Processing. 2009. pp. 1869–1872.
. Region-based caption text extraction. In 11th. International Workshop on Image Analysis for Multimedia Application Services. 2010. pp. 1–4.
. Towards large scale multimedia indexing: A case study on person discovery in broadcast news. In International Workshop on Content-Based Multimedia Indexing - CBMI 2017. Firenze, Italy; 2017. (831.12 KB)
. Cooperative background modelling using multiple cameras towards human detection in smart-room. In 14th European Signal Processing Conference. 2006. pp. 1–5.
. A Unified Framework for Consistent 2D/3D Foreground Object Detection. . Universitat Politècnica de Catalunya (UPC); 2008.
. A Unified Framework for Consistent 2D/3D Foreground Object Detection. IEEE transactions on circuits and systems for video technology. 2008;18:1040–1051.
. Reconstruction of 3D shapes considering inconsistent 2D silhouettes. In International Conference on Image Processing. 2006. pp. 1–4.
. Shape from Inconsistent Silhouette for Free Viewpoint Video. In IEEE International Conference on Image Processing. 2008. pp. 213–216.
. HMM recognition of expressions in unrestrained video intervals. In International conference on Acoustics, Speech, and Signal Processing. 2003. pp. 197–200.
. Robust Tracking and Object Classification Towards Automated Video Surveillance. In International Conference on Image Analysis and Recognition. 2004. pp. 46333–470.
. Robust Tracking and Object Classification Towards Automated Video Surveillance. Lecture notes in computer science. 2004;3212:463–470.
. Foreground Regions Extraction and Characterization Towards Real-Time Object Tracking. In 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms. 2005. pp. 241–249.
.