Search Publication
. Graph Convolutional Neural Networks for 3D Data Analysis. . Signal Theory and Communications. [Barcelona]: Universitat Politècnica de Catalunya; 2023.
. SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters. In IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR). New Orleans, USA; 2022.
(5.45 MB)
. 2D–3D Geometric Fusion network using Multi-Neighbourhood Graph Convolution for RGB-D indoor scene classification. Information Fusion. 2021;76.
(771.86 KB)
. The CHIL Audiovisual Corpus for Lecture and Meeting Analysis inside Smart Rooms. Language resources and evaluation. 2007;41(3):389–407.
. Simbad:a tool for speech analysis and synthesis. In IASTED INT.CONF.SIGNAL PROC.&DIG.FILT. 1990.
. Multi-view Body Tracking with a Detector-Driven Hierarchical Particle Filter. In 7th International Conference AMDO 2012. Port d'Andratx, Mallorca: Springer; 2012.
. Multi-view Body Tracking with a Detector-Driven Hierarchical Particle Filter. In: . Lecture Notes in Computer Science: Articulated Motion and Deformable Objects. Berlin / Heidelberg: Springer ; 2012. pp. 82-91.
. Multimodal Integration of Sensor Network. In Artificial Intelligence Applications and Innovations. Boston: Springer; 2006. pp. 312–323.
. Integration of audiovisual sensors and technologies in a smart room. Personal and ubiquitous computing. 2009;13:15–23.
(477.45 KB)
. Multimodal Integration of Sensor Network. In Proceedings of 3rd IFIP Conference on Artificial Intelligence Applications & Innovations. Athens, Greece: Springer; 2006.
(401.58 KB)
. NII-HITACHI-UIT at TRECVID 2015 Instance Search. In TRECVID 2015 Workshop. Gaithersburg, MD, USA: NIST; 2015.
(1.53 MB)
. Advanced visual rendering, gesture-based interaction and distributed delivery for immersive and interactive media services. In International Broadcasting Convention 2011. 2011. pp. 1–8.
(7.56 MB)
. Towards A Format-agnostic Approach for Production, Delivery and Rendering of Immersive Media. In ACM Multimedia Systems. Oslo, Norway; 2013.
(2.24 MB)
. Activity Classification. In Computers in the Human Interaction Loop. London: Springer; 2009. pp. 107–119.
. PiCoEDL: Discovery and Learning of Minecraft Navigation Goals from Pixels and Coordinates. In CVPR 2021 Embodied AI Workshop. 2021.
(847.54 KB)
. Video Saliency Prediction with Deep Neural Networks. . 2019.
(2.17 MB)
. Discovery and Learning of Navigation Goals from Pixels in Minecraft. . 2021.
(16.15 MB)
. Unsupervised Skill-Discovery and Skill-Learning in Minecraft. In ICML 2021 Workshop on Unsupervised Reinforcement Learning (URL). 2021.
(5.67 MB)
. A contour-based approach to binary shape coding using a multiple grid chain code. Signal processing: image communication. 2000;15:585–599.
. Region and object segmentation algorithms in the Qimera segmentation platform. In Third International Workshop on Content-Based Multimedia Indexing. 2003. pp. 95–103.
(411.49 KB)
. Media Aesthetics Based Multimedia Storytelling. . Universitat Politècnica de Catalunya (UPC); 2011.
(4.07 MB)
. Tou - Brazo Robot Asistencial: Control Verbal. In 2º Congreso de la Asociación Española de Robótica. 1991. pp. 93–100.
]