Josep R. Casas

Position![]() |
|
---|---|
Associate Professor | josep.ramon.casas@upc.edu |
Office | Phone |
---|---|
D5-213 | +34 934 054 002 |
Biography
Josep R. Casas has been principal investigator of projects MuViPro and PROVEC (Plan Nacional de I+D+i, 2007-2014), and led or contributed to several European Framework Program projects (FascinatE, ACTIBIO, CHIL, SCHEMA, ADViSOR) and industry-sponsored projects (VISION, HESPERIA, D'Ocon) since 1995. He was a visiting researcher at CSIRO in Canberra, Australia in 2001, and served as Finance Chair for the IEEE conference ICIP 2003.
Josep's current research interests focus on multi-view analysis and representation from multiple sensor data: 3D reconstruction and analysis, human body tracking, model-based human motion analysis and gesture recognition. Applications are in the fields of human interaction and audio-visual communication.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Journal Articles top
“Correspondence matching in unorganized 3D point clouds using Convolutional Neural Networks”, Image and Vision Computing, vol. 83-84, 2019.![]() |
,
“Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network”, Sensors, vol. 19, no. 8, 2019.![]() |
,
“Temporally Coherent 3D Point Cloud Video Segmentation in Generic Scenes”, IEEE Transactions on Image Processing, vol. 27, no. 6, pp. 3087 - 3099, 2018.![]() |
,
“A closed-loop approach for tracking a humanoid robot using particle filtering and depth data”, Intelligent Service Robotics, vol. 10, no. 4, pp. 297–312, 2017.![]() |
,
“Real-time Fingertip Localization Conditioned on Hand Gesture Classification”, Image and Vision Computing, vol. 32, no. 8, pp. 522 - 532, 2014.![]() |
,
Book Chapters and Bookstop
“Multi-view Body Tracking with a Detector-Driven Hierarchical Particle Filter”, in Lecture Notes in Computer Science: Articulated Motion and Deformable Objects, vol. 7378, Berlin / Heidelberg: Springer , 2012, pp. 82-91. | ,
“INTAIRACT: Joint Hand Gesture and Fingertip Classification for Touchless Interaction”, in Computer Vision – ECCV 2012, vol. 7585, Heidelberg: Springer, 2012, pp. 602-606.![]() |
,
“Skeleton and shape adjustment and tracking in multicamera environments”, in Lecture notes in computer science, vol. 6169/2010, Berlin / Heidelberg: Springer, 2010, pp. 88–97. | ,
“Computers in the Human Interaction Loop”, in Handbook on Ambient Intelligence and Smart Environments (AISE), Boston, MA: Springer, 2010, pp. 1071–1116. | ,
“The Memory Jog Service”, in Computers in the Human Interaction Loop, London: Springer, 2009, pp. 207–234. | ,
Conference Papers top
“One Shot Learning for Generic Instance Segmentation in RGBD Videos”, in International Conference on Computer Vision, Theory and Applications, Prague, 2019.![]() |
,
“HybridNet for Depth Estimation and Semantic Segmentation”, in ICASSP 2018, Calgary, Alberta, Canada, 2018.![]() |
,
“SLAM-based 3D outdoor reconstructions from LIDAR data”, in IC3D, Brussels, Belgium, 2018.![]() |
,
“Segmentation-based Multi-Scale Edge Extraction to Measure the Persistence of Features in Unorganized Point Clouds”, in International Conference on Computer Vision Theory and Applications, Porto, Portugal, 2017.![]() |
,
“Collaborative voting of 3D features for robust gesture estimation”, in International Conference on Acoustics, Speech and Signal Processing, New Orleans, USA, 2017.![]() |
,
Theses top
“Semantic and Generic Object Segmentation for Scene Analysis Using RGB-D Data”, Universitat Politècnica de Catalunya (UPC), download link, 2018. | ,
“Stochastic optimization and interactive machine learning for human motion analysis”, UPC, Barcelona, 2014.![]() |
,
“Human body analysis using depth data”, Universitat Politècnica de Catalunya (UPC), 2013.![]() |
,
“Articulated Models for Human Motion Analysis”, Universitat Politècnica de Catalunya (UPC), 2012.![]() |
,
“Media Aesthetics Based Multimedia Storytelling”, Universitat Politècnica de Catalunya (UPC), 2011.![]() |
,
Other top
“Sistema de gestió de vídeo off-line per una smart-room”. 2007.![]() |
, Ms Thesis |
Projects top
![]() |
MALEGRA - Multimodal Signal Processing and Machine Learning on Graphs | National | Jan 2017 | Dec 2020 |
![]() |
Image processing for Plasma Facing Components protection | European | Jan 2019 | Dec 2020 |
![]() |
BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces. | National | Jan 2014 | Dec 2017 |
![]() |
Camomile - Collaborative Annotation of multi-MOdal, MultI-Lingual and multi-mEdia documents | European | Feb 2013 | Aug 2017 |
![]() |
SGR14 - Image and Video Processing Group | National | Jan 2014 | Apr 2017 |
Research Areas top
![]() |
Multi-view/Multi-sensor scene capture, analysis and representation | Internal | Mar 2004 | Nov 2015 |
Demos and Resources top
![]() |
Correspondence matching in unorganized 3D point clouds using Convolutional Neural Networks | Software | Oct 2018 |
![]() |
Temporally Coherent 3D Point Cloud Video Segmentation in Generic Scenes | Results | Aug 2017 |
![]() |
3D Point Cloud Segmentation using a Fully Connected Conditional Random Field | Results | Mar 2017 |
![]() |
Registration of images to unorganized 3D point clouds using contour cues | Software | Sep 2016 |
![]() |
Interactive registration for 3D data fusion | Demo | Jul 2016 |
Teaching top
Acronym | Title | Level | College |
---|---|---|---|
TAD | Analog and Digital Television Systems | Telecommunications Engineering, 5 years degree | TelecomBCN, ETSETB |
TPA | Audiovisual Production Technology | Bachelor's degree in Telecommunications Technologies and Services Engineering (Major: Audiovisual Systems) | TelecomBCN, ETSETB |
K3D | K3D: 3D with Kinect | Degree in Engineering of Audiovisual Systems | TelecomBCN, ETSETB |
VA | Video Analysis | Master in Computer Vision (MCV) | UAB, UOC, UPC & UPF |