Javier Ruiz Hidalgo

Position![]() |
|
---|---|
Associate Professor | j.ruiz@upc.edu |
Office | Phone |
---|---|
D5-008 (Barcelona - ETSETB)TR2-103 (Terrassa - ESEIAAT) | +34 9373 98980 |
Biography
Javier Ruiz Hidalgo received a degree in Telecommunications Engineering at the Universitat Politècnica de Catalunya (UPC), Barcelona, Spain in 1997. From 1998 to 1999, he developed an MSc by Research on the field of Computer Vision by the University of East Anglia (UEA) in Norwich, UK. During 1999 he joined the Image Processing Group at UPC working on image and video indexing in the context of the MPEG-7 standard. In 2006, he received his PhD. in the field of image processing.
Since 1999 he has been involved in various European Projects as a researcher from the Image Processing Group at UPC. During 1999 and 2000 he worked in the ACTS(AC308) DICEMAN project developing new descriptors and representations for image and video sequences. Since 2001 he is also involved in the IST/FET(2000-26467) project MASCOT developing an efficient compression scheme exploiting metadata information. In 2009 he worked as principal researcher for the national project HESPERIA involved in improving the security of large infrastructures such as airports and power plants. From 2010 to 2013 he was principal researcher for the EU project FASCINATE working on interactive human computer interfaces using 3D data.
Since 2001 he is an Associate Professor at the Universitat Politècnica de Catalunya. He is currently lecturing on the area of digital signal and systems and image processing. His current research interests include 3D video coding, 3D analysis and super-resolution.
Journal Articles top
“Correspondence matching in unorganized 3D point clouds using Convolutional Neural Networks”, Image and Vision Computing, vol. 83-84, 2019.![]() |
,
“Multi-modal Deep Learning for Fuji Apple Detection Using RGB-D Cameras and their Radiometric Capabilities”, Computers and Electronics in Agriculture, vol. 162, 2019. | ,
“KFuji RGB-DS database: Fuji apple multi-modal images for fruit detection with color, depth and range-corrected IR data”, Data in Brief, 2019.![]() |
,
“Fruit Detection in an Apple Orchard Using a Mobile Terrestrial Laser Scanner”, Biosystems Engineering, vol. 187, 2019. | ,
“3D hierarchical optimization for multi-view depth map coding”, Multimedia Tools and Applications, 2017.![]() |
,
Book Chapters and Bookstop
Media Production, Delivery and Interaction for Platform Independent Systems. Wiley, 2014. | ,
“INTAIRACT: Joint Hand Gesture and Fingertip Classification for Touchless Interaction”, in Computer Vision – ECCV 2012, vol. 7585, Heidelberg: Springer, 2012, pp. 602-606.![]() |
,
“How are digital TV programs compressed to allow broadcasting?”, in Applied signal processing, 2009, pp. 311–359. | ,
“How are digital images compressed in the web?”, in Applied signal processing, 2009, pp. 265–310. | ,
“Multimodal Integration of Sensor Network”, in Artificial Intelligence Applications and Innovations, vol. 204, Boston: Springer, 2006, pp. 312–323. | ,
Conference Papers top
“Residual Attention Graph Convolutional Network for Geometric 3D Scene Classification”, in IEEE Conference on Computer Vision Workshop (ICCVW), Seoul, Korea, 2019.![]() |
,
“Segmentation-based Multi-Scale Edge Extraction to Measure the Persistence of Features in Unorganized Point Clouds”, in International Conference on Computer Vision Theory and Applications, Porto, Portugal, 2017.![]() |
,
“Spatio-Temporal Road Detection from Aerial Imagery using CNNs”, in International Conference on Computer Vision Theory and Applications, Porto, Portugal, 2017.![]() |
,
“Collaborative voting of 3D features for robust gesture estimation”, in International Conference on Acoustics, Speech and Signal Processing, New Orleans, USA, 2017.![]() |
,
“Registration of Images to Unorganized 3D Point Clouds Using Contour Cues”, in The 25th European Signal Processing Conference (EUSIPCO 2017), Kos island, Greece, 2017.![]() |
,
Theses top
“Manifold Learning for Super Resolution”, Leibniz Universität Hannover, Hannover, 2017.![]() |
,
“Multi-view depth coding based on a region representation combining color and depth information”, Universitat Politècnica de Catalunya (UPC), 2017. | ,
“Human body analysis using depth data”, Universitat Politècnica de Catalunya (UPC), 2013.![]() |
,
“On the Synergy between indexing and compression representations for video sequences”, Universitat Politècnica de Catalunya (UPC), 2006.![]() |
,
Other top
“Method for upscaling an image and apparatus for upscaling an image”, U.S. Patent US 20170132759 A12017. | ,Patent |
“The representation of images using scale trees”, University of East Anglia, 1999.![]() |
, Report |
Projects top
![]() |
A European AI On Demand Platform and Ecosystem | European | Jan 2019 | Dec 2021 |
![]() |
MALEGRA - Multimodal Signal Processing and Machine Learning on Graphs | National | Jan 2017 | Dec 2020 |
![]() |
BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces. | National | Jan 2014 | Dec 2017 |
![]() |
SGR14 - Image and Video Processing Group | National | Jan 2014 | Apr 2017 |
![]() |
MuViPro - Multicamera Video Processing | National | Jan 2011 | Aug 2014 |
Research Areas top
![]() |
Region-based image and video processing | Internal | Jan 1992 | Dec 2020 |
![]() |
Multiview Coding | Internal | Jul 2010 | Jul 2018 |
![]() |
Multi-view/Multi-sensor scene capture, analysis and representation | Internal | Mar 2004 | Nov 2015 |
Demos and Resources top
![]() |
Correspondence matching in unorganized 3D point clouds using Convolutional Neural Networks | Software | Oct 2018 |
![]() |
Spatio-Temporal Road Detection from Aerial Imagery using CNNs - Dataset | Dataset | Dec 2016 |
![]() |
Registration of images to unorganized 3D point clouds using contour cues | Software | Sep 2016 |
![]() |
Interactive registration for 3D data fusion | Demo | Jul 2016 |
![]() |
Interactive registration for 3D data fusion | Demo | Jul 2016 |
Teaching top
Acronym | Title | Level | College |
---|---|---|---|
PAE | Advanced Project in Science and Telecommunication Technologies (CDIO) | 3rd year | Telecom BCN - ETSETB |
CM | Codificació Multimèdia | Degree in AudioVisual Systems | ESEIAAT |
VC | Computer Vision | Degree in Engineering of Audiovisual Systems | ESEIAAT |
DLAI | Deep Learning for Artificial Intelligence | Master MET | ETSETB TelecomBCN |
DLCV | Deep Learning for Computer Vision | Master in Telecommunications Engineering (MET) | ETSETB Telecom BCN |
IPSAV | Introduction to Audiovisual Signal Processing | Degree in Engineering of Audiovisual Systems | TelecomBCN, ETSETB |
ICV | Introduction to Computer Vision | Master in Telecommunications Engineering (MET) | ETSETB - Telecom BCN |
IDL | Introduction to Deep Learning | BSc | ETSETB TelecomBCN |
IHCV | M1. Introduction to Humand and Computer Vision | Master in Computer Vision (MCV) | UAB, UOC, UPC & UPF |
APA | Programming Audio-Visual Algorithms | Degree in Engineering of Audiovisual Systems | ESEIAAT |