Javier Ruiz Hidalgo

Biography
Javier Ruiz Hidalgo received a degree in Telecommunications Engineering at the Universitat Politècnica de Catalunya (UPC), Barcelona, Spain in 1997. From 1998 to 1999, he developed an MSc by Research on the field of Computer Vision by the University of East Anglia (UEA) in Norwich, UK. During 1999 he joined the Image Processing Group at UPC working on image and video indexing in the context of the MPEG-7 standard. In 2006, he received his PhD. in the field of image processing.
Since 1999 he has been involved in various European Projects as a researcher from the Image Processing Group at UPC. During 1999 and 2000 he worked in the ACTS(AC308) DICEMAN project developing new descriptors and representations for image and video sequences. Since 2001 he is also involved in the IST/FET(2000-26467) project MASCOT developing an efficient compression scheme exploiting metadata information. In 2009 he worked as principal researcher for the national project HESPERIA involved in improving the security of large infrastructures such as airports and power plants. From 2010 to 2013 he was principal researcher for the EU project FASCINATE working on interactive human computer interfaces using 3D data.
Since 2001 he is an Associate Professor at the Universitat Politècnica de Catalunya. He is currently lecturing on the area of digital signal and systems and image processing. His current research interests include 3D video coding, 3D analysis and super-resolution.
Journal Articles top
“Fruit detection and 3D location using instance segmentation neural networks and structure-from-motion photogrammetry”, Computers and Electronics in Agriculture, vol. 169, 2020. | ,
“Fuji-SfM dataset: A collection of annotated images and point clouds for Fuji apple detection and location using structure-from-motion photogrammetry”, vol. Data in Brief, no. Vol. 30, 2020. | ,
“FuCiTNet: Improving the generalization of deep learning networks by the fusion of learned class-inherent transformations”, Information Fusion, vol. 63, no. 195, 2020.![]() |
,
“Correspondence matching in unorganized 3D point clouds using Convolutional Neural Networks”, Image and Vision Computing, vol. 83-84, 2019.![]() |
,
“Multi-modal Deep Learning for Fuji Apple Detection Using RGB-D Cameras and their Radiometric Capabilities”, Computers and Electronics in Agriculture, vol. 162, 2019. | ,
Book Chapters and Bookstop
Media Production, Delivery and Interaction for Platform Independent Systems. Wiley, 2014. | ,
“INTAIRACT: Joint Hand Gesture and Fingertip Classification for Touchless Interaction”, in Computer Vision – ECCV 2012, vol. 7585, Heidelberg: Springer, 2012, pp. 602-606.![]() |
,
“How are digital TV programs compressed to allow broadcasting?”, in Applied signal processing, 2009, pp. 311–359. | ,
“How are digital images compressed in the web?”, in Applied signal processing, 2009, pp. 265–310. | ,
“Multimodal Integration of Sensor Network”, in Artificial Intelligence Applications and Innovations, vol. 204, Boston: Springer, 2006, pp. 312–323. | ,
Conference Papers top
“Residual Attention Graph Convolutional Network for Geometric 3D Scene Classification”, in IEEE Conference on Computer Vision Workshop (ICCVW), Seoul, Korea, 2019.![]() |
,
“Segmentation-based Multi-Scale Edge Extraction to Measure the Persistence of Features in Unorganized Point Clouds”, in International Conference on Computer Vision Theory and Applications, Porto, Portugal, 2017.![]() |
,
“Spatio-Temporal Road Detection from Aerial Imagery using CNNs”, in International Conference on Computer Vision Theory and Applications, Porto, Portugal, 2017.![]() |
,
“Collaborative voting of 3D features for robust gesture estimation”, in International Conference on Acoustics, Speech and Signal Processing, New Orleans, USA, 2017.![]() |
,
“Registration of Images to Unorganized 3D Point Clouds Using Contour Cues”, in The 25th European Signal Processing Conference (EUSIPCO 2017), Kos island, Greece, 2017.![]() |
,
Theses top
“Learning to extract features for 2D-3D multimodal registration”, Universitat Politècnica de Catalunya (UPC), 2020.![]() |
,
“Manifold Learning for Super Resolution”, Leibniz Universität Hannover, Hannover, 2017.![]() |
,
“Multi-view depth coding based on a region representation combining color and depth information”, Universitat Politècnica de Catalunya (UPC), 2017. | ,
“Human body analysis using depth data”, Universitat Politècnica de Catalunya (UPC), 2013.![]() |
,
“On the Synergy between indexing and compression representations for video sequences”, Universitat Politècnica de Catalunya (UPC), 2006.![]() |
,
Other top
“Method for upscaling an image and apparatus for upscaling an image”, U.S. Patent US 20170132759 A12017. | ,Patent |
“The representation of images using scale trees”, University of East Anglia, 1999.![]() |
, Report |
Projects top
![]() |
A European AI On Demand Platform and Ecosystem | European | Jan 2019 | Dec 2021 |
![]() |
MALEGRA - Multimodal Signal Processing and Machine Learning on Graphs | National | Jan 2017 | Jun 2021 |
![]() |
BigGraph - Heterogeneous information and graph signal processing for the Big Data era. Application to high-throughput, remote sensing, multimedia and human computer interfaces. | National | Jan 2014 | Dec 2017 |
![]() |
SGR14 - Image and Video Processing Group | National | Jan 2014 | Apr 2017 |
![]() |
MuViPro - Multicamera Video Processing | National | Jan 2011 | Aug 2014 |
Research Areas top
![]() |
Region-based image and video processing | Internal | Jan 1992 | Dec 2020 |
![]() |
Multiview Coding | Internal | Jul 2010 | Jul 2018 |
![]() |
Multi-view/Multi-sensor scene capture, analysis and representation | Internal | Mar 2004 | Nov 2015 |
Demos and Resources top
![]() |
Correspondence matching in unorganized 3D point clouds using Convolutional Neural Networks | Software | Oct 2018 |
![]() |
Spatio-Temporal Road Detection from Aerial Imagery using CNNs - Dataset | Dataset | Dec 2016 |
![]() |
Registration of images to unorganized 3D point clouds using contour cues | Software | Sep 2016 |
![]() |
Interactive registration for 3D data fusion | Demo | Jul 2016 |
![]() |
Interactive registration for 3D data fusion | Demo | Jul 2016 |
Teaching top
Acronym | Title | Level | College |
---|---|---|---|
VA | Video Analysis | Master in Computer Vision (MCV) | UAB, UOC, UPC & UPF |