Javier Ruiz Hidalgo

Position	e-mail
Associate Professor	j.ruiz@upc.edu

Office	Phone
D5-008 (Barcelona - ETSETB) TR2-103 (Terrassa - ESEIAAT)	+34 9373 98980

Journals

Biography

Javier Ruiz Hidalgo received a degree in Telecommunications Engineering at the Universitat Politècnica de Catalunya (UPC), Barcelona, Spain in 1997. From 1998 to 1999, he developed an MSc by Research on the field of Computer Vision by the University of East Anglia (UEA) in Norwich, UK. During 1999 he joined the Image Processing Group at UPC working on image and video indexing in the context of the MPEG-7 standard. In 2006, he received his PhD. in the field of image processing.

Since 1999 he has been involved in various European Projects as a researcher from the Image Processing Group at UPC. During 1999 and 2000 he worked in the ACTS(AC308) DICEMAN project developing new descriptors and representations for image and video sequences. Since 2001 he is also involved in the IST/FET(2000-26467) project MASCOT developing an efficient compression scheme exploiting metadata information. In 2009 he worked as principal researcher for the national project HESPERIA involved in improving the security of large infrastructures such as airports and power plants. From 2010 to 2013 he was principal researcher for the EU project FASCINATE working on interactive human computer interfaces using 3D data. During 2017 to 2021 he was the principal researcher for the national project MALEGRA developing tools combining graph signal representation and processing ideas with machine learning technology.

Since 2001 he is an Associate Professor at the Universitat Politècnica de Catalunya. He is currently lecturing on the area of digital signal and systems, image processing and computer vision. His current research interests include 3D video coding and analysis, graph neural networks, conditional generative networks and super-resolution.

Journal Articles top

2024
J. Gené-Mola, Ferrer-Ferrer, M., Hemming, J., Dalfsen, P., Hoog, D., Sanz-Cortiella, R., Rosell-Polo, J. R., Morros, J. R., Vilaplana, V., Ruiz-Hidalgo, J., and Gregorio, E., “AmodalAppleSize_RGB-D dataset: RGB-D images of apple trees annotated with modal and amodal segmentation masks for fruit detection, visibility and size estimation”, Data in Brief, vol. 52, 2024. DOI Google Scholar BibTex

2023
M. Ferrer-Ferrer, Ruiz-Hidalgo, J., Gregorio, E., Vilaplana, V., Morros, J. R., and Gené-Mola, J., “Simultaneous Fruit Detection and Size Estimation Using Multitask Deep Neural Networks  ”, Biosystems Engineering, vol. 233, pp. 63-75, 2023. DOI Google Scholar BibTex (10.36 MB)
J. Gené-Mola, Ferrer-Ferrer, M., Gregorio, E., Blok, P. M., Hemming, J., Morros, J. R., Rosell-Polo, J. R., Vilaplana, V., and Ruiz-Hidalgo, J., “Looking behind occlusions: A study on amodal segmentation for robust on-tree apple fruit size estimation”, Computers and Electronics in Agriculture, vol. 209, 2023. DOI Google Scholar BibTex (9.02 MB)

2021
A. Mosella-Montoro and Ruiz-Hidalgo, J., “2D–3D Geometric Fusion network using Multi-Neighbourhood Graph Convolution for RGB-D indoor scene classification”, Information Fusion, vol. 76, 2021. DOI Google Scholar BibTex (771.86 KB)

2020
J. Gené-Mola, Sanz, R., Rosell-Polo, J. R., Morros, J. R., Ruiz-Hidalgo, J., Vilaplana, V., and Gregorio, E., “Fruit detection and 3D location using instance segmentation neural networks and structure-from-motion photogrammetry”, Computers and Electronics in Agriculture, vol. 169, 2020. DOI Google Scholar BibTex

see all

Book Chapters and Bookstop

2014
O. Schreer, Macq, J., Niamut, O., Ruiz-Hidalgo, J., Shirley, B., Thallinger, G., and Thomas, G., Media Production, Delivery and Interaction for Platform Independent Systems, vol. ISBN 978-1-118-60533-2. Wiley, ISBN 978-1-118-60533-2, 2014. Google Scholar BibTex

2012
X. Suau, Alcoverro, M., López-Méndez, A., Ruiz-Hidalgo, J., and Casas, J., “INTAIRACT: Joint Hand Gesture and Fingertip Classification for Touchless Interaction”, in Computer Vision – ECCV 2012, vol. 7585, Heidelberg: Springer, 2012, pp. 602-606. DOI Google Scholar BibTex (214.4 KB)

2009
F. Marqués, Menezes, M., and Ruiz-Hidalgo, J., “How are digital TV programs compressed to allow broadcasting?”, in Applied signal processing, T. Dutoit and Marqués, F. 2009, pp. 311–359. Google Scholar BibTex
F. Marqués, Menezes, M., and Ruiz-Hidalgo, J., “How are digital images compressed in the web?”, in Applied signal processing, T. Dutoit and Marqués, F. 2009, pp. 265–310. Google Scholar BibTex

2006
J. Neumann, Casas, J., Macho, D., and Ruiz-Hidalgo, J., “Multimodal Integration of Sensor Network”, in Artificial Intelligence Applications and Innovations, vol. 204, Boston: Springer, 2006, pp. 312–323. DOI Google Scholar BibTex

Conference Papers top

2023
C. Hurtado, Shekkizhar, S., Ruiz-Hidalgo, J., and Ortega, A., “Study of Manifold Geometry using Multiscale Non-Negative Kernel Graphs”, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023. DOI Google Scholar BibTex (1.4 MB)
J. Gené-Mola, Felip-Pomés, M., Net-Barnés, F., Morros, J. R., Miranda, J. C., J. Satorra, A., L. Jones, A., J. Sanahuja, L., Ruiz-Hidalgo, J., and Gregorio, E., “Video-Based Fruit Detection and Tracking for Apple Counting and Mapping”, in IEEE International Workshop on Metrology for Agriculture and Forestry (MetroAgriFor), 2023. DOI Google Scholar BibTex (680.49 KB)

2022
D. Bonet, Ortega, A., Ruiz-Hidalgo, J., and Shekkizhar, S., “Channel Redundancy and Overlap in Convolutional Neural Networks with Channel-Wise NNK Graphs”, in International Conference on Acoustics, Speech and Signal Processing, 2022. Google Scholar BibTex (1.13 MB)
A. Mosella-Montoro and Ruiz-Hidalgo, J., “SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters”, in IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), New Orleans, USA, 2022. Google Scholar BibTex (5.45 MB)

2021
D. Bonet, Ortega, A., Ruiz-Hidalgo, J., and Shekkizhar, S., “Channel-Wise Early Stopping without a Validation Set via NNK Polytope Interpolation”, in Asia Pacific Signal and Information Processing Association Annual Summit, APSIPA, Tokyo, Japan, 2021. Google Scholar BibTex (995.84 KB)

see all

Theses top

2023
A. Mosella-Montoro, “Graph Convolutional Neural Networks for 3D Data Analysis”, Universitat Politècnica de Catalunya, Barcelona, 2023. Google Scholar BibTex

2020
A. Pujol-Miró, “Learning to extract features for 2D-3D multimodal registration”, Universitat Politècnica de Catalunya (UPC), 2020. Google Scholar BibTex (14.22 MB)

2017
E. Perez-Pellitero, “Manifold Learning for Super Resolution”, Leibniz Universität Hannover, Hannover, 2017. Google Scholar BibTex (18.6 MB)
M. Maceira, “Multi-view depth coding based on a region representation combining color and depth information”, Universitat Politècnica de Catalunya (UPC), 2017. Google Scholar BibTex (15.24 MB)

2013
X. Suau, “Human body analysis using depth data”, Universitat Politècnica de Catalunya (UPC), 2013. Google Scholar BibTex (10.67 MB)

see all

Other top

2022
R. Terradas, Domingo, P., Grau, M., Alarcón, E., and Ruiz-Hidalgo, J., “A method, system and computer programs to automatically transform an image”, 2022. Google Scholar BibTex	Patent

2021
D. Bonet, “Improved Neural Network Generalization using Channel-Wise NNK Graph Constructions”. Final Year Project, UPC, 2021. Google Scholar BibTex (3.67 MB)	Unpublished

2018
E. Perez-Pellitero, Salvador, J., Ruiz-Hidalgo, J., and Rosenhahn, B., “Method for upscaling an image and apparatus for upscaling an image”, U.S. Patent US 20170132759 A12018. Google Scholar BibTex	Patent

1999
J. Ruiz-Hidalgo, “The representation of images using scale trees”, University of East Anglia, 1999. Google Scholar BibTex (2.33 MB)	Report

Projects top

AI-Enhanced Fiber-Wireless Optical 6G Network in Support for Connected Mobility	European	Jan 2024	Dec 2026
DeeLight: Efficient Deep Learning for Video Sequences and Point Clouds	National	Sep 2021	Aug 2025
Generación automática de estadísticas y vídeos para la mejora deportiva de equipos semiprofesionales mediante IA	Other	Jan 2023	Jun 2024
Artificial Intelligence for Magnetic Devices in Quantum and Neuromorphic Computing	National	Jan 2023	Dec 2023
A European AI On Demand Platform and Ecosystem	European	Jan 2019	Dec 2021

see all

Research Areas top

Region-based image and video processing	Internal	Jan 1992	Dec 2020
Multiview Coding	Internal	Jul 2010	Jul 2018
Multi-view/Multi-sensor scene capture, analysis and representation	Internal	Mar 2004	Nov 2015

Demos and Resources top

Correspondence matching in unorganized 3D point clouds using Convolutional Neural Networks	Software	Oct 2018
Spatio-Temporal Road Detection from Aerial Imagery using CNNs - Dataset	Dataset	Dec 2016
Registration of images to unorganized 3D point clouds using contour cues	Software	Sep 2016
Interactive registration for 3D data fusion	Demo	Jul 2016
Interactive registration for 3D data fusion	Demo	Jul 2016

see all

Teaching top

Acronym	Title	Level	College
PAE	Advanced Project in Science and Telecommunication Technologies (CDIO)	3rd year	Telecom BCN - ETSETB
IHCV	C1. Introduction to Humand and Computer Vision	Master in Computer Vision (MCV)	UAB, UB, UOC, UPC & UPF
CVDL	Computer Vision with Deep Learning	Master in Telecommunications Engineering (MET)	ETSETB - Telecom BCN
IPSAV	Introduction to Audiovisual Signal Processing	Degree in Engineering of Audiovisual Systems	TelecomBCN, ETSETB
IDL	Introduction to Deep Learning	BSc	ETSETB TelecomBCN
VA	Video Analysis	Master in Computer Vision (MCV)	UAB, UB, UOC, UPC & UPF

Search form

User login

Javier Ruiz Hidalgo

Biography

Journal Articles top

Book Chapters and Bookstop

Conference Papers top

Theses top

Other top

Projects top

Research Areas top

Demos and Resources top

Teaching top