MALEGRA - Multimodal Signal Processing and Machine Learning on Graphs

Type | Start | End |
---|---|---|
National | Jan 2017 | Jun 2021 |
Responsible | URL |
---|---|
Javier Ruiz-Hidalgo / Xavier Giró |
Reference
MALEGRA, TEC2016-75976-R, financed by the Spanish Ministerio de Economía, Industria y Competitividad and the European Regional Development Fund (ERDF)
Description
The goal of this project is to study and develop tools combining graph signal representation and processing ideas with machine learning technology. These tools will be used in the context of applications where the size and/or the heterogeneity of the data represent challenges of the Big Data era. The development of technologies related to the capture, storage, search, distribution, transfer, analysis and visualization of ever growing heterogeneous datasets entails tremendous difficulties. At the same time, these difficulties open new opportunities and this development has become a major trend in the field of Information and Communication Technology. The research performed in this project targets applications such as multi-view representations, video analysis, remote sensing for earth monitoring, person identification, health monitoring, medical imaging, genomics, etc.
The project has 4 major objectives. The first two objectives concentrate most of the development of theoretical and basic tools within the project. Within them, we will investigate the creation, analysis, segmentation, filtering and merging of graph structures of heterogeneous multimodal data and on the combination of these ideas with machine learning techniques. This combination with machine learning will be used for several different purposes. In particular, to provide a classification decision, to learn a mapping or a model to be used in a data processing architecture, to learn features that outperform handcrafted equivalents or to aggregate several features to create a signal to be further processed.
The last two objectives of the project focus on the application of the techniques and tools developed in the first two objectives in complex challenges that deal with big and heterogeneous data. In particular, these techniques and tools will be used to study the identification of persons in broadcast TV programs, the optimal encoding of depth maps in multi-view plus depth representations, the radiometric estimation and object detection in SAR and PolSAR images, the classification of multispectral and hyperspectral images, the understanding of brain changes during the evolution of Alzheimers disease, the inference of gene regulatory networks and the segmentation, tracking, indexing and super-resolution of multimodal video sequences.
Publications
3D hierarchical optimization for multi-view depth map coding. Multimedia Tools and Applications. 2017 .![]() |
.
Temporal-aware Cross-modal Embeddings for Video and Audio Retrieval. In: NIPS 2017 Women in Machine Learning Workshop (WiML). NIPS 2017 Women in Machine Learning Workshop (WiML). Long Beach, CA, USA: NIPS 2017 Women in Machine Learning Workshop; 2017.![]() |
.
Cost-Effective Active Learning for Melanoma Segmentation. In: ML4H: Machine Learning for Health Workshop at NIPS 2017. ML4H: Machine Learning for Health Workshop at NIPS 2017. Long Beach, CA, USA; 2017.![]() |
.
Active Deep Learning for Medical Imaging Segmentation. In: Medical Image meets NIPS 2017 Workshop. Medical Image meets NIPS 2017 Workshop. ; 2017.![]() |
.
ViTS: Video Tagging System from Massive Web Multimedia Collections. In: ICCV 2017 Workshop on Web-scale Vision and Social Media . ICCV 2017 Workshop on Web-scale Vision and Social Media . Venice, Italy; 2017.![]() |
.
Registration of Images to Unorganized 3D Point Clouds Using Contour Cues. In: The 25th European Signal Processing Conference (EUSIPCO 2017). The 25th European Signal Processing Conference (EUSIPCO 2017). Kos island, Greece: Eurasip; 2017.![]() |
.
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks. In: NIPS Time Series Workshop 2017. NIPS Time Series Workshop 2017. Long Beach, CA, USA; 2017.![]() |
.
3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field. In: The 25th European Signal Processing Conference (EUSIPCO 2017). The 25th European Signal Processing Conference (EUSIPCO 2017). Kos island, Greece: Eurasip/IEEE; 2017.![]() |
.
Multi-view depth coding based on a region representation combining color and depth information . Signal Theory and Communications (TSC). 2017 .![]() |
.
Hierarchical stack filtering: a bitplane-based algorithm for massively parallel processors. Journal of Real-Time Image Processing. 2017 .![]() |
.
Pages
Collaborators
Javier Ruiz Hidalgo | Associate Professor | j.ruiz@upc.edu |
Xavier Giró | Associate Professor | xavier.giro@upc.edu |
Ferran Marqués | Professor | ferran.marques@upc.edu |
Albert Oliveras | Associate Professor | albert@tsc.upc.edu |
Philippe Salembier | Professor | philippe.salembier@upc.edu |
Elisa Sayrol | Associate Professor | elisa.sayrol@upc.edu |
Josep R. Casas | Associate Professor | josep.ramon.casas@upc.edu |
Veronica Vilaplana | Associate Professor | veronica.vilaplana@upc.edu |
Montse Pardàs | Professor | montse.pardas@upc.edu |
Marc Maceira | PhD Candidate | marc.maceira@upc.edu |
Xiao Lin | PhD Candidate | xiao.lin@upc.edu |
Alba Pujol | PhD Candidate | alba.pujol@upc.edu |
Adrià Casamitjana | PhD Candidate | adria.casamitjana@upc.edu |
Albert Gil Moreno | Software Engineer | albert.gil@upc.edu |