MASCOT - distributed Metadata for Advanced Scalable video COding Tools

Type Start End
European May 2001 Apr 2003
Responsible URL
Javier Ruiz Hidalgo

Reference

IST/FET (2000-26467)

Description

The explosion of multimedia applications leads to a great expansion of video transmission over heterogeneous channels such as Internet, mobile nets and In-Home Digital Networks with new issues in terms of varying transport conditions (e.g., bandwidth, error rate) and receiver capabilities (CPU, display). In the future the Internet is expected to become the major carrier for all forms of audio-visual information and data. The requirements for bandwidth availability and quick and easy access to large multimedia databases will be more and more stringent. However new techniques are required to overcome the limitation of current coding techniques and to enable a quick and easy access to large multimedia data repositories. In this context, the main goal of MASCOT is to bring a breakthrough in video coding and access through the exploitation of two innovative techniques:

  • New decompositions enabling scalable compression: The notion of scalability is the expected functionality to introduce a high degree of flexibility in the coding/decoding systems. For all currently available interactive multimedia applications however, which are very demanding in terms of video quality and coding efficiency, the cost as well as the limited performances of the scalability obtained in the current standards remain unacceptable. That is why an intrinsically scalable video coding scheme providing fully progressive bitstreams will be first designed in the framework of the MASCOT project by exploiting novel wavelet decomposition methods and more efficient prediction techniques. A major contribution of MASCOT will be the investigation of new nonlinear wavelets in the framework of scalable coding. Motion compensation is a basic ingredient of current video compression standards. This example of a prediction technique will be further developed and enriched to fit into a scalable coding framework and to reduce the coding redundancy involving other image transformation models like colour and illumination changes as well as transitions.
  • Use of metadata information to improve coding: It can be expected that in the future, a very large amount of audio-visual documents will be indexed and that metadata information will be rather easy to create. As a result, in many circumstances, audio-visual material will be available together with the metadata describing its content. Therefore, future image and video sequence encoders will be able to use the metadata information in order to improve their efficiency or to optimise their strategy. One of the main objectives of this project is to demonstrate the validity of this approach and to develop an efficient compression scheme exploiting metadata information. 

 

Consortium:

  • Centre for Mathematics and Computer Science
  • Centre de Morphologie Mathematique
  • Heinrich-Hertz-Institut
  • Poznan University of Technology
  • Philips Research Lab.
  • Ecole Nationale Supérieure des Telecommunications
  • Universitat Politècnica de Catalunya
  • Vrije Universiteit Brussel

Publications

Salembier P, Wilkinson MHF. Connected operators: A review of region-based morphological image processing techniques. IEEE Signal Processing Magazine. 2009 ;6:136–157. (4.25 MB)
Ruiz-Hidalgo J, Salembier P. Comparison of MPEG-7 descriptors for long term selection of reference frames. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009. IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009. Taipei, Taiwan; 2009. pp. 941–944. (189.96 KB)
Ruiz-Hidalgo J, Salembier P. Long term selection of reference frame sub-blocks using MPEG-7 indexing metadata. In: International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007. International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007. Honolulu, Hawaii; 2007. pp. 669–672. (111.99 KB)
Solé J, Salembier P. Quadratic Interpolation and Linear Lifting Design. Eurasip Journal on Applied Signal Processing. 2007 ;1:1–9. (876.4 KB)
Salembier P, Benitez AB. Structure description tools. Journal of the American Society for Information Science. 2007 ;58:1329–1337. (722.43 KB)
Ruiz-Hidalgo J, Salembier P. On the use of indexing metadata to improve the efficiency of video compression. IEEE transactions on circuits and systems for video technology. 2006 ;16:410–419. (485.83 KB)
Ruiz-Hidalgo J, Salembier P. Metadata-based coding tools for hybrid video codecs. In: Picture Coding Symposium, PCS 2003. Picture Coding Symposium, PCS 2003. Saint-Malo, France; 2003. pp. 473–477. (50.67 KB)
Salembier P, Llach J, Garrido L. Visual Segment Tree Creation for MPEG-7 Description Schemes. Pattern recognition. 2002 ;35:563–579. (1.14 MB)
Salembier P. Overview of the MPEG-7 Standard and of Future Challenges for Visual Information Analysis. EURASIP Journal on Applied Signal Processing. 2002 ;4:1–11. (415.4 KB)
Avaro O, Salembier P. MPEG-7 Systems: overview. IEEE transactions on circuits and systems for video technology. 2001 ;11:760–764. (94.51 KB)

Pages