CPSC 538A: Time-Sensitive Distributed Applications

Overview of fine granularity scalability in MPEG-4 video standard

Weiping Li, IEEE Transactions on Circuits and Systems for Video Technology, Volume 11, Issue: 3. March 2001

Presentation: PowerPoint, PDF

Summary:

Fine granularity scalability (FGS) is an amendment on the MPEG-4 standard. This work aims to optimize the video quality over a given range of bit rate. This was motivated by streaming video over the internet, where two critical assumptions come into play:

This requires the bitstream should be partially decodable at any bit rate within the supported bit rate range, which is not supported by some other layered scalable techniques such as signal-to-noise ratio coding, temporal scaling or spatial scaling. The goal was to have the quality at a given bit rate as close as possible to what non-scalable technique would give, which represents the optimum.

The technique uses two layers, a base layer and an enhancement layer. The full base layer is required for the decoding of the video stream, and the enhancement layer is structured such that even a partial bitstream can be used to improve the video quality. This is done using bit-plane coding. A bit-plane is defined by taking one bit at the same significant position from each quantized DCT coefficient (absolute, in binary form). The so-formed bit-planes are encoded using variable length encoding, starting form the most significant bit-plane. This also gives a 20% bit saving over run-length coding.

Some advanced features have also been added to FGS, these are:

Two profiles were defined to facilitate FGS in MPEG, one for each layer:

The coding efficiency of FGS was compared to simulcast, SNR, and non-scalable coding. FGS was found to outperform multicast at the low and high end of the bit-rate range, and much more efficient than SNR, particularly at the upper end of the bit-range. Compared to non-scalable, SNR was about 2-dB worse at the high end.

Class discussion:

The discussion brought up a few points: