image and video compression plays an important role in providing high quality image/video services under the limited capabilities of transmission networks and storage. The redundancies within images and videos are fundamentally important for image and video compression, including spatial redundancy, visual redundancy and statistical redundancy.Besides, the temporal redundancy existing in video sequences enables the video compression to achieve higher compression ratio compared with image compression.For image compression, the early methods mainly realize compression by directly utilizing the entropy coding to reduce statistical redundancy within the image, such as Huffman coding , Golomb code and arithmetic coding. In later 1960s, transform coding was proposed for imagecompression by encoding the spatial frequencies, including Fourier transform and Hadamard transform. In 1974, Ahmed et al. proposed Discrete Cosine Transform (DCT) for image coding, which can compact image energy in the low frequency domain such that compression in the frequency domain becomes much more efficient.Besides reducing statistical redundancy by entropy coding and transform techniques, the prediction and quantization techniques are further proposed to reduce spatial redundancy and visual redundancy in images. The most popular image compression standard, JPEG, is a successful image compression system by integrating its preceding coding techniques. It first divides image into blocks and then transforms blocks into the DCT domain. For each block, the differential pulse code modulation (DPCM) [7] is applied to its DC components, such that the prediction residuals of DC components between neighboring DCT blocks are compressed instead of compressing the DC value directly. To reduce the visual redundancy, a special quantization table is designed to well preserve low-frequency information and discard more highfrequency (noise-like) details as humans are less sensitive to the information loss in high frequency parts [8]. Another well-known still image compression standard, JPEG 2000 [9], applies the 2D wavelet transform instead of DCT to represent images in a compact form, and utilizes an efficient arithmetic coding method, EBCOT [10], to reduce the statistical redundancy existing in wavelet coefficients.
image and video compression plays an important role in providing high quality image/video services under the limited capabilities of transmission networks and storage. The redundancies within images and videos are fundamentally important for image and video compression, including spatial redundancy, visual redundancy and statistical redundancy.Besides, the temporal redundancy existing in video sequences enables the video compression to achieve higher compression ratio compared with image compression.For image compression, the early methods mainly realize compression by directly utilizing the entropy coding to reduce statistical redundancy within the image, such as Huffman coding , Golomb code and arithmetic coding. In later 1960s, transform coding was proposed for imagecompression by encoding the spatial frequencies, including Fourier transform and Hadamard transform. In 1974, Ahmed et al. proposed Discrete Cosine Transform (DCT) for image coding, which can compact image energy in the low frequency domain such that compression in the frequency domain becomes much more efficient.Besides reducing statistical redundancy by entropy coding and transform techniques, the prediction and quantization techniques are further proposed to reduce spatial redundancy and visual redundancy in images. The most popular image compression standard, JPEG, is a successful image compression system by integrating its preceding coding techniques. It first divides image into blocks and then transforms blocks into the DCT domain. For each block, the differential pulse code modulation (DPCM) [7] is applied to its DC components, such that the prediction residuals of DC components between neighboring DCT blocks are compressed instead of compressing the DC value directly. To reduce the visual redundancy, a special quantization table is designed to well preserve low-frequency information and discard more highfrequency (noise-like) details as humans are less sensitive to the information loss in high frequency parts [8]. Another well-known still image compression standard, JPEG 2000 [9], applies the 2D wavelet transform instead of DCT to represent images in a compact form, and utilizes an efficient arithmetic coding method, EBCOT [10], to reduce the statistical redundancy existing in wavelet coefficients.<br>
正在翻译中..