The Discrete Cosine Transform (DCT)-zhiqiang0071-ChinaUnix博客

嵌入式linux系统与图像处理seen.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

zhiqiang0071

博客访问： 1988115
博文数量： 185
博客积分： 10707
博客等级：上将
技术积分： 1777
用户组：普通用户
注册时间： 2008-09-19 17:31

文章分类

全部博文（185）

Algorithm（3）
others（4）
Audio（1）
Software Testing（1）
Soft Architectur（2）
Hardware（0）
linux filesys（5）
Video（7）
TCP/IP（1）
GUI（8）
Linux others（28）
Linux Device Dri（12）
Linux Programmin（17）
Linux Kernel（24）
Embedded（32）
USB（9）
Davinci（15）
未分配的博文（16）

文章存档

2014年（1）

2012年（6）

2011年（27）

2010年（13）

2009年（75）

2008年（63）

我的朋友

最近访客

推荐博文

The Discrete Cosine Transform (DCT)

分类：

2011-06-06 12:03:13

原文地址：The Discrete Cosine Transform (DCT) 作者：steven_miao

The discrete cosine transform (DCT) helps separate the image into parts (or spectral sub-bands) of differing importance (with respect to the image's visual quality). The DCT is similar to the discrete Fourier transform: it transforms a signal or image from the spatial domain to the frequency domain (Fig ).

DCT Encoding

The general equation for a 1D (N data items) DCT is defined by the following equation:

$.begin{displaymath} F(u) = .left(.frac{2}{N}.right)^{.frac{1}{2}} .sum_{i=0}^{N-1} .Lambda(i).cos.left[ .frac{.pi.u}{2.N}(2i+1) .right]f(i).end{displaymath}$

and the corresponding inverse 1D DCT transform is simple F^-1(u), i.e.:

where

$.begin{displaymath} .Lambda(i) = .left.{ .begin{array} {ll} .frac{1}{.sqrt{2}} & {.rm for} .xi = 0. 1 & {.rm otherwise}.end{array} .right..end{displaymath}$

The general equation for a 2D (N by M image) DCT is defined by the following equation:

$.begin{displaymath} F(u,v) = .left(.frac{2}{N}.right)^{.frac{1}{2}} .left(.frac{... ...}(2i+1) .right]cos.left[ .frac{.pi.v}{2.M}(2j+1) .right].f(i,j).end{displaymath}$

and the corresponding inverse 2D DCT transform is simple F^-1(u,v), i.e.:

where

$.begin{displaymath} .Lambda(.xi) = .left.{ .begin{array} {ll} .frac{1}{.sqrt{2}} & {.rm for} .xi = 0 . 1 & {.rm otherwise}.end{array} .right..end{displaymath}$

The basic operation of the DCT is as follows:

The input image is N by M;
f(i,j) is the intensity of the pixel in row i and column j;
F(u,v) is the DCT coefficient in row k1 and column k2 of the DCT matrix.
For most images, much of the signal energy lies at low frequencies; these appear in the upper left corner of the DCT.
Compression is achieved since the lower right values represent higher frequencies, and are often small - small enough to be neglected with little visible distortion.
The DCT input is an 8 by 8 array of integers. This array contains each pixel's gray scale level;
8 bit pixels have levels from 0 to 255.
Therefore an 8 point DCT would be:
where

$.begin{displaymath} .Lambda(.xi) = .left.{ .begin{array} {ll} .frac{1}{.sqrt{2}} & {.rm for} .xi = 0 . 1 & {.rm otherwise}.end{array} .right..end{displaymath}$

Question: What is F[0,0]?
answer: They define DC and AC components.
The output array of DCT coefficients contains integers; these can range from -1024 to 1023.
It is computationally easier to implement and more efficient to regard the DCT as a set of basis functions which given a known input array size (8 x 8) can be precomputed and stored. This involves simply computing values for a convolution mask (8 x8 window) that get applied (summ values x pixelthe window overlap with image apply window accros all rows/columns of image). The values as simply calculated from the DCT formula. The 64 (8 x 8) DCT basis functions are illustrated in Fig .

DCT basis functions
Why DCT not FFT?
DCT is similar to the Fast Fourier Transform (FFT), but can approximate lines well with fewer coefficients (Fig )

DCT/FFT Comparison
Computing the 2D DCT
- Factoring reduces problem to a series of 1D DCTs (Fig ):
  - apply 1D DCT (Vertically) to Columns
  - apply 1D DCT (Horizontally) to resultant Vertical DCT above.
  - or alternatively Horizontal to Vertical.
  The equations are given by:
- Most software implementations use fixed point arithmetic. Some fast implementations approximate coefficients so all multiplies are shifts and adds.
- World record is 11 multiplies and 29 adds. (C. Loeffler, A. Ligtenberg and G. Moschytz, "Practical Fast 1-D DCT Algorithms with 11 Multiplications", Proc. Int'l. Conf. on Acoustics, Speech, and Signal Processing 1989 (ICASSP `89), pp. 988-991)

阅读(1716) | 评论(0) | 转发(0) |

上一篇：JPEG算法简介

下一篇：时域和频域的概念

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6