Graduation Semester and Year
2008
Language
English
Document Type
Dissertation
Degree Name
Doctor of Philosophy in Electrical Engineering
Department
Electrical Engineering
First Advisor
Kamisetty R Rao
Abstract
This research proposes a block-based video codec design with two objectives. The first goal is to propose a method for intraframe that improves the rate-distortion (peak signal-to-noise ratio versus bit rate) of a fixed-size transform encoder. The proposed method uses three integer transform sizes (4x4), (8x8), and (16x16). The codec also adopts H.264-like spatial prediction to intraframe encoding. For simplicity of the design, Huffman variable-length code is used as entropy encoding. For intraframe encoding, the simulations show rate-distortion improvement over JPEG and JPEG2000. In some test sequences, the simulations also show improvement over H.264 (baseline profile at low complexity mode without rate-distortion optimization) with a small increase of operations on each macroblock at the decoder side. The second goal of this research is to study rate-distortion behavior of the interframe codec with novel motion estimation based on structural similarity (SSIM) and the codec with conventional motion estimation based on pixel error distortion (sum of absolute difference). A study from previous literature shows that the structural similarity metric provides better image assessment than a pixel error based metric (mean square error and peak signal-to-noise ratio). Structural similarity measurement on the true color components (RGB) with equal weight for each component is proposed. The results on rate-distortion show that both structural similarity and peak signal-to-noise ratio (PSNR) provide similar measurements. Both sum of absolute difference (SAD)- and structural similarity (SSIM)-based distortions in motion prediction of large block sizes, {(16x16), (8x8)}, have similar performances. For the small block size of (4x4), SAD-based distor-tion provides better rate-distortion performance. Distortion calculation for SSIM requires more operations compared to SAD.
Disciplines
Electrical and Computer Engineering | Engineering
License
This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 International License.
Recommended Citation
Kruafak, Att, "Hybrid Video Coding Design With Variable Size Integer Transforms And Structural Similarity" (2008). Electrical Engineering Dissertations. 71.
https://mavmatrix.uta.edu/electricaleng_dissertations/71
Comments
Degree granted by The University of Texas at Arlington