Journal of Northeastern University(Natural Science) ›› 2022, Vol. 43 ›› Issue (6): 783-791.DOI: 10.12068/j.issn.1005-3026.2022.06.004

• Information & Control • Previous Articles     Next Articles

High Perceptual Image Compression Based on Conditional GAN

ZHANG Xue-feng, XU Hua-wen, YANG Mian-zimei   

  1. School of Sciences, Northeastern University, Shenyang 110819, China.
  • Revised:2021-07-06 Accepted:2021-07-06 Published:2022-07-01
  • Contact: ZHANG Xue-feng
  • About author:-
  • Supported by:
    -

Abstract: A conditional generative adversarial network-based high perceptual image compression(HPIC) model was proposed to obtain a compressed image that conforms to human visual perception. In HPIC, the original image was encoded and quantized based on a hyper prior probability model. A generator based on the conditional additional label and residual module was used for the reconstruction of the compressed image, and a discriminator based on the convolutional neural network was used to distinguish the difference between the compressed image and the real image. Based on the bit rate-distortion-perception optimization theory to design the loss function, the perceptual distortion metric was chosen based on the eigenvalues of the pre-trained inception network to achieve image compression reconstruction with high perceptual quality on the one hand, and use the adversarial generation network loss to eliminate compression artifacts and improve the compression accuracy on the other hand. The experimental results show that HPIC achieves a good balance in the bit rate-distortion-perception tradeoff and outperforms the current common algorithms in all perceptual metrics scores, even though the latter uses twice as much bit rate as ours. HPIC can achieve compression with high perceptual quality.

Key words: image compression; bit rate-distortion-perception theory; conditional generative adversarial network; loss function

CLC Number: