Journal of Northeastern University(Natural Science) ›› 2026, Vol. 47 ›› Issue (1): 11-19.DOI: 10.12068/j.issn.1005-3026.2026.20259020

• Smart Healthcare Column • Previous Articles     Next Articles

Joint Optimization Approach for Medical Image Compression and Vision Tasks

Chao YAO1(), Zi-xuan GAO2, Jun-ru CHEN3, Yi-peng LU4   

  1. 1.School of Computer & Communication Engineering,University of Science and Technology Beijing,Beijing 100083,China
    2.School of Computer Science & Technology,Beijing Jiaotong University,Beijing 100044,China
    3.School of Integrated Circuit Science and Engineering,Wuxi University,Wuxi 214105,China
    4.School of Integrated Circuits,Peking University,Beijing 100871,China. cn
  • Received:2025-06-06 Online:2026-01-15 Published:2026-03-17
  • Contact: Chao YAO

Abstract:

In medical image processing, the reliance on independent encoding components makes it impossible to achieve joint optimization of data compression and machine vision tasks. To address this issue, an end-to-end machine vision task-driven medical image compression network (MVMICNet) was proposed, achieving harmonious unification of data compression and medical image analysis in an end-to-end manner. To maintain the performance of machine vision tasks before and after medical image compression, a task-aware improved code rate-accuracy loss function was designed. By introducing task-related loss terms, it dynamically balanced the relationship among code rate, reconstructed image distortion, and machine vision task accuracy during the optimization process. Furthermore, the MVMICNet model adopted a stage-wise training approach, specifically optimizing for the different characteristics of machine vision tasks to ensure that the model can accurately capture the feature information crucial for diagnosis. This has achieved a simultaneous improvement in compression efficiency and task performance, thus demonstrating superior robustness in complex medical application scenarios. Finally, the effectiveness of the framework was verified in semantic segmentation and object detection tasks.

Key words: medical image compression, semantic segmentation, object detection, convolutional neural network (CNN), task-driven optimization

CLC Number: