医疗图像大数据差分隐私保护方案

Differential privacy protection scheme for medical image big data with multi-resolution analysis

  • 摘要: 医疗图像大数据共享能够有效提升医疗服务质量,为防止医疗图像数据共享过程中发生患者隐私泄露,需要对医疗图像进行隐私保护。差分隐私作为一种具备严格的数学定义和隐私保护强度证明的安全机制,已经应用于图像数据隐私保护。针对医疗图像大数据的隐私保护问题,提出了一种基于多分辨率分析的医疗图像大数据差分隐私保护方案。该方案在Hadoop大数据平台上设计图像数据格式,基于MapReduce计算框架设计医疗图像大数据差分隐私保护算法;现有图像差分隐私方案对图像中的所有数据进行同等强度的保护,没有考虑不同的数据存在不同的隐私需求,针对此问题,结合医疗图像处理领域常用的小波变换技术,基于小波多分辨率分析提出一种隐私预算分配算法,该算法在小波域内根据各小波频带的隐私需求进行隐私预算分配,根据不同小波频带系数的隐私保护需求进行不同强度的差分隐私保护;最后,设计像素差分扰动算法,基于差分隐私指数机制对图像矩阵中的每个像素进行差分隐私扰动。实验结果表明,该方案能够根据各小波频带的隐私保护需求进行差分隐私保护,且在相同的隐私预算下,该方案的图像视觉效用相比对照方案最多可提升97.7%,图像分类效用最多可提升87.2%。在Hadoop集群上进行性能测试,该方案能够实现高效的医疗图像大数据差分隐私保护。

     

    Abstract: The release and sharing of medical image big data can effectively improve the quality of medical services. Medical images contain sensitive information of patients, in order to prevent the disclosure of patient privacy during the sharing of medical image data, it is necessary to protect the privacy of medical images. As a security mechanism with strict mathematical definition and proof of privacy protection strength, differential privacy has been widely used in image data privacy protection. To achieve the privacy protection for medical image big data, this paper proposes a differential privacy scheme for medical image big data with wavelet multi-resolution analysis. This scheme designs the medical image data format on the Hadoop platform and designs the differential privacy protection algorithm of medical image big data based on the MapReduce framework. Existing image differential privacy methods protect entire image data with a same privacy level without considering the different privacy requirements of different data. To solve this problem, combining with the wavelet transform technology commonly used in medical image processing, this paper proposes a privacy budget allocation algorithm based on wavelet multi-resolution analysis. The algorithm measures the importance and privacy requirement of different wavelet subbands in wavelet domain, and allocates privacy budget according to the privacy requirement of each wavelet subband. Finally, this paper proposes a pixel differential disturbance algorithm, which disturbs every pixel based on differential privacy exponential mechanism. The experimental results show that the proposed scheme can implement differential privacy protection according to the privacy protection requirements of each wavelet subband. Under the same privacy budget, the image visual effect of this scheme can be improved by up to 97.7% compared with the control scheme, and the image classification effect can be improved by up to 87.2%. The performance experiment on the big data platform shows the proposed scheme can realize efficient differential privacy protection of medical image big data.

     

/

返回文章
返回