运筹学学报 ›› 2022, Vol. 26 ›› Issue (3): 1-16.doi: 10.15960/j.cnki.issn.1007-6093.2022.03.001

• • 上一篇    下一篇

k-均值问题的差分隐私算法综述

袁藩1, 徐大川1, 张冬梅2,*()   

  1. 1. 北京工业大学北京科学与工程计算研究院, 北京 100124
    2. 山东建筑大学计算机科学与技术学院, 山东济南 250101
  • 收稿日期:2021-12-20 出版日期:2022-09-15 发布日期:2022-09-07
  • 通讯作者: 张冬梅 E-mail:zhangdongmei@sdjzu.edu.cn
  • 作者简介:张冬梅, E-mail: zhangdongmei@sdjzu.edu.cn
  • 基金资助:
    国家自然科学基金(11871081);国家自然科学基金(12131003)

A survey of differential privacy algorithms for the $k$-means problem

Fan YUAN1, Dachuan XU1, Dongmei ZHANG2,*()   

  1. 1. Beijing Institute for Scientific and Engineering Computing, Beijing University of Technology, Beijing 100124, China
    2. School of Computer Science and Technology, Shandong Jianzhu University, Jinan 250101, Shandong, China
  • Received:2021-12-20 Online:2022-09-15 Published:2022-09-07
  • Contact: Dongmei ZHANG E-mail:zhangdongmei@sdjzu.edu.cn

摘要:

$k$-均值问题是机器学习和组合优化领域十分重要的问题。它是经典的NP-难问题, 被广泛的应用于数据挖掘、企业生产决策、图像处理、生物医疗科技等领域。随着时代的发展, 人们越来越注重于个人的隐私保护:在决策通常由人工智能算法做出的情况下, 如何保证尽可能多地从数据中挖掘更多信息,同时不泄露个人隐私。近十年来不断有专家学者研究探索带隐私保护的$k$-均值问题, 得到了许多具有理论指导意义和实际应用价值的结果, 本文主要介绍关于$k$-均值问题的差分隐私算法供读者参考。

关键词: $k$-均值问题, 差分隐私, 近似算法, 指数机制, 拉普拉斯机制

Abstract:

The $k$-means problem is a very important problem in the field of machine learning and combinatorial optimization. It is a classic NP-hard problem, which is widely used in data mining, business production decision-making, image processing, biomedical technology, and more. As people in these fields pay more and more attention to personal privacy protection, which raises a question: In the case that decisions are usually made by artificial intelligence algorithms, how to ensure that as much information as possible is extracted from the data without revealing personal privacy? In the past ten years, experts and scholars have continuously studied and explored the $k$-means problem with privacy protection, and has also obtained many results with theoretical guiding significance and practical application value. In this paper we mainly introduce these differential privacy algorithms of the $k$-means problem for readers.

Key words: $k$-means problem, differential privacy, approximate algorithm, exponential mechanism, Laplace mechanism

中图分类号: