首页    期刊浏览 2024年07月09日 星期二
登录注册

文章基本信息

  • 标题:Practical Privacy-Preserving K-means Clustering
  • 本地全文:下载
  • 作者:Payman Mohassel ; Mike Rosulek ; Ni Trieu
  • 期刊名称:Proceedings on Privacy Enhancing Technologies
  • 电子版ISSN:2299-0984
  • 出版年度:2020
  • 卷号:2020
  • 期号:4
  • 页码:414-433
  • DOI:10.2478/popets-2020-0080
  • 语种:English
  • 出版社:Sciendo
  • 摘要:Clustering is a common technique for data analysis,which aims to partition data into similar groups. When the data comes from different sources, it is highly desirable to maintain the privacy of each database. In this work,we study a popular clustering algorithm (K-means) and adapt it to the privacypreserving context. Specifically,to construct our privacy-preserving clustering algorithm,we first propose an efficient batched Euclidean squared distance computation protocol in the amortizing setting,when one needs to compute the distance from the same point to other points. Furthermore, we construct a customized garbled circuit for computing the minimum value among shared values. We believe these new constructions may be of independent interest. We implement and evaluate our protocols to demonstrate their practicality and show that they are able to train datasets that are much larger and faster than in the previous work. The numerical results also show that the proposed protocol achieve almost the same accuracy compared to a K-means plain-text clustering algorithm.
国家哲学社会科学文献中心版权所有