首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:Efficient Privacy Preserving Protocols for Similarity Join
  • 本地全文:下载
  • 作者:Bilal Hawashin ; Farshad Fotouhi ; Traian Marius Truta
  • 期刊名称:Transactions on Data Privacy
  • 印刷版ISSN:1888-5063
  • 电子版ISSN:2013-1631
  • 出版年度:2012
  • 卷号:5
  • 期号:1
  • 页码:297-331
  • 出版社:IIIA-CSIC
  • 摘要:

    During the similarity join process, one or more sources may not allow sharing its data with other sources. In this case, a privacy preserving similarity join is required. We showed in our previous work [4] that using long attributes, such as paper abstracts, movie summaries, product descriptions, and user feedbacks, could improve the similarity join accuracy using supervised learning. However, the existing secure protocols for similarity join methods can not be used to join sources using these long attributes. Moreover, the majority of the existing privacy‐preserving protocols do not consider the semantic similarities during the similarity join process. In this paper, we introduce a secure efficient protocol to semantically join sources when the join attributes are long attributes. We provide two secure protocols for both scenarios when a training set exists and when there is no available training set. Furthermore, we introduced the multi‐label supervised secure protocol and the expandable supervised secure protocol. Results show that our protocols can efficiently join sources using the long attributes by considering the semantic relationships among the long string values. Therefore, it improves the overall secure similarity join performance.

国家哲学社会科学文献中心版权所有