首页    期刊浏览 2025年06月05日 星期四
登录注册

文章基本信息

  • 标题:A Machine Learning-Based Method for Content Verification in the E-Commerce Domain
  • 本地全文:下载
  • 作者:Theodoros Alexakis ; Nikolaos Peppes ; Konstantinos Demestichas
  • 期刊名称:Information
  • 电子版ISSN:2078-2489
  • 出版年度:2022
  • 卷号:13
  • 期号:3
  • 页码:116
  • DOI:10.3390/info13030116
  • 语种:English
  • 出版社:MDPI Publishing
  • 摘要:Analysis of extreme-scale data is an emerging research topic; the explosion in available data raises the need for suitable content verification methods and tools to decrease the analysis and processing time of various applications. Personal data, for example, are a very valuable source of information for several purposes of analysis, such as marketing, billing and forensics. However, the extraction of such data (referred to as person instances in this study) is often faced with duplicate or similar entries about persons that are not easily detectable by the end users. In this light, the authors of this study present a machine learning- and deep learning-based approach in order to mitigate the problem of duplicate person instances. The main concept of this approach is to gather different types of information referring to persons, compare different person instances and predict whether they are similar or not. Using the Jaro algorithm for person attribute similarity calculation and by cross-examining the information available for person instances, recommendations can be provided to users regarding the similarity or not between two person instances. The degree of importance of each attribute was also examined, in order to gain a better insight with respect to the declared features that play a more important role.
国家哲学社会科学文献中心版权所有