首页    期刊浏览 2025年02月17日 星期一
登录注册

文章基本信息

  • 标题:Gradual Rules: A Heuristic Based Method and Application to Outlier Extraction
  • 本地全文:下载
  • 作者:Lisa Di Jorio ; Anne Laurent ; Maguelonne Teisseire
  • 期刊名称:International Journal of Computer Information Systems and Industrial Management Applications
  • 印刷版ISSN:2150-7988
  • 电子版ISSN:2150-7988
  • 出版年度:2009
  • 卷号:1
  • 页码:145-154
  • 出版社:Machine Intelligence Research Labs (MIR Labs)
  • 摘要:Nowaday, in spite of more and more efficent data mining tools, tackling databases containing discrete values or having a value for each item, like gene expression data, remains challenging. On such data, existing approaches either transform the data to classical binary attributes, or use discretisation, including fuzzy partition to deal with the data. However, binary mapping of such databases drives to a loss of information and extracted knowledge is not exploitable for end-users. Thus, powerful tools designed for this kind of data are needed. On the other hand, existing fuzzy approaches hardly take gradual notions into account, or are not scalable enougth to tackle the problem. In this paper, we thus propose a heuristic in order to extract tendencies, in the form of gradual association rules. A gradual rule can be read as "The more X and the less Y, then the more V and the less W". Instead of using fuzzy sets, we apply our method directly on valued data and we propose an efficient heuristic, thus reducing combinatorial complexity and scalability. Experiments on synthetic datasets show the interest of our method. Moreover, we propose to use our method for an outlier extraction process. Experiments lead on real dataset shows the efficiency of our method.
  • 关键词:Gradual Rules; Data Mining; Trends; Outlier
国家哲学社会科学文献中心版权所有