期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2015
卷号:3
期号:2
DOI:10.15680/ijircce.2015.0302041
出版社:S&S Publications
摘要:For extracting useful knowledge which is hidden in large set of data, Data mining is a very importanttechnology. There are some negative perceptions about data mining. This perception may contain unfairly treatingpeople who belongs to some specific group. Classification rule mining technique has covered the way for makingautomatic decisions like loan granting/denial and insurance premium computation etc. These are automated datacollection and data mining techniques. According to discrimination attributes if training data sets are biases thendiscriminatory decisions may ensue. Thus in data mining antidiscrimination techniques with discrimination discoveryand prevention are included. It can be direct or indirect. When decisions are made based on sensitive attributes thattime the discrimination is indirect. When decisions are made based on nonsensitive attributes which are stronglycorrelated with biased sensitive ones that time the discrimination is indirect. The proposed system tries to tacklediscrimination prevention in data mining. It proposes new improved techniques applicable for direct or indirectdiscrimination prevention individually or both at the same time. Discussions about how to clean training data sets andoutsourced data sets in such a way that direct and/or indirect discriminatory decision rules are converted to legitimateclassification rules are done. New metrics to evaluate the utility of the proposed approaches are proposes andcomparison of these approaches is also done.
关键词:Antidiscrimination; data mining; direct and indirect discrimination prevention; rule protection; rule;generalization; privacy.