期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2015
卷号:3
期号:11
DOI:10.15680/IJIRCCE.2015.0311091
出版社:S&S Publications
摘要:Probabilistic data is generated by automated data analysis/enrichment techniques like entity resolution,information extraction, and speech processing. Legacy system which is used is corresponding to the pre-existing webapplications like Picasa, Flicker etc. Our intention and the very goal is to generate a deterministic representation ofprobabilistic data which optimizes the Quality of the end-application built on deterministic data .Exploring such aproblem in the context of two very different data processing tasks-which can be also termed as triggers and selectionqueries. There by showing the approaches like thresholding or top-1 selection which is traditionally used fordeterminizing leading to suboptimal performance for such kind of applications .Instead developing a query-awarestrategy and showing its various advantages over the existing solutions through a comprehensive empirical evaluationover the real and synthetic datasets