首页    期刊浏览 2024年09月18日 星期三
登录注册

文章基本信息

  • 标题:RANdom SAmple Consensus (RANSAC) algorithm for material-informatics: application to photovoltaic solar cells
  • 本地全文:下载
  • 作者:Omer Kaspi ; Abraham Yosipof ; Hanoch Senderowitz
  • 期刊名称:Journal of Cheminformatics
  • 印刷版ISSN:1758-2946
  • 电子版ISSN:1758-2946
  • 出版年度:2017
  • 卷号:9
  • 期号:1
  • 页码:34
  • DOI:10.1186/s13321-017-0224-0
  • 语种:English
  • 出版社:BioMed Central
  • 摘要:An important aspect of chemoinformatics and material-informatics is the usage of machine learning algorithms to build Quantitative Structure Activity Relationship (QSAR) models. The RANdom SAmple Consensus (RANSAC) algorithm is a predictive modeling tool widely used in the image processing field for cleaning datasets from noise. RANSAC could be used as a “one stop shop” algorithm for developing and validating QSAR models, performing outlier removal, descriptors selection, model development and predictions for test set samples using applicability domain. For “future” predictions (i.e., for samples not included in the original test set) RANSAC provides a statistical estimate for the probability of obtaining reliable predictions, i.e., predictions within a pre-defined number of standard deviations from the true values. In this work we describe the first application of RNASAC in material informatics, focusing on the analysis of solar cells. We demonstrate that for three datasets representing different metal oxide (MO) based solar cell libraries RANSAC-derived models select descriptors previously shown to correlate with key photovoltaic properties and lead to good predictive statistics for these properties. These models were subsequently used to predict the properties of virtual solar cells libraries highlighting interesting dependencies of PV properties on MO compositions.
  • 关键词:RANSAC ; Material-informatics ; QSAR ; Photovoltaics ; Solar Cells
国家哲学社会科学文献中心版权所有