首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:RMol: a toolset for transforming SD/Molfile structure information into R objects
  • 本地全文:下载
  • 作者:Martin Grabner ; Kurt Varmuza ; Matthias Dehmer
  • 期刊名称:Source Code for Biology and Medicine
  • 印刷版ISSN:1751-0473
  • 电子版ISSN:1751-0473
  • 出版年度:2012
  • 卷号:7
  • 期号:1
  • 页码:12
  • DOI:10.1186/1751-0473-7-12
  • 语种:English
  • 出版社:BioMed Central
  • 摘要:The graph-theoretical analysis of molecular networks has a long tradition in chemoinformatics. As demonstrated frequently, a well designed format to encode chemical structures and structure-related information of organic compounds is the Molfile format. But when it comes to use modern programming languages for statistical data analysis in Bio- and Chemoinformatics, R as one of the most powerful free languages lacks tools to process Molfile data collections and import molecular network data into R. We design an R object which allows a lossless information mapping of structural information from Molfiles into R objects. This provides the basis to use the RMol object as an anchor for connecting Molfile data collections with R libraries for analyzing graphs. Associated with the RMol objects, a set of R functions completes the toolset to organize, describe and manipulate the converted data sets. Further, we bypass R-typical limits for manipulating large data sets by storing R objects in bz-compressed serialized files instead of employing RData files. By design, RMol is a R toolset without dependencies to other libraries or programming languages. It is useful to integrate into pipelines for serialized batch analysis by using network data and, therefore, helps to process sdf-data sets in R efficiently. It is freely available under the BSD licence. The script source can be downloaded from http://sourceforge.net/p/rmol-toolset
  • 关键词:Molecular Network ; Serialize File ; Modern Programming Language ; Chemical Structure Information ; Exist Data Collection
国家哲学社会科学文献中心版权所有