首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:A continuous binning for discrete, sparse and concentrated observations
  • 本地全文:下载
  • 作者:Rafael Prieto Curiel ; Carmen Cabrera Arnau ; Mara Torres Pinedo
  • 期刊名称:MethodsX
  • 印刷版ISSN:2215-0161
  • 电子版ISSN:2215-0161
  • 出版年度:2020
  • 卷号:7
  • 页码:1-7
  • DOI:10.1016/j.mex.2019.10.020
  • 语种:English
  • 出版社:Elsevier
  • 摘要:Graphical abstractDisplay OmittedAbstractDiscrete observations from data which are obtained from sparse, and yet concentrated events are often observed (e.g. road accidents or murders). Traditional methods to compute summary statistics often include placing the data in discrete bins but for this type of data this approach often results in large numbers of empty bins for which no function or summary statistic can be computed.Here, a method for dealing with sparse and concentrated observations is constructed, based on a sequence of non-overlapping bins of varying size, which gives a continuous interpolation of data for computing summary statistics of the values for the data, such as the mean.The method presented here overcomes the problem which sparsity and concentration present when computing functions to represent the data. Implementation of the method presented here is facilitated via open access to the code.•A new method for computing functions over sparse and concentrated data is constructed.•The method allows straightforward functions to be computed over partitions of the data, such as the mean, but also more complicated functions, such as coefficients, ratios, correlations, regressions and others.
  • 关键词:Sparse data;Discrete data;Continuous binning
国家哲学社会科学文献中心版权所有