首页    期刊浏览 2025年02月17日 星期一
登录注册

文章基本信息

  • 标题:Direct regression modelling of high-order moments in big data
  • 作者:Ruibin Xi ; Nan Lin
  • 期刊名称:Statistics and Its Interface
  • 印刷版ISSN:1938-7989
  • 电子版ISSN:1938-7997
  • 出版年度:2016
  • 卷号:9
  • 期号:4
  • 页码:445-452
  • DOI:10.4310/SII.2016.v9.n4.a4
  • 出版社:International Press
  • 摘要:Big data problems present great challenges to statistical analyses, especially from the computational side. In this paper, we consider regression estimation of high-order moments in big data problems based on the U-statistic-based Functional Regression Model (U-FRM) model. The U-FRM model is a nonparametric method that allows direct estimation of higher-order moments without imposing parametric assumptions on the high order-moments. Despite this modeling advantage, its estimation relies on a U-statistics-based estimating equation whose computational complexity is generally too high for big data. In this paper, we propose using the “divide-and-conquer” strategy to construct a computationally more succinct surrogate estimating equation. Through both theoretical proof and simulations, we show that our method significantly reduces the computational time and meanwhile enjoys the same asymptotic behavior as the original estimation method.We then apply our method to a genomic problem to illustrate its performance on real data.
  • 关键词:big data; higher-order moment; U-statistics; estimating equation; divide-and-conquer; aggregation; consistency; asymptotic normality; data cube
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有