首页    期刊浏览 2025年07月14日 星期一
登录注册

文章基本信息

  • 标题:Large-Scale Dataset of Local Java Software Build Results
  • 本地全文:下载
  • 作者:Matúš Sulír ; Michaela Bačíková ; Matej Madeja
  • 期刊名称:Data
  • 印刷版ISSN:2306-5729
  • 出版年度:2020
  • 卷号:5
  • 期号:3
  • 页码:86-96
  • DOI:10.3390/data5030086
  • 出版社:MDPI Publishing
  • 摘要:When a person decides to inspect or modify a third-party software project, the first necessary step is its successful compilation from source code using a build system.However, such attempts often end in failure.In this data descriptor paper, we provide a dataset of build results of open source Java software systems.We tried to automatically build a large number of Java projects from GitHub using their Maven, Gradle, and Ant build scripts in a Docker container simulating a standard programmer’s environment.The dataset consists of the output of two executions: 7264 build logs from a study executed in 2016 and 7233 logs from the 2020 execution.In addition to the logs, we collected exit codes, file counts, and various project metadata.The proportion of failed builds in our dataset is 38% in the 2016 execution and 59% in the 2020 execution.The published data can be helpful for multiple purposes, such as correlation analysis of factors affecting build success, build failure prediction, and research in the area of build breakage repair.
  • 关键词:build tool; program compilation; failure; Ant; Maven; Gradle
国家哲学社会科学文献中心版权所有