首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:Systematic Labeling Bias in Galaxy Morphologies
  • 本地全文:下载
  • 作者:Guillermo Cabrera-Vives ; Christopher J. Miller ; Jeff Schneider
  • 期刊名称:The Astronomical journal
  • 印刷版ISSN:0004-6256
  • 电子版ISSN:1538-3881
  • 出版年度:2018
  • 卷号:156
  • 期号:6
  • 页码:1-11
  • DOI:10.3847/1538-3881/aae9f4
  • 语种:English
  • 出版社:American Institute of Physics
  • 摘要:We present a metric to quantify systematic labeling bias in galaxy morphology data sets stemming from the quality of the labeled data. This labeling bias is independent from labeling errors and requires knowledge about the intrinsic properties of the data with respect to the observed properties. We conduct a relative comparison of label bias for different low-redshift galaxy morphology data sets. We show our metric is able to recover previous de-biasing procedures based on redshift as biasing parameter. By using the image resolution instead, we find biases that have not been addressed. We find that the morphologies based on supervised machine learning trained over features such as colors, shape, and concentration show significantly less bias than morphologies based on expert or citizen-science classifiers. This result holds even when there is underlying bias present in the training sets used in the supervised machine learning process. We use catalog simulations to validate our bias metric and show how to bin the multi-dimensional intrinsic and observed galaxy properties used in the bias quantification. Our approach is designed to work on any other labeled multi-dimensional data set, and the code is publicly available (https://github.com/guille-c/labeling_bias).
  • 关键词:galaxies: statistics;methods: data analysis;methods: statistical
国家哲学社会科学文献中心版权所有