A machine learning approach to galaxy properties: joint redshift–stellar mass probability distributions with Random Forest

ABSTRACT We demonstrate that highly accurate joint redshift–stellar mass probability distribution functions (PDFs) can be obtained using the Random Forest (RF) machine learning (ML) algorithm, even with few photometric bands available. As an example, we use the Dark Energy Survey (DES), combined wit...

Full description

Saved in:
Bibliographic Details
Published inMonthly notices of the Royal Astronomical Society Vol. 502; no. 2; pp. 2770 - 2786
Main Authors Mucesh, S, Hartley, W G, Palmese, A, Lahav, O, Whiteway, L, Bluck, A F L, Alarcon, A, Amon, A, Bechtol, K, Bernstein, G M, Carnero Rosell, A, Carrasco Kind, M, Choi, A, Eckert, K, Everett, S, Gruen, D, Gruendl, R A, Harrison, I, Huff, E M, Kuropatkin, N, Sevilla-Noarbe, I, Sheldon, E, Yanny, B, Aguena, M, Allam, S, Bacon, D, Bertin, E, Bhargava, S, Brooks, D, Carretero, J, Castander, F J, Conselice, C, Costanzi, M, Crocce, M, da Costa, L N, Pereira, M E S, De Vicente, J, Desai, S, Diehl, H T, Drlica-Wagner, A, Evrard, A E, Ferrero, I, Flaugher, B, Fosalba, P, Frieman, J, García-Bellido, J, Gaztanaga, E, Gerdes, D W, Gschwend, J, Gutierrez, G, Hinton, S R, Hollowood, D L, Honscheid, K, James, D J, Kuehn, K, Lima, M, Lin, H, Maia, M A G, Melchior, P, Menanteau, F, Miquel, R, Morgan, R, Paz-Chinchón, F, Plazas, A A, Sanchez, E, Scarpine, V, Schubnell, M, Serrano, S, Smith, M, Suchyta, E, Tarle, G, Thomas, D, To, C, Varga, T N, Wilkinson, R D
Format Journal Article
LanguageEnglish
Norwegian
Published United States Oxford University Press 01.04.2021
Oxford University Press (OUP): Policy P - Oxford Open Option A
Royal Astronomical Society
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:ABSTRACT We demonstrate that highly accurate joint redshift–stellar mass probability distribution functions (PDFs) can be obtained using the Random Forest (RF) machine learning (ML) algorithm, even with few photometric bands available. As an example, we use the Dark Energy Survey (DES), combined with the COSMOS2015 catalogue for redshifts and stellar masses. We build two ML models: one containing deep photometry in the griz bands, and the second reflecting the photometric scatter present in the main DES survey, with carefully constructed representative training data in each case. We validate our joint PDFs for 10 699 test galaxies by utilizing the copula probability integral transform and the Kendall distribution function, and their univariate counterparts to validate the marginals. Benchmarked against a basic set-up of the template-fitting code bagpipes, our ML-based method outperforms template fitting on all of our predefined performance metrics. In addition to accuracy, the RF is extremely fast, able to compute joint PDFs for a million galaxies in just under 6 min with consumer computer hardware. Such speed enables PDFs to be derived in real time within analysis codes, solving potential storage issues. As part of this work we have developed galpro1, a highly intuitive and efficient python package to rapidly generate multivariate PDFs on-the-fly. galpro is documented and available for researchers to use in their cosmology and galaxy evolution studies.
Bibliography:NFR/287772
FERMILAB-PUB-20-653-AE; DES-2020-0542; arXiv:2012.05928
USDOE Office of Science (SC), High Energy Physics (HEP)
AC02-07CH11359; SC0019193; AC02-76SF00515; ST/P006736/1; ST/R000476/1; ST/M001334/1; 695671; TESTDE FP7/291329; AC05-00OR22725; AC02-06CH11357
USDOE Office of Science (SC), Basic Energy Sciences (BES)
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
National Science Foundation (NSF)
European Research Council (ERC)
United Kingdom Science and Technology Facilities Council
ISSN:0035-8711
1365-2966
DOI:10.1093/mnras/stab164