TY - JOUR
T1 - Revealing the Milky Way's most recent major merger with a Gaia EDR3 catalogue of machine-learned line-of-sight velocities
AU - Dropulic, Adriana
AU - Liu, Hongwan
AU - Ostdiek, Bryan
AU - Lisanti, Mariangela
N1 - Funding Information:
The authors would like to thank V. Belokurov, T. Cohen, J. Han, Y. Kahn, L. Necib, D. Roberts, and S. Yaida for fruitful conversations. ML gratefully acknowledges financial support from the Schmidt DataX Fund at Princeton University made possible through a major gift from the Schmidt Futures Foundation. BO was supported in part by the U.S. Department of Energy under contract DE-SC0013607 and DE-SC0020223. HL and ML are supported by the U.S. Department of Energy under Award Number DE-SC0007968. Additionally, HL is supported by National Science Foundation grant PHY-1915409 and the Simons Foundation. AD is supported by National Science FoundationGraduate Research Fellowship Program under grant number DGE-2039656. This work was supported by the National Science Foundation under cooperative agreement PHY-2019786 (The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, http://iaifi.org/ ) and was performed in part at the Aspen Center for Physics, which is supported by National Science Foundation grant PHY-1607611. The work presented in this paper was performed on computational resources managed and supported by Princeton Research Computing. It made use of the astropy (Robitaille et al. ), gala (M. Price-Whelan ), corner (Foreman-Mackey ), h5py (Collette ), IPython (Perez & Granger ), Jupyter (Kluyver et al. ), matplotlib (Hunter ), NumPy (van der Walt, Colbert & Varoquaux ), pandas (Wes McKinney ), and SciPy (Jones et al. ) software packages.
Publisher Copyright:
© 2023 The Author(s) Published by Oxford University Press on behalf of Royal Astronomical Society.
PY - 2023/5/1
Y1 - 2023/5/1
N2 - Machine learning can play a powerful role in inferring missing line-of-sight velocities from astrometry in surveys such as Gaia. In this paper, we apply a neural network to Gaia Early Data Release 3 (EDR3) and obtain line-of-sight velocities and associated uncertainties for ∼92 million stars. The network, which takes as input a star's parallax, angular coordinates, and proper motions, is trained and validated on ∼6.4 million stars in Gaia with complete phase-space information. The network's uncertainty on its velocity prediction is a key aspect of its design; by properly convolving these uncertainties with the inferred velocities, we obtain accurate stellar kinematic distributions. As a first science application, we use the new network-completed catalogue to identify candidate stars that belong to the Milky Way's most recent major merger, Gaia-Sausage-Enceladus (GSE). We present the kinematic, energy, angular momentum, and spatial distributions of the ∼450 000 GSE candidates in this sample, and also study the chemical abundances of those with cross matches to GALAH and APOGEE. The network's predictive power will only continue to improve with future Gaia data releases as the training set of stars with complete phase-space information grows. This work provides a first demonstration of how to use machine learning to exploit high-dimensional correlations on data to infer line-of-sight velocities, and offers a template for how to train, validate, and apply such a neural network when complete observational data is not available.
AB - Machine learning can play a powerful role in inferring missing line-of-sight velocities from astrometry in surveys such as Gaia. In this paper, we apply a neural network to Gaia Early Data Release 3 (EDR3) and obtain line-of-sight velocities and associated uncertainties for ∼92 million stars. The network, which takes as input a star's parallax, angular coordinates, and proper motions, is trained and validated on ∼6.4 million stars in Gaia with complete phase-space information. The network's uncertainty on its velocity prediction is a key aspect of its design; by properly convolving these uncertainties with the inferred velocities, we obtain accurate stellar kinematic distributions. As a first science application, we use the new network-completed catalogue to identify candidate stars that belong to the Milky Way's most recent major merger, Gaia-Sausage-Enceladus (GSE). We present the kinematic, energy, angular momentum, and spatial distributions of the ∼450 000 GSE candidates in this sample, and also study the chemical abundances of those with cross matches to GALAH and APOGEE. The network's predictive power will only continue to improve with future Gaia data releases as the training set of stars with complete phase-space information grows. This work provides a first demonstration of how to use machine learning to exploit high-dimensional correlations on data to infer line-of-sight velocities, and offers a template for how to train, validate, and apply such a neural network when complete observational data is not available.
KW - catalogues
KW - Galaxy: kinematics and dynamics
KW - Galaxy: structure
KW - methods: statistical
KW - techniques: radial velocities
UR - http://www.scopus.com/inward/record.url?scp=85160944713&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85160944713&partnerID=8YFLogxK
U2 - 10.1093/mnras/stad209
DO - 10.1093/mnras/stad209
M3 - Article
AN - SCOPUS:85160944713
SN - 0035-8711
VL - 521
SP - 1633
EP - 1645
JO - Monthly Notices of the Royal Astronomical Society
JF - Monthly Notices of the Royal Astronomical Society
IS - 2
ER -