CEERS Key Paper. IX. Identifying Galaxy Mergers in CEERS NIRCam Images Using Random Forests and Convolutional Neural Networks

Rose, Caitlin; Kartaltepe, Jeyhan S.; Snyder, Gregory F.; Huertas-Company, Marc; Yung, L. Y. Aaron; Arrabal Haro, Pablo; Bagley, Micaela B.; Bisigello, Laura; Calabrò, Antonello; Cleri, Nikko J.; Dickinson, Mark; Ferguson, Henry C.; Finkelstein, Steven L.; Fontana, Adriano; Grazian, Andrea; Grogin, Norman A.; Holwerda, Benne W.; Iyer, Kartheik G.; Kewley, Lisa J.; Kirkpatrick, Allison; Kocevski, Dale D.; Koekemoer, Anton M.; Lotz, Jennifer M.; Lucas, Ray A.; Napolitano, Lorenzo; Papovich, Casey; Pentericci, Laura; Pérez-González, Pablo G.; Pirzkal, Nor; Ravindranath, Swara; Somerville, Rachel S.; Straughn, Amber N.; Trump, Jonathan R.; Wilkins, Stephen M.; Yang, Guang
Bibliographical reference

The Astrophysical Journal

Advertised on:
11
2024
Number of authors
35
IAC number of authors
1
Citations
2
Refereed citations
0
Description
A crucial yet challenging task in galaxy evolution studies is the identification of distant merging galaxies, a task that suffers from a variety of issues ranging from telescope sensitivities and limitations to the inherently chaotic morphologies of young galaxies. In this paper, we use random forests and convolutional neural networks to identify high-redshift JWST Cosmic Evolution Early Release Science Survey (CEERS) galaxy mergers. We train these algorithms on simulated 3 < z < 5 CEERS galaxies created from the IllustrisTNG subhalo morphologies and the Santa Cruz SAM light cone. We apply our models to observed CEERS galaxies at 3 < z < 5. We find that our models correctly classify ∼60%–70% of simulated merging and nonmerging galaxies; better performance on the merger class comes at the expense of misclassifying more nonmergers. We could achieve more accurate classifications, as well as test for a dependency on physical parameters such as gas fraction, mass ratio, and relative orbits, by curating larger training sets. When applied to real CEERS galaxies using visual classifications as ground truth, the random forests correctly classified 40%–60% of mergers and nonmergers at 3 < z < 4 but tended to classify most objects as nonmergers at 4 < z < 5 (misclassifying ∼70% of visually classified mergers). On the other hand, the CNNs tended to classify most objects as mergers across all redshifts (misclassifying 80%–90% of visually classified nonmergers). We investigate what features the models find most useful, as well as the characteristics of false positives and false negatives, and also calculate merger rates derived from the identifications made by the models.