The miniJPAS survey: star-galaxy classification using machine learning

Bibcode

2021A&A...645A..87B

DOI

10.1051/0004-6361/202038986

Baqui, P. O.; Marra, V.; Casarini, L.; Angulo, R.; Díaz-García, L. A.; Hernández-Monteagudo, C.; Lopes, P. A. A.; López-Sanjuan, C.; Muniesa, D.; Placco, V. M.; Quartin, M.; Queiroz, C.; Sobral, D.; Solano, E.; Tempel, E.; Varela, J.; Vílchez, J. M.; Abramo, R.; Alcaniz, J.; Benitez, N.; Bonoli, S.; Carneiro, S.; Cenarro, A. J.; Cristóbal-Hornillos, D.; de Amorim, A. L.; de Oliveira, C. M.; Dupke, R.; Ederoclite, A.; González Delgado, R. M.; Marín-Franch, A.; Moles, M.; Vázquez Ramió, H.; Sodré, L.; Taylor, K.

Bibliographical reference

Astronomy and Astrophysics

Advertised on:

2021

Journal

Astronomy and Astrophysics

Number of authors

IAC number of authors

Citations

Refereed citations

Description

Context. Future astrophysical surveys such as J-PAS will produce very large datasets, the so-called "big data", which will require the deployment of accurate and efficient machine-learning (ML) methods. In this work, we analyze the miniJPAS survey, which observed about ∼1 deg2 of the AEGIS field with 56 narrow-band filters and 4 ugri broad-band filters. The miniJPAS primary catalog contains approximately 64 000 objects in the r detection band (magAB ≲ 24), with forced-photometry in all other filters.
Aims: We discuss the classification of miniJPAS sources into extended (galaxies) and point-like (e.g., stars) objects, which is a step required for the subsequent scientific analyses. We aim at developing an ML classifier that is complementary to traditional tools that are based on explicit modeling. In particular, our goal is to release a value-added catalog with our best classification.
Methods: In order to train and test our classifiers, we cross-matched the miniJPAS dataset with SDSS and HSC-SSP data, whose classification is trustworthy within the intervals 15 ≤ r ≤ 20 and 18.5 ≤ r ≤ 23.5, respectively. We trained and tested six different ML algorithms on the two cross-matched catalogs: K-nearest neighbors, decision trees, random forest (RF), artificial neural networks, extremely randomized trees (ERT), and an ensemble classifier. This last is a hybrid algorithm that combines artificial neural networks and RF with the J-PAS stellar and galactic loci classifier. As input for the ML algorithms we used the magnitudes from the 60 filters together with their errors, with and without the morphological parameters. We also used the mean point spread function in the r detection band for each pointing.
Results: We find that the RF and ERT algorithms perform best in all scenarios. When the full magnitude range of 15 ≤ r ≤ 23.5 is analyzed, we find an area under the curve AUC = 0.957 with RF when photometric information alone is used, and AUC = 0.986 with ERT when photometric and morphological information is used together. When morphological parameters are used, the full width at half maximum is the most important feature. When photometric information is used alone, we observe that broad bands are not necessarily more important than narrow bands, and errors (the width of the distribution) are as important as the measurements (central value of the distribution). In other words, it is apparently important to fully characterize the measurement.
Conclusions: ML algorithms can compete with traditional star and galaxy classifiers; they outperform the latter at fainter magnitudes (r ≳ 21). We use our best classifiers, with and without morphology, in order to produce a value-added catalog.

Full Table 2 is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (ftp://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/cat/J/A+A/645/A87

The catalog is available at http://j-pas.org/datareleases via the ADQL table minijpas.StarGalClass. The ML models are available at http://github.com/J-PAS-collaboration/StarGalClass-MachineLearning.

It may interest you

Refereed

Connecting laboratory and spectroscopic observations of aerospace materials to characterize the reflectivity of artificial space objects and debris in LEO regimes

Increasing space activities, especially in low-Earth orbits (LEO), lead to more orbital debris and night-sky pollution. Spectroscopic analysis of light reflected from artificial space objects can aid to identify dominant surface materials and their reflective properties. Satellites interact with sunlight in diffuse and specular reflections, which

Žilková, Danica et al.

Advertised on:

11

2025
Bibcode

2025AcAau.236..479Z

Citations

0

Read more
Refereed

E-INSPIRE – I. Bridging the gap with the local Universe: stellar population of a statistical sample of ultra-compact massive galaxies at z < 0.3

This paper presents the first effort to Extend the Investigation of Stellar Populations In RElics (E-INSPIRE). We present a catalogue of 430 spectroscopically confirmed ultra-compact massive galaxies (UCMGs) from the Sloan Digital Sky Survey at redshifts $0.01< z< 0.3$. This increases the original INSPIRE sample eightfold, bridging the gap with the

Mills, John et al.

Advertised on:

8

2025
Bibcode

2025MNRAS.541.2440M

Citations

0

Read more
Refereed

GRB 221009A: Observations with LST-1 of CTAO and Implications for Structured Jets in Long Gamma-Ray Bursts

GRB 221009A is the brightest gamma-ray burst (GRB) observed to date. Extensive observations of its afterglow emission across the electromagnetic spectrum were performed, providing the first strong evidence of a jet with a nontrivial angular structure in a long GRB. We carried out an extensive observation campaign in very-high-energy (VHE) gamma

Abe, K. et al.

Advertised on:

8

2025
Bibcode

2025ApJ...988L..42A

Citations

0

Read more