Toggle Main Menu Toggle Search

Open Access padlockePrints

Improved photometric redshift estimations through self-organizing map-based data augmentation

Lookup NU author(s): Dr Marika AsgariORCiD

Downloads


Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).


Abstract

© The Author(s) 2025. Published by Oxford University Press on behalf of Royal Astronomical Society. We introduce a framework for the enhanced estimation of photometric redshifts using self-organizing maps (SOMs). Our method projects galaxy spectral energy distributions (SEDs) onto a 2D map, identifying regions that are sparsely sampled by existing spectroscopic observations. These undersampled areas are then augmented with simulated galaxies, yielding a more representative spectroscopic training data set. To assess the efficacy of this SOM-based data augmentation in the context of the forthcoming Legacy Survey of Space and Time (LSST), we employ mock galaxy catalogues from the OpenUniverse2024 project and generate synthetic data sets that mimic the expected photometric selections of LSST after one (Y1) and ten (Y10) years of observation. We construct 501 degraded realizations of synthetic spectroscopic surveys by sampling galaxy colours, magnitudes, redshifts, and spectroscopic success rates, in order to emulate the diverse compilation of spectroscopic data sets that may exist for LSST analysis. Augmenting the degraded mock data sets with simulated galaxies from the independent CosmoDC2 catalogues significantly improves the performance of our photometric-redshift estimates – particularly at high redshift (Formula presented) – even in the presence of differences in the underlying galaxy SED modelling between the two catalogues. This improvement is manifested in notably reduced systematic biases and a decrease in catastrophic failures by up to approximately a factor of 2, along with a reduction in information loss in the conditional density estimations. These results underscore the effectiveness of SOM-based augmentation in refining photometric redshift estimation, thereby enabling more robust analyses in cosmology and astrophysics for the NSF-DOE Vera C. Rubin Observatory.


Publication metadata

Author(s): Zhang Y-H, Zuntz J, Moskowitz I, Gawiser E, Kuijken K, Asgari M, Hoekstra H, Malz AI, Yan Z, Zhang T

Publication type: Article

Publication status: Published

Journal: Monthly Notices of the Royal Astronomical Society

Year: 2026

Volume: 547

Issue: 4

Print publication date: 01/04/2026

Online publication date: 19/12/2025

Acceptance date: 11/12/2025

Date deposited: 06/05/2026

ISSN (print): 0035-8711

ISSN (electronic): 1365-2966

Publisher: Oxford University Press

URL: https://doi.org/10.1093/mnras/staf2226

DOI: 10.1093/mnras/staf2226

Data Access Statement: All scripts associated with this study are publicly available in the GitHub repository SOMZCloud (https://github.com/CosmoCloudZhang/SOMZCloud). The generated data sets and accompanying products–including photometric redshift estimates–have been deposited in the LSST DESC Community File System (CFS) at the National Energy Research Scientific Computing Center (NERSC).


Altmetrics

Altmetrics provided by Altmetric


Share