Toggle Main Menu Toggle Search

Open Access padlockePrints

Machine Learning Approach to Redefining Risk in Railway Drainage Systems

Lookup NU author(s): Dr Manuel HerreraORCiD

Downloads


Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).


Abstract

Inadequate railway drainage can trigger flooding, accelerate track degradation, and ultimately cause the sudden failure of tracks, slopes, and embankments. Routine visual inspections score each drainage asset’s service-level on a scale of one to five, yet existing prediction studies typically assume a linear mapping between readily measured drainage-asset characteristics (e.g., pipe material, shape, slope, and local rainfall) and those ordinal service-level grades. Such linearity greatly oversimplifies the nonlinear, multifactor deterioration processes that govern drainage performance. This study introduce a risk-based redefinition of service level based on the direction and magnitude of change between consecutive inspections (low, medium, or high risk). Using this reformulated target, we build and compare three supervised machine learning models, namely, multinomial logistic regression, random forests, and artificial neural networks, to predict the drainage asset’s future risk category from its physical, environmental, and operational attributes. The modeling framework explicitly tackles practical data challenges: misclassification bias in visual grades, repeated measurements of the same asset, severe class imbalance, and the categorical nature of most variables. The approach is demonstrated on two UK main-line routes (London–Cardiff and Edinburgh–Glasgow) comprising approximately 10,000 drainage assets. After careful oversampling, random forests delivered the highest recall for high-risk assets, outperforming the linear baseline and the other nonlinear model. Results confirm that abandoning the linearity assumption, redefining risk to capture service-level dynamics and applying tailored preprocessing markedly improved predictive accuracy and interpretability. This study therefore provides railway owners with a data-driven tool to prioritize drainage maintenance and reduce network disruption.


Publication metadata

Author(s): Okazaki S, Herrera M, Sasidharan M, McNaughton J, Raja J, Parlikad AK

Publication type: Article

Publication status: Published

Journal: Journal of Infrastructure Systems

Year: 2026

Volume: 32

Issue: 1

Print publication date: 01/03/2026

Online publication date: 02/01/2026

Acceptance date: 20/10/2025

Date deposited: 03/01/2026

ISSN (print): 1076-0342

ISSN (electronic): 1943-555X

Publisher: American Society of Civil Engineers

URL: https://doi.org/10.1061/JITSE4.ISENG-2614

DOI: 10.1061/JITSE4.ISENG-2614

ePrints DOI: 10.57711/sn32-bk50

Data Access Statement: Some or all data, models, or code used during the study were provided by Network Rail. Direct requests for these materials may be made to the provider as indicated in the Acknowledgements.


Altmetrics

Altmetrics provided by Altmetric


Funding

Funder referenceFunder name
920035 (Centre for Smart Infrastructure and Construction)
Engineering and Physical Science Research Council (EPSRC)
EP/Y024257/1
EP/N021614/1
Innovate UK

Share