Toggle Main Menu Toggle Search

Open Access padlockePrints

Predicting adsorption capacities of pharmaceutical pollutants using chemoinformatics and machine learning techniques

Lookup NU author(s): Maamar Laidi, Dr Jie ZhangORCiD

Downloads

Full text for this publication is not currently held within this repository. Alternative links are provided below where available.


Abstract

© The Author(s), under exclusive licence to Springer Nature B.V. 2025. Pharmaceutical pollutants are increasingly recognized as emerging contaminants in aquatic environments. Their persistence, bioactivity, and resistance to conventional treatment processes raise ecological and human health concerns, including the spread of antimicrobial resistance. Adsorption has emerged as a promising polishing step for their removal, but adsorption capacity (Qe, mg/g) varies widely depending on molecular structure and operational conditions, making predictive modeling essential. In this work, we developed machine learning models to predict adsorption capacities for Aspirin, Caffeine, Carbamazepine, Ketoprofen, Sulfamethoxazole, Nimesulide, and Paracetamol using chemoinformatics descriptors derived from SMILES strings and experimental inputs, including equilibrium concentration (Ce), initial concentration (C0), temperature, and contact time. Feature reduction with LassoCV and multicollinearity analysis yielded a compact, chemically interpretable descriptor set. Support Vector Regression (SVR), Extreme Gradient Boosting (XGB), and Artificial Neural Networks (ANN) were optimized with Optuna and evaluated using cross-validation. XGB delivered the best predictive performance (R2 = 0.997, RMSE = 2.62 mg/g), outperforming SVR and ANN. SHAP analysis highlighted the influence of charge-partitioned surface areas and nitro functionalities on adsorption outcomes. The best-performing model was deployed in a Streamlit application, enabling predictions of Qe from SMILES and experimental conditions with built-in applicability-domain checks.


Publication metadata

Author(s): Bouzemlal H, Hentabli M, Laidi M, Laidani Y, Kouider Amar M, Ibrir A, Zhang J

Publication type: Article

Publication status: Published

Journal: Environmental Geochemistry and Health

Year: 2026

Volume: 48

Online publication date: 10/12/2025

Acceptance date: 03/12/2025

ISSN (print): 0269-4042

ISSN (electronic): 1573-2983

Publisher: Springer Nature

URL: https://doi.org/10.1007/s10653-025-02934-7

DOI: 10.1007/s10653-025-02934-7

PubMed id: 41369995


Altmetrics

Altmetrics provided by Altmetric


Share