Toggle Main Menu Toggle Search

Open Access padlockePrints

Towards Democratising Urban Sustainability Data: An LLM-Enabled Natural Language Interface for Smart-City Air-Quality Decision Support

Lookup NU author(s): Adam Booth, Professor Philip JamesORCiD, Dr Ellis SolaimanORCiD

Downloads


Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).


Abstract

Urban sustainability management increasingly relies on large volumes of heterogeneous environmental data generated by smart city infrastructures. While these data streams offer significant potential for evidence-informed policymaking, environmental governance, and public engagement, their effective use is often constrained by technical barriers and persistent data-skills gaps among non-specialist stakeholders. Using urban air quality as a policy-relevant and data-rich sustainability domain, this paper presents a proof-of-concept dashboard that investigates how large language model (LLM)-enabled natural language interfaces can lower barriers to querying, analysing, and visualising urban environmental data. The system translates natural language questions into executable database queries and automatically generates visualisations over air-quality datasets. A controlled comparative benchmark of proprietary and open-source LLMs is conducted to assess their suitability for text-to-SQL generation in this application context. In this benchmark, proprietary GPT-based models achieved the highest observed query accuracy and robustness among the evaluated models, highlighting practical trade-offs between performance, transparency, reproducibility, and long-term governance. This paper makes a twofold contribution: First, it demonstrates the technical feasibility of an LLM-enabled natural language access layer for smart-city environmental data. Second, it uses the implemented system as a concrete case through which to analyse the trust, transparency, inclusivity, vendor-dependency, and data-quality challenges that arise when such systems are incorporated into sustainability-oriented decision-support workflows. The study provides a transferable design contribution for urban sustainability data access by showing how natural language interfaces, model benchmarking, automated visualisation, and governance-aware system design can be combined to support more inclusive interaction with complex environmental datasets.


Publication metadata

Author(s): Booth A, James P, Solaiman E

Publication type: Article

Publication status: Published

Journal: Sustainability

Year: 2026

Volume: 18

Issue: 11

Print publication date: 01/06/2026

Online publication date: 01/06/2026

Acceptance date: 26/05/2026

Date deposited: 05/06/2026

ISSN (electronic): 2071-1050

Publisher: MDPI AG

URL: https://doi.org/10.3390/su18115506

DOI: 10.3390/su18115506

Data Access Statement: The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.


Altmetrics

Altmetrics provided by Altmetric


Funding

Funder referenceFunder name
UKRI EPSRC) Centre for Doctoral Training in Geospatial Systems under grant number EP/S023577/1.

Share