Browse by author
Lookup NU author(s): Maymoon Madkhali, Conor Rankine, Professor Thomas Penfold
This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
An important consideration when developing a deep neural network (DNN) for the prediction of molecular properties is the representation of the chemical space. Herein we explore the effect of the representation on the performance of our DNN engineered to predict Fe K-edge X-ray absorption near-edge structure (XANES) spectra, and address the question: How important is the choice of representation for the local environment around an arbitrary Fe absorption site? Using two popular representations of chemical space—the Coulomb matrix (CM) and pair-distribution/radial distribution curve (RDC)—we investigate the effect that the choice of representation has on the performance of our DNN. While CM and RDC featurisation are demonstrably robust descriptors, it is possible to obtain a smaller mean squared error (MSE) between the target and estimated XANES spectra when using RDC featurisation, and converge to this state a) faster and b) using fewer data samples. This is advantageous for future extension of our DNN to other X-ray absorption edges, and for reoptimisation of our DNN to reproduce results from higher levels of theory. In the latter case, dataset sizes will be limited more strongly by the resource-intensive nature of the underlying theoretical calculations.
Author(s): Madkhali MMM, Rankine CD, Penfold TJ
Publication type: Article
Publication status: Published
Journal: Molecules
Year: 2020
Volume: 25
Issue: 11
Online publication date: 11/06/2020
Acceptance date: 08/06/2020
Date deposited: 11/06/2020
ISSN (print): 1431-5165
ISSN (electronic): 1420-3049
Publisher: MDPI AG
URL: https://doi.org/10.3390/molecules25112715
DOI: 10.3390/molecules25112715
Altmetrics provided by Altmetric