Gary Moss g.p.j.moss@keele.ac.uk
Development of a Gaussian Process - feature selection model to characterise (poly)dimethylsiloxane (Silastic (R)) membrane permeation
Moss
Authors
Abstract
Objectives The current study aims to determine the effect of physicochemical descriptor selection on models of polydimethylsiloxane permeation. Methods A total of 2942 descriptors were calculated for a data set of 77 chemicals. Data were processed to remove redundancy, single values, imbalanced and highly correlated data, yielding 1363 relevant descriptors. For four independent test sets, feature selection methods were applied and modelled via a variety of Machine Learning methods. Key findings Two sets of molecular descriptors which can provide improved predictions, compared to existing models, have been identified. Best permeation predictions were found with Gaussian Process methods. The molecular descriptors describe lipophilicity, partial charge and hydrogen bonding as key determinants of PDMS permeation. Conclusions This study highlights important considerations in the development of relevant models and in the construction and use of the data sets used in such studies, particularly that highly correlated descriptors should be removed from data sets. Predictive models are improved by the methodology adopted in this study, notably the systematic evaluation of descriptors, rather than simply using any and all available descriptors, often based empirically on in vitro experiments. Such findings also have clear relevance to a number of other fields.
Citation
Moss. (2020). Development of a Gaussian Process - feature selection model to characterise (poly)dimethylsiloxane (Silastic (R)) membrane permeation. Journal of Pharmacy and Pharmacology, https://doi.org/10.1111/jphp.13263
Acceptance Date | Mar 8, 2020 |
---|---|
Publication Date | Apr 4, 2020 |
Journal | Journal of Pharmacy and Pharmacology |
Publisher | David Publishing |
DOI | https://doi.org/10.1111/jphp.13263 |
Keywords | data set design; feature selection; Gaussian Process Regression; machine learning; polydimethylsiloxane |
Publisher URL | https://onlinelibrary.wiley.com/doi/full/10.1111/jphp.13263 |
Files
jphp.13263.pdf
(3.3 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
You might also like
Downloadable Citations
About Keele Repository
Administrator e-mail: research.openaccess@keele.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search