Quantitative Structure Activity Relationship (QSAR) analysis was implemented for the prediction of elimination half-life for environmental chemicals. The experimental values of elimination half-life and two sets of molecular descriptors, the Linear Free Energy Relationship (LFER) and the PaDEL descriptors, were collected for 199 environmental chemicals and used as input data for the development of QSAR models. The initial datasets were split to the training and the prediction set. Principal Component Analysis (PCA) was implemented for the distribution of chemical compounds, while a genetic algorithm was then used for the selection of the optimal set of descriptors. The datasets were analyzed using Multiple Linear Regression (MLR) and Artificial Neural Networks (ANN). The fitting performance (R2) of the best models, using LFER and PaDEL descriptors, was 0.80 and 0.87, respectively. The Applicability Domain (AD) of the developed models was determined indicating that there were no outliers and verifying their reliability. The developed QSARs were then applied to several compounds with unknown values of elimination half-life. In conclusion, the proposed models were successfully evaluated for their fitting capacity, their validity and applicability and were found to be capable of predicting the elimination half-life of “data poor” chemical compounds within their applicability domain.

Modeling of the total elimination half life for environmental chemicals

Karakitsios S.;Sarigiannis D
2022-01-01

Abstract

Quantitative Structure Activity Relationship (QSAR) analysis was implemented for the prediction of elimination half-life for environmental chemicals. The experimental values of elimination half-life and two sets of molecular descriptors, the Linear Free Energy Relationship (LFER) and the PaDEL descriptors, were collected for 199 environmental chemicals and used as input data for the development of QSAR models. The initial datasets were split to the training and the prediction set. Principal Component Analysis (PCA) was implemented for the distribution of chemical compounds, while a genetic algorithm was then used for the selection of the optimal set of descriptors. The datasets were analyzed using Multiple Linear Regression (MLR) and Artificial Neural Networks (ANN). The fitting performance (R2) of the best models, using LFER and PaDEL descriptors, was 0.80 and 0.87, respectively. The Applicability Domain (AD) of the developed models was determined indicating that there were no outliers and verifying their reliability. The developed QSARs were then applied to several compounds with unknown values of elimination half-life. In conclusion, the proposed models were successfully evaluated for their fitting capacity, their validity and applicability and were found to be capable of predicting the elimination half-life of “data poor” chemical compounds within their applicability domain.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12076/15043
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact