Read Across Prediction of Estrogenicity for Hindered Phenols 2017 Data
공공데이터포털
Read-across is an important data gap filling technique used within category and analog approaches for regulatory hazard identification and risk assessment. Although much technical guidance is available that describes how to develop category/analog approaches, practical principles to evaluate and substantiate analog validity (suitability) are still lacking. This case study uses hindered phenols as an example chemical class to determine: (1) the capability of three structure fingerprint/descriptor methods (PubChem, ToxPrints and MoSS MCSS) to identify analogs for read-across to predict Estrogen Receptor (ER) binding activity and, (2) the utility of data confidence measures, physicochemical properties, and chemical R-group properties as filters to improve ER binding predictions. The training dataset comprised 462 hindered phenols and 257 non- hindered phenols. For each chemical of interest (target), source analogs were identified from two datasets (hindered and non-hindered phenols) that had been characterized by a fingerprint/descriptor method and by two cut-offs: (1) minimum similarity distance (range: 0.1 - 0.9) and, (2) N closest analogs (range: 1 - 10). Analogs were then filtered using: (1) physicochemical properties of the phenol (termed global filtering) and, (2) physicochemical properties of the R-groups neighboring the active hydroxyl group (termed local filtering). A read-across prediction was made for each target chemical on the basis of a majority vote of the N closest analogs. The results demonstrate that: (1) concordance in ER activity increases with structural similarity, regardless of the structure fingerprint/descriptor method, (2) increased data confidence significantly improves read-across predictions, and (3) filtering analogs using global and local properties can help identify more suitable analogs. This case study illustrates that the quality of the underlying experimental data and use of endpoint relevant chemical descriptors to evaluate source analogs are critical to achieving robust read-across predictions. This dataset is associated with the following publication: Pradeep, P., K. Mansouri, G. Patlewicz, and R. Judson. (Computational Toxicology) A systematic evaluation of analogs and automated read-across prediction of estrogenicity: A case study using hindered phenols. Computational Toxicology. Elsevier B.V., Amsterdam, NETHERLANDS, 4: 22-30, (2017).
Generalised Read-Across (GenRA) refinements
공공데이터포털
These new analysis builds on the baseline GenRA approach and presents a proof of concept of how other contexts of similarity namely physchem can be implemented into a search strategy for identification of analogues and how this impacts performance of read-across. Chemicals Involved: Same ToxRef dataset as used in the original GenRA manuscript. This dataset is associated with the following publication: Helman, G., I. Shah, and G. Patlewicz. Extending the Generalised Read-Across approach (GenRA): A systematic analysis of the impact of physicochemical property information on read-across performance. Computational Toxicology. Elsevier B.V., Amsterdam, NETHERLANDS, 8: 34-50, (2018).
Analogue search results for p,p'-DDD
공공데이터포털
The dataset contains the outputs for the analogue searches conducted for the chemical of interest, p,p'-DDD. This dataset is associated with the following publication: Lizarraga, L., J. Dean, J. Kaiser, S. Wesselkamper, J. Lambert, and J. Zhao. A Case Study on the Application of An Expert-driven Read-Across Approach in Support of Quantitative Risk Assessment of p,p’-Dichlorodiphenyldichloroethane. REGULATORY TOXICOLOGY AND PHARMACOLOGY. Elsevier Science Ltd, New York, NY, USA, 103: 301-313, (2019).
Implementing in vitro bioactivity data to modernize priority setting of chemical inventories
공공데이터포털
All of the code used to analyze and report the data as well as build confidence in the approach is available as a supplementary RMarkdown report, and a tool to derive PODBioactivity and PODRead-Across is available as an RShiny web-application. The data used in the workflow are either available on public databases or are included in the supplementary material to allow for reproducibility of results. The results and output of the workflow (i.e., chemical info, PODs, etc.) are provided in the supplementary material (available as a download from the journal article). This dataset is associated with the following publication: Beal, M., M. Gagne, S. Kulkarni, G. Patlewicz, R. Thomas, and T. Barton-Maclaren. Implementing in vitro Bioactivity Data to Modernize Priority Setting of Chemical Inventories. ALTEX. Society ALTEX Edition, Kuesnacht, SWITZERLAND, 39(1): 123-139, (2022).
Generalised Read-Across Prediction using genra-py
공공데이터포털
Read-across (RAX) is a widely used data gap filling approach and the authors have developed a data-driven tool, called GenRA, to support expert-driven RAX. This work describes a stand-alone Python 3 package, called genra-py, which enables end-users to conduct hazard identification and point of departure (POD) estimation using GenRA. This dataset is associated with the following publication: Shah, I., T. Tate, and G. Patlewicz. Generalised Read-Across Prediction using genra-py. BIOINFORMATICS. Oxford University Press, Cary, NC, USA, 37(19): 3380-3381, (2021).
Quantitative Structure-Use Relationship Model thresholds for Model Validation, Domain of Applicability, and Candidate Alternative Selection
공공데이터포털
This file contains value of the model training set confusion matrix, domain of applicability evaluation based on training set to predicted chemicals structural similarity, and 75th percentile bioactivity index values for each QSUR model. This dataset is associated with the following publication: Phillips, K., J. Wambaugh, C. Grulke, K. Dionisio, and K. Isaacs. High-throughput screening of chemicals as functional substitutes using structure-based classification models. GREEN CHEMISTRY. Royal Society of Chemistry, Cambridge, UK, 19: 1063-1074, (2017).
Quantitative Structure-Use Relationship Model Predictions to evaluate Tox21 Chemicals as Functional Substitutes and Candidate Alternatives
공공데이터포털
This dataset provides a prediction for all Tox21 chemicals with available QSUR descriptors across all 41 valid QSUR models developed with FUse. This dataset is associated with the following publication: Phillips, K., J. Wambaugh, C. Grulke, K. Dionisio, and K. Isaacs. High-throughput screening of chemicals as functional substitutes using structure-based classification models. GREEN CHEMISTRY. Royal Society of Chemistry, Cambridge, UK, 19: 1063-1074, (2017).
Predict Organ Toxicity ChemResTox Data
공공데이터포털
We use a supervised machine learning strategy to systematically investigate the relative importance of study type, machine learning algorithm, and type of descriptor on predicting in vivo repeat-dose toxicity at the organ-level. A total of 985 compounds were represented using chemical structural descriptors, ToxPrint chemotype descriptors, and bioactivity descriptors from ToxCast in vitro high-throughput screening assays. Using ToxRefDB, a total of 35 target organ outcomes were identified that contained at least 100 chemicals (50 positive and 50 negative). Supervised machine learning was performed using Naïve Bayes, k-nearest neighbor, random forest, classification and regression trees, and support vector classification approaches. Model performnce was assessed based on F1 scores using five-fold cross-validation with balanced bootstrap replicates. Fixed effects modeling showed the variance in F1 scores was explained mostly by target organ outcome, followed by descriptor type, machine learning algorithm, and interactions between these three factors. A combination of bioactivity and chemical structure or chemotype descriptors were the most predictive. Model performance improved with more chemicals (up to a maximum of 24%) and these gains were correlated (ρ= 0.92) with the number of chemicals. This dataset is associated with the following publication: Liu, J., G. Patlewicz, A. Williams, R. Thomas, and I. Shah. (Chemical Research in Toxicology) Predicting organ toxicity using in vitro bioactivity data and chemical structure. CHEMICAL RESEARCH IN TOXICOLOGY. American Chemical Society, Washington, DC, USA, 30: 2046−2059, (2017).
Towards reproducible structure-based chemical categories for PFAS to inform and evaluate toxicity and toxicokinetic testing
공공데이터포털
Data and code for "Grace Patlewicz, Ann M. Richard, Antony J. Williams, Richard S. Judson, Russell S. Thomas, Towards reproducible structure-based chemical categories for PFAS to inform and evaluate toxicity and toxicokinetic testing, Computational Toxicology, Volume 24, 2022, 100250, ISSN 2468-1113, https://doi.org/10.1016/j.comtox.2022.100250.". This dataset is associated with the following publication: Patlewicz, G., A. Richard, A. Williams, R. Judson, and R. Thomas. Towards reproducible structure-based chemical categories for PFAS to inform and evaluate toxicity and toxicokinetic testing.. Computational Toxicology. Elsevier B.V., Amsterdam, NETHERLANDS, 24: 100250, (2022).