CCPortal
DOI10.1186/s13321-016-0164-0
An ensemble model of QSAR tools for regulatory risk assessment
Pradeep, Prachi1; Povinelli, Richard J.2; White, Shannon4; Merrill, Stephen J.3
发表日期2016-09-22
ISSN1758-2946
卷号8
英文摘要

Quantitative structure activity relationships (QSARs) are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR predictions can be used for chemical risk assessment for protection of human and environmental health, which makes them interesting to regulators, especially in the absence of experimental data. For compatibility with regulatory use, QSAR models should be transparent, reproducible and optimized to minimize the number of false negatives. In silico QSAR tools are gaining wide acceptance as a faster alternative to otherwise time-consuming clinical and animal testing methods. However, different QSAR tools often make conflicting predictions for a given chemical and may also vary in their predictive performance across different chemical datasets. In a regulatory context, conflicting predictions raise interpretation, validation and adequacy concerns. To address these concerns, ensemble learning techniques in the machine learning paradigm can be used to integrate predictions from multiple tools. By leveraging various underlying QSAR algorithms and training datasets, the resulting consensus prediction should yield better overall predictive ability. We present a novel ensemble QSAR model using Bayesian classification. The model allows for varying a cut-off parameter that allows for a selection in the desirable trade-off between model sensitivity and specificity. The predictive performance of the ensemble model is compared with four in silico tools (Toxtree, Lazar, OECD Toolbox, and Danish QSAR) to predict carcinogenicity for a dataset of air toxins (332 chemicals) and a subset of the gold carcinogenic potency database (480 chemicals). Leaveone- out cross validation results show that the ensemble model achieves the best trade-off between sensitivity and specificity (accuracy: 83.8 % and 80.4 %, and balanced accuracy: 80.6 % and 80.8 %) and highest inter-rater agreement [kappa (kappa): 0.63 and 0.62] for both the datasets. The ROC curves demonstrate the utility of the cut-off feature in the predictive ability of the ensemble model. This feature provides an additional control to the regulators in grading a chemical based on the severity of the toxic endpoint under study.


英文关键词Computational toxicology;In silico QSAR tools;Hybrid QSAR models;Ensemble models;Risk assessment
语种英语
WOS记录号WOS:000383831100001
来源期刊JOURNAL OF CHEMINFORMATICS
来源机构美国环保署
文献类型期刊论文
条目标识符http://gcip.llas.ac.cn/handle/2XKMVOVA/59352
作者单位1.US EPA, Natl Ctr Computat Toxicol ORISE Fellow, Res Triangle Pk, NC 27711 USA;
2.Marquette Univ, Dept Elect & Comp Engn, Milwaukee, WI 53233 USA;
3.Marquette Univ, Dept Math Stat & Comp Sci, Milwaukee, WI 53233 USA;
4.Georgetown Univ, Med Ctr, Lombardi Comprehens Canc Ctr, Washington, DC 20007 USA
推荐引用方式
GB/T 7714
Pradeep, Prachi,Povinelli, Richard J.,White, Shannon,et al. An ensemble model of QSAR tools for regulatory risk assessment[J]. 美国环保署,2016,8.
APA Pradeep, Prachi,Povinelli, Richard J.,White, Shannon,&Merrill, Stephen J..(2016).An ensemble model of QSAR tools for regulatory risk assessment.JOURNAL OF CHEMINFORMATICS,8.
MLA Pradeep, Prachi,et al."An ensemble model of QSAR tools for regulatory risk assessment".JOURNAL OF CHEMINFORMATICS 8(2016).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Pradeep, Prachi]的文章
[Povinelli, Richard J.]的文章
[White, Shannon]的文章
百度学术
百度学术中相似的文章
[Pradeep, Prachi]的文章
[Povinelli, Richard J.]的文章
[White, Shannon]的文章
必应学术
必应学术中相似的文章
[Pradeep, Prachi]的文章
[Povinelli, Richard J.]的文章
[White, Shannon]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。