J Chem Inf Model - GRID-based three-dimensional pharmacophores II: PharmBench, a benchmark data set for evaluating pharmacophore elucidation methods.


{ model(2656) set(1616) predict(1553) }
{ system(1976) rule(880) can(841) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ measur(2081) correl(1212) valu(896) }
{ search(2224) databas(1162) retriev(909) }
{ use(1733) differ(960) four(931) }
{ imag(2830) propos(1344) filter(1198) }
{ take(945) account(800) differ(722) }
{ method(1219) similar(1157) match(930) }
{ extract(1171) text(1153) clinic(932) }
{ featur(1941) imag(1645) propos(1176) }
{ perform(999) metric(946) measur(919) }
{ compound(1573) activ(1297) structur(1058) }
{ first(2504) two(1366) second(1323) }
{ implement(1333) system(1263) develop(1122) }
{ can(774) often(719) complex(702) }
{ studi(2440) review(1878) systemat(933) }
{ model(3480) simul(1196) paramet(876) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ method(1969) cluster(1462) data(1082) }
{ patient(2315) diseas(1263) diabet(1191) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ chang(1828) time(1643) increas(1301) }
{ clinic(1479) use(1117) guidelin(835) }
{ method(1557) propos(1049) approach(1037) }
{ design(1359) user(1324) use(1319) }
{ general(901) number(790) one(736) }
{ case(1353) use(1143) diagnosi(1136) }
{ visual(1396) interact(850) tool(830) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ medic(1828) order(1363) alert(1069) }
{ group(2977) signific(1463) compar(1072) }
{ activ(1138) subject(705) human(624) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ estim(2440) model(1874) function(577) }
{ process(1125) use(805) approach(778) }
{ activ(1452) weight(1219) physic(1104) }
{ model(3404) distribut(989) bayesian(671) }
{ imag(1947) propos(1133) code(1026) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ imag(1057) registr(996) error(939) }
{ featur(3375) classif(2383) classifi(1994) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ algorithm(1844) comput(1787) effici(935) }
{ data(1714) softwar(1251) tool(1186) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ method(984) reconstruct(947) comput(926) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ data(3008) multipl(1320) sourc(1022) }
{ intervent(3218) particip(2042) group(1664) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(976) code(926) identifi(902) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ survey(1388) particip(1329) question(1065) }
{ decis(3086) make(1611) patient(1517) }
{ method(2212) result(1239) propos(1039) }
{ detect(2391) sensit(1101) algorithm(908) }


To date, published pharmacophore elucidation approaches typically use a handful of data sets for validation: here, we have assembled a data set for 81 targets, containing 960 ligands aligned using their cocrystallized protein targets, to provide the experimental "gold standard". The two-dimensional structures are also assembled to remove conformational bias; an ideal method would be able to take these structures as input, find the common features, and reproduce the bioactive conformations and their alignments to correspond with the X-ray-determined gold standard alignments. Here we present this data set and describe three objective measures to evaluate performance: the ability to identify the bioactive conformation, the ability to identify and correctly align this conformation for 50% of the molecules in each data set, and the pharmacophoric field similarity. We have applied this validation methodology to our pharmacophore elucidation method FLAPpharm, that is published in the first paper of this series and discuss the limitations of the data set and objective success criteria. Starting from two-dimensional structures and producing unbiased models, FLAPpharm was able to identify the bioactive conformations for 67% of the ligands and also to produce successful models according to the second metric for 67% of the Pharmbench data sets. Inspection of the unsuccessful models highlighted the limitation of this root mean square (rms)-derived metric, since many were found to be pharmacophorically reasonable, increasing the overall success rate to 83%. The PharmBench data set is available at http://www.moldiscovery.com/PharmBench , along with a web service to enable users to score model alignments coming from external methods in the same way that we have presented here and, therefore, establishes a pharmacophore elucidation benchmark data set available to be used by the community.

Resumo Limpo

date publish pharmacophor elucid approach typic use hand data set valid assembl data set target contain ligand align use cocrystal protein target provid experiment gold standard twodimension structur also assembl remov conform bias ideal method abl take structur input find common featur reproduc bioactiv conform align correspond xraydetermin gold standard align present data set describ three object measur evalu perform abil identifi bioactiv conform abil identifi correct align conform molecul data set pharmacophor field similar appli valid methodolog pharmacophor elucid method flappharm publish first paper seri discuss limit data set object success criteria start twodimension structur produc unbias model flappharm abl identifi bioactiv conform ligand also produc success model accord second metric pharmbench data set inspect unsuccess model highlight limit root mean squar rmsderiv metric sinc mani found pharmacophor reason increas overal success rate pharmbench data set avail httpwwwmoldiscoverycompharmbench along web servic enabl user score model align come extern method way present therefor establish pharmacophor elucid benchmark data set avail use communiti

Resumos Similares

J Chem Inf Model - Time-split cross-validation as a method for estimating the goodness of prospective prediction. ( 0,82048652424994 )
J Chem Inf Model - Study of chromatographic retention of natural terpenoids by chemoinformatic tools. ( 0,808101929445192 )
Artif Intell Med - Training artificial neural networks directly on the concordance index for censored data using genetic algorithms. ( 0,8008173472685 )
J Chem Inf Model - Does rational selection of training and test sets improve the outcome of QSAR modeling? ( 0,792825092282565 )
J Chem Inf Model - Impact of template choice on homology model efficiency in virtual screening. ( 0,791470066863432 )
J Chem Inf Model - iLOGP: a simple, robust, and efficient description of n-octanol/water partition coefficient for drug design using the GB/SA approach. ( 0,787148397772136 )
AMIA Annu Symp Proc - Effect of data combination on predictive modeling: a study using gene expression data. ( 0,781187230733355 )
J Chem Inf Model - Pharmacophore assessment through 3-D QSAR: evaluation of the predictive ability on new derivatives by the application on a series of antitubercular agents. ( 0,769084127122977 )
J Chem Inf Model - Rank order entropy: why one metric is not enough. ( 0,763304935902436 )
Int J Health Geogr - Incorporating geographical factors with artificial neural networks to predict reference values of erythrocyte sedimentation rate. ( 0,763230991285254 )
J Chem Inf Model - RS-Predictor models augmented with SMARTCyp reactivities: robust metabolic regioselectivity predictions for nine CYP isozymes. ( 0,757451395352544 )
J Chem Inf Model - CSAR data set release 2012: ligands, affinities, complexes, and docking decoys. ( 0,756700211288352 )
J Chem Inf Model - Beyond the scope of Free-Wilson analysis: building interpretable QSAR models with machine learning algorithms. ( 0,754086293623331 )
J Chem Inf Model - Predicting pK(a) values of substituted phenols from atomic charges: comparison of different quantum mechanical methods and charge distribution schemes. ( 0,743448063938211 )
BMC Med Inform Decis Mak - Concordance and predictive value of two adverse drug event data sets. ( 0,73790822837079 )
J. Comput. Biol. - The complexity of the dirichlet model for multiple alignment data. ( 0,726030321219921 )
J Chem Inf Model - Development of novel 3D-QSAR combination approach for screening and optimizing B-Raf inhibitors in silico. ( 0,721364068141851 )
J Chem Inf Model - Leave-cluster-out cross-validation is appropriate for scoring functions derived from diverse protein data sets. ( 0,719793488002839 )
BMC Med Inform Decis Mak - Regression tree construction by bootstrap: model search for DRG-systems applied to Austrian health-data. ( 0,718962370877991 )
J Chem Inf Model - Comparative studies on some metrics for external validation of QSPR models. ( 0,718338751048553 )
Artif Intell Med - Fuzzy model identification of dengue epidemic in Colombia based on multiresolution analysis. ( 0,714915202768695 )
J Chem Inf Model - Robust scoring functions for protein-ligand interactions with quantum chemical charge models. ( 0,710786249195593 )
BMC Med Inform Decis Mak - Modeling healthcare authorization and claim submissions using the openEHR dual-model approach. ( 0,704560113819661 )
J Chem Inf Model - Three useful dimensions for domain applicability in QSAR models using random forest. ( 0,69957674588437 )
AMIA Annu Symp Proc - Advanced proficiency EHR training: effect on physicians' EHR efficiency, EHR satisfaction and job satisfaction. ( 0,698418971353962 )
J Chem Inf Model - Building a three-dimensional model of CYP2C9 inhibition using the Autocorrelator: an autonomous model generator. ( 0,696456127836188 )
J Chem Inf Model - Combined 3D-QSAR, molecular docking, and molecular dynamics study on piperazinyl-glutamate-pyridines/pyrimidines as potent P2Y12 antagonists for inhibition of platelet aggregation. ( 0,696099602451805 )
J Chem Inf Model - Criterion for evaluating the predictive ability of nonlinear regression models without cross-validation. ( 0,69573812405712 )
AMIA Annu Symp Proc - Predicting the dengue incidence in Singapore using univariate time series models. ( 0,694137999597824 )
Comput. Aided Surg. - Evaluation of a computational model to predict elbow range of motion. ( 0,693841458619712 )
J Chem Inf Model - Best of both worlds: combining pharma data and state of the art modeling technology to improve in Silico pKa prediction. ( 0,693481017837667 )
J Chem Inf Model - DrugPred: a structure-based approach to predict protein druggability developed using an extensive nonredundant data set. ( 0,69043930727434 )
BMC Med Inform Decis Mak - Measuring preferences for analgesic treatment for cancer pain: how do African-Americans and Whites perform on choice-based conjoint (CBC) analysis experiments? ( 0,688881139634357 )
J Am Med Inform Assoc - Harvest: an open platform for developing web-based biomedical data discovery and reporting applications. ( 0,68397786424025 )
AMIA Annu Symp Proc - Motivating the additional use of external validity: examining transportability in a model of glioblastoma multiforme. ( 0,683215351688775 )
J Chem Inf Model - Introducing conformal prediction in predictive modeling. A transparent and flexible alternative to applicability domain determination. ( 0,68311981713588 )
J Chem Inf Model - Statistical analysis and compound selection of combinatorial libraries for soluble epoxide hydrolase. ( 0,679466118062881 )
Comput Methods Programs Biomed - Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models. ( 0,673920147104753 )
J Chem Inf Model - Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection. ( 0,668091685335566 )
Comput. Biol. Med. - A prediction model of substrates and non-substrates of breast cancer resistance protein (BCRP) developed by GA-CG-SVM method. ( 0,663724366808509 )
Comput Methods Programs Biomed - Kinetic modelling of haemodialysis removal of myoglobin in rhabdomyolysis patients. ( 0,663070369637904 )
J Biomed Inform - MysiRNA: improving siRNA efficacy prediction using a machine-learning model combining multi-tools and whole stacking energy (G). ( 0,661492913573911 )
J Chem Inf Model - Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient. ( 0,660928541111923 )
Int J Health Geogr - Comparative analysis of remotely-sensed data products via ecological niche modeling of avian influenza case occurrences in Middle Eastern poultry. ( 0,656681613467355 )
Comput Biol Chem - Homology modeling, binding site identification and docking in flavone hydroxylase CYP105P2 in Streptomyces peucetius ATCC 27952. ( 0,655762890676969 )
J Chem Inf Model - A new approach to radial basis function approximation and its application to QSAR. ( 0,655243786748504 )
Int J Comput Assist Radiol Surg - Assessing performance in brain tumor resection using a novel virtual reality simulator. ( 0,654811837820869 )
J Chem Inf Model - Estimation of carcinogenicity using molecular fragments tree. ( 0,653595404188282 )
J Chem Inf Model - Applicability domains for classification problems: Benchmarking of distance to models for Ames mutagenicity set. ( 0,653121028937883 )
Int J Med Inform - Design and implementation of I2Vote--an interactive image-based voting system using windows mobile devices. ( 0,652485951488308 )
Med Decis Making - Developing a tuberculosis transmission model that accounts for changes in population health. ( 0,651051718691999 )
J Chem Inf Model - In silico prediction of aqueous solubility using simple QSPR models: the importance of phenol and phenol-like moieties. ( 0,650460202244954 )
Neural Comput - Molecular diffusion model of neurotransmitter homeostasis around synapses supporting gradients. ( 0,649317352769115 )
J Chem Inf Model - Kinase-kernel models: accurate in silico screening of 4 million compounds across the entire human kinome. ( 0,649172760675474 )
J Chem Inf Model - Hsp90 inhibitors, part 1: definition of 3-D QSAutogrid/R models as a tool for virtual screening. ( 0,648948770753918 )
AMIA Annu Symp Proc - Ontology-based federated data access to human studies information. ( 0,648833348817937 )
J Chem Inf Model - Four-dimensional structure-activity relationship model to predict HIV-1 integrase strand transfer inhibition using LQTA-QSAR methodology. ( 0,648165587283343 )
J Chem Inf Model - In silico prediction of total human plasma clearance. ( 0,64665976291876 )
Med Biol Eng Comput - Optimal design of clinical tests for the identification of physiological models of type 1 diabetes in the presence of model mismatch. ( 0,64251081265142 )
J Am Med Inform Assoc - Choosing blindly but wisely: differentially private solicitation of DNA datasets for disease marker discovery. ( 0,63956229552972 )
Med Biol Eng Comput - Application of the RIMARC algorithm to a large data set of action potentials and clinical parameters for risk prediction of atrial fibrillation. ( 0,636132578548167 )
J Chem Inf Model - Prediction of linear cationic antimicrobial peptides based on characteristics responsible for their interaction with the membranes. ( 0,634586124344359 )
J Chem Inf Model - In silico prediction of chemical Ames mutagenicity. ( 0,633247622863273 )
J. Med. Internet Res. - A case study of the New York City 2012-2013 influenza season with daily geocoded Twitter data from temporal and spatiotemporal perspectives. ( 0,632851706005999 )
J Chem Inf Model - Design of novel FLT-3 inhibitors based on dual-layer 3D-QSAR model and fragment-based compounds in silico. ( 0,623516976538943 )
Comput Methods Programs Biomed - Interstitial insulin kinetic parameters for a 2-compartment insulin model with saturable clearance. ( 0,614736845902562 )
Comput Methods Programs Biomed - A predictive model of longitudinal, patient-specific colonoscopy results. ( 0,614565636863039 )
J Chem Inf Model - Design and synthesis of new antioxidants predicted by the model developed on a set of pulvinic acid derivatives. ( 0,612670048649473 )
J Am Med Inform Assoc - Use of a support vector machine for categorizing free-text notes: assessment of accuracy across two institutions. ( 0,612627713375748 )
J Chem Inf Model - Classification of compounds with distinct or overlapping multi-target activities and diverse molecular mechanisms using emerging chemical patterns. ( 0,612443274215423 )
Comput. Biol. Med. - Quantification of contributions of molecular fragments for eye irritation of organic chemicals using QSAR study. ( 0,611841186135441 )
J Chem Inf Model - Template CoMFA applied to 116 biological targets. ( 0,60987946459993 )
Brief. Bioinformatics - An empirical assessment of validation practices for molecular classifiers. ( 0,608488198712299 )
AMIA Annu Symp Proc - Identifying Deviations from Usual Medical Care using a Statistical Approach. ( 0,604340185595078 )
Comput Math Methods Med - Multiscale autoregressive identification of neuroelectrophysiological systems. ( 0,603376954122318 )
Curr Comput Aided Drug Des - QSAR Models for the Reactivation of Sarin Inhibited AChE by Quaternary Pyridinium Oximes Based on Monte Carlo Method. ( 0,602852993308428 )
Int J Health Geogr - A linear programming model for preserving privacy when disclosing patient spatial information for secondary purposes. ( 0,600865921683512 )
IEEE Trans Image Process - Incremental N-mode SVD for large-scale multilinear generative models. ( 0,600842693327809 )
J Integr Bioinform - Excemplify: a flexible template based solution, parsing and managing data in spreadsheets for experimentalists. ( 0,599550435551267 )
J Chem Inf Model - Applicability Domain ANalysis (ADAN): a robust method for assessing the reliability of drug property predictions. ( 0,599438203236892 )
Spat Spatiotemporal Epidemiol - Spatial modelling of disease using data- and knowledge-driven approaches. ( 0,598247177019609 )
J Chem Inf Model - Optimizing predictive performance of CASE Ultra expert system models using the applicability domains of individual toxicity alerts. ( 0,597106836329264 )
J Chem Inf Model - Applicability domain based on ensemble learning in classification and regression analyses. ( 0,594643327097688 )
Comput. Biol. Med. - Artificial neural network modelling of the results of tympanoplasty in chronic suppurative otitis media patients. ( 0,59407975437763 )
J Chem Inf Model - A comparison of different QSAR approaches to modeling CYP450 1A2 inhibition. ( 0,592813629188742 )
Med Decis Making - Prediction of health preference values from CD4 counts in individuals with HIV. ( 0,590144657090221 )
J Chem Inf Model - Acetylcholinesterase inhibitors: structure based design, synthesis, pharmacophore modeling, and virtual screening. ( 0,590022936875388 )
J Chem Inf Model - Automated building of organometallic complexes from 3D fragments. ( 0,589048256767816 )
J Chem Inf Model - Coping with unbalanced class data sets in oral absorption models. ( 0,588657677729038 )
J Chem Inf Model - Binary classification of a large collection of environmental chemicals from estrogen receptor assays by quantitative structure-activity relationship and machine learning methods. ( 0,588084861935774 )
AMIA Annu Symp Proc - Order sets in computerized physician order entry systems: an analysis of seven sites. ( 0,585528721403761 )
J Chem Inf Model - New strategy for receptor-based pharmacophore query construction: a case study for 5-HT7 receptor ligands. ( 0,584790503079344 )
J Chem Inf Model - A multiscale simulation system for the prediction of drug-induced cardiotoxicity. ( 0,58468669187724 )
Comput Methods Programs Biomed - Bayesian bivariate generalized Lindley model for survival data with a cure fraction. ( 0,583443059282849 )
J Chem Inf Model - Template CoMFA: the 3D-QSAR Grail? ( 0,581839269226443 )
J. Med. Internet Res. - Outsourcing medical data analyses: can technology overcome legal, privacy, and confidentiality issues? ( 0,580866619519395 )
Comput Methods Programs Biomed - Predicting body fat percentage based on gender, age and BMI by using artificial neural networks. ( 0,576261516129535 )
Med Biol Eng Comput - Development of a comprehensive musculoskeletal model of the shoulder and elbow. ( 0,575073415292536 )
Lifetime Data Anal - Analysis of cure rate survival data under proportional odds model. ( 0,57358844741075 )
Med Biol Eng Comput - Accelerometry-based prediction of movement dynamics for balance monitoring. ( 0,572583534403638 )