J Chem Inf Model - The valence state combination model: a generic framework for handling tautomers and protonation states.

Tópicos

{ compound(1573) activ(1297) structur(1058) }
{ take(945) account(800) differ(722) }
{ imag(2830) propos(1344) filter(1198) }
{ result(1111) use(1088) new(759) }
{ data(1737) use(1416) pattern(1282) }
{ framework(1458) process(801) describ(734) }
{ sampl(1606) size(1419) use(1276) }
{ design(1359) user(1324) use(1319) }
{ structur(1116) can(940) graph(676) }
{ use(976) code(926) identifi(902) }
{ can(774) often(719) complex(702) }
{ assess(1506) score(1403) qualiti(1306) }
{ state(1844) use(1261) util(961) }
{ clinic(1479) use(1117) guidelin(835) }
{ case(1353) use(1143) diagnosi(1136) }
{ perform(999) metric(946) measur(919) }
{ first(2504) two(1366) second(1323) }
{ research(1085) discuss(1038) issu(1018) }
{ visual(1396) interact(850) tool(830) }
{ spatial(1525) area(1432) region(1030) }
{ model(2656) set(1616) predict(1553) }
{ detect(2391) sensit(1101) algorithm(908) }
{ model(3404) distribut(989) bayesian(671) }
{ system(1976) rule(880) can(841) }
{ method(1219) similar(1157) match(930) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ extract(1171) text(1153) clinic(932) }
{ model(2220) cell(1177) simul(1124) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ studi(1410) differ(1259) use(1210) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ model(3480) simul(1196) paramet(876) }
{ medic(1828) order(1363) alert(1069) }
{ gene(2352) biolog(1181) express(1162) }
{ use(1733) differ(960) four(931) }
{ decis(3086) make(1611) patient(1517) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ featur(3375) classif(2383) classifi(1994) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ chang(1828) time(1643) increas(1301) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(1557) propos(1049) approach(1037) }
{ data(1714) softwar(1251) tool(1186) }
{ control(1307) perform(991) simul(935) }
{ care(1570) inform(1187) nurs(1089) }
{ featur(1941) imag(1645) propos(1176) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ risk(3053) factor(974) diseas(938) }
{ system(1050) medic(1026) inform(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ blood(1257) pressur(1144) flow(957) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ data(3008) multipl(1320) sourc(1022) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ drug(1928) target(777) effect(648) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ process(1125) use(805) approach(778) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }

Resumo

The consistent handling of molecules is probably the most basic and important requirement in the field of cheminformatics. Reliable results can only be obtained if the underlying calculations are independent of the specific way molecules are represented in the input data. However, ensuring consistency is a complex task with many pitfalls, an important one being the fact that the same molecule can be represented by different valence bond structures. In order to achieve reliability, a cheminformatics system needs to solve two fundamental problems. First, different choices of valence bond structures must be identified as the same molecule. Second, for each molecule all valence bond structures relevant to the context must be taken into consideration. The latter is especially important with regard to tautomers and protonation states, as these have considerable influence on physicochemical properties of molecules. We present a comprehensive method for the rapid and consistent generation of reasonable tautomers and protonation states for molecules relevant in the context of drug design. This method is based on a generic scheme, the Valence State Combination Model, which has been designed for the enumeration and scoring of valence bond structures in large data sets. In order to ensure our method's consistency, we have developed procedures which can serve as a general validation scheme for similar approaches. The analysis of both the average number of generated structures and the associated runtimes shows that our method is perfectly suited for typical cheminformatics applications. By comparison with frequently used and curated public data sets, we can demonstrate that the tautomers and protonation state produced by our method are chemically reasonable.

Resumo Limpo

consist handl molecul probabl basic import requir field cheminformat reliabl result can obtain under calcul independ specif way molecul repres input data howev ensur consist complex task mani pitfal import one fact molecul can repres differ valenc bond structur order achiev reliabl cheminformat system need solv two fundament problem first differ choic valenc bond structur must identifi molecul second molecul valenc bond structur relev context must taken consider latter especi import regard tautom proton state consider influenc physicochem properti molecul present comprehens method rapid consist generat reason tautom proton state molecul relev context drug design method base generic scheme valenc state combin model design enumer score valenc bond structur larg data set order ensur method consist develop procedur can serv general valid scheme similar approach analysi averag number generat structur associ runtim show method perfect suit typic cheminformat applic comparison frequent use curat public data set can demonstr tautom proton state produc method chemic reason

Resumos Similares

J Chem Inf Model - Knowledge-based libraries for predicting the geometric preferences of druglike molecules. ( 0,735703789065395 )
J Chem Inf Model - Prediction of new bioactive molecules using a Bayesian belief network. ( 0,717965105318753 )
J Chem Inf Model - Biologically relevant chemical space navigator: from patent and structure-activity relationship analysis to library acquisition and design. ( 0,715823671105297 )
J Chem Inf Model - Visual characterization and diversity quantification of chemical libraries: 2. Analysis and selection of size-independent, subspace-specific diversity indices. ( 0,715055294298379 )
J Chem Inf Model - Identification of a new class of FtsZ inhibitors by structure-based design and in vitro screening. ( 0,713146358594212 )
J Chem Inf Model - BioSM: metabolomics tool for identifying endogenous mammalian biochemical structures in chemical structure space. ( 0,711903790656574 )
J Chem Inf Model - Design of multitarget activity landscapes that capture hierarchical activity cliff distributions. ( 0,705036350562083 )
J Chem Inf Model - De novo design of drug-like molecules by a fragment-based molecular evolutionary approach. ( 0,700520289466833 )
J Chem Inf Model - Scaffold diversity of exemplified medicinal chemistry space. ( 0,700314008743454 )
J Chem Inf Model - Scanning structure-activity relationships with structure-activity similarity and related maps: from consensus activity cliffs to selectivity switches. ( 0,699526377172439 )
J Chem Inf Model - Identification of multitarget activity ridges in high-dimensional bioactivity spaces. ( 0,699196433336578 )
J Chem Inf Model - Automated recycling of chemistry for virtual screening and library design. ( 0,698091347306203 )
J Chem Inf Model - Compound optimization through data set-dependent chemical transformations. ( 0,693598673883848 )
J Chem Inf Model - Combining horizontal and vertical substructure relationships in scaffold hierarchies for activity prediction. ( 0,693298329589295 )
J Chem Inf Model - Shaping a screening file for maximal lead discovery efficiency and effectiveness: elimination of molecular redundancy. ( 0,69133876164418 )
J Chem Inf Model - Automated selection of compounds with physicochemical properties to maximize bioavailability and druglikeness. ( 0,689808517195958 )
J Chem Inf Model - Automated design of realistic organometallic molecules from fragments. ( 0,686160121224193 )
J Chem Inf Model - Identification of 1,2,5-oxadiazoles as a new class of SENP2 inhibitors using structure based virtual screening. ( 0,685520497927386 )
J Chem Inf Model - Introduction of target cliffs as a concept to identify and describe complex molecular selectivity patterns. ( 0,683724935074824 )
J Chem Inf Model - Boosting virtual screening enrichments with data fusion: coalescing hits from two-dimensional fingerprints, shape, and docking. ( 0,681964186964646 )
J Chem Inf Model - Hsp90 inhibitors, part 2: combining ligand-based and structure-based approaches for virtual screening application. ( 0,680960990702962 )
J Chem Inf Model - Visual characterization and diversity quantification of chemical libraries: 1. creation of delimited reference chemical subspaces. ( 0,680770255004752 )
J Chem Inf Model - Automatic tailoring and transplanting: a practical method that makes virtual screening more useful. ( 0,680657447669464 )
J Chem Inf Model - Extending the activity cliff concept: structural categorization of activity cliffs and systematic identification of different types of cliffs in the ChEMBL database. ( 0,680145032566412 )
J Chem Inf Model - Best of both worlds: on the complementarity of ligand-based and structure-based virtual screening. ( 0,678079112143698 )
J Chem Inf Model - SABRE: ligand/structure-based virtual screening approach using consensus molecular-shape pattern recognition. ( 0,677877830892699 )
J Chem Inf Model - Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. ( 0,676584556091202 )
J Chem Inf Model - Bioturbo similarity searching: combining chemical and biological similarity to discover structurally diverse bioactive molecules. ( 0,675786532576009 )
J Chem Inf Model - Discovery of novel Pim-1 kinase inhibitors by a hierarchical multistage virtual screening approach based on SVM model, pharmacophore, and molecular docking. ( 0,675751021229503 )
J Chem Inf Model - Navigating high-dimensional activity landscapes: design and application of the ligand-target differentiation map. ( 0,675400355789594 )
J Chem Inf Model - A searchable map of PubChem. ( 0,675269897923234 )
J Chem Inf Model - MQN-mapplet: visualization of chemical space with interactive maps of DrugBank, ChEMBL, PubChem, GDB-11, and GDB-13. ( 0,674920896041951 )
J Chem Inf Model - Polypharmacology directed compound data mining: identification of promiscuous chemotypes with different activity profiles and comparison to approved drugs. ( 0,674461847065775 )
J Chem Inf Model - Comparative analysis of pharmacophore screening tools. ( 0,672743525548288 )
J Chem Inf Model - Harvesting classification trees for drug discovery. ( 0,672587187275235 )
J Chem Inf Model - Mining chemical reactions using neighborhood behavior and condensed graphs of reactions approaches. ( 0,672565747941434 )
J Chem Inf Model - G-protein coupled receptors virtual screening using genetic algorithm focused chemical space. ( 0,671611639791515 )
J Chem Inf Model - Applying medicinal chemistry transformations and multiparameter optimization to guide the search for high-quality leads and candidates. ( 0,67152054177552 )
J Chem Inf Model - Prediction of synthetic accessibility based on commercially available compound databases. ( 0,668420849707545 )
J Chem Inf Model - Selection of in silico drug screening results for G-protein-coupled receptors by using universal active probes. ( 0,667697550401099 )
J Chem Inf Model - Structural determinants of drug partitioning in n-hexadecane/water system. ( 0,666500684582713 )
J Chem Inf Model - From activity cliffs to activity ridges: informative data structures for SAR analysis. ( 0,665209894661697 )
J Chem Inf Model - Design of a three-dimensional multitarget activity landscape. ( 0,664545048800102 )
J Chem Inf Model - Discovery of novel histamine H4 and serotonin transporter ligands using the topological feature tree descriptor. ( 0,66421779487332 )
J Chem Inf Model - Conditional probabilistic analysis for prediction of the activity landscape and relative compound activities. ( 0,663652916806836 )
J Chem Inf Model - SAR monitoring of evolving compound data sets using activity landscapes. ( 0,663487853956481 )
J Chem Inf Model - An integrated virtual screening approach for VEGFR-2 inhibitors. ( 0,663462305222227 )
J Chem Inf Model - Molecular topology analysis of the differences between drugs, clinical candidate compounds, and bioactive molecules. ( 0,663217591789966 )
J Chem Inf Model - Novel method for pharmacophore analysis by examining the joint pharmacophore space. ( 0,662272282677389 )
J Chem Inf Model - Prediction of individual compounds forming activity cliffs using emerging chemical patterns. ( 0,66180465640343 )
J Chem Inf Model - Fragment-based lead discovery and design. ( 0,661477001671303 )
J Chem Inf Model - Target-independent prediction of drug synergies using only drug lipophilicity. ( 0,661368904628821 )
J Chem Inf Model - Structural similarity based kriging for quantitative structure activity and property relationship modeling. ( 0,66105868215054 )
J Chem Inf Model - ColBioS-FlavRC: a collection of bioselective flavonoids and related compounds filtered from high-throughput screening outcomes. ( 0,659455883558352 )
J Chem Inf Model - Identifying compound-target associations by combining bioactivity profile similarity search and public databases mining. ( 0,659430947512067 )
J Chem Inf Model - Application of computer-aided drug repurposing in the search of new cruzipain inhibitors: discovery of amiodarone and bromocriptine inhibitory effects. ( 0,659029437010745 )
J Chem Inf Model - Mining the ChEMBL database: an efficient chemoinformatics workflow for assembling an ion channel-focused screening library. ( 0,657089654211238 )
J Chem Inf Model - Maximum-score diversity selection for early drug discovery. ( 0,656896236452893 )
J Chem Inf Model - Rapid scanning structure-activity relationships in combinatorial data sets: identification of activity switches. ( 0,65668801076104 )
J Chem Inf Model - Library enhancement through the wisdom of crowds. ( 0,656557099987397 )
J Chem Inf Model - Construction and use of fragment-augmented molecular Hasse diagrams. ( 0,656075827227955 )
J Chem Inf Model - Systematic identification of scaffolds representing compounds active against individual targets and single or multiple target families. ( 0,655932996552502 )
J Chem Inf Model - Natural product-like virtual libraries: recursive atom-based enumeration. ( 0,655893551319452 )
J Chem Inf Model - Locating sweet spots for screening hits and evaluating pan-assay interference filters from the performance analysis of two lead-like libraries. ( 0,655769397698616 )
J Chem Inf Model - Characterizing the diversity and biological relevance of the MLPCN assay manifold and screening set. ( 0,654565447314137 )
J Chem Inf Model - How do 2D fingerprints detect structurally diverse active compounds? Revealing compound subset-specific fingerprint features through systematic selection. ( 0,65448999325619 )
J Chem Inf Model - Scaffold-focused virtual screening: prospective application to the discovery of TTK inhibitors. ( 0,654100383525168 )
J Chem Inf Model - Fighting high molecular weight in bioactive molecules with sub-pharmacophore-based virtual screening. ( 0,653317501236834 )
J Chem Inf Model - Discovery and design of tricyclic scaffolds as protein kinase CK2 (CK2) inhibitors through a combination of shape-based virtual screening and structure-based molecular modification. ( 0,65319827219185 )
J Chem Inf Model - Exploring aromatic chemical space with NEAT: novel and electronically equivalent aromatic template. ( 0,653045446240004 )
J Chem Inf Model - How diverse are diversity assessment methods? A comparative analysis and benchmarking of molecular descriptor space. ( 0,652949621479342 )
J Chem Inf Model - Atom pair 2D-fingerprints perceive 3D-molecular shape and pharmacophores for very fast virtual screening of ZINC and GDB-17. ( 0,652637219214883 )
J Chem Inf Model - MMP-Cliffs: systematic identification of activity cliffs on the basis of matched molecular pairs. ( 0,650244883305463 )
J Chem Inf Model - Identification of a novel inhibitor of dengue virus protease through use of a virtual screening drug discovery Web portal. ( 0,649556266938388 )
J Chem Inf Model - Similarity boosted quantitative structure-activity relationship--a systematic study of enhancing structural descriptors by molecular similarity. ( 0,649243826703409 )
J Chem Inf Model - TIN-a combinatorial compound collection of synthetically feasible multicomponent synthesis products. ( 0,648640928898581 )
Curr Comput Aided Drug Des - Development of Chemical Compound Libraries for In Silico Drug Screening. ( 0,648433298755938 )
J Chem Inf Model - Discovery of new selective human aldose reductase inhibitors through virtual screening multiple binding pocket conformations. ( 0,645610227964949 )
J Chem Inf Model - Identification of novel liver X receptor activators by structure-based modeling. ( 0,644384944405943 )
J Chem Inf Model - Compound set enrichment: a novel approach to analysis of primary HTS data. ( 0,64434458792387 )
J Chem Inf Model - Chemical data visualization and analysis with incremental generative topographic mapping: big data challenge. ( 0,643014508450653 )
J Chem Inf Model - In silico enzymatic synthesis of a 400,000 compound biochemical database for nontargeted metabolomics. ( 0,641584583801252 )
J Chem Inf Model - Rationalizing the role of SAR tolerance for ligand-based virtual screening. ( 0,641250456466845 )
J Chem Inf Model - SimG: an alignment based method for evaluating the similarity of small molecules and binding sites. ( 0,641189476027095 )
J Chem Inf Model - Capturing structure-activity relationships from chemogenomic spaces. ( 0,64096441626597 )
J Chem Inf Model - Development of Ecom50 and retention index models for nontargeted metabolomics: identification of 1,3-dicyclohexylurea in human serum by HPLC/mass spectrometry. ( 0,640213947486479 )
J Chem Inf Model - Searching for recursively defined generic chemical patterns in nonenumerated fragment spaces. ( 0,639705073831686 )
J Chem Inf Model - Optimization of molecular representativeness. ( 0,638559376284013 )
J Chem Inf Model - Identification of novel malarial cysteine protease inhibitors using structure-based virtual screening of a focused cysteine protease inhibitor library. ( 0,636832248264687 )
J Chem Inf Model - Freely available conformer generation methods: how good are they? ( 0,636573669215377 )
J Chem Inf Model - SMIfp (SMILES fingerprint) chemical space for virtual screening and visualization of large databases of organic molecules. ( 0,636081995872569 )
J Chem Inf Model - Identification of novel serotonin transporter compounds by virtual screening. ( 0,634973453640998 )
J Chem Inf Model - Novel mycosin protease MycP1 inhibitors identified by virtual screening and 4D fingerprints. ( 0,634481148724305 )
J Chem Inf Model - Application of support vector machine to three-dimensional shape-based virtual screening using comprehensive three-dimensional molecular shape overlay with known inhibitors. ( 0,633998460472544 )
J Chem Inf Model - Integrating medicinal chemistry, organic/combinatorial chemistry, and computational chemistry for the discovery of selective estrogen receptor modulators with Forecaster, a novel platform for drug discovery. ( 0,633088494917981 )
J Chem Inf Model - QSAR modeling of imbalanced high-throughput screening data in PubChem. ( 0,632193988622381 )
J Chem Inf Model - Increasing the coverage of medicinal chemistry-relevant space in commercial fragments screening. ( 0,631982791589906 )
J Chem Inf Model - Computational repositioning and experimental validation of approved drugs for HIF-prolyl hydroxylase inhibition. ( 0,631801109948669 )
J Chem Inf Model - Feasibility of using molecular docking-based virtual screening for searching dual target kinase inhibitors. ( 0,631653849156696 )
J Chem Inf Model - Localized heuristic inverse quantitative structure activity relationship with bulk descriptors using numerical gradients. ( 0,629859619688402 )