J Chem Inf Model - Efficient substructure searching of large chemical libraries: the ABCD chemical cartridge.

Tópicos

{ search(2224) databas(1162) retriev(909) }
{ data(1714) softwar(1251) tool(1186) }
{ algorithm(1844) comput(1787) effici(935) }
{ system(1050) medic(1026) inform(1018) }
{ compound(1573) activ(1297) structur(1058) }
{ first(2504) two(1366) second(1323) }
{ imag(1947) propos(1133) code(1026) }
{ problem(2511) optim(1539) algorithm(950) }
{ time(1939) patient(1703) rate(768) }
{ health(1844) social(1437) communiti(874) }
{ method(1219) similar(1157) match(930) }
{ design(1359) user(1324) use(1319) }
{ general(901) number(790) one(736) }
{ age(1611) year(1155) adult(843) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ survey(1388) particip(1329) question(1065) }
{ detect(2391) sensit(1101) algorithm(908) }
{ system(1976) rule(880) can(841) }
{ sequenc(1873) structur(1644) protein(1328) }
{ imag(2830) propos(1344) filter(1198) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ research(1085) discuss(1038) issu(1018) }
{ model(2341) predict(2261) use(1141) }
{ visual(1396) interact(850) tool(830) }
{ perform(1367) use(1326) method(1137) }
{ spatial(1525) area(1432) region(1030) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ model(2656) set(1616) predict(1553) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ patient(1821) servic(1111) care(1106) }
{ analysi(2126) use(1163) compon(1037) }
{ result(1111) use(1088) new(759) }
{ implement(1333) system(1263) develop(1122) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ featur(3375) classif(2383) classifi(1994) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ framework(1458) process(801) describ(734) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ clinic(1479) use(1117) guidelin(835) }
{ extract(1171) text(1153) clinic(932) }
{ method(1557) propos(1049) approach(1037) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ method(984) reconstruct(947) comput(926) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ import(1318) role(1303) understand(862) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ ehr(2073) health(1662) electron(1139) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ data(3008) multipl(1320) sourc(1022) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ structur(1116) can(940) graph(676) }
{ high(1669) rate(1365) level(1280) }
{ use(976) code(926) identifi(902) }
{ drug(1928) target(777) effect(648) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }

Resumo

Efficient substructure searching is a key requirement for any chemical information management system. In this paper, we describe the substructure search capabilities of ABCD, an integrated drug discovery informatics platform developed at Johnson & Johnson Pharmaceutical Research & Development, L.L.C. The solution consists of several algorithmic components: 1) a pattern mapping algorithm for solving the subgraph isomorphism problem, 2) an indexing scheme that enables very fast substructure searches on large structure files, 3) the incorporation of that indexing scheme into an Oracle cartridge to enable querying large relational databases through SQL, and 4) a cost estimation scheme that allows the Oracle cost-based optimizer to generate a good execution plan when a substructure search is combined with additional constraints in a single SQL query. The algorithm was tested on a public database comprising nearly 1 million molecules using 4,629 substructure queries, the vast majority of which were submitted by discovery scientists over the last 2.5 years of user acceptance testing of ABCD. 80.7% of these queries were completed in less than a second and 96.8% in less than ten seconds on a single CPU, while on eight processing cores these numbers increased to 93.2% and 99.7%, respectively. The slower queries involved extremely generic patterns that returned the entire database as screening hits and required extensive atom-by-atom verification.

Resumo Limpo

effici substructur search key requir chemic inform manag system paper describ substructur search capabl abcd integr drug discoveri informat platform develop johnson johnson pharmaceut research develop llc solut consist sever algorithm compon pattern map algorithm solv subgraph isomorph problem index scheme enabl fast substructur search larg structur file incorpor index scheme oracl cartridg enabl queri larg relat databas sql cost estim scheme allow oracl costbas optim generat good execut plan substructur search combin addit constraint singl sql queri algorithm test public databas compris near million molecul use substructur queri vast major submit discoveri scientist last year user accept test abcd queri complet less second less ten second singl cpu eight process core number increas respect slower queri involv extrem generic pattern return entir databas screen hit requir extens atombyatom verif

Resumos Similares

BMC Med Inform Decis Mak - CDAPubMed: a browser extension to retrieve EHR-based biomedical literature. ( 0,763315491610176 )
Health Info Libr J - Facilitating access to evidence: Primary Health Care Search Filter. ( 0,761110305196304 )
Telemed J E Health - MEDLINE versus EMBASE and CINAHL for telemedicine searches. ( 0,74558274536053 )
Int J Med Inform - An analysis of clinical queries in an electronic health record search utility. ( 0,741491574065571 )
J. Med. Internet Res. - Sensitivity and predictive value of 15 PubMed search strategies to answer clinical questions rated against full systematic reviews. ( 0,737968533119679 )
J. Med. Internet Res. - Retrieving clinical evidence: a comparison of PubMed and Google Scholar for quick clinical searches. ( 0,737533757841662 )
J Am Med Inform Assoc - Retrieval of diagnostic and treatment studies for clinical use through PubMed and PubMed's Clinical Queries filters. ( 0,735993668157673 )
J Biomed Inform - On the query reformulation technique for effective MEDLINE document retrieval. ( 0,733927414910837 )
BMC Med Inform Decis Mak - Glomerular disease search filters for Pubmed, Ovid Medline, and Embase: a development and validation study. ( 0,731889604900729 )
J Integr Bioinform - The LAILAPS search engine: a feature model for relevance ranking in life science databases. ( 0,72898229678534 )
J Integr Bioinform - Classification methods for finding articles describing protein-protein interactions in PubMed. ( 0,728012757668679 )
AMIA Annu Symp Proc - BIOSPIDA: A Relational Database Translator for NCBI. ( 0,727060596877375 )
J Am Med Inform Assoc - Search filters to identify geriatric medicine in Medline. ( 0,715262517190032 )
J Chem Inf Model - Speeding up chemical searches using the inverted index: the convergence of chemoinformatics and text search methods. ( 0,715115448344361 )
AMIA Annu Symp Proc - Does query expansion limit our learning? A comparison of social-based expansion to content-based expansion for medical queries on the internet. ( 0,713822081859414 )
AMIA Annu Symp Proc - Evaluation of automated term groupings for detecting anaphylactic shock signals for drugs. ( 0,711955473536331 )
J Biomed Inform - Development and evaluation of a biomedical search engine using a predicate-based vector space model. ( 0,710902123111662 )
Health Info Libr J - Medical literature searches: a comparison of PubMed and Google Scholar. ( 0,709018861253925 )
J Am Med Inform Assoc - Federated queries of clinical data repositories: the sum of the parts does not equal the whole. ( 0,70897885631806 )
J Med Syst - MIRASS: medical informatics research activity support system using information mashup network. ( 0,70828233898041 )
J Telemed Telecare - How to improve your PubMed/MEDLINE searches: 1. background and basic searching. ( 0,707498074415375 )
J Biomed Inform - MeSHy: Mining unanticipated PubMed information using frequencies of occurrences and concurrences of MeSH terms. ( 0,705044792611193 )
Methods Inf Med - Learning the preferences of physicians for the organization of result lists of medical evidence articles. ( 0,704325809913128 )
J Am Med Inform Assoc - A practical approach to achieve private medical record linkage in light of public resources. ( 0,703804614327243 )
J. Med. Internet Res. - Development and validation of filters for the retrieval of studies of clinical examination from Medline. ( 0,703354700974339 )
Health Info Libr J - Developing a geographic search filter to identify randomised controlled trials in Africa: finding the optimal balance between sensitivity and precision. ( 0,699967061148607 )
J Am Med Inform Assoc - Search terms and a validated brief search filter to retrieve publications on health-related values in Medline: a word frequency analysis study. ( 0,695602992544127 )
J Integr Bioinform - GMB: an efficient query processor for biological data. ( 0,695548934671756 )
AMIA Annu Symp Proc - Search filter precision can be improved by NOTing out irrelevant content. ( 0,693594841564249 )
BMC Med Inform Decis Mak - Boolean versus ranked querying for biomedical systematic reviews. ( 0,690767814218478 )
J Integr Bioinform - The LAILAPS search engine: relevance ranking in life science databases. ( 0,68977930261358 )
J Biomed Inform - Reflective random indexing for semi-automatic indexing of the biomedical literature. ( 0,687322716197205 )
Methods Inf Med - Developing topic-specific search filters for PubMed with click-through data. ( 0,686597068703366 )
BMC Med Inform Decis Mak - Performance evaluation of Unified Medical Language System?'s synonyms expansion to query PubMed. ( 0,683663868777767 )
Brief. Bioinformatics - Fast and efficient searching of biological data resources--using EB-eye. ( 0,683347141333666 )
BMC Med Inform Decis Mak - Publication trends of shared decision making in 15 high impact medical journals: a full-text review with bibliometric analysis. ( 0,681113127642124 )
J Integr Bioinform - A query suggestion workflow for life science IR-systems. ( 0,681069322232874 )
Int J Health Geogr - HEALTH GeoJunction: place-time-concept browsing of health publications. ( 0,679612505161667 )
J Am Med Inform Assoc - A literature search tool for intelligent extraction of disease-associated genes. ( 0,678249402866687 )
IEEE Trans Vis Comput Graph - WORDGRAPH: Keyword-in-Context Visualization for NETSPEAK's Wildcard Search. ( 0,673224781274481 )
J. Med. Internet Res. - Using Internet search engines to obtain medical information: a comparative study. ( 0,673200010418991 )
J Am Med Inform Assoc - MEDLINE clinical queries are robust when searching in recent publishing years. ( 0,672257792358619 )
Health Info Libr J - The performance of adverse effects search filters in MEDLINE and EMBASE. ( 0,671158779056124 )
Health Info Libr J - Utilisation of search filters in systematic reviews of prognosis questions. ( 0,669485315813091 )
J. Med. Internet Res. - Net improvement of correct answers to therapy questions after pubmed searches: pre/post comparison. ( 0,666823373067881 )
BMC Med Inform Decis Mak - BOSS: context-enhanced search for biomedical objects. ( 0,664951941980249 )
Health Info Libr J - Assessment of indexing trends with specific and general terms for herbal medicine. ( 0,662659648302339 )
J Biomed Inform - Small sum privacy and large sum utility in data publishing. ( 0,659418701674997 )
Inform Health Soc Care - Readability of online health information: implications for health literacy. ( 0,65912909606016 )
AMIA Annu Symp Proc - Query log analysis of an electronic health record search engine. ( 0,658521849594197 )
Health Info Libr J - Searching for randomised controlled trials and clinical controlled trials in Thai online bibliographical biomedical databases. ( 0,65638116071744 )
J Chem Inf Model - SymDex: increasing the efficiency of chemical fingerprint similarity searches for comparing large chemical libraries by using query set indexing. ( 0,655208170986746 )
Health Info Libr J - Sensitivity and precision of adverse effects search filters in MEDLINE and EMBASE: a case study of fractures with thiazolidinediones. ( 0,643650304346789 )
J. Med. Internet Res. - Definition of Health 2.0 and Medicine 2.0: a systematic review. ( 0,640532407209656 )
Int J Med Inform - MEDRank: using graph-based concept ranking to index biomedical texts. ( 0,639359860871745 )
J Chem Inf Model - Chemical and biological properties of frequent screening hits. ( 0,639085567620415 )
J. Med. Internet Res. - A search engine to access PubMed monolingual subsets: proof of concept and evaluation in French. ( 0,636089676432912 )
Comput Methods Programs Biomed - RDFBuilder: a tool to automatically build RDF-based interfaces for MAGE-OM microarray data sources. ( 0,629263838651379 )
J Biomed Inform - Predicting microRNA modulation in human prostate cancer using a simple String IDentifier (SID1.0). ( 0,629134357549939 )
Health Info Libr J - Where and how to search for information on the effectiveness of public health interventions - a case study for prevention of cardiovascular disease. ( 0,62907144353938 )
Health Info Libr J - Can we prioritise which databases to search? A case study using a systematic review of frozen shoulder management. ( 0,628159216803921 )
Res Synth Methods - Inquisitio validus Index Medicus: A simple method of validating MEDLINE systematic review searches. ( 0,626240118157452 )
J Biomed Inform - A semi-supervised approach to extract pharmacogenomics-specific drug-gene pairs from biomedical literature for personalized medicine. ( 0,625827868820479 )
Methods Inf Med - A survey on visual information search behavior and requirements of radiologists. ( 0,624640748177068 )
J Am Med Inform Assoc - Improving image retrieval effectiveness via query expansion using MeSH hierarchical structure. ( 0,623562978806442 )
BMC Med Inform Decis Mak - Accessing the public MIMIC-II intensive care relational database for clinical research. ( 0,623431473337729 )
AMIA Annu Symp Proc - Development and evaluation of a prototype search engine to meet public health information needs. ( 0,619552037121878 )
J Am Med Inform Assoc - Directing the public to evidence-based online content. ( 0,619551329393248 )
J Chem Inf Model - Scaffold hopping by fragment replacement. ( 0,619119001367098 )
J Biomed Inform - Knowledge-based personalized search engine for the Web-based Human Musculoskeletal System Resources (HMSR) in biomechanics. ( 0,618903546647263 )
J Biomed Inform - Improving search over Electronic Health Records using UMLS-based query expansion through random walks. ( 0,618887847506721 )
J. Med. Internet Res. - Comparative analysis of online health queries originating from personal computers and smart devices on a consumer health information portal. ( 0,616864920588803 )
Res Synth Methods - Comprehensive computer searches and reporting in systematic reviews. ( 0,616056784592975 )
J. Med. Internet Res. - A study of innovative features in scholarly open access journals. ( 0,614126221692013 )
J Biomed Inform - Using statistical text mining to supplement the development of an ontology. ( 0,610757464463372 )
Perspect Health Inf Manag - Risk factors for bladder cancer: challenges of conducting a literature search using PubMed. ( 0,608506349081456 )
J Biomed Inform - A mutation-centric approach to identifying pharmacogenomic relations in text. ( 0,606880195284515 )
J. Med. Internet Res. - The impact of search engine selection and sorting criteria on vaccination beliefs and attitudes: two experiments manipulating Google output. ( 0,606311331515567 )
J Chem Inf Model - Do not hesitate to use Tversky-and other hints for successful active analogue searches with feature count descriptors. ( 0,605429974931416 )
BMC Med Inform Decis Mak - How are the different specialties represented in the major journals in general medicine? ( 0,60417048337081 )
AMIA Annu Symp Proc - Semantic MEDLINE for discovery browsing: using semantic predications and the literature-based discovery paradigm to elucidate a mechanism for the obesity paradox. ( 0,603480589289792 )
AMIA Annu Symp Proc - Using Co-Authoring and Cross-Referencing Information for MEDLINE Indexing. ( 0,603222979310555 )
J Am Med Inform Assoc - Design and usability study of an iconic user interface to ease information retrieval of medical guidelines. ( 0,601826225080633 )
Int J Med Inform - FindZebra: a search engine for rare diseases. ( 0,600385027429806 )
AMIA Annu Symp Proc - Web to world: predicting transitions from self-diagnosis to the pursuit of local medical assistance in web search. ( 0,600128790770538 )
J Am Med Inform Assoc - Automatically extracting sentences from Medline citations to support clinicians' information needs. ( 0,599233780479436 )
J Biomed Inform - A comparison of evaluation metrics for biomedical journals, articles, and websites in terms of sensitivity to topic. ( 0,598993477743617 )
Telemed J E Health - A web search on environmental topics: what is the role of ranking? ( 0,598575329660412 )
J Biomed Inform - Supporting effective health and biomedical information retrieval and navigation: a novel facet view interface evaluation. ( 0,598074932577863 )
Methods Inf Med - Technology-induced errors. The current use of frameworks and models from the biomedical and life sciences literatures. ( 0,597531354523877 )
Appl Clin Inform - Usability of selected databases for low-resource clinical decision support. ( 0,59709050539934 )
J Biomed Inform - Automatic generation of investigator bibliographies for institutional research networking systems. ( 0,596845689182772 )
AMIA Annu Symp Proc - A bottom-up approach to MEDLINE indexing recommendations. ( 0,596454385042481 )
Int J Med Inform - The publication echo: effects of retrieving literature in PubMed by year of publication. ( 0,59316279278201 )
J Chem Inf Model - Large-scale similarity search profiling of ChEMBL compound data sets. ( 0,591183688094979 )
J. Med. Internet Res. - Searching for truth: internet search patterns as a method of investigating online responses to a Russian illicit drug policy debate. ( 0,589359583022314 )
J Integr Bioinform - On comparison of SimTandem with state-of-the-art peptide identification tools, efficiency of precursor mass filter and dealing with variable modifications. ( 0,587877881805175 )
J Integr Bioinform - Bio-Search Computing: integration and global ranking of bioinformatics search results. ( 0,585648708527181 )
J Am Med Inform Assoc - DCMDSM: a DICOM decomposed storage model. ( 0,585236366657968 )
J. Med. Internet Res. - Clinician search behaviors may be influenced by search engine design. ( 0,58102631339132 )