Comput Methods Programs Biomed - Alternatives to relational database: comparison of NoSQL and XML approaches for clinical data storage.

Tópicos

{ compound(1573) activ(1297) structur(1058) }
{ learn(2355) train(1041) set(1003) }
{ extract(1171) text(1153) clinic(932) }
{ data(1714) softwar(1251) tool(1186) }
{ sampl(1606) size(1419) use(1276) }
{ use(2086) technolog(871) perceiv(783) }
{ implement(1333) system(1263) develop(1122) }
{ method(2212) result(1239) propos(1039) }
{ data(1737) use(1416) pattern(1282) }
{ system(1976) rule(880) can(841) }
{ motion(1329) object(1292) video(1091) }
{ concept(1167) ontolog(924) domain(897) }
{ search(2224) databas(1162) retriev(909) }
{ featur(1941) imag(1645) propos(1176) }
{ howev(809) still(633) remain(590) }
{ record(1888) medic(1808) patient(1693) }
{ method(1219) similar(1157) match(930) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ algorithm(1844) comput(1787) effici(935) }
{ model(2220) cell(1177) simul(1124) }
{ perform(999) metric(946) measur(919) }
{ model(2341) predict(2261) use(1141) }
{ studi(1119) effect(1106) posit(819) }
{ health(3367) inform(1360) care(1135) }
{ model(2656) set(1616) predict(1553) }
{ time(1939) patient(1703) rate(768) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ method(1969) cluster(1462) data(1082) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ featur(3375) classif(2383) classifi(1994) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ studi(2440) review(1878) systemat(933) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ clinic(1479) use(1117) guidelin(835) }
{ method(1557) propos(1049) approach(1037) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ case(1353) use(1143) diagnosi(1136) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ import(1318) role(1303) understand(862) }
{ visual(1396) interact(850) tool(830) }
{ perform(1367) use(1326) method(1137) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ gene(2352) biolog(1181) express(1162) }
{ data(3008) multipl(1320) sourc(1022) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ patient(1821) servic(1111) care(1106) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(976) code(926) identifi(902) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ activ(1452) weight(1219) physic(1104) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

Clinical data are dynamic in nature, often arranged hierarchically and stored as free text and numbers. Effective management of clinical data and the transformation of the data into structured format for data analysis are therefore challenging issues in electronic health records development. Despite the popularity of relational databases, the scalability of the NoSQL database model and the document-centric data structure of XML databases appear to be promising features for effective clinical data management. In this paper, three database approaches--NoSQL, XML-enabled and native XML--are investigated to evaluate their suitability for structured clinical data. The database query performance is reported, together with our experience in the databases development. The results show that NoSQL database is the best choice for query speed, whereas XML databases are advantageous in terms of scalability, flexibility and extensibility, which are essential to cope with the characteristics of clinical data. While NoSQL and XML technologies are relatively new compared to the conventional relational database, both of them demonstrate potential to become a key database technology for clinical data management as the technology further advances.

Resumo Limpo

clinic data dynam natur often arrang hierarch store free text number effect manag clinic data transform data structur format data analysi therefor challeng issu electron health record develop despit popular relat databas scalabl nosql databas model documentcentr data structur xml databas appear promis featur effect clinic data manag paper three databas approachesnosql xmlenabl nativ xmlare investig evalu suitabl structur clinic data databas queri perform report togeth experi databas develop result show nosql databas best choic queri speed wherea xml databas advantag term scalabl flexibl extens essenti cope characterist clinic data nosql xml technolog relat new compar convent relat databas demonstr potenti becom key databas technolog clinic data manag technolog advanc

Resumos Similares

J Chem Inf Model - Prediction of activity cliffs using support vector machines. ( 0,690086162227441 )
J Chem Inf Model - Novel method for pharmacophore analysis by examining the joint pharmacophore space. ( 0,676327828567741 )
J Chem Inf Model - Application of support vector machine to three-dimensional shape-based virtual screening using comprehensive three-dimensional molecular shape overlay with known inhibitors. ( 0,650993227481839 )
J Chem Inf Model - admetSAR: a comprehensive source and free tool for assessment of chemical ADMET properties. ( 0,641574052716712 )
J Chem Inf Model - Modeling and benchmark data set for the inhibition of c-Jun N-terminal kinase-3. ( 0,63564729969493 )
J Chem Inf Model - Natural product-like virtual libraries: recursive atom-based enumeration. ( 0,632024113428207 )
J Biomed Inform - Mining connections between chemicals, proteins, and diseases extracted from Medline annotations. ( 0,629851402746778 )
J Chem Inf Model - In silico target predictions: defining a benchmarking data set and comparison of performance of the multiclass Na?ve Bayes and Parzen-Rosenblatt window. ( 0,625366080343993 )
J Chem Inf Model - Integrating medicinal chemistry, organic/combinatorial chemistry, and computational chemistry for the discovery of selective estrogen receptor modulators with Forecaster, a novel platform for drug discovery. ( 0,624992254783792 )
J Chem Inf Model - SHAFTS: a hybrid approach for 3D molecular similarity calculation. 1. Method and assessment of virtual screening. ( 0,618609056961124 )
J Chem Inf Model - Hit expansion approaches using multiple similarity methods and virtualized query structures. ( 0,613402091331726 )
J Chem Inf Model - Visual characterization and diversity quantification of chemical libraries: 1. creation of delimited reference chemical subspaces. ( 0,611466136347589 )
J Chem Inf Model - Navigating high-dimensional activity landscapes: design and application of the ligand-target differentiation map. ( 0,608160060322919 )
J Chem Inf Model - Systematic assessment of compound series with SAR transfer potential. ( 0,605626447342773 )
AMIA Annu Symp Proc - Active Learning-based corpus annotation--the PathoJen experience. ( 0,605508649931539 )
J Chem Inf Model - Target-specific support vector machine scoring in structure-based virtual screening: computational validation, in vitro testing in kinases, and effects on lung cancer cell proliferation. ( 0,604904637904914 )
J Chem Inf Model - From activity cliffs to activity ridges: informative data structures for SAR analysis. ( 0,604719077187297 )
Curr Comput Aided Drug Des - Development of Chemical Compound Libraries for In Silico Drug Screening. ( 0,603837083455541 )
J Chem Inf Model - Automated recycling of chemistry for virtual screening and library design. ( 0,60375347071478 )
J Chem Inf Model - Improving classical substructure-based virtual screening to handle extrapolation challenges. ( 0,603383248124382 )
J Chem Inf Model - Markov logic networks for optical chemical structure recognition. ( 0,602801776066666 )
J Am Med Inform Assoc - Active learning for clinical text classification: is it better than random sampling? ( 0,602750640652413 )
J Chem Inf Model - Identifying compound-target associations by combining bioactivity profile similarity search and public databases mining. ( 0,601361181731933 )
J Chem Inf Model - Mining the ChEMBL database: an efficient chemoinformatics workflow for assembling an ion channel-focused screening library. ( 0,601184035129417 )
J Chem Inf Model - Small-molecule 3D structure prediction using open crystallography data. ( 0,601125540694352 )
J Chem Inf Model - Target-independent prediction of drug synergies using only drug lipophilicity. ( 0,596266754797314 )
J Chem Inf Model - Conditional probabilistic analysis for prediction of the activity landscape and relative compound activities. ( 0,595882733995377 )
J Chem Inf Model - Multitarget structure-activity relationships characterized by activity-difference maps and consensus similarity measure. ( 0,595407380022205 )
J Chem Inf Model - Locating sweet spots for screening hits and evaluating pan-assay interference filters from the performance analysis of two lead-like libraries. ( 0,593055771202054 )
J Biomed Inform - Applying active learning to assertion classification of concepts in clinical text. ( 0,592044631002509 )
J Chem Inf Model - Automatic tailoring and transplanting: a practical method that makes virtual screening more useful. ( 0,591923795637665 )
J Chem Inf Model - MQN-mapplet: visualization of chemical space with interactive maps of DrugBank, ChEMBL, PubChem, GDB-11, and GDB-13. ( 0,591522244867155 )
J Chem Inf Model - SABRE: ligand/structure-based virtual screening approach using consensus molecular-shape pattern recognition. ( 0,590517388837943 )
IEEE Trans Pattern Anal Mach Intell - A Prototype Learning Framework Using EMD: Application to Complex Scenes Analysis. ( 0,589985325828613 )
J Chem Inf Model - Extraction of discontinuous structure-activity relationships from compound data sets through particle swarm optimization. ( 0,58956844918564 )
J Chem Inf Model - BioSM: metabolomics tool for identifying endogenous mammalian biochemical structures in chemical structure space. ( 0,588909126989938 )
J Chem Inf Model - Fighting high molecular weight in bioactive molecules with sub-pharmacophore-based virtual screening. ( 0,588275690938469 )
J Chem Inf Model - Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS) and its application on modeling ligand functionality for 5HT-subtype GPCR families. ( 0,588041608143658 )
J Chem Inf Model - ColBioS-FlavRC: a collection of bioselective flavonoids and related compounds filtered from high-throughput screening outcomes. ( 0,587015999707342 )
J Chem Inf Model - A multivariate chemical similarity approach to search for drugs of potential environmental concern. ( 0,586215384911336 )
J Chem Inf Model - Prediction of individual compounds forming activity cliffs using emerging chemical patterns. ( 0,585216319964337 )
Brief. Bioinformatics - Toward more realistic drug-target interaction predictions. ( 0,584396577782487 )
J Chem Inf Model - Enrichment of chemical libraries docked to protein conformational ensembles and application to aldehyde dehydrogenase 2. ( 0,583962601483231 )
J Chem Inf Model - Discovery of chemical compound groups with common structures by a network analysis approach (affinity prediction method). ( 0,583069628919386 )
J Chem Inf Model - QSAR classification model for antibacterial compounds and its use in virtual screening. ( 0,582335592728987 )
J Chem Inf Model - An unbiased method to build benchmarking sets for ligand-based virtual screening and its application to GPCRs. ( 0,582217170953394 )
J Chem Inf Model - How do 2D fingerprints detect structurally diverse active compounds? Revealing compound subset-specific fingerprint features through systematic selection. ( 0,579368544134311 )
J Integr Bioinform - Database supported candidate search for metabolite identification. ( 0,578969804934107 )
J Am Med Inform Assoc - Evaluating the utility of syndromic surveillance algorithms for screening to detect potentially clonal hospital infection outbreaks. ( 0,578023668269618 )
J Chem Inf Model - Combining horizontal and vertical substructure relationships in scaffold hierarchies for activity prediction. ( 0,577656874599353 )
J Chem Inf Model - TIN-a combinatorial compound collection of synthetically feasible multicomponent synthesis products. ( 0,577559464412364 )
J Chem Inf Model - Evaluation and optimization of virtual screening workflows with DEKOIS 2.0--a public library of challenging docking benchmark sets. ( 0,577122921171679 )
J Chem Inf Model - Identification of 1,2,5-oxadiazoles as a new class of SENP2 inhibitors using structure based virtual screening. ( 0,57702625946453 )
J Chem Inf Model - In silico enzymatic synthesis of a 400,000 compound biochemical database for nontargeted metabolomics. ( 0,576269664711465 )
J Chem Inf Model - Identification of novel androgen receptor antagonists using structure- and ligand-based methods. ( 0,575664853224947 )
J Chem Inf Model - ZINClick: a database of 16 million novel, patentable, and readily synthesizable 1,4-disubstituted triazoles. ( 0,574773509114324 )
J Chem Inf Model - Polypharmacology directed compound data mining: identification of promiscuous chemotypes with different activity profiles and comparison to approved drugs. ( 0,574709703863627 )
J Chem Inf Model - Searching for substructures in fragment spaces. ( 0,574476071436734 )
J Chem Inf Model - Data set modelability by QSAR. ( 0,574122061701415 )
J Chem Inf Model - Identification of novel liver X receptor activators by structure-based modeling. ( 0,573819874685206 )
J Chem Inf Model - Exploring polypharmacology using a ROCS-based target fishing approach. ( 0,572671595162107 )
J Chem Inf Model - Similarity boosted quantitative structure-activity relationship--a systematic study of enhancing structural descriptors by molecular similarity. ( 0,572495614586348 )
J Chem Inf Model - Automated information extraction and structure-activity relationship analysis of cytochrome P450 substrates. ( 0,572429652624029 )
J Chem Inf Model - Knowledge-based libraries for predicting the geometric preferences of druglike molecules. ( 0,571421631379892 )
J Chem Inf Model - Extending the activity cliff concept: structural categorization of activity cliffs and systematic identification of different types of cliffs in the ChEMBL database. ( 0,571392001769385 )
J Chem Inf Model - Atom pair 2D-fingerprints perceive 3D-molecular shape and pharmacophores for very fast virtual screening of ZINC and GDB-17. ( 0,57117323191914 )
J Chem Inf Model - G-protein coupled receptors virtual screening using genetic algorithm focused chemical space. ( 0,570886370347888 )
J Chem Inf Model - Freely available conformer generation methods: how good are they? ( 0,570864685647768 )
J Chem Inf Model - Library enhancement through the wisdom of crowds. ( 0,570257294266629 )
Comput Methods Programs Biomed - Multistage approach for clustering and classification of ECG data. ( 0,569884841232574 )
J Chem Inf Model - Note on naive Bayes based on binary descriptors in cheminformatics. ( 0,569691241578666 )
IEEE Trans Neural Netw Learn Syst - Adaptive Batch Mode Active Learning. ( 0,569170884821968 )
J Chem Inf Model - Searching for recursively defined generic chemical patterns in nonenumerated fragment spaces. ( 0,567384833497853 )
J Biomed Inform - A method for determining the number of documents needed for a gold standard corpus. ( 0,567310471706229 )
Sci Data - Quantum chemistry structures and properties of 134 kilo molecules. ( 0,566689483767401 )
J Chem Inf Model - Prediction of new bioactive molecules using a Bayesian belief network. ( 0,566489423612979 )
J Chem Inf Model - Comparison of confirmed inactive and randomly selected compounds as negative training examples in support vector machine-based virtual screening. ( 0,566118462249226 )
J Chem Inf Model - Capturing structure-activity relationships from chemogenomic spaces. ( 0,564623339444257 )
J Chem Inf Model - Molecular topology analysis of the differences between drugs, clinical candidate compounds, and bioactive molecules. ( 0,563989733750001 )
J Chem Inf Model - Design of a three-dimensional multitarget activity landscape. ( 0,563798652207617 )
J Chem Inf Model - Computational repositioning and experimental validation of approved drugs for HIF-prolyl hydroxylase inhibition. ( 0,56376450387207 )
J Chem Inf Model - Improved chemical text mining of patents with infinite dictionaries and automatic spelling correction. ( 0,563538625385895 )
J Chem Inf Model - How diverse are diversity assessment methods? A comparative analysis and benchmarking of molecular descriptor space. ( 0,561993427699051 )
J Am Med Inform Assoc - 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. ( 0,561677921871439 )
J Chem Inf Model - A system for encoding and searching Markush structures. ( 0,561629093140557 )
J Biomed Inform - The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions. ( 0,560400512887614 )
J Chem Inf Model - Discovery of a7-nicotinic receptor ligands by virtual screening of the chemical universe database GDB-13. ( 0,560171413751961 )
J Chem Inf Model - Similarity searching for potent compounds using feature selection. ( 0,560131634739226 )
J Chem Inf Model - Discovery of novel acetohydroxyacid synthase inhibitors as active agents against Mycobacterium tuberculosis by virtual screening and bioassay. ( 0,559146441405599 )
J Chem Inf Model - Design of multitarget activity landscapes that capture hierarchical activity cliff distributions. ( 0,558929948161902 )
J Chem Inf Model - Identification of multitarget activity ridges in high-dimensional bioactivity spaces. ( 0,558892060923201 )
J Chem Inf Model - Searching for closely related ligands with different mechanisms of action using machine learning and mapping algorithms. ( 0,558023073196104 )
J Chem Inf Model - Neighborhood-based prediction of novel active compounds from SAR matrices. ( 0,556877144096725 )
J Chem Inf Model - Identification of a new class of FtsZ inhibitors by structure-based design and in vitro screening. ( 0,556748151265011 )
J Chem Inf Model - GPU accelerated chemical similarity calculation for compound library comparison. ( 0,556368250861487 )
J Chem Inf Model - Systematic identification of scaffolds representing compounds active against individual targets and single or multiple target families. ( 0,556292655823388 )
J Chem Inf Model - Novel mycosin protease MycP1 inhibitors identified by virtual screening and 4D fingerprints. ( 0,555693659422888 )
J Chem Inf Model - Harvesting classification trees for drug discovery. ( 0,553955182554791 )
J Chem Inf Model - Improving the use of ranking in virtual screening against HIV-1 integrase with triangular numbers and including ligand profiling with antitargets. ( 0,553902276564026 )
J Am Med Inform Assoc - Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. ( 0,552776094927071 )