J Biomed Inform - A knowledge-driven conditional approach to extract pharmacogenomics specific drug-gene relationships from free text.

Tópicos

{ data(1737) use(1416) pattern(1282) }
{ extract(1171) text(1153) clinic(932) }
{ gene(2352) biolog(1181) express(1162) }
{ method(1557) propos(1049) approach(1037) }
{ search(2224) databas(1162) retriev(909) }
{ use(2086) technolog(871) perceiv(783) }
{ can(774) often(719) complex(702) }
{ studi(2440) review(1878) systemat(933) }
{ learn(2355) train(1041) set(1003) }
{ drug(1928) target(777) effect(648) }
{ bind(1733) structur(1185) ligand(1036) }
{ blood(1257) pressur(1144) flow(957) }
{ research(1218) medic(880) student(794) }
{ result(1111) use(1088) new(759) }
{ general(901) number(790) one(736) }
{ research(1085) discuss(1038) issu(1018) }
{ structur(1116) can(940) graph(676) }
{ method(2212) result(1239) propos(1039) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ take(945) account(800) differ(722) }
{ care(1570) inform(1187) nurs(1089) }
{ method(984) reconstruct(947) comput(926) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ import(1318) role(1303) understand(862) }
{ compound(1573) activ(1297) structur(1058) }
{ studi(1119) effect(1106) posit(819) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ group(2977) signific(1463) compar(1072) }
{ data(3008) multipl(1320) sourc(1022) }
{ model(3404) distribut(989) bayesian(671) }
{ imag(1947) propos(1133) code(1026) }
{ system(1976) rule(880) can(841) }
{ imag(1057) registr(996) error(939) }
{ featur(3375) classif(2383) classifi(1994) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ concept(1167) ontolog(924) domain(897) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ data(1714) softwar(1251) tool(1186) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ system(1050) medic(1026) inform(1018) }
{ model(2341) predict(2261) use(1141) }
{ visual(1396) interact(850) tool(830) }
{ perform(1367) use(1326) method(1137) }
{ spatial(1525) area(1432) region(1030) }
{ record(1888) medic(1808) patient(1693) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ patient(2837) hospit(1953) medic(668) }
{ model(2656) set(1616) predict(1553) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ sampl(1606) size(1419) use(1276) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(976) code(926) identifi(902) }
{ use(1733) differ(960) four(931) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

An important task in pharmacogenomics (PGx) studies is to identify genetic variants that may impact drug response. The success of many systematic and integrative computational approaches for PGx studies depends on the availability of accurate, comprehensive and machine understandable drug-gene relationship knowledge bases. Scientific literature is one of the most comprehensive knowledge sources for PGx-specific drug-gene relationships. However, the major barrier in accessing this information is that the knowledge is buried in a large amount of free text with limited machine understandability. Therefore there is a need to develop automatic approaches to extract structured PGx-specific drug-gene relationships from unstructured free text literature. In this study, we have developed a conditional relationship extraction approach to extract PGx-specific drug-gene pairs from 20 million MEDLINE abstracts using known drug-gene pairs as prior knowledge. We have demonstrated that the conditional drug-gene relationship extraction approach significantly improves the precision and F1 measure compared to the unconditioned approach (precision: 0.345 vs. 0.11; recall: 0.481 vs. 1.00; F1: 0.402 vs. 0.201). In this study, a method based on co-occurrence is used as the underlying relationship extraction method for its simplicity. It can be replaced by or combined with more advanced methods such as machine learning or natural language processing approaches to further improve the performance of the drug-gene relationship extraction from free text. Our method is not limited to extracting a drug-gene relationship; it can be generalized to extract other types of relationships when related background knowledge bases exist.

Resumo Limpo

import task pharmacogenom pgx studi identifi genet variant may impact drug respons success mani systemat integr comput approach pgx studi depend avail accur comprehens machin understand druggen relationship knowledg base scientif literatur one comprehens knowledg sourc pgxspecif druggen relationship howev major barrier access inform knowledg buri larg amount free text limit machin understand therefor need develop automat approach extract structur pgxspecif druggen relationship unstructur free text literatur studi develop condit relationship extract approach extract pgxspecif druggen pair million medlin abstract use known druggen pair prior knowledg demonstr condit druggen relationship extract approach signific improv precis f measur compar uncondit approach precis vs recal vs f vs studi method base cooccurr use under relationship extract method simplic can replac combin advanc method machin learn natur languag process approach improv perform druggen relationship extract free text method limit extract druggen relationship can general extract type relationship relat background knowledg base exist

Resumos Similares

Artif Intell Med - Development of traditional Chinese medicine clinical data warehouse for medical knowledge discovery and decision support. ( 0,827282591197731 )
J Biomed Inform - Sequential patterns mining and gene sequence visualization to discover novelty from microarray data. ( 0,808128495996284 )
Artif Intell Med - Discovering metric temporal constraint networks on temporal databases. ( 0,793863804422073 )
J Med Syst - Data mining in healthcare and biomedicine: a survey of the literature. ( 0,784936386019628 )
J Med Syst - Discovering medical knowledge using association rule mining in young adults with acute myocardial infarction. ( 0,770586795483454 )
AMIA Annu Symp Proc - Linked data and online classifications to organise mined patterns in patient data. ( 0,767814660419046 )
J Biomed Inform - Mining association language patterns using a distributional semantic model for negative life event classification. ( 0,767016349468667 )
J Biomed Inform - Unraveling complex temporal associations in cellular systems across multiple time-series microarray datasets. ( 0,74899306326974 )
IEEE Trans Vis Comput Graph - Abstracting Attribute Space for Transfer Function Exploration and Design. ( 0,741977408439357 )
J Biomed Inform - Automatic construction of a large-scale and accurate drug-side-effect association knowledge base from biomedical literature. ( 0,730008196500334 )
J Am Med Inform Assoc - Modeling temporal relationships in large scale clinical associations. ( 0,711837087042427 )
J Am Med Inform Assoc - Web-scale pharmacovigilance: listening to signals from the crowd. ( 0,703259105077885 )
AMIA Annu Symp Proc - Drug repositioning using disease associated biological processes and network analysis of drug targets. ( 0,701716063871542 )
J Biomed Inform - Exploring the ncRNA-ncRNA patterns based on bridging rules. ( 0,699126110716546 )
IEEE Trans Vis Comput Graph - Splatterplots: Overcoming Overdraw in Scatter Plots. ( 0,687658285510548 )
J Biomed Inform - Discovering discovery patterns with Predication-based Semantic Indexing. ( 0,683747124974142 )
Inform Health Soc Care - A methodological framework for the analysis of highly intensive, multimodal and heterogeneous data in the context of health-enabling technologies and ambient-assisted living. ( 0,679192018494142 )
BMC Med Inform Decis Mak - Detecting causality from online psychiatric texts using inter-sentential language patterns. ( 0,676788707994274 )
J Am Med Inform Assoc - The TOKEn project: knowledge synthesis for in silico science. ( 0,66915881337236 )
J Integr Bioinform - Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing. ( 0,667082100753383 )
Artif Intell Med - On mining clinical pathway patterns from medical behaviors. ( 0,662021912696102 )
Med Biol Eng Comput - Targeting an efficient target-to-target interval for P300 speller brain-computer interfaces. ( 0,656903084565825 )
Brief. Bioinformatics - Discriminative pattern mining and its applications in bioinformatics. ( 0,648395551266149 )
J Biomed Inform - Summarizing clinical pathways from event logs. ( 0,644552557772815 )
J Integr Bioinform - Prognostic prediction through biclustering-based classification of clinical gene expression time series. ( 0,644155779418378 )
AMIA Annu Symp Proc - Concordance of Electronic Health Record (EHR) Data Describing Delirium at a VA Hospital. ( 0,643307989777794 )
Comput Methods Programs Biomed - A novel data mining mechanism considering bio-signal and environmental data with applications on asthma monitoring. ( 0,642674540109214 )
J Am Med Inform Assoc - Using information mining of the medical literature to improve drug safety. ( 0,639330955992291 )
J Biomed Inform - Text mining for traditional Chinese medical knowledge discovery: a survey. ( 0,638404247102609 )
J Biomed Inform - Biomedical text mining and its applications in cancer research. ( 0,636948844260407 )
AMIA Annu Symp Proc - Exploring generalized association rule mining for disease co-occurrences. ( 0,634271404173763 )
Brief. Bioinformatics - Combining literature text mining with microarray data: advances for system biology modeling. ( 0,632963988150724 )
AMIA Annu Symp Proc - Differences in nationwide cohorts of acupuncture users identified using structured and free text medical records. ( 0,632885534542262 )
BMC Med Inform Decis Mak - Discovering context-specific relationships from biological literature by using multi-level context terms. ( 0,632260109093944 )
AMIA Annu Symp Proc - PubMedMiner: Mining and Visualizing MeSH-based Associations in PubMed. ( 0,631918567572397 )
J Biomed Inform - Time motion studies in healthcare: what are we talking about? ( 0,631296934132741 )
J Chem Inf Model - Emerging pattern mining to aid toxicological knowledge discovery. ( 0,627805678399114 )
J Biomed Inform - A methodology for interactive mining and visual analysis of clinical event patterns using electronic health record data. ( 0,627419504366098 )
Methods Inf Med - Health level seven interoperability strategy: big data, incrementally structured. ( 0,625684158046302 )
Comput. Biol. Med. - Using positive and negative patterns to extract information from journal articles regarding the regulation of a target gene by a transcription factor. ( 0,618247293412762 )
J Am Med Inform Assoc - Using ontology-based annotation to profile disease research. ( 0,618097571559582 )
Comput Math Methods Med - Development of the complex general linear model in the Fourier domain: application to fMRI multiple input-output evoked responses for single subjects. ( 0,61760174027488 )
Int J Health Geogr - Developing GIS-based eastern equine encephalitis vector-host models in Tuskegee, Alabama. ( 0,617186519804501 )
J Biomed Inform - Interestingness measures and strategies for mining multi-ontology multi-level association rules from gene ontology annotations for the discovery of new GO relationships. ( 0,612577785709663 )
J Am Med Inform Assoc - A vector space model approach to identify genetically related diseases. ( 0,610752808478597 )
J Biomed Inform - Is standard multivariate analysis sufficient in clinical and epidemiological studies? ( 0,608338424504229 )
J Med Syst - Characterizing mammography reports for health analytics. ( 0,60728739818626 )
Methods Inf Med - Adaptive semantic tag mining from heterogeneous clinical research texts. ( 0,59547284714534 )
Comput Methods Programs Biomed - Multimodal fusion of biomedical data at different temporal and dimensional scales. ( 0,594499675610263 )
BMC Med Inform Decis Mak - Combining classifiers for robust PICO element detection. ( 0,592602457752339 )
J. Med. Internet Res. - Missing data approaches in eHealth research: simulation study and a tutorial for nonmathematically inclined researchers. ( 0,59231201712813 )
J Am Med Inform Assoc - Role of genetic heterogeneity and epistasis in bladder cancer susceptibility and outcome: a learning classifier system approach. ( 0,591714206991599 )
Comput. Biol. Med. - Monitoring care processes in the gynecologic oncology department. ( 0,591545949333447 )
J Biomed Inform - Identifying direct miRNA-mRNA causal regulatory relationships in heterogeneous data. ( 0,587276659432284 )
AMIA Annu Symp Proc - Semantic annotation of clinical events for generating a problem list. ( 0,583592165758894 )
J Am Med Inform Assoc - A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. ( 0,580862354302833 )
Artif Intell Med - Employing heat maps to mine associations in structured routine care data. ( 0,57626487716109 )
Artif Intell Med - Multi-way association extraction and visualization from biological text documents using hyper-graphs: applications to genetic association studies for diseases. ( 0,573802310941005 )
Int J Med Inform - A methodology to enhance spatial understanding of disease outbreak events reported in news articles. ( 0,573541868850104 )
Comput Math Methods Med - Ranking biomedical annotations with annotator's semantic relevancy. ( 0,573230672374648 )
Brief. Bioinformatics - Modern bioinformatics meets traditional Chinese medicine. ( 0,572974746087601 )
Methods Inf Med - Mining health care administrative data with temporal association rules on hybrid events. ( 0,569840480788089 )
Comput Methods Programs Biomed - Structural identifiability and indistinguishability analyses of the minimal model and a euglycemic hyperinsulinemic clamp model for glucose-insulin dynamics. ( 0,569516765812806 )
AMIA Annu Symp Proc - It's about this and that: a description of anaphoric expressions in clinical text. ( 0,568785514015258 )
J Am Med Inform Assoc - Exploiting time in electronic health record correlations. ( 0,568452121831492 )
AMIA Annu Symp Proc - Survival association rule mining towards type 2 diabetes risk assessment. ( 0,567758873090395 )
J Biomed Inform - Determining the difficulty of Word Sense Disambiguation. ( 0,563885932473297 )
J Integr Bioinform - Mining and analysing spatio-temporal patterns of gene expression in an integrative database framework. ( 0,5627566295084 )
J Med Syst - Data mining techniques for assisting the diagnosis of pressure ulcer development in surgical patients. ( 0,559355760000728 )
J Biomed Inform - Incorporating temporal EHR data in predictive models for risk stratification of renal function deterioration. ( 0,559094671856695 )
IEEE J Biomed Health Inform - Big heart data: advancing health informatics through data sharing in cardiovascular imaging. ( 0,553944225179263 )
J Biomed Inform - Selecting information in electronic health records for knowledge acquisition. ( 0,551551628200353 )
AMIA Annu Symp Proc - Improving perceived and actual text difficulty for health information consumers using semi-automated methods. ( 0,551295626674534 )
Int J Health Geogr - Free and simple GIS as appropriate for health mapping in a low resource setting: a case study in eastern Indonesia. ( 0,550558652580747 )
Comput. Biol. Med. - Frequent patterns mining in multiple biological sequences. ( 0,549635965326161 )
Brief. Bioinformatics - A primer to frequent itemset mining for bioinformatics. ( 0,548120842460353 )
J Biomed Inform - Querying temporal clinical databases on granular trends. ( 0,546596814960683 )
J Am Med Inform Assoc - Induced lexico-syntactic patterns improve information extraction from online medical forums. ( 0,544918855278355 )
J Biomed Inform - Knowledge based word-concept model estimation and refinement for biomedical text mining. ( 0,544161383157351 )
Comput Biol Chem - Deciphering histone code of transcriptional regulation in malaria parasites by large-scale data mining. ( 0,54204437455078 )
Comput. Biol. Med. - Evaluation of a Teleform-based data collection system: a multi-center obesity research case study. ( 0,541083174862481 )
Comput. Biol. Med. - THEME: a web tool for loop-design microarray data analysis. ( 0,541002237381154 )
J Biomed Inform - Discovery of clinical pathway patterns from event logs using probabilistic topic models. ( 0,540763788031316 )
Comput Biol Chem - Computational identification and characterization of primate-specific microRNAs in human genome. ( 0,540005100417895 )
J Med Syst - Latent treatment pattern discovery for clinical processes. ( 0,539683987947953 )
IEEE Trans Image Process - Max-margin multiattribute learning with low-rank constraint. ( 0,539011473695272 )
Comput. Biol. Med. - Identifying high-cost patients using data mining techniques and a small set of non-trivial attributes. ( 0,537902428736174 )
AMIA Annu Symp Proc - Desiderata for healthcare integrated data repositories based on architectural comparison of three public repositories. ( 0,537358324301429 )
BMC Med Inform Decis Mak - Dynamic summarization of bibliographic-based data. ( 0,536972893765291 )
Neural Comput - Pavlov's dog associative learning demonstrated on synaptic-like organic transistors. ( 0,53682204045783 )
Med Decis Making - Creating compact comparative health care information: what are the key quality attributes to present for cataract and total hip or knee replacement surgery? ( 0,536604039330837 )
Comput Methods Programs Biomed - Micro-Analyzer: automatic preprocessing of Affymetrix microarray data. ( 0,534877566474628 )
J Am Med Inform Assoc - Don't take your EHR to heaven, donate it to science: legal and research policies for EHR post mortem. ( 0,534637154386641 )
J Chem Inf Model - Improved chemical text mining of patents with infinite dictionaries and automatic spelling correction. ( 0,534163385046293 )
J Biomed Inform - Where we stand, where we are moving: Surveying computational techniques for identifying miRNA genes and uncovering their regulatory role. ( 0,532820911825521 )
Methods Inf Med - Prioritising lexical patterns to increase axiomatisation in biomedical ontologies. The role of localisation and modularity. ( 0,532444525048354 )
Wiley Interdiscip Rev Syst Biol Med - Mediators and dynamics of DNA methylation. ( 0,531587538399339 )
J Biomed Inform - A data recipient centered de-identification method to retain statistical attributes. ( 0,529099458284317 )
Brief. Bioinformatics - Literature-aided interpretation of gene expression data with the weighted global test. ( 0,528795268423665 )
IEEE Trans Vis Comput Graph - The Topological Effects of Smoothing. ( 0,528750701086042 )