J Am Med Inform Assoc - Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries.

Tópicos

{ extract(1171) text(1153) clinic(932) }
{ featur(3375) classif(2383) classifi(1994) }
{ use(1733) differ(960) four(931) }
{ assess(1506) score(1403) qualiti(1306) }
{ method(2212) result(1239) propos(1039) }
{ concept(1167) ontolog(924) domain(897) }
{ method(1557) propos(1049) approach(1037) }
{ system(1976) rule(880) can(841) }
{ general(901) number(790) one(736) }
{ treatment(1704) effect(941) patient(846) }
{ learn(2355) train(1041) set(1003) }
{ model(2656) set(1616) predict(1553) }
{ first(2504) two(1366) second(1323) }
{ imag(2830) propos(1344) filter(1198) }
{ system(1050) medic(1026) inform(1018) }
{ analysi(2126) use(1163) compon(1037) }
{ detect(2391) sensit(1101) algorithm(908) }
{ imag(1947) propos(1133) code(1026) }
{ imag(1057) registr(996) error(939) }
{ patient(2315) diseas(1263) diabet(1191) }
{ data(1714) softwar(1251) tool(1186) }
{ model(2220) cell(1177) simul(1124) }
{ howev(809) still(633) remain(590) }
{ compound(1573) activ(1297) structur(1058) }
{ perform(1367) use(1326) method(1137) }
{ activ(1138) subject(705) human(624) }
{ structur(1116) can(940) graph(676) }
{ cancer(2502) breast(956) screen(824) }
{ process(1125) use(805) approach(778) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ take(945) account(800) differ(722) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ care(1570) inform(1187) nurs(1089) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ research(1085) discuss(1038) issu(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ visual(1396) interact(850) tool(830) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ data(3008) multipl(1320) sourc(1022) }
{ intervent(3218) particip(2042) group(1664) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ health(1844) social(1437) communiti(874) }
{ high(1669) rate(1365) level(1280) }
{ use(976) code(926) identifi(902) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }

Resumo

JECTIVE: A system that translates narrative text in the medical domain into structured representation is in great demand. The system performs three sub-tasks: concept extraction, assertion classification, and relation identification.DESIGN: The overall system consists of five steps: (1) pre-processing sentences, (2) marking noun phrases (NPs) and adjective phrases (APs), (3) extracting concepts that use a dosage-unit dictionary to dynamically switch two models based on Conditional Random Fields (CRF), (4) classifying assertions based on voting of five classifiers, and (5) identifying relations using normalized sentences with a set of effective discriminating features.MEASUREMENTS: Macro-averaged and micro-averaged precision, recall and F-measure were used to evaluate results.RESULTS: The performance is competitive with the state-of-the-art systems with micro-averaged F-measure of 0.8489 for concept extraction, 0.9392 for assertion classification and 0.7326 for relation identification.CONCLUSIONS: The system exploits an array of common features and achieves state-of-the-art performance. Prudent feature engineering sets the foundation of our systems. In concept extraction, we demonstrated that switching models, one of which is especially designed for telegraphic sentences, improved extraction of the treatment concept significantly. In assertion classification, a set of features derived from a rule-based classifier were proven to be effective for the classes such as conditional and possible. These classes would suffer from data scarcity in conventional machine-learning methods. In relation identification, we use two-staged architecture, the second of which applies pairwise classifiers to possible candidate classes. This architecture significantly improves performance.

Resumo Limpo

jectiv system translat narrat text medic domain structur represent great demand system perform three subtask concept extract assert classif relat identificationdesign overal system consist five step preprocess sentenc mark noun phrase nps adject phrase ap extract concept use dosageunit dictionari dynam switch two model base condit random field crf classifi assert base vote five classifi identifi relat use normal sentenc set effect discrimin featuresmeasur macroaverag microaverag precis recal fmeasur use evalu resultsresult perform competit stateoftheart system microaverag fmeasur concept extract assert classif relat identificationconclus system exploit array common featur achiev stateoftheart perform prudent featur engin set foundat system concept extract demonstr switch model one especi design telegraph sentenc improv extract treatment concept signific assert classif set featur deriv rulebas classifi proven effect class condit possibl class suffer data scarciti convent machinelearn method relat identif use twostag architectur second appli pairwis classifi possibl candid class architectur signific improv perform

Resumos Similares

AMIA Annu Symp Proc - Word Sense Disambiguation of clinical abbreviations with hyperdimensional computing. ( 0,823809379149593 )
J Biomed Inform - Ontology-guided feature engineering for clinical text classification. ( 0,818166962504772 )
J Am Med Inform Assoc - Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection. ( 0,808453080387664 )
J Am Med Inform Assoc - A comprehensive study of named entity recognition in Chinese clinical text. ( 0,798485689394319 )
J Am Med Inform Assoc - Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text. ( 0,780533778531252 )
AMIA Annu Symp Proc - Extracting semantic lexicons from discharge summaries using machine learning and the C-Value method. ( 0,759985802326739 )
Artif Intell Med - Document classification for mining host pathogen protein-protein interactions. ( 0,758817442252282 )
AMIA Annu Symp Proc - Automatically pairing measured findings across narrative abdomen CT reports. ( 0,75879963310457 )
J Biomed Inform - Enhancing clinical concept extraction with distributional semantics. ( 0,756804444993732 )
AMIA Annu Symp Proc - Automatic acquisition of sublanguage semantic schema: towards the word sense disambiguation of clinical narratives. ( 0,752188068830148 )
J Am Med Inform Assoc - Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. ( 0,747672750022584 )
J Am Med Inform Assoc - Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification. ( 0,746537613476943 )
J Am Med Inform Assoc - MITRE system for clinical assertion status classification. ( 0,746067994006147 )
J Am Med Inform Assoc - An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. ( 0,744233431322853 )
AMIA Annu Symp Proc - TagLine: Information Extraction for Semi-Structured Text in Medical Progress Notes. ( 0,742936412908133 )
J Am Med Inform Assoc - Eventual situations for timeline extraction from clinical reports. ( 0,737926314151108 )
Artif Intell Med - Biomedical events extraction using the hidden vector state model. ( 0,737168110841227 )
J Biomed Inform - Towards generating a patient's timeline: extracting temporal relationships from clinical notes. ( 0,736565192990601 )
J Biomed Inform - Extraction of events and temporal expressions from clinical narratives. ( 0,736520280683073 )
J Am Med Inform Assoc - A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. ( 0,734258872888986 )
J Am Med Inform Assoc - Automatic discourse connective detection in biomedical text. ( 0,726970675487111 )
J. Med. Internet Res. - Web 2.0-based crowdsourcing for high-quality gold standard development in clinical natural language processing. ( 0,724699985493588 )
J Am Med Inform Assoc - Functional evaluation of out-of-the-box text-mining tools for data-mining tasks. ( 0,722250507832567 )
AMIA Annu Symp Proc - A cloud-based approach to medical NLP. ( 0,721999639763016 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,721584736751297 )
AMIA Annu Symp Proc - Throw the bath water out, keep the baby: keeping medically-relevant terms for text mining. ( 0,721563219182432 )
AMIA Annu Symp Proc - Detecting abbreviations in discharge summaries using machine learning methods. ( 0,721182461099246 )
J Biomed Inform - Lessons learnt from the DDIExtraction-2013 Shared Task. ( 0,718501935123087 )
Int J Med Inform - Detecting temporal expressions in medical narratives. ( 0,717171742004961 )
AMIA Annu Symp Proc - Automatically classifying the role of citations in biomedical articles. ( 0,715306535102273 )
J Am Med Inform Assoc - Anaphoric relations in the clinical narrative: corpus creation. ( 0,714994958567649 )
AMIA Annu Symp Proc - Identifying discourse connectives in biomedical text. ( 0,711905731714378 )
J Am Med Inform Assoc - Pneumonia identification using statistical feature selection. ( 0,709135904896764 )
J Am Med Inform Assoc - Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification. ( 0,707835246167168 )
J Biomed Inform - The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions. ( 0,704693898502267 )
J Am Med Inform Assoc - Vaccine adverse event text mining system for extracting features from vaccine safety reports. ( 0,70411120244829 )
Appl Clin Inform - Representation of information about family relatives as structured data in electronic health records. ( 0,703025164601777 )
J Biomed Inform - Dynamic categorization of clinical research eligibility criteria by hierarchical clustering. ( 0,698738138567895 )
J Biomed Inform - UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text. ( 0,698102847939509 )
J Am Med Inform Assoc - Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives. ( 0,697881428272346 )
J Biomed Inform - Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. ( 0,695046783504647 )
J Am Med Inform Assoc - Learning regular expressions for clinical text classification. ( 0,694689409460538 )
J Biomed Inform - MedTime: a temporal information extraction system for clinical narratives. ( 0,693865209386319 )
J Biomed Inform - Unsupervised biomedical named entity recognition: experiments with clinical and biological texts. ( 0,693316577067586 )
J Am Med Inform Assoc - Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. ( 0,691336941764894 )
J Biomed Inform - A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora. ( 0,691190434335905 )
J Am Med Inform Assoc - A hybrid system for temporal information extraction from clinical text. ( 0,688817724172154 )
J Am Med Inform Assoc - A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. ( 0,688794675565262 )
J Biomed Inform - Desiderata for ontologies to be used in semantic annotation of biomedical documents. ( 0,68766288471949 )
J Biomed Inform - A new clustering method for detecting rare senses of abbreviations in clinical notes. ( 0,686881043240308 )
J Am Med Inform Assoc - Induced lexico-syntactic patterns improve information extraction from online medical forums. ( 0,684773382120997 )
J Am Med Inform Assoc - A classification approach to coreference in discharge summaries: 2011 i2b2 challenge. ( 0,683951635824737 )
J Med Syst - Redactable signatures for signed CDA Documents. ( 0,682131387881535 )
AMIA Annu Symp Proc - Inter-annotator reliability of medical events, coreferences and temporal relations in clinical narratives by annotators with varying levels of clinical expertise. ( 0,681420224534752 )
J Biomed Inform - Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text. ( 0,679565752500752 )
J Am Med Inform Assoc - A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. ( 0,678760323152778 )
AMIA Annu Symp Proc - Building gold standard corpora for medical natural language processing tasks. ( 0,676723422212048 )
J Am Med Inform Assoc - Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification. ( 0,675103485633528 )
J Am Med Inform Assoc - A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources. ( 0,67371759613672 )
J Biomed Inform - Anaphoric reference in clinical reports: characteristics of an annotated corpus. ( 0,673337047757789 )
J Biomed Inform - Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text. ( 0,670340563146237 )
Comput Math Methods Med - Ranking biomedical annotations with annotator's semantic relevancy. ( 0,670284084378173 )
J Biomed Inform - Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. ( 0,669145373866614 )
J Biomed Inform - Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction. ( 0,668845930992389 )
Int J Med Inform - Detection of infectious symptoms from VA emergency department and primary care clinical documentation. ( 0,668242432844885 )
J Integr Bioinform - Automatic extraction of microorganisms and their habitats from free text using text mining workflows. ( 0,667940439593646 )
J Am Med Inform Assoc - Towards comprehensive syntactic and semantic annotations of the clinical narrative. ( 0,66772300495769 )
J Biomed Inform - A biological continuum based approach for efficient clinical classification. ( 0,667206362839234 )
J Biomed Inform - An enhanced CRFs-based system for information extraction from radiology reports. ( 0,666710710313969 )
AMIA Annu Symp Proc - Automatic identification of critical follow-up recommendation sentences in radiology reports. ( 0,665631752728074 )
J Am Med Inform Assoc - 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. ( 0,664333970598155 )
AMIA Annu Symp Proc - Mapping annotations with textual evidence using an scLDA model. ( 0,663631925772142 )
Artif Intell Med - Noninvasive evaluation of mental stress using by a refined rough set technique based on biomedical signals. ( 0,663331998615408 )
AMIA Annu Symp Proc - Document clustering of clinical narratives: a systematic study of clinical sublanguages. ( 0,662630587939025 )
Comput Methods Programs Biomed - BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments. ( 0,662597887158452 )
J Am Med Inform Assoc - MedXN: an open source medication extraction and normalization tool for clinical text. ( 0,661759053764915 )
AMIA Annu Symp Proc - Qualitative analysis of workflow modifications used to generate the reference standard for the 2010 i2b2/VA challenge. ( 0,660924300511774 )
Artif Intell Med - Conceptual-driven classification for coding advise in health insurance reimbursement. ( 0,658825586214665 )
J Biomed Inform - Automatically extracting information needs from complex clinical questions. ( 0,658456484834655 )
J Biomed Inform - Identifying non-elliptical entity mentions in a coordinated NP with ellipses. ( 0,657621933481067 )
J Med Syst - A new approach for concealed information identification based on ERP assessment. ( 0,65450227319055 )
AMIA Annu Symp Proc - Generalizability and comparison of automatic clinical text de-identification methods and resources. ( 0,650034939172088 )
J Am Med Inform Assoc - Automated clinical trial eligibility prescreening: increasing the efficiency of patient identification for clinical trials in the emergency department. ( 0,649435051940868 )
J Am Med Inform Assoc - Automated concept-level information extraction to reduce the need for custom software and rules development. ( 0,648455974809882 )
Int J Med Inform - Bootstrapping a de-identification system for narrative patient records: cost-performance tradeoffs. ( 0,647245776001952 )
J Am Med Inform Assoc - Assisted annotation of medical free text using RapTAT. ( 0,646247402399274 )
AMIA Annu Symp Proc - Natural language processing to extract follow-up provider information from hospital discharge summaries. ( 0,644855927901498 )
AMIA Annu Symp Proc - Using ontology network structure in text mining. ( 0,644187708290709 )
J Am Med Inform Assoc - A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction. ( 0,64399273576612 )
AMIA Annu Symp Proc - Natural language processing for lines and devices in portable chest x-rays. ( 0,643546683739676 )
BMC Med Inform Decis Mak - Recognition of medication information from discharge summaries using ensembles of classifiers. ( 0,643205401080799 )
AMIA Annu Symp Proc - Extracting Concepts Related to Homelessness from the Free Text of VA Electronic Medical Records. ( 0,643009164439986 )
Comput. Biol. Med. - A P300-based brain computer interface system for words typing. ( 0,642652086624656 )
J Biomed Inform - Common data model for natural language processing based on two existing standard information models: CDA+GrAF. ( 0,641874575564559 )
J Chem Inf Model - Automated information extraction and structure-activity relationship analysis of cytochrome P450 substrates. ( 0,641542680118961 )
AMIA Annu Symp Proc - Pharmacovigilance on twitter? Mining tweets for adverse drug reactions. ( 0,641021170260871 )
J Am Med Inform Assoc - Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation. ( 0,638859912679339 )
AMIA Annu Symp Proc - Towards a semantic lexicon for clinical natural language processing. ( 0,638373688673844 )
Int J Med Inform - A methodology to enhance spatial understanding of disease outbreak events reported in news articles. ( 0,637535284021577 )
J Biomed Inform - Boosting performance of gene mention tagging system by hybrid methods. ( 0,637425972342358 )