J Biomed Inform - Identifying non-elliptical entity mentions in a coordinated NP with ellipses.

Tópicos

{ extract(1171) text(1153) clinic(932) }
{ can(774) often(719) complex(702) }
{ system(1976) rule(880) can(841) }
{ control(1307) perform(991) simul(935) }
{ perform(1367) use(1326) method(1137) }
{ problem(2511) optim(1539) algorithm(950) }
{ patient(2837) hospit(1953) medic(668) }
{ perform(999) metric(946) measur(919) }
{ framework(1458) process(801) describ(734) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ structur(1116) can(940) graph(676) }
{ use(976) code(926) identifi(902) }
{ detect(2391) sensit(1101) algorithm(908) }
{ data(1737) use(1416) pattern(1282) }
{ measur(2081) correl(1212) valu(896) }
{ featur(3375) classif(2383) classifi(1994) }
{ treatment(1704) effect(941) patient(846) }
{ case(1353) use(1143) diagnosi(1136) }
{ spatial(1525) area(1432) region(1030) }
{ model(3480) simul(1196) paramet(876) }
{ age(1611) year(1155) adult(843) }
{ group(2977) signific(1463) compar(1072) }
{ use(1733) differ(960) four(931) }
{ method(2212) result(1239) propos(1039) }
{ model(3404) distribut(989) bayesian(671) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ method(1557) propos(1049) approach(1037) }
{ data(1714) softwar(1251) tool(1186) }
{ design(1359) user(1324) use(1319) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ featur(1941) imag(1645) propos(1176) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ visual(1396) interact(850) tool(830) }
{ compound(1573) activ(1297) structur(1058) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ model(2656) set(1616) predict(1553) }
{ data(2317) use(1299) case(1017) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ data(3008) multipl(1320) sourc(1022) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }

Resumo

Named entities in the biomedical domain are often written using a Noun Phrase (NP) along with a coordinating conjunction such as 'and' and 'or'. In addition, repeated words among named entity mentions are frequently omitted. It is often difficult to identify named entities. Although various Named Entity Recognition (NER) methods have tried to solve this problem, these methods can only deal with relatively simple elliptical patterns in coordinated NPs. We propose a new NER method for identifying non-elliptical entity mentions with simple or complex ellipses using linguistic rules and an entity mention dictionary. The GENIA and CRAFT corpora were used to evaluate the performance of the proposed system. The GENIA corpus was used to evaluate the performance of the system according to the quality of the dictionary. The GENIA corpus comprises 3434 non-elliptical entity mentions in 1585 coordinated NPs with ellipses. The system achieves 92.11% precision, 95.20% recall, and 93.63% F-score in identification of non-elliptical entity mentions in coordinated NPs. The accuracy of the system in resolving simple and complex ellipses is 94.54% and 91.95%, respectively. The CRAFT corpus was used to evaluate the performance of the system under realistic conditions. The system achieved 78.47% precision, 67.10% recall, and 72.34% F-score in coordinated NPs. The performance evaluations of the system show that it efficiently solves the problem caused by ellipses, and improves NER performance. The algorithm is implemented in PHP and the code can be downloaded from https://code.google.com/p/medtextmining/.

Resumo Limpo

name entiti biomed domain often written use noun phrase np along coordin conjunct addit repeat word among name entiti mention frequent omit often difficult identifi name entiti although various name entiti recognit ner method tri solv problem method can deal relat simpl ellipt pattern coordin nps propos new ner method identifi nonellipt entiti mention simpl complex ellips use linguist rule entiti mention dictionari genia craft corpora use evalu perform propos system genia corpus use evalu perform system accord qualiti dictionari genia corpus compris nonellipt entiti mention coordin nps ellips system achiev precis recal fscore identif nonellipt entiti mention coordin nps accuraci system resolv simpl complex ellips respect craft corpus use evalu perform system realist condit system achiev precis recal fscore coordin nps perform evalu system show effici solv problem caus ellips improv ner perform algorithm implement php code can download httpscodegooglecompmedtextmin

Resumos Similares

J Biomed Inform - MedTime: a temporal information extraction system for clinical narratives. ( 0,871226539926316 )
J Am Med Inform Assoc - MedXN: an open source medication extraction and normalization tool for clinical text. ( 0,869993348906515 )
J Am Med Inform Assoc - Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives. ( 0,866629623644624 )
J Am Med Inform Assoc - A hybrid system for temporal information extraction from clinical text. ( 0,864489763952306 )
J. Med. Internet Res. - Web 2.0-based crowdsourcing for high-quality gold standard development in clinical natural language processing. ( 0,853968312171359 )
Artif Intell Med - Biomedical events extraction using the hidden vector state model. ( 0,850189886563402 )
J Biomed Inform - A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora. ( 0,847455933788494 )
J Am Med Inform Assoc - Eventual situations for timeline extraction from clinical reports. ( 0,847194326372193 )
J Am Med Inform Assoc - An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. ( 0,843290584600002 )
J Biomed Inform - Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. ( 0,843095212345282 )
J Biomed Inform - Anaphoric reference in clinical reports: characteristics of an annotated corpus. ( 0,841888902658174 )
J Am Med Inform Assoc - Automatic discourse connective detection in biomedical text. ( 0,841687243150408 )
AMIA Annu Symp Proc - Inter-annotator reliability of medical events, coreferences and temporal relations in clinical narratives by annotators with varying levels of clinical expertise. ( 0,835771483059027 )
J Am Med Inform Assoc - Assisted annotation of medical free text using RapTAT. ( 0,8347718610798 )
J Am Med Inform Assoc - Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text. ( 0,833735913113488 )
J Biomed Inform - Extraction of events and temporal expressions from clinical narratives. ( 0,829281860267067 )
J Biomed Inform - Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. ( 0,829259431132708 )
Int J Med Inform - Bootstrapping a de-identification system for narrative patient records: cost-performance tradeoffs. ( 0,827695011983688 )
J Am Med Inform Assoc - Anaphoric relations in the clinical narrative: corpus creation. ( 0,825225039055952 )
J Am Med Inform Assoc - A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources. ( 0,824034686587445 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,823028584891337 )
J Biomed Inform - Towards generating a patient's timeline: extracting temporal relationships from clinical notes. ( 0,822914433889998 )
J Biomed Inform - Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text. ( 0,822591395235935 )
Comput Math Methods Med - Ranking biomedical annotations with annotator's semantic relevancy. ( 0,822208194677893 )
J Am Med Inform Assoc - Using rule-based natural language processing to improve disease normalization in biomedical text. ( 0,821946506373651 )
Int J Med Inform - Detection of infectious symptoms from VA emergency department and primary care clinical documentation. ( 0,821504385167781 )
Appl Clin Inform - Representation of information about family relatives as structured data in electronic health records. ( 0,817219000819313 )
J Am Med Inform Assoc - Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. ( 0,81689782916254 )
J Biomed Inform - Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction. ( 0,815573017527376 )
AMIA Annu Symp Proc - A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries. ( 0,814364135799317 )
J Am Med Inform Assoc - A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. ( 0,814090898541988 )
Int J Med Inform - Detecting temporal expressions in medical narratives. ( 0,813017531789587 )
J Am Med Inform Assoc - Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification. ( 0,811539393698115 )
J Biomed Inform - A new clustering method for detecting rare senses of abbreviations in clinical notes. ( 0,811338584305301 )
AMIA Annu Symp Proc - Throw the bath water out, keep the baby: keeping medically-relevant terms for text mining. ( 0,809041656128564 )
J Am Med Inform Assoc - Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements. ( 0,808544149856838 )
BMC Med Inform Decis Mak - Detecting causality from online psychiatric texts using inter-sentential language patterns. ( 0,806458329920669 )
AMIA Annu Symp Proc - Extracting semantic lexicons from discharge summaries using machine learning and the C-Value method. ( 0,805141337655527 )
AMIA Annu Symp Proc - Natural language processing for lines and devices in portable chest x-rays. ( 0,803162377669822 )
J Med Syst - Redactable signatures for signed CDA Documents. ( 0,798583327331618 )
J Biomed Inform - Text de-identification for privacy protection: a study of its impact on clinical text information content. ( 0,790072413419569 )
AMIA Annu Symp Proc - Natural language processing to extract follow-up provider information from hospital discharge summaries. ( 0,788307324223101 )
J Am Med Inform Assoc - A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. ( 0,786994865128514 )
Sci Data - Building the graph of medicine from millions of clinical narratives. ( 0,786911263222139 )
AMIA Annu Symp Proc - Automatically pairing measured findings across narrative abdomen CT reports. ( 0,78102322527429 )
J Biomed Inform - Desiderata for ontologies to be used in semantic annotation of biomedical documents. ( 0,780877832720662 )
AMIA Annu Symp Proc - Mapping annotations with textual evidence using an scLDA model. ( 0,779508655813824 )
AMIA Annu Symp Proc - Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations. ( 0,778676335984849 )
J Am Med Inform Assoc - Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences. ( 0,777488681957175 )
J Am Med Inform Assoc - 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. ( 0,775219593523934 )
AMIA Annu Symp Proc - Extracting Concepts Related to Homelessness from the Free Text of VA Electronic Medical Records. ( 0,774261218408508 )
Comput Methods Programs Biomed - BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments. ( 0,773112450883826 )
J Biomed Inform - Enhancing clinical concept extraction with distributional semantics. ( 0,770661901007698 )
J Integr Bioinform - Automatic extraction of microorganisms and their habitats from free text using text mining workflows. ( 0,770201490647543 )
J Biomed Inform - Using an ensemble system to improve concept extraction from clinical records. ( 0,767617474076047 )
J Biomed Inform - Lessons learnt from the DDIExtraction-2013 Shared Task. ( 0,765205870124378 )
AMIA Annu Symp Proc - Generalizability and comparison of automatic clinical text de-identification methods and resources. ( 0,764017966309114 )
AMIA Annu Symp Proc - Detecting abbreviations in discharge summaries using machine learning methods. ( 0,763491601819277 )
AMIA Annu Symp Proc - Extracting patient demographics and personal medical information from online health forums. ( 0,76269629380751 )
J Biomed Inform - UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text. ( 0,762338682569302 )
AMIA Annu Symp Proc - Pharmacovigilance on twitter? Mining tweets for adverse drug reactions. ( 0,760925396890137 )
AMIA Annu Symp Proc - Towards a semantic lexicon for clinical natural language processing. ( 0,759637332664191 )
AMIA Annu Symp Proc - Sophia: A Expedient UMLS Concept Extraction Annotator. ( 0,759471086459613 )
J Biomed Inform - Annotating temporal information in clinical narratives. ( 0,758456241294023 )
J Am Med Inform Assoc - MITRE system for clinical assertion status classification. ( 0,756473946517082 )
J Biomed Inform - NCBI disease corpus: a resource for disease name recognition and concept normalization. ( 0,755339632155495 )
AMIA Annu Symp Proc - Building gold standard corpora for medical natural language processing tasks. ( 0,7550808762527 )
J Am Med Inform Assoc - Comparison of a semi-automatic annotation tool and a natural language processing application for the generation of clinical statement entries. ( 0,754221377654582 )
BMC Med Inform Decis Mak - Text summarization as a decision support aid. ( 0,752233765865527 )
J Biomed Inform - Ontology modularization to improve semantic medical image annotation. ( 0,74928505007275 )
J Am Med Inform Assoc - Large-scale evaluation of automated clinical note de-identification and its impact on information extraction. ( 0,748926779824307 )
J Biomed Inform - Semantator: semantic annotator for converting biomedical text to linked data. ( 0,748614503089415 )
J Am Med Inform Assoc - Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification. ( 0,747636017374049 )
J Biomed Inform - The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions. ( 0,746870338913342 )
Int J Med Inform - A methodology to enhance spatial understanding of disease outbreak events reported in news articles. ( 0,745151347898605 )
J Am Med Inform Assoc - Vaccine adverse event text mining system for extracting features from vaccine safety reports. ( 0,745003328182813 )
J Am Med Inform Assoc - Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification. ( 0,743673028856836 )
AMIA Annu Symp Proc - Automated illustration of patients instructions. ( 0,742023925810267 )
AMIA Annu Symp Proc - TagLine: Information Extraction for Semi-Structured Text in Medical Progress Notes. ( 0,741064712628263 )
J Biomed Inform - Degree centrality for semantic abstraction summarization of therapeutic studies. ( 0,738883701416268 )
AMIA Annu Symp Proc - It's about this and that: a description of anaphoric expressions in clinical text. ( 0,738109384560418 )
AMIA Annu Symp Proc - Voice-dictated versus typed-in clinician notes: linguistic properties and the potential implications on natural language processing. ( 0,736446929138031 )
J Biomed Inform - Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text. ( 0,735989391523537 )
Brief. Bioinformatics - A survey on annotation tools for the biomedical literature. ( 0,735413066564572 )
J Am Med Inform Assoc - The effect of word familiarity on actual and perceived text difficulty. ( 0,734969553728697 )
J Am Med Inform Assoc - Automatic abstraction of imaging observations with their characteristics from mammography reports. ( 0,734919861630466 )
J Am Med Inform Assoc - Towards comprehensive syntactic and semantic annotations of the clinical narrative. ( 0,731847835818765 )
AMIA Annu Symp Proc - A cloud-based approach to medical NLP. ( 0,731139633082389 )
AMIA Annu Symp Proc - Critical finding capture in the impression section of radiology reports. ( 0,730181298432871 )
J Biomed Inform - Automatically extracting information needs from complex clinical questions. ( 0,728709589870602 )
AMIA Annu Symp Proc - Developing a section labeler for clinical documents. ( 0,72754562582698 )
J Biomed Inform - Text summarization in the biomedical domain: a systematic review of recent research. ( 0,723977358794612 )
J Biomed Inform - Ontology-guided feature engineering for clinical text classification. ( 0,723081825028584 )
J Am Med Inform Assoc - Extracting drug indication information from structured product labels using natural language processing. ( 0,721289898558202 )
AMIA Annu Symp Proc - Application of a temporal reasoning framework tool in analysis of medical device adverse events. ( 0,718907870114282 )
J Am Med Inform Assoc - ccML, a new mark-up language to improve ISO/EN 13606-based electronic health record extracts practical edition. ( 0,717940853659235 )
J Am Med Inform Assoc - A rule based solution to co-reference resolution in clinical text. ( 0,715852427435192 )
J Am Med Inform Assoc - Deriving comorbidities from medical records using natural language processing. ( 0,71396445370183 )
IEEE Trans Pattern Anal Mach Intell - Toward Integrated Scene Text Reading. ( 0,713726290096571 )
J Am Med Inform Assoc - Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries. ( 0,713230451738054 )