AMIA Annu Symp Proc - Building gold standard corpora for medical natural language processing tasks.

Tópicos

{ extract(1171) text(1153) clinic(932) }
{ case(1353) use(1143) diagnosi(1136) }
{ system(1050) medic(1026) inform(1018) }
{ assess(1506) score(1403) qualiti(1306) }
{ concept(1167) ontolog(924) domain(897) }
{ data(3963) clinic(1234) research(1004) }
{ data(2317) use(1299) case(1017) }
{ learn(2355) train(1041) set(1003) }
{ compound(1573) activ(1297) structur(1058) }
{ inform(2794) health(2639) internet(1427) }
{ studi(2440) review(1878) systemat(933) }
{ treatment(1704) effect(941) patient(846) }
{ visual(1396) interact(850) tool(830) }
{ sampl(1606) size(1419) use(1276) }
{ drug(1928) target(777) effect(648) }
{ implement(1333) system(1263) develop(1122) }
{ process(1125) use(805) approach(778) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ data(1737) use(1416) pattern(1282) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ featur(3375) classif(2383) classifi(1994) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ motion(1329) object(1292) video(1091) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(1557) propos(1049) approach(1037) }
{ data(1714) softwar(1251) tool(1186) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ featur(1941) imag(1645) propos(1176) }
{ howev(809) still(633) remain(590) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ research(1085) discuss(1038) issu(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ model(2656) set(1616) predict(1553) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ gene(2352) biolog(1181) express(1162) }
{ data(3008) multipl(1320) sourc(1022) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(976) code(926) identifi(902) }
{ use(1733) differ(960) four(931) }
{ result(1111) use(1088) new(759) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

We present the construction of three annotated corpora to serve as gold standards for medical natural language processing (NLP) tasks. Clinical notes from the medical record, clinical trial announcements, and FDA drug labels are annotated. We report high inter-annotator agreements (overall F-measures between 0.8467 and 0.9176) for the annotation of Personal Health Information (PHI) elements for a de-identification task and of medications, diseases/disorders, and signs/symptoms for information extraction (IE) task. The annotated corpora of clinical trials and FDA labels will be publicly released and to facilitate translational NLP tasks that require cross-corpora interoperability (e.g. clinical trial eligibility screening) their annotation schemas are aligned with a large scale, NIH-funded clinical text annotation project.

Resumo Limpo

present construct three annot corpora serv gold standard medic natur languag process nlp task clinic note medic record clinic trial announc fda drug label annot report high interannot agreement overal fmeasur annot person health inform phi element deidentif task medic diseasesdisord signssymptom inform extract ie task annot corpora clinic trial fda label will public releas facilit translat nlp task requir crosscorpora interoper eg clinic trial elig screen annot schema align larg scale nihfund clinic text annot project

Resumos Similares

J Biomed Inform - NCBI disease corpus: a resource for disease name recognition and concept normalization. ( 0,863224691369964 )
J Am Med Inform Assoc - Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification. ( 0,862069408879295 )
Appl Clin Inform - Representation of information about family relatives as structured data in electronic health records. ( 0,860641718001924 )
J Am Med Inform Assoc - Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives. ( 0,849789157044062 )
J Am Med Inform Assoc - An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. ( 0,844706553635591 )
J. Med. Internet Res. - Web 2.0-based crowdsourcing for high-quality gold standard development in clinical natural language processing. ( 0,844458533724931 )
AMIA Annu Symp Proc - Inter-annotator reliability of medical events, coreferences and temporal relations in clinical narratives by annotators with varying levels of clinical expertise. ( 0,840276224061199 )
J Am Med Inform Assoc - A hybrid system for temporal information extraction from clinical text. ( 0,834489252343946 )
J Med Syst - Redactable signatures for signed CDA Documents. ( 0,834186678399151 )
J Biomed Inform - MedTime: a temporal information extraction system for clinical narratives. ( 0,832807492034822 )
J Biomed Inform - Anaphoric reference in clinical reports: characteristics of an annotated corpus. ( 0,831654360532035 )
J Biomed Inform - Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. ( 0,830114236117602 )
AMIA Annu Symp Proc - Throw the bath water out, keep the baby: keeping medically-relevant terms for text mining. ( 0,828806808802661 )
AMIA Annu Symp Proc - Towards a semantic lexicon for clinical natural language processing. ( 0,828277734243568 )
J Am Med Inform Assoc - MITRE system for clinical assertion status classification. ( 0,827801952411281 )
Artif Intell Med - Biomedical events extraction using the hidden vector state model. ( 0,827535135111296 )
J Am Med Inform Assoc - Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements. ( 0,826732141578071 )
AMIA Annu Symp Proc - Natural language processing for lines and devices in portable chest x-rays. ( 0,823668933328193 )
J Biomed Inform - Automatically extracting information needs from complex clinical questions. ( 0,822233220303746 )
J Biomed Inform - Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction. ( 0,822032124411448 )
J Chem Inf Model - Automated extraction of information on chemical-P-glycoprotein interactions from the literature. ( 0,819652159724837 )
J Am Med Inform Assoc - Anaphoric relations in the clinical narrative: corpus creation. ( 0,818873957936322 )
J Biomed Inform - Towards generating a patient's timeline: extracting temporal relationships from clinical notes. ( 0,816565544770225 )
J Am Med Inform Assoc - A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources. ( 0,815313367655881 )
J Am Med Inform Assoc - Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text. ( 0,81474616450673 )
Comput Methods Programs Biomed - BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments. ( 0,814745735817037 )
AMIA Annu Symp Proc - Detecting abbreviations in discharge summaries using machine learning methods. ( 0,813531950855211 )
AMIA Annu Symp Proc - A Knowledge Intensive Approach to Mapping Clinical Narrative to LOINC. ( 0,813388324454491 )
AMIA Annu Symp Proc - Automatically pairing measured findings across narrative abdomen CT reports. ( 0,81192930190631 )
Comput Math Methods Med - Ranking biomedical annotations with annotator's semantic relevancy. ( 0,811894666618513 )
J Am Med Inform Assoc - Eventual situations for timeline extraction from clinical reports. ( 0,806211872889339 )
J Biomed Inform - A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora. ( 0,803166762136178 )
J Biomed Inform - Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text. ( 0,800730777576583 )
J Am Med Inform Assoc - 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. ( 0,798158479328904 )
J Am Med Inform Assoc - Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. ( 0,797984062077018 )
J Am Med Inform Assoc - Automatic discourse connective detection in biomedical text. ( 0,797411264821948 )
J Biomed Inform - Text summarization in the biomedical domain: a systematic review of recent research. ( 0,795350735886963 )
Int J Med Inform - Detection of infectious symptoms from VA emergency department and primary care clinical documentation. ( 0,795093959407871 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,794590309245179 )
AMIA Annu Symp Proc - Extracting Concepts Related to Homelessness from the Free Text of VA Electronic Medical Records. ( 0,793338633192534 )
J Biomed Inform - UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text. ( 0,79280321616287 )
AMIA Annu Symp Proc - A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries. ( 0,786109530799488 )
Perspect Health Inf Manag - A comparison of two approaches to text processing: facilitating chart reviews of radiology reports in electronic medical records. ( 0,785049918248219 )
AMIA Annu Symp Proc - Natural language processing to extract follow-up provider information from hospital discharge summaries. ( 0,783347435300973 )
J Am Med Inform Assoc - Assisted annotation of medical free text using RapTAT. ( 0,782765969928454 )
AMIA Annu Symp Proc - A cloud-based approach to medical NLP. ( 0,775968847330282 )
AMIA Annu Symp Proc - Mapping annotations with textual evidence using an scLDA model. ( 0,774678801576108 )
J Biomed Inform - Extraction of events and temporal expressions from clinical narratives. ( 0,774355667285708 )
J Biomed Inform - Text de-identification for privacy protection: a study of its impact on clinical text information content. ( 0,773695077558248 )
J Biomed Inform - Desiderata for ontologies to be used in semantic annotation of biomedical documents. ( 0,773159057907418 )
J Am Med Inform Assoc - A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. ( 0,773106583034311 )
J Biomed Inform - Lessons learnt from the DDIExtraction-2013 Shared Task. ( 0,773073496244901 )
J Am Med Inform Assoc - MedXN: an open source medication extraction and normalization tool for clinical text. ( 0,770180075531125 )
BMC Med Inform Decis Mak - Text summarization as a decision support aid. ( 0,769883460378176 )
Int J Med Inform - Detecting temporal expressions in medical narratives. ( 0,768974454544714 )
J Am Med Inform Assoc - Extracting drug indication information from structured product labels using natural language processing. ( 0,767610713246914 )
J Biomed Inform - The EU-ADR corpus: annotated drugs, diseases, targets, and their relationships. ( 0,767539727406084 )
AMIA Annu Symp Proc - Pharmacovigilance on twitter? Mining tweets for adverse drug reactions. ( 0,767403762293868 )
AMIA Annu Symp Proc - Extracting temporal information from electronic patient records. ( 0,764232734003171 )
J Am Med Inform Assoc - Evaluating the state of the art in disorder recognition and normalization of the clinical narrative. ( 0,763884302976774 )
J Am Med Inform Assoc - Towards comprehensive syntactic and semantic annotations of the clinical narrative. ( 0,763379486992727 )
Brief. Bioinformatics - A survey on annotation tools for the biomedical literature. ( 0,76182492243217 )
J Am Med Inform Assoc - Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification. ( 0,761821665761096 )
J Biomed Inform - Semantator: semantic annotator for converting biomedical text to linked data. ( 0,760646476022634 )
J Integr Bioinform - Automatic extraction of microorganisms and their habitats from free text using text mining workflows. ( 0,756858605334884 )
J Biomed Inform - A new clustering method for detecting rare senses of abbreviations in clinical notes. ( 0,756166505005434 )
J Biomed Inform - Degree centrality for semantic abstraction summarization of therapeutic studies. ( 0,755819858785049 )
J Biomed Inform - Identifying non-elliptical entity mentions in a coordinated NP with ellipses. ( 0,7550808762527 )
AMIA Annu Symp Proc - TagLine: Information Extraction for Semi-Structured Text in Medical Progress Notes. ( 0,754905353663759 )
Int J Med Inform - Bootstrapping a de-identification system for narrative patient records: cost-performance tradeoffs. ( 0,754293651028305 )
J Am Med Inform Assoc - The effect of word familiarity on actual and perceived text difficulty. ( 0,753120867939886 )
AMIA Annu Symp Proc - Application of a temporal reasoning framework tool in analysis of medical device adverse events. ( 0,751480971940346 )
AMIA Annu Symp Proc - Developing a section labeler for clinical documents. ( 0,751319096890225 )
AMIA Annu Symp Proc - Automated illustration of patients instructions. ( 0,749797447869449 )
AMIA Annu Symp Proc - Extracting semantic lexicons from discharge summaries using machine learning and the C-Value method. ( 0,749234612025098 )
J Biomed Inform - Ontology modularization to improve semantic medical image annotation. ( 0,748703631814284 )
J Biomed Inform - The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions. ( 0,747973578860219 )
J Biomed Inform - Enhancing clinical concept extraction with distributional semantics. ( 0,746801065422307 )
AMIA Annu Symp Proc - Active Learning-based corpus annotation--the PathoJen experience. ( 0,745188794383795 )
J Biomed Inform - Annotating temporal information in clinical narratives. ( 0,744747203612065 )
J Am Med Inform Assoc - Capturing patient information at nursing shift changes: methodological evaluation of speech recognition and information extraction. ( 0,744601522422664 )
AMIA Annu Symp Proc - Generalizability and comparison of automatic clinical text de-identification methods and resources. ( 0,743479258859262 )
J Biomed Inform - Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text. ( 0,742857421943761 )
Int J Med Inform - A methodology to enhance spatial understanding of disease outbreak events reported in news articles. ( 0,740353798263678 )
J Am Med Inform Assoc - Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences. ( 0,739011271699265 )
J Biomed Inform - Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. ( 0,738529966382549 )
J Am Med Inform Assoc - Using rule-based natural language processing to improve disease normalization in biomedical text. ( 0,738500594360247 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,737824575775017 )
J Am Med Inform Assoc - Automated clinical trial eligibility prescreening: increasing the efficiency of patient identification for clinical trials in the emergency department. ( 0,736134860037835 )
J Am Med Inform Assoc - Large-scale evaluation of automated clinical note de-identification and its impact on information extraction. ( 0,735838656230402 )
BMC Med Inform Decis Mak - The freetext matching algorithm: a computer program to extract diagnoses and causes of death from unstructured text in electronic health records. ( 0,735692236970687 )
AMIA Annu Symp Proc - Semantic characteristics of NLP-extracted concepts in clinical notes vs. biomedical literature. ( 0,73516255748833 )
J Biomed Inform - Disambiguation of ambiguous biomedical terms using examples generated from the UMLS Metathesaurus. ( 0,733656163417318 )
Health Informatics J - University of California, Irvine-Pathology Extraction Pipeline: the pathology extraction pipeline for information extraction from pathology reports. ( 0,733511236365441 )
Sci Data - Building the graph of medicine from millions of clinical narratives. ( 0,732498503750269 )
J. Med. Internet Res. - Evaluating a web-based clinical decision support system for language disorders screening in a nursery school. ( 0,732351541884428 )
BMC Med Inform Decis Mak - A framework for enhancing spatial and temporal granularity in report-based health surveillance systems. ( 0,730598333484738 )
AMIA Annu Symp Proc - Sophia: A Expedient UMLS Concept Extraction Annotator. ( 0,730068715779571 )
J Am Med Inform Assoc - Automatic abstraction of imaging observations with their characteristics from mammography reports. ( 0,725118154687614 )
J Biomed Inform - Unsupervised biomedical named entity recognition: experiments with clinical and biological texts. ( 0,724901019627125 )