J Am Med Inform Assoc - Large-scale evaluation of automated clinical note de-identification and its impact on information extraction.

Tópicos

{ extract(1171) text(1153) clinic(932) }
{ method(1969) cluster(1462) data(1082) }
{ detect(2391) sensit(1101) algorithm(908) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ search(2224) databas(1162) retriev(909) }
{ perform(1367) use(1326) method(1137) }
{ activ(1138) subject(705) human(624) }
{ studi(2440) review(1878) systemat(933) }
{ data(1714) softwar(1251) tool(1186) }
{ model(2220) cell(1177) simul(1124) }
{ risk(3053) factor(974) diseas(938) }
{ system(1050) medic(1026) inform(1018) }
{ record(1888) medic(1808) patient(1693) }
{ data(3008) multipl(1320) sourc(1022) }
{ first(2504) two(1366) second(1323) }
{ method(2212) result(1239) propos(1039) }
{ imag(2830) propos(1344) filter(1198) }
{ clinic(1479) use(1117) guidelin(835) }
{ visual(1396) interact(850) tool(830) }
{ state(1844) use(1261) util(961) }
{ patient(2837) hospit(1953) medic(668) }
{ model(2656) set(1616) predict(1553) }
{ age(1611) year(1155) adult(843) }
{ sampl(1606) size(1419) use(1276) }
{ patient(1821) servic(1111) care(1106) }
{ high(1669) rate(1365) level(1280) }
{ use(1733) differ(960) four(931) }
{ result(1111) use(1088) new(759) }
{ activ(1452) weight(1219) physic(1104) }
{ data(1737) use(1416) pattern(1282) }
{ sequenc(1873) structur(1644) protein(1328) }
{ imag(2675) segment(2577) method(1081) }
{ take(945) account(800) differ(722) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(1557) propos(1049) approach(1037) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ perform(999) metric(946) measur(919) }
{ research(1085) discuss(1038) issu(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ compound(1573) activ(1297) structur(1058) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ research(1218) medic(880) student(794) }
{ data(2317) use(1299) case(1017) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ gene(2352) biolog(1181) express(1162) }
{ intervent(3218) particip(2042) group(1664) }
{ time(1939) patient(1703) rate(768) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ cancer(2502) breast(956) screen(824) }
{ use(976) code(926) identifi(902) }
{ drug(1928) target(777) effect(648) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ system(1976) rule(880) can(841) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ method(1219) similar(1157) match(930) }
{ featur(3375) classif(2383) classifi(1994) }
{ network(2748) neural(1063) input(814) }
{ patient(2315) diseas(1263) diabet(1191) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }

Resumo

JECTIVE: (1) To evaluate a state-of-the-art natural language processing (NLP)-based approach to automatically de-identify a large set of diverse clinical notes. (2) To measure the impact of de-identification on the performance of information extraction algorithms on the de-identified documents.MATERIAL AND METHODS: A cross-sectional study that included 3503 stratified, randomly selected clinical notes (over 22 note types) from five million documents produced at one of the largest US pediatric hospitals. Sensitivity, precision, F value of two automated de-identification systems for removing all 18 HIPAA-defined protected health information elements were computed. Performance was assessed against a manually generated 'gold standard'. Statistical significance was tested. The automated de-identification performance was also compared with that of two humans on a 10% subsample of the gold standard. The effect of de-identification on the performance of subsequent medication extraction was measured.RESULTS: The gold standard included 30815 protected health information elements and more than one million tokens. The most accurate NLP method had 91.92% sensitivity (R) and 95.08% precision (P) overall. The performance of the system was indistinguishable from that of human annotators (annotators' performance was 92.15%(R)/93.95%(P) and 94.55%(R)/88.45%(P) overall while the best system obtained 92.91%(R)/95.73%(P) on same text). The impact of automated de-identification was minimal on the utility of the narrative notes for subsequent information extraction as measured by the sensitivity and precision of medication name extraction.DISCUSSION AND CONCLUSION: NLP-based de-identification shows excellent performance that rivals the performance of human annotators. Furthermore, unlike manual de-identification, the automated approach scales up to millions of documents quickly and inexpensively.

Resumo Limpo

jectiv evalu stateoftheart natur languag process nlpbase approach automat deidentifi larg set divers clinic note measur impact deidentif perform inform extract algorithm deidentifi documentsmateri method crosssect studi includ stratifi random select clinic note note type five million document produc one largest us pediatr hospit sensit precis f valu two autom deidentif system remov hipaadefin protect health inform element comput perform assess manual generat gold standard statist signific test autom deidentif perform also compar two human subsampl gold standard effect deidentif perform subsequ medic extract measuredresult gold standard includ protect health inform element one million token accur nlp method sensit r precis p overal perform system indistinguish human annot annot perform rp rp overal best system obtain rp text impact autom deidentif minim util narrat note subsequ inform extract measur sensit precis medic name extractiondiscuss conclus nlpbase deidentif show excel perform rival perform human annot furthermor unlik manual deidentif autom approach scale million document quick inexpens

Resumos Similares

J Am Med Inform Assoc - A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. ( 0,88425840681297 )
J Biomed Inform - A new clustering method for detecting rare senses of abbreviations in clinical notes. ( 0,878533545996148 )
AMIA Annu Symp Proc - Natural language processing for lines and devices in portable chest x-rays. ( 0,829456681476584 )
J Am Med Inform Assoc - Eventual situations for timeline extraction from clinical reports. ( 0,825354229219762 )
J Am Med Inform Assoc - Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives. ( 0,821534495326053 )
J Biomed Inform - MedTime: a temporal information extraction system for clinical narratives. ( 0,821208492968168 )
J Am Med Inform Assoc - An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. ( 0,819249500273616 )
BMC Med Inform Decis Mak - Text summarization as a decision support aid. ( 0,817716136569727 )
Artif Intell Med - Biomedical events extraction using the hidden vector state model. ( 0,805221340075533 )
J Am Med Inform Assoc - A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources. ( 0,802231254830438 )
J. Med. Internet Res. - Web 2.0-based crowdsourcing for high-quality gold standard development in clinical natural language processing. ( 0,801899185248023 )
J Am Med Inform Assoc - Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text. ( 0,80034817442347 )
AMIA Annu Symp Proc - Throw the bath water out, keep the baby: keeping medically-relevant terms for text mining. ( 0,798375210248086 )
J Am Med Inform Assoc - MedXN: an open source medication extraction and normalization tool for clinical text. ( 0,796615340785737 )
Appl Clin Inform - Representation of information about family relatives as structured data in electronic health records. ( 0,793752891327257 )
AMIA Annu Symp Proc - Inter-annotator reliability of medical events, coreferences and temporal relations in clinical narratives by annotators with varying levels of clinical expertise. ( 0,791440681379047 )
J Am Med Inform Assoc - Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. ( 0,790928122971854 )
Comput Math Methods Med - Ranking biomedical annotations with annotator's semantic relevancy. ( 0,789258789760893 )
J Biomed Inform - A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora. ( 0,788611109425729 )
Int J Med Inform - Detection of infectious symptoms from VA emergency department and primary care clinical documentation. ( 0,787346817896426 )
J Am Med Inform Assoc - Automatic discourse connective detection in biomedical text. ( 0,786990399212388 )
J Biomed Inform - Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. ( 0,784121520736929 )
J Am Med Inform Assoc - A hybrid system for temporal information extraction from clinical text. ( 0,783677248027147 )
J Biomed Inform - Towards generating a patient's timeline: extracting temporal relationships from clinical notes. ( 0,780073818788184 )
J Biomed Inform - Anaphoric reference in clinical reports: characteristics of an annotated corpus. ( 0,77703172993868 )
AMIA Annu Symp Proc - Extracting Concepts Related to Homelessness from the Free Text of VA Electronic Medical Records. ( 0,777025222278372 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,773719539308396 )
J Biomed Inform - Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction. ( 0,773495152133325 )
AMIA Annu Symp Proc - Pharmacovigilance on twitter? Mining tweets for adverse drug reactions. ( 0,770769583611198 )
J Am Med Inform Assoc - Anaphoric relations in the clinical narrative: corpus creation. ( 0,770019779278141 )
Int J Med Inform - Detecting temporal expressions in medical narratives. ( 0,769630849380936 )
J Am Med Inform Assoc - Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification. ( 0,768451794033374 )
Comput Methods Programs Biomed - BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments. ( 0,764353318447382 )
J Med Syst - Redactable signatures for signed CDA Documents. ( 0,759795576194097 )
AMIA Annu Symp Proc - Using ontology network structure in text mining. ( 0,758486542250532 )
J Am Med Inform Assoc - Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements. ( 0,758392899362317 )
J Am Med Inform Assoc - Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification. ( 0,757823795176079 )
Sci Data - Building the graph of medicine from millions of clinical narratives. ( 0,757328835919962 )
J Am Med Inform Assoc - Assisted annotation of medical free text using RapTAT. ( 0,750323278626865 )
BMC Med Inform Decis Mak - The freetext matching algorithm: a computer program to extract diagnoses and causes of death from unstructured text in electronic health records. ( 0,748984997609992 )
J Biomed Inform - Identifying non-elliptical entity mentions in a coordinated NP with ellipses. ( 0,748926779824307 )
Int J Med Inform - Bootstrapping a de-identification system for narrative patient records: cost-performance tradeoffs. ( 0,748604728616594 )
AMIA Annu Symp Proc - A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries. ( 0,74809826912045 )
AMIA Annu Symp Proc - Mapping annotations with textual evidence using an scLDA model. ( 0,74760258811004 )
J Biomed Inform - NCBI disease corpus: a resource for disease name recognition and concept normalization. ( 0,745473752071854 )
J Am Med Inform Assoc - Automatic abstraction of imaging observations with their characteristics from mammography reports. ( 0,745170927485658 )
AMIA Annu Symp Proc - Natural language processing to extract follow-up provider information from hospital discharge summaries. ( 0,744600357839893 )
AMIA Annu Symp Proc - Automatically pairing measured findings across narrative abdomen CT reports. ( 0,743927459604553 )
J Biomed Inform - Lessons learnt from the DDIExtraction-2013 Shared Task. ( 0,741521494434027 )
Appl Clin Inform - Comparing the effectiveness of computerized adverse drug event monitoring systems to enhance clinical decision support for hospitalized patients. ( 0,740764301975103 )
J Am Med Inform Assoc - Using rule-based natural language processing to improve disease normalization in biomedical text. ( 0,740478266298412 )
J Biomed Inform - Text de-identification for privacy protection: a study of its impact on clinical text information content. ( 0,738618765138401 )
AMIA Annu Symp Proc - Using UMLS lexical resources to disambiguate abbreviations in clinical text. ( 0,738227471035601 )
J Biomed Inform - Text summarization in the biomedical domain: a systematic review of recent research. ( 0,738048174854545 )
J Biomed Inform - Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text. ( 0,736990995199395 )
AMIA Annu Symp Proc - Building gold standard corpora for medical natural language processing tasks. ( 0,735838656230402 )
J Biomed Inform - Detecting hedge cues and their scope in biomedical text with conditional random fields. ( 0,735726533664723 )
J Biomed Inform - UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text. ( 0,734584201263717 )
J Am Med Inform Assoc - A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. ( 0,733642429624249 )
J Am Med Inform Assoc - 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. ( 0,733000709322229 )
J Biomed Inform - Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. ( 0,732556692294078 )
J Biomed Inform - Desiderata for ontologies to be used in semantic annotation of biomedical documents. ( 0,73067853710224 )
J Integr Bioinform - Automatic extraction of microorganisms and their habitats from free text using text mining workflows. ( 0,729776659152677 )
AMIA Annu Symp Proc - Detecting abbreviations in discharge summaries using machine learning methods. ( 0,72948959436033 )
J Biomed Inform - The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions. ( 0,725523046367791 )
J Am Med Inform Assoc - MITRE system for clinical assertion status classification. ( 0,724024110544577 )
AMIA Annu Symp Proc - Towards a semantic lexicon for clinical natural language processing. ( 0,720284641055813 )
J Biomed Inform - Extraction of events and temporal expressions from clinical narratives. ( 0,719812524459582 )
AMIA Annu Symp Proc - Voice-dictated versus typed-in clinician notes: linguistic properties and the potential implications on natural language processing. ( 0,718690238199603 )
J Biomed Inform - Comparison of automated and human assignment of MeSH terms on publicly-available molecular datasets. ( 0,718320291568574 )
J Biomed Inform - Automatically extracting information needs from complex clinical questions. ( 0,716566946726025 )
AMIA Annu Symp Proc - Generalizability and comparison of automatic clinical text de-identification methods and resources. ( 0,71655424876432 )
AMIA Annu Symp Proc - TagLine: Information Extraction for Semi-Structured Text in Medical Progress Notes. ( 0,713021847166236 )
J Biomed Inform - Annotating temporal information in clinical narratives. ( 0,712904431974056 )
Perspect Health Inf Manag - A comparison of two approaches to text processing: facilitating chart reviews of radiology reports in electronic medical records. ( 0,712782494722375 )
AMIA Annu Symp Proc - Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations. ( 0,712721728893629 )
Brief. Bioinformatics - A survey on annotation tools for the biomedical literature. ( 0,712650826968479 )
J Am Med Inform Assoc - Vaccine adverse event text mining system for extracting features from vaccine safety reports. ( 0,712099559896779 )
J Am Med Inform Assoc - Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences. ( 0,707324208985209 )
Health Informatics J - University of California, Irvine-Pathology Extraction Pipeline: the pathology extraction pipeline for information extraction from pathology reports. ( 0,704015286506253 )
J Biomed Inform - Using an ensemble system to improve concept extraction from clinical records. ( 0,702974631794359 )
Int J Med Inform - A methodology to enhance spatial understanding of disease outbreak events reported in news articles. ( 0,702331964883829 )
AMIA Annu Symp Proc - Qualitative analysis of workflow modifications used to generate the reference standard for the 2010 i2b2/VA challenge. ( 0,701605506696182 )
J Am Med Inform Assoc - Towards comprehensive syntactic and semantic annotations of the clinical narrative. ( 0,699788244632993 )
J Biomed Inform - Degree centrality for semantic abstraction summarization of therapeutic studies. ( 0,697209630852991 )
AMIA Annu Symp Proc - Sophia: A Expedient UMLS Concept Extraction Annotator. ( 0,69719607551545 )
J Biomed Inform - Ontology modularization to improve semantic medical image annotation. ( 0,695155555033982 )
J Am Med Inform Assoc - Extracting drug indication information from structured product labels using natural language processing. ( 0,692547655632569 )
AMIA Annu Symp Proc - Extracting semantic lexicons from discharge summaries using machine learning and the C-Value method. ( 0,692503751892898 )
J Am Med Inform Assoc - The effect of word familiarity on actual and perceived text difficulty. ( 0,692181362084721 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,691323388082321 )
J Am Med Inform Assoc - Automated clinical trial eligibility prescreening: increasing the efficiency of patient identification for clinical trials in the emergency department. ( 0,691176838316624 )
BMC Med Inform Decis Mak - Detecting causality from online psychiatric texts using inter-sentential language patterns. ( 0,691144992051068 )
AMIA Annu Symp Proc - Developing a section labeler for clinical documents. ( 0,689862593987881 )
J Am Med Inform Assoc - A comprehensive study of named entity recognition in Chinese clinical text. ( 0,689543591019783 )
J Am Med Inform Assoc - Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection. ( 0,689442531941031 )
AMIA Annu Symp Proc - Extracting patient demographics and personal medical information from online health forums. ( 0,686845142246568 )
J Am Med Inform Assoc - Comparison of a semi-automatic annotation tool and a natural language processing application for the generation of clinical statement entries. ( 0,685223355674581 )
AMIA Annu Symp Proc - It's about this and that: a description of anaphoric expressions in clinical text. ( 0,683718726787088 )
J Biomed Inform - Common data model for natural language processing based on two existing standard information models: CDA+GrAF. ( 0,681443788916735 )