AMIA Annu Symp Proc - Na?ve Electronic Health Record phenotype identification for Rheumatoid arthritis.

Tópicos

{ featur(3375) classif(2383) classifi(1994) }
{ extract(1171) text(1153) clinic(932) }
{ ehr(2073) health(1662) electron(1139) }
{ learn(2355) train(1041) set(1003) }
{ case(1353) use(1143) diagnosi(1136) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ can(774) often(719) complex(702) }
{ error(1145) method(1030) estim(1020) }
{ framework(1458) process(801) describ(734) }
{ design(1359) user(1324) use(1319) }
{ risk(3053) factor(974) diseas(938) }
{ system(1050) medic(1026) inform(1018) }
{ studi(1119) effect(1106) posit(819) }
{ health(3367) inform(1360) care(1135) }
{ research(1218) medic(880) student(794) }
{ data(2317) use(1299) case(1017) }
{ signal(2180) analysi(812) frequenc(800) }
{ result(1111) use(1088) new(759) }
{ process(1125) use(805) approach(778) }
{ model(3404) distribut(989) bayesian(671) }
{ data(1737) use(1416) pattern(1282) }
{ system(1976) rule(880) can(841) }
{ sequenc(1873) structur(1644) protein(1328) }
{ patient(2315) diseas(1263) diabet(1191) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ chang(1828) time(1643) increas(1301) }
{ concept(1167) ontolog(924) domain(897) }
{ method(1557) propos(1049) approach(1037) }
{ data(3963) clinic(1234) research(1004) }
{ research(1085) discuss(1038) issu(1018) }
{ model(2341) predict(2261) use(1141) }
{ compound(1573) activ(1297) structur(1058) }
{ perform(1367) use(1326) method(1137) }
{ spatial(1525) area(1432) region(1030) }
{ model(2656) set(1616) predict(1553) }
{ use(2086) technolog(871) perceiv(783) }
{ high(1669) rate(1365) level(1280) }
{ use(976) code(926) identifi(902) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ take(945) account(800) differ(722) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ problem(2511) optim(1539) algorithm(950) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ data(1714) softwar(1251) tool(1186) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ featur(1941) imag(1645) propos(1176) }
{ howev(809) still(633) remain(590) }
{ studi(1410) differ(1259) use(1210) }
{ perform(999) metric(946) measur(919) }
{ import(1318) role(1303) understand(862) }
{ visual(1396) interact(850) tool(830) }
{ blood(1257) pressur(1144) flow(957) }
{ record(1888) medic(1808) patient(1693) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ state(1844) use(1261) util(961) }
{ patient(2837) hospit(1953) medic(668) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ data(3008) multipl(1320) sourc(1022) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

Electronic Health Records (EHRs) provide a real-world patient cohort for clinical and genomic research. Phenotype identification using informatics algorithms has been shown to replicate known genetic associations found in clinical trials and observational cohorts. However, development of accurate phenotype identification methods can be challenging, requiring significant time and effort. We applied Support Vector Machines (SVMs) to both na?ve (i.e., non-curated) and expert-defined collections of EHR features to identify Rheumatoid Arthritis cases using billing codes, medication exposures, and natural language processing-derived concepts. SVMs trained on na?ve and expert-defined data outperformed an existing deterministic algorithm; the best performing na?ve system had precision of 0.94 and recall of 0.87, compared to precision of 0.75 and recall of 0.51 for the deterministic algorithm. We show that with an expert defined feature set as few as 50-100 training samples are required. This study demonstrates that SVMs operating on non-curated sets of attributes can accurately identify cases from an EHR.

Resumo Limpo

electron health record ehr provid realworld patient cohort clinic genom research phenotyp identif use informat algorithm shown replic known genet associ found clinic trial observ cohort howev develop accur phenotyp identif method can challeng requir signific time effort appli support vector machin svms nave ie noncur expertdefin collect ehr featur identifi rheumatoid arthriti case use bill code medic exposur natur languag processingderiv concept svms train nave expertdefin data outperform exist determinist algorithm best perform nave system precis recal compar precis recal determinist algorithm show expert defin featur set train sampl requir studi demonstr svms oper noncur set attribut can accur identifi case ehr

Resumos Similares

J. Med. Internet Res. - Web-based newborn screening system for metabolic diseases: machine learning versus clinicians. ( 0,708402387132416 )
J Am Med Inform Assoc - A comprehensive study of named entity recognition in Chinese clinical text. ( 0,697707023005605 )
J Am Med Inform Assoc - Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. ( 0,683079169526614 )
J Med Syst - A new approach for concealed information identification based on ERP assessment. ( 0,664423382759398 )
J Am Med Inform Assoc - Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection. ( 0,660945979623587 )
J Am Med Inform Assoc - A system for coreference resolution for the clinical narrative. ( 0,656652081703045 )
AMIA Annu Symp Proc - Word Sense Disambiguation of clinical abbreviations with hyperdimensional computing. ( 0,65378059290643 )
BMC Med Inform Decis Mak - Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. ( 0,651451698889617 )
J Am Med Inform Assoc - Learning regular expressions for clinical text classification. ( 0,651024031638643 )
J Biomed Inform - A biological continuum based approach for efficient clinical classification. ( 0,639584293744339 )
J Am Med Inform Assoc - Practical implementation of an existing smoking detection pipeline and reduced support vector machine training corpus requirements. ( 0,628104246484763 )
Brief. Bioinformatics - Class-imbalanced classifiers for high-dimensional data. ( 0,625266274615992 )
Comput. Biol. Med. - Decision forest for classification of gene expression data. ( 0,618239538346248 )
Comput. Biol. Med. - Gene expression data classification using locally linear discriminant embedding. ( 0,610842753591457 )
Artif Intell Med - Conceptual-driven classification for coding advise in health insurance reimbursement. ( 0,608770593264075 )
J Biomed Inform - Enhancing clinical concept extraction with distributional semantics. ( 0,608658853879737 )
Comput Methods Programs Biomed - Computer-aided diagnosis system: a Bayesian hybrid classification method. ( 0,607259540182109 )
J Med Syst - Luminance sticker based facial expression recognition using discrete wavelet transform for physically disabled persons. ( 0,602811621491833 )
J Biomed Inform - Temporal relation discovery between events and temporal expressions identified in clinical narrative. ( 0,600866670101042 )
Comput. Biol. Med. - A method of tumor classification based on wavelet packet transforms and neighborhood rough set. ( 0,60080016507311 )
J Am Med Inform Assoc - Using statistical text classification to identify health information technology incidents. ( 0,599512791066722 )
J Am Med Inform Assoc - A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction. ( 0,597853217871861 )
AMIA Annu Symp Proc - Automatic identification of critical follow-up recommendation sentences in radiology reports. ( 0,597403432782189 )
Comput Biol Chem - Information-theoretic approaches to SVM feature selection for metagenome read classification. ( 0,596515491266665 )
J Am Med Inform Assoc - Pneumonia identification using statistical feature selection. ( 0,596491674687643 )
J Am Med Inform Assoc - MITRE system for clinical assertion status classification. ( 0,588564704457323 )
IEEE J Biomed Health Inform - Support Vector Feature Selection for Early Detection of Anastomosis Leakage from Bag-of-Words in Electronic Health Records. ( 0,587386111826474 )
AMIA Annu Symp Proc - Automated non-alphanumeric symbol resolution in clinical texts. ( 0,586065000709259 )
J Biomed Inform - Dynamic categorization of clinical research eligibility criteria by hierarchical clustering. ( 0,58387586163092 )
J Am Med Inform Assoc - Capturing patient information at nursing shift changes: methodological evaluation of speech recognition and information extraction. ( 0,583364940927033 )
AMIA Annu Symp Proc - Automatically Detecting Acute Myocardial Infarction Events from EHR Text: A Preliminary Study. ( 0,582531449376625 )
J Am Med Inform Assoc - Functional evaluation of out-of-the-box text-mining tools for data-mining tasks. ( 0,582230504847126 )
Artif Intell Med - Supervised machine learning-based classification of oral malodor based on the microbiota in saliva samples. ( 0,579632757496355 )
J Biomed Inform - Relational machine learning for electronic health record-driven phenotyping. ( 0,579604052047925 )
Comput. Biol. Med. - A hybrid feature selection method for DNA microarray data. ( 0,579178231415799 )
Appl Clin Inform - The contribution of the vaccine adverse event text mining system to the classification of possible Guillain-Barr? syndrome reports. ( 0,578551797072318 )
AMIA Annu Symp Proc - Identifying discourse connectives in biomedical text. ( 0,578023708174196 )
AMIA Annu Symp Proc - Automated disambiguation of acronyms and abbreviations in clinical texts: window and training size considerations. ( 0,575684154465087 )
AMIA Annu Symp Proc - TagLine: Information Extraction for Semi-Structured Text in Medical Progress Notes. ( 0,574034146995096 )
AMIA Annu Symp Proc - Detecting abbreviations in discharge summaries using machine learning methods. ( 0,573301703113571 )
Artif Intell Med - Selective voting in convex-hull ensembles improves classification accuracy. ( 0,573093551830189 )
Med Biol Eng Comput - Evaluation of feature extraction methods for EEG-based brain-computer interfaces in terms of robustness to slight changes in electrode locations. ( 0,571919579733882 )
BMC Med Inform Decis Mak - Recognition of medication information from discharge summaries using ensembles of classifiers. ( 0,571720858311881 )
J Integr Bioinform - On the parameter optimization of Support Vector Machines for binary classification. ( 0,571344161563574 )
Brief. Bioinformatics - Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle? ( 0,57081185003741 )
J Biomed Inform - A simulation to analyze feature selection methods utilizing gene ontology for gene expression classification. ( 0,569664288848226 )
IEEE Trans Image Process - A unified feature and instance selection framework using optimum experimental design. ( 0,569166666962581 )
Artif Intell Med - Kernel machines for epilepsy diagnosis via EEG signal classification: a comparative study. ( 0,569014673211414 )
Comput Biol Chem - CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition. ( 0,568798665748904 )
J Biomed Inform - An enhanced CRFs-based system for information extraction from radiology reports. ( 0,568711124591671 )
AMIA Annu Symp Proc - It's about this and that: a description of anaphoric expressions in clinical text. ( 0,566838750508861 )
Methods Inf Med - Correlation-based gene selection and classification using Taguchi-BPSO. ( 0,56452480237225 )
AMIA Annu Symp Proc - Developing a section labeler for clinical documents. ( 0,56428655959455 )
J Med Syst - Enhanced cancer recognition system based on random forests feature elimination algorithm. ( 0,563931692609711 )
Comput. Biol. Med. - Computer-aided diagnosis system for the Acute Respiratory Distress Syndrome from chest radiographs. ( 0,563582955328814 )
J Am Med Inform Assoc - N-gram support vector machines for scalable procedure and diagnosis classification, with applications to clinical free text data from the intensive care unit. ( 0,562968191343672 )
J Am Med Inform Assoc - A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. ( 0,562745548486772 )
J Biomed Inform - An efficient statistical feature selection approach for classification of gene expression data. ( 0,56258645091955 )
Comput Methods Programs Biomed - A random forest classifier for lymph diseases. ( 0,561846657173039 )
Comput Methods Programs Biomed - Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis. ( 0,561531893636624 )
AMIA Annu Symp Proc - Predicting discharge mortality after acute ischemic stroke using balanced data. ( 0,55978755384638 )
J Am Med Inform Assoc - Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers. ( 0,55961637892433 )
AMIA Annu Symp Proc - Improving predictions in imbalanced data using Pairwise Expanded Logistic Regression. ( 0,559087568183734 )
J Med Syst - A software framework for building biomedical machine learning classifiers through grid computing resources. ( 0,557816730384875 )
Comput Math Methods Med - Recursive feature selection with significant variables of support vectors. ( 0,557679725922089 )
J Am Med Inform Assoc - Discovering body site and severity modifiers in clinical texts. ( 0,557043567839624 )
J Am Med Inform Assoc - A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. ( 0,552314970957999 )
J Med Syst - Symptomatic vs. asymptomatic plaque classification in carotid ultrasound. ( 0,550792226105333 )
Artif Intell Med - Document classification for mining host pathogen protein-protein interactions. ( 0,550256823524402 )
J Am Med Inform Assoc - Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification. ( 0,550075121430067 )
J Am Med Inform Assoc - Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries. ( 0,547268860921729 )
J Integr Bioinform - Improving imbalanced scientific text classification using sampling strategies and dictionaries. ( 0,545762845710951 )
Comput Biol Chem - A novel divide-and-merge classification for high dimensional datasets. ( 0,545383184554942 )
J Am Med Inform Assoc - Vaccine adverse event text mining system for extracting features from vaccine safety reports. ( 0,544306556053384 )
J Am Med Inform Assoc - Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record. ( 0,543848662939971 )
J Am Med Inform Assoc - Automatic discourse connective detection in biomedical text. ( 0,543630333368961 )
Comput. Biol. Med. - Classification of EMG signals using PSO optimized SVM for diagnosis of neuromuscular disorders. ( 0,543417563706239 )
J Med Syst - A three-stage expert system based on support vector machines for thyroid disease diagnosis. ( 0,541823734429984 )
J Am Med Inform Assoc - Diagnosis code assignment: models and evaluation metrics. ( 0,541503033760023 )
Int J Neural Syst - Improved adaptive splitting and selection: the hybrid training method of a classifier based on a feature space partitioning. ( 0,539574605867348 )
J Integr Bioinform - Evaluating the effect of unbalanced data in biomedical document classification. ( 0,539229144722115 )
J Biomed Inform - Ontology-guided feature engineering for clinical text classification. ( 0,539059695793714 )
Med Decis Making - Automatically annotating topics in transcripts of patient-provider interactions via machine learning. ( 0,538478389517424 )
J Biomed Inform - Disambiguation of ambiguous biomedical terms using examples generated from the UMLS Metathesaurus. ( 0,538318270750005 )
Comput. Biol. Med. - On the relevance of automatically selected single-voxel MRS and multimodal MRI and MRSI features for brain tumour differentiation. ( 0,537515724989086 )
Int J Comput Assist Radiol Surg - Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model. ( 0,536972582747505 )
Artif Intell Med - Figure classification in biomedical literature to elucidate disease mechanisms, based on pathways. ( 0,536962599147546 )
AMIA Annu Symp Proc - Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations. ( 0,536748213761387 )
Comput Methods Programs Biomed - Complex extreme learning machine applications in terahertz pulsed signals feature sets. ( 0,5365643248736 )
J Biomed Inform - Using PharmGKB to train text mining approaches for identifying potential gene targets for pharmacogenomic studies. ( 0,53557411519639 )
Perspect Health Inf Manag - Adding a genomic healthcare component to a health information management curriculum. ( 0,535448964548711 )
J Chem Inf Model - Classifying molecules using a sparse probabilistic kernel binary classifier. ( 0,535315558510874 )
Comput. Biol. Med. - A novel class dependent feature selection method for cancer biomarker discovery. ( 0,53527742875557 )
Comput. Biol. Med. - Computerized system for recognition of autism on the basis of gene expression microarray data. ( 0,535261644857542 )
Artif Intell Med - Classification of small lesions on dynamic breast MRI: Integrating dimension reduction and out-of-sample extension into CADx methodology. ( 0,535116668286285 )
Artif Intell Med - Computer-aided diagnosis of pulmonary nodules using a two-step approach for feature selection and classifier ensemble construction. ( 0,533887292516482 )
Comput Biol Chem - Derivation of an artificial gene to improve classification accuracy upon gene selection. ( 0,531936037290847 )
Comput. Biol. Med. - An ensemble system for automatic sleep stage classification using single channel EEG signal. ( 0,531364626005038 )
J Biomed Inform - Complex epilepsy phenotype extraction from narrative clinical discharge summaries. ( 0,531191839743207 )
Comput Biol Chem - Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm. ( 0,530803596965095 )