J Am Med Inform Assoc - A benchmark comparison of deterministic and probabilistic methods for defining manual review datasets in duplicate records reconciliation.

Tópicos

{ detect(2391) sensit(1101) algorithm(908) }
{ perform(1367) use(1326) method(1137) }
{ perform(999) metric(946) measur(919) }
{ sampl(1606) size(1419) use(1276) }
{ model(3404) distribut(989) bayesian(671) }
{ featur(3375) classif(2383) classifi(1994) }
{ imag(2675) segment(2577) method(1081) }
{ error(1145) method(1030) estim(1020) }
{ activ(1452) weight(1219) physic(1104) }
{ can(774) often(719) complex(702) }
{ model(3480) simul(1196) paramet(876) }
{ use(976) code(926) identifi(902) }
{ take(945) account(800) differ(722) }
{ problem(2511) optim(1539) algorithm(950) }
{ control(1307) perform(991) simul(935) }
{ studi(1410) differ(1259) use(1210) }
{ record(1888) medic(1808) patient(1693) }
{ model(2656) set(1616) predict(1553) }
{ data(3008) multipl(1320) sourc(1022) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ system(1976) rule(880) can(841) }
{ method(1219) similar(1157) match(930) }
{ network(2748) neural(1063) input(814) }
{ chang(1828) time(1643) increas(1301) }
{ learn(2355) train(1041) set(1003) }
{ extract(1171) text(1153) clinic(932) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ howev(809) still(633) remain(590) }
{ model(2341) predict(2261) use(1141) }
{ studi(1119) effect(1106) posit(819) }
{ group(2977) signific(1463) compar(1072) }
{ use(2086) technolog(871) perceiv(783) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ survey(1388) particip(1329) question(1065) }
{ imag(1947) propos(1133) code(1026) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ imag(2830) propos(1344) filter(1198) }
{ patient(2315) diseas(1263) diabet(1191) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ concept(1167) ontolog(924) domain(897) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(1557) propos(1049) approach(1037) }
{ data(1714) softwar(1251) tool(1186) }
{ design(1359) user(1324) use(1319) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ data(3963) clinic(1234) research(1004) }
{ risk(3053) factor(974) diseas(938) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ import(1318) role(1303) understand(862) }
{ visual(1396) interact(850) tool(830) }
{ compound(1573) activ(1297) structur(1058) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ health(3367) inform(1360) care(1135) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ gene(2352) biolog(1181) express(1162) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ implement(1333) system(1263) develop(1122) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }

Resumo

TRODUCTION: Clinical databases require accurate entity resolution (ER). One approach is to use algorithms that assign questionable cases to manual review. Few studies have compared the performance of common algorithms for such a task. Furthermore, previous work has been limited by a lack of objective methods for setting algorithm parameters. We compared the performance of common ER algorithms: using algorithmic optimization, rather than manual parameter tuning, and on two-threshold classification (match/manual review/non-match) as well as single-threshold (match/non-match).METHODS: We manually reviewed 20,000 randomly selected, potential duplicate record-pairs to identify matches (10,000 training set, 10,000 test set). We evaluated the probabilistic expectation maximization, simple deterministic and fuzzy inference engine (FIE) algorithms. We used particle swarm to optimize algorithm parameters for a single and for two thresholds. We ran 10 iterations of optimization using the training set and report averaged performance against the test set.RESULTS: The overall estimated duplicate rate was 6%. FIE and simple deterministic algorithms allowed a lower manual review set compared to the probabilistic method (FIE 1.9%, simple deterministic 2.5%, probabilistic 3.6%; p<0.001). For a single threshold, the simple deterministic algorithm performed better than the probabilistic method (positive predictive value 0.956 vs 0.887, sensitivity 0.985 vs 0.887, p<0.001). ER with FIE classifies 98.1% of record-pairs correctly (1/10,000 error rate), assigning the remainder to manual review.CONCLUSIONS: Optimized deterministic algorithms outperform the probabilistic method. There is a strong case for considering optimized deterministic methods for ER.

Resumo Limpo

troduct clinic databas requir accur entiti resolut er one approach use algorithm assign question case manual review studi compar perform common algorithm task furthermor previous work limit lack object method set algorithm paramet compar perform common er algorithm use algorithm optim rather manual paramet tune twothreshold classif matchmanu reviewnonmatch well singlethreshold matchnonmatchmethod manual review random select potenti duplic recordpair identifi match train set test set evalu probabilist expect maxim simpl determinist fuzzi infer engin fie algorithm use particl swarm optim algorithm paramet singl two threshold ran iter optim use train set report averag perform test setresult overal estim duplic rate fie simpl determinist algorithm allow lower manual review set compar probabilist method fie simpl determinist probabilist p singl threshold simpl determinist algorithm perform better probabilist method posit predict valu vs sensit vs p er fie classifi recordpair correct error rate assign remaind manual reviewconclus optim determinist algorithm outperform probabilist method strong case consid optim determinist method er

Resumos Similares

IEEE Trans Pattern Anal Mach Intell - Robust Text Detection in Natural Scene Images. ( 0,719295294459181 )
Int J Neural Syst - Automated seizure detection using EKG. ( 0,714037154479903 )
Med Biol Eng Comput - Quasi real-time gait event detection using shank-attached gyroscopes. ( 0,711372572031976 )
Comput Math Methods Med - Bayesian method with spatial constraint for retinal vessel segmentation. ( 0,707439272662954 )
IEEE J Biomed Health Inform - Automatic identification and classification of muscle spasms in long-term EMG recordings. ( 0,693899806815492 )
Comput Methods Programs Biomed - Unsupervised skin lesions border detection via two-dimensional image analysis. ( 0,683483280524915 )
Comput Methods Programs Biomed - Automated pulmonary nodule detection based on three-dimensional shape-based feature descriptor. ( 0,680475414530137 )
Comput. Biol. Med. - Myocardial border detection from ventriculograms using support vector machines and real-coded genetic algorithms. ( 0,667960521926902 )
Int J Neural Syst - Kernel collaborative representation-based automatic seizure detection in intracranial EEG. ( 0,667625070751293 )
Comput Methods Programs Biomed - Blood vessel segmentation methodologies in retinal images--a survey. ( 0,66297171065221 )
Int J Comput Assist Radiol Surg - Hybrid method for the detection of pulmonary nodules using positron emission tomography/computed tomography: a preliminary study. ( 0,658895661251433 )
J Clin Monit Comput - Detection of endobronchial intubation by monitoring the CO2 level above the endotracheal cuff. ( 0,656674462101597 )
IEEE Trans Pattern Anal Mach Intell - Automatic and Accurate Shadow Detection using Near-Infrared Information. ( 0,655339380391442 )
J Am Med Inform Assoc - Adjusting outbreak detection algorithms for surveillance during epidemic and non-epidemic periods. ( 0,655011188659823 )
Comput. Biol. Med. - A user-operated test of suprathreshold acuity in noise for adult hearing screening: The SUN (Speech Understanding in Noise) test. ( 0,650725248689979 )
J. Comput. Biol. - Feature detection with controlled error rates in LC/MS images. ( 0,645715180984616 )
Int J Comput Assist Radiol Surg - Automated measurement of mandibular cortical width on dental panoramic radiographs. ( 0,642841024771025 )
Comput Methods Programs Biomed - Automated detection of exudates and macula for grading of diabetic macular edema. ( 0,636925868965831 )
J Am Med Inform Assoc - Syndromic surveillance for health information system failures: a feasibility study. ( 0,628446737624632 )
IEEE Trans Image Process - Lightweight detection of additive watermarking in the DWT-domain. ( 0,628421190044931 )
Comput Math Methods Med - Automatic segmentation and measurement of vasculature in retinal fundus images using probabilistic formulation. ( 0,627130983098102 )
BMC Med Inform Decis Mak - Outbreak detection algorithms for seasonal disease data: a case study using Ross River virus disease. ( 0,625155463550907 )
J. Med. Internet Res. - FluBreaks: early epidemic detection from Google flu trends. ( 0,62347881297176 )
Int J Comput Assist Radiol Surg - Detection and quantification of intracerebral and intraventricular hemorrhage from computed tomography images with adaptive thresholding and case-based reasoning. ( 0,622322168547988 )
Med Biol Eng Comput - Abnormal localization of immature precursors (ALIP) detection for early prediction of acute myelocytic leukemia (AML) relapse. ( 0,61807739637494 )
Comput. Biol. Med. - Automated detection of the osseous acetabular rim using three-dimensional models of the pelvis. ( 0,616134490428596 )
AMIA Annu Symp Proc - Optimized dual threshold entity resolution for electronic health record databases--training set size and active learning. ( 0,615394457050662 )
Comput Math Methods Med - New estimators and guidelines for better use of fetal heart rate estimators with Doppler ultrasound devices. ( 0,615336299468238 )
Comput Math Methods Med - Smart spotting of pulmonary TB cavities using CT images. ( 0,608741544001466 )
IEEE J Biomed Health Inform - Electrocardiogram classification using reservoir computing with logistic regression. ( 0,608524555193523 )
Comput Methods Programs Biomed - Debris removal in Pap-smear images. ( 0,604064034351917 )
BMC Med Inform Decis Mak - Improving sensitivity of machine learning methods for automated case identification from free-text electronic medical records. ( 0,603682866376144 )
Int J Comput Assist Radiol Surg - Computer-aided focal liver lesion detection. ( 0,603297111000381 )
J Am Med Inform Assoc - Phenotyping for patient safety: algorithm development for electronic health record based automated adverse event and medical error detection in neonatal intensive care. ( 0,602177858490715 )
J Am Med Inform Assoc - A simple heuristic for blindfolded record linkage. ( 0,600327401379815 )
J Am Med Inform Assoc - Use of computerized algorithm to identify individuals in need of testing for celiac disease. ( 0,600015789654923 )
IEEE Trans Image Process - Towards online iris and periocular recognition under relaxed imaging constraints. ( 0,598015641854522 )
AMIA Annu Symp Proc - Computer surveillance of patients at high risk for and with venous thromboembolism. ( 0,597205473962382 )
Comput. Biol. Med. - Automatic identification of fetal breathing movements in fetal RR interval time series. ( 0,596006222522974 )
Int J Neural Syst - Multi-instance dictionary learning for detecting abnormal events in surveillance videos. ( 0,595323512073349 )
J Clin Monit Comput - Pulse oximetry saturation patterns detect repetitive reductions in airflow. ( 0,592408777853142 )
Comput. Biol. Med. - Investigating the performance improvement of HRV Indices in CHF using feature selection methods based on backward elimination and statistical significance. ( 0,591564101823776 )
Comput. Biol. Med. - A correlation analysis-based detection and delineation of ECG characteristic events using template waveforms extracted by ensemble averaging of clustered heart cycles. ( 0,590692467917689 )
Comput. Biol. Med. - An image feature approach for computer-aided detection of ischemic stroke. ( 0,59049223721718 )
Comput Methods Programs Biomed - Automated detection of microaneurysms using scale-adapted blob analysis and semi-supervised learning. ( 0,58833187330807 )
J Med Syst - Automated screening of arrhythmia using wavelet based machine learning techniques. ( 0,587858915120195 )
Comput Math Methods Med - Higuchi fractal properties of onset epilepsy electroencephalogram. ( 0,586520354171208 )
Med Biol Eng Comput - Automatic breath-to-breath analysis of nocturnal polysomnographic recordings. ( 0,586484857546268 )
Int J Comput Assist Radiol Surg - Toward embedded detection of polyps in WCE images for early diagnosis of colorectal cancer. ( 0,585463380964682 )
Med Biol Eng Comput - Analysis of retinal fundus images for grading of diabetic retinopathy severity. ( 0,584147638000803 )
J Med Syst - Field programmable gate array based fuzzy neural signal processing system for differential diagnosis of QRS complex tachycardia and tachyarrhythmia in noisy ECG signals. ( 0,581771469254705 )
Comput. Biol. Med. - Region based stellate features combined with variable selection using AdaBoost learning in mammographic computer-aided detection. ( 0,579038492205568 )
Comput Biol Chem - ProteinLasso: A Lasso regression approach to protein inference problem in shotgun proteomics. ( 0,578785818753195 )
Comput Methods Programs Biomed - Influence of QRS complex detection errors on entropy algorithms. Application to heart rate variability discrimination. ( 0,578521857352329 )
Med Biol Eng Comput - GPU-based real-time detection and analysis of biological targets using solid-state nanopores. ( 0,574470212057268 )
Res Synth Methods - Methods for the joint meta-analysis of multiple tests. ( 0,57175703615929 )
Appl Clin Inform - Retrospective derivation and validation of a search algorithm to identify emergent endotracheal intubations in the intensive care unit. ( 0,571574871705342 )
Methods Inf Med - Central sleep apnea detection from ECG-derived respiratory signals. Application of multivariate recurrence plot analysis. ( 0,56746595282093 )
Comput Methods Programs Biomed - Accurate detection of blood vessels improves the detection of exudates in color fundus images. ( 0,567455849603051 )
Int J Comput Assist Radiol Surg - Diffusion tensor tractography of normal facial and vestibulocochlear nerves. ( 0,567139077575954 )
J Biomed Inform - Controlling false match rates in record linkage using extreme value theory. ( 0,56586432863773 )
BMC Med Inform Decis Mak - Evaluation of syndromic algorithms for detecting patients with potentially transmissible infectious diseases based on computerised emergency-department data. ( 0,565441571931705 )
Comput Methods Programs Biomed - Virtual colon flattening method based on colonic outer surface. ( 0,564868854638684 )
Med Decis Making - Detecting blood laboratory errors using a Bayesian network: an evaluation on liver enzyme tests. ( 0,563776024379597 )
Comput Math Methods Med - Hypovigilance detection for UCAV operators based on a hidden Markov model. ( 0,562911718876597 )
J Biomed Inform - Automation of a high risk medication regime algorithm in a home health care population. ( 0,562706513623486 )
Comput. Biol. Med. - Event-based progression detection strategies using scanning laser polarimetry images of the human retina. ( 0,562695360593833 )
J Med Syst - Inter-greedy technique for fusion of different segmentation strategies leading to high-performance carotid IMT measurement in ultrasound images. ( 0,56231994455449 )
Comput Math Methods Med - Automatic detection and quantification of WBCs and RBCs using iterative structured circle detection algorithm. ( 0,560697897944394 )
Int J Comput Assist Radiol Surg - Combination of computer-aided detection algorithms for automatic lung nodule identification. ( 0,560051311578239 )
IEEE J Biomed Health Inform - An automated screening system for tuberculosis. ( 0,55774666421359 )
Comput. Biol. Med. - An IVUS image-based approach for improvement of coronary plaque characterization. ( 0,556794289055109 )
Med Biol Eng Comput - A robust method for online heart sound localization in respiratory sound based on temporal fuzzy c-means. ( 0,554306184783742 )
Comput Methods Programs Biomed - An automated decision-support system for non-proliferative diabetic retinopathy disease based on MAs and HAs detection. ( 0,554154434141074 )
Health Info Libr J - Where and how to search for information on the effectiveness of public health interventions - a case study for prevention of cardiovascular disease. ( 0,553183962245132 )
J Chem Inf Model - A multiscale simulation system for the prediction of drug-induced cardiotoxicity. ( 0,552637732086858 )
Appl Clin Inform - Towards prevention of acute syndromes: electronic identification of at-risk patients during hospital admission. ( 0,552415326482586 )
IEEE Trans Pattern Anal Mach Intell - Online Learning and Sequential Anomaly Detection in Trajectories. ( 0,551251996646136 )
J Med Syst - Automatic detection of the existence of subarachnoid hemorrhage from clinical CT images. ( 0,55052527228851 )
AMIA Annu Symp Proc - Methods for identifying suicide or suicidal ideation in EHRs. ( 0,550477187201271 )
J Biomed Inform - PICO element detection in medical text without metadata: are first sentences enough? ( 0,550366026896578 )
Comput Methods Programs Biomed - A method for corneal nerves automatic segmentation and morphometric analysis. ( 0,550220336179006 )
IEEE J Biomed Health Inform - Robust and sensitive video motion detection for sleep analysis. ( 0,546956219464124 )
Comput Methods Programs Biomed - A review of thresholding strategies applied to human chromosome segmentation. ( 0,545737602950446 )
Comput. Biol. Med. - Automatic exudate detection by fusing multiple active contours and regionwise classification. ( 0,545110611019623 )
J Med Syst - Three-dimensional SVM with latent variable: application for detection of lung lesions in CT images. ( 0,543524634110786 )
Artif Intell Med - Asynchronous gaze-independent event-related potential-based brain-computer interface. ( 0,541875165047227 )
IEEE Trans Image Process - Chromaticity space for illuminant invariant recognition. ( 0,540116247595796 )
BMC Med Inform Decis Mak - Manual and automated methods for identifying potentially preventable readmissions: a comparison in a large healthcare system. ( 0,539725447425719 )
BMC Med Inform Decis Mak - Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers. ( 0,538922535770065 )
Comput. Biol. Med. - Novel technique for ST-T interval characterization in patients with acute myocardial ischemia. ( 0,538663441824794 )
Comput. Biol. Med. - Tumor segmentation from computed tomography image data using a probabilistic pixel selection approach. ( 0,538280963072687 )
BMC Med Inform Decis Mak - Adverse drug events with hyperkalaemia during inpatient stays: evaluation of an automated method for retrospective detection in hospital databases. ( 0,537299846819462 )
Comput. Biol. Med. - Early detection of epileptic seizures based on parameter identification of neural mass model. ( 0,536540402167774 )
Comput. Biol. Med. - Real-time electrocardiogram P-QRS-T detection-delineation algorithm based on quality-supported analysis of characteristic templates. ( 0,536456112262449 )
BMC Med Inform Decis Mak - Genotypic tropism testing by massively parallel sequencing: qualitative and quantitative analysis. ( 0,535822210512439 )
Int J Comput Assist Radiol Surg - Object-based analysis of CT images for automatic detection and segmentation of hypodense liver lesions. ( 0,534044908511269 )
Comput Methods Programs Biomed - Automatic model-based tracing algorithm for vessel segmentation and diameter estimation. ( 0,533635958119567 )
Int J Comput Assist Radiol Surg - Automated MR morphometry to predict Alzheimer's disease in mild cognitive impairment. ( 0,53314174242239 )
IEEE Trans Image Process - Detection of dynamic background due to swaying movements from motion features. ( 0,532138495042245 )