J Biomed Inform - A semantic framework to protect the privacy of electronic health records with non-numerical attributes.

Tópicos

{ method(1969) cluster(1462) data(1082) }
{ data(1737) use(1416) pattern(1282) }
{ framework(1458) process(801) describ(734) }
{ system(1050) medic(1026) inform(1018) }
{ ehr(2073) health(1662) electron(1139) }
{ data(3008) multipl(1320) sourc(1022) }
{ can(774) often(719) complex(702) }
{ method(2212) result(1239) propos(1039) }
{ monitor(1329) mobil(1314) devic(1160) }
{ patient(1821) servic(1111) care(1106) }
{ concept(1167) ontolog(924) domain(897) }
{ method(1557) propos(1049) approach(1037) }
{ perform(999) metric(946) measur(919) }
{ health(3367) inform(1360) care(1135) }
{ age(1611) year(1155) adult(843) }
{ detect(2391) sensit(1101) algorithm(908) }
{ sequenc(1873) structur(1644) protein(1328) }
{ take(945) account(800) differ(722) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ studi(1410) differ(1259) use(1210) }
{ record(1888) medic(1808) patient(1693) }
{ result(1111) use(1088) new(759) }
{ inform(2794) health(2639) internet(1427) }
{ featur(3375) classif(2383) classifi(1994) }
{ network(2748) neural(1063) input(814) }
{ data(1714) softwar(1251) tool(1186) }
{ control(1307) perform(991) simul(935) }
{ howev(809) still(633) remain(590) }
{ model(2341) predict(2261) use(1141) }
{ compound(1573) activ(1297) structur(1058) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ state(1844) use(1261) util(961) }
{ signal(2180) analysi(812) frequenc(800) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ first(2504) two(1366) second(1323) }
{ health(1844) social(1437) communiti(874) }
{ cancer(2502) breast(956) screen(824) }
{ use(976) code(926) identifi(902) }
{ model(3404) distribut(989) bayesian(671) }
{ imag(1947) propos(1133) code(1026) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ chang(1828) time(1643) increas(1301) }
{ learn(2355) train(1041) set(1003) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ extract(1171) text(1153) clinic(932) }
{ design(1359) user(1324) use(1319) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ data(3963) clinic(1234) research(1004) }
{ risk(3053) factor(974) diseas(938) }
{ research(1085) discuss(1038) issu(1018) }
{ import(1318) role(1303) understand(862) }
{ visual(1396) interact(850) tool(830) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ model(3480) simul(1196) paramet(876) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ model(2656) set(1616) predict(1553) }
{ data(2317) use(1299) case(1017) }
{ medic(1828) order(1363) alert(1069) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ structur(1116) can(940) graph(676) }
{ high(1669) rate(1365) level(1280) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ activ(1452) weight(1219) physic(1104) }

Resumo

Structured patient data like Electronic Health Records (EHRs) are a valuable source for clinical research. However, the sensitive nature of such information requires some anonymisation procedure to be applied before releasing the data to third parties. Several studies have shown that the removal of identifying attributes, like the Social Security Number, is not enough to obtain an anonymous data file, since unique combinations of other attributes as for example, rare diagnoses and personalised treatments, may lead to patient's identity disclosure. To tackle this problem, Statistical Disclosure Control (SDC) methods have been proposed to mask sensitive attributes while preserving, up to a certain degree, the utility of anonymised data. Most of these methods focus on continuous-scale numerical data. Considering that part of the clinical data found in EHRs is expressed with non-numerical attributes as for example, diagnoses, symptoms, procedures, etc., their application to EHRs produces far from optimal results. In this paper, we propose a general framework to enable the accurate application of SDC methods to non-numerical clinical data, with a focus on the preservation of semantics. To do so, we exploit structured medical knowledge bases like SNOMED CT to propose semantically-grounded operators to compare, aggregate and sort non-numerical terms. Our framework has been applied to several well-known SDC methods and evaluated using a real clinical dataset with non-numerical attributes. Results show that the exploitation of medical semantics produces anonymised datasets that better preserve the utility of EHRs.

Resumo Limpo

structur patient data like electron health record ehr valuabl sourc clinic research howev sensit natur inform requir anonymis procedur appli releas data third parti sever studi shown remov identifi attribut like social secur number enough obtain anonym data file sinc uniqu combin attribut exampl rare diagnos personalis treatment may lead patient ident disclosur tackl problem statist disclosur control sdc method propos mask sensit attribut preserv certain degre util anonymis data method focus continuousscal numer data consid part clinic data found ehr express nonnumer attribut exampl diagnos symptom procedur etc applic ehr produc far optim result paper propos general framework enabl accur applic sdc method nonnumer clinic data focus preserv semant exploit structur medic knowledg base like snome ct propos semanticallyground oper compar aggreg sort nonnumer term framework appli sever wellknown sdc method evalu use real clinic dataset nonnumer attribut result show exploit medic semant produc anonymis dataset better preserv util ehr

Resumos Similares

IEEE Trans Pattern Anal Mach Intell - A Link-Based Approach to the Cluster Ensemble Problem. ( 0,629010225217973 )
Int J Health Geogr - Detecting activity locations from raw GPS data: a novel kernel-based algorithm. ( 0,607072468639369 )
IEEE Trans Neural Netw Learn Syst - Improved Fault Classification in Series Compensated Transmission Line: Comparative Evaluation of Chebyshev Neural Network Training Algorithms. ( 0,60627066367111 )
J Integr Bioinform - Clustering of gene expression profiles: creating initialization-independent clusterings by eliminating unstable genes. ( 0,605881895186771 )
J Med Syst - Application of attribute weighting method based on clustering centers to discrimination of linearly non-separable medical datasets. ( 0,599576879128751 )
Int J Health Geogr - A binary-based approach for detecting irregularly shaped clusters. ( 0,599380349842438 )
Med Decis Making - Multiple imputation methods for handling missing data in cost-effectiveness analyses that use data from hierarchical studies: an application to cluster randomized trials. ( 0,595425997364772 )
IEEE Trans Pattern Anal Mach Intell - Semi-Supervised Kernel Mean Shift Clustering. ( 0,593958454257881 )
IEEE Trans Vis Comput Graph - GPU-based Multilevel Clustering. ( 0,59379607184634 )
Spat Spatiotemporal Epidemiol - Optimal selection of the spatial scan parameters for cluster detection: a simulation study. ( 0,59242820586458 )
AMIA Annu Symp Proc - Using hierarchical mixture of experts model for fusion of outbreak detection methods. ( 0,59193887358612 )
J Am Med Inform Assoc - Privacy-preserving heterogeneous health data sharing. ( 0,58792530643142 )
J. Med. Internet Res. - Security analysis and improvements to the PsychoPass method. ( 0,581272842054109 )
Health Info Libr J - A bibliometric approach demonstrates the impact of a social care data set on research and policy. ( 0,569941241752874 )
Methods Inf Med - A pictorial schema for a comprehensive user-oriented identification of medical Apps. ( 0,569090450280153 )
Int J Neural Syst - Adaptive k-means algorithm for overlapped graph clustering. ( 0,567202777316623 )
Comput Methods Programs Biomed - Fuzzy and hard clustering analysis for thyroid disease. ( 0,5652452822063 )
J Biomed Inform - Average correlation clustering algorithm (ACCA) for grouping of co-regulated genes with similar pattern of variation in their expression values. ( 0,564851926634181 )
Neural Comput - Spontaneous clustering via minimum -divergence. ( 0,563073265596647 )
J Biomed Inform - Clustering clinical models from local electronic health records based on semantic similarity. ( 0,562178667058497 )
Int J Health Geogr - Detection of arbitrarily-shaped clusters using a neighbor-expanding approach: a case study on murine typhus in south Texas. ( 0,562118569961746 )
J Integr Bioinform - An evolutionary and visual framework for clustering of DNA microarray data. ( 0,556905685754336 )
Methods Inf Med - Multidimensional point transform for public health practice. ( 0,555639505813224 )
IEEE Trans Vis Comput Graph - Point-Based Visualization for Large Hierarchies. ( 0,553399335735193 )
Comput Math Methods Med - Localizing true brain interactions from EEG and MEG data with subspace methods and modified beamformers. ( 0,549217880789588 )
Med Biol Eng Comput - A mathematical method for constraint-based cluster analysis towards optimized constrictive diameter smoothing of saphenous vein grafts. ( 0,548519505510815 )
J. Comput. Biol. - A geometric clustering algorithm with applications to structural data. ( 0,547067567480176 )
J Biomed Inform - Use patterns of health information exchange through a multidimensional lens: conceptual framework and empirical validation. ( 0,543283257493523 )
J Biomed Inform - A data recipient centered de-identification method to retain statistical attributes. ( 0,542655805756604 )
BMC Med Inform Decis Mak - Efficient algorithms for fast integration on large data sets from multiple sources. ( 0,54210539246602 )
IEEE Trans Image Process - Maximum a posteriori video super-resolution using a new multichannel image prior. ( 0,541675025913666 )
Comput Math Methods Med - Liver segmentation based on Snakes Model and improved GrowCut algorithm in abdominal CT image. ( 0,541574258387481 )
AMIA Annu Symp Proc - Anomaly and signature filtering improve classifier performance for detection of suspicious access to EHRs. ( 0,539210610742807 )
J Biomed Inform - Quantifying the determinants of outbreak detection performance through simulation and machine learning. ( 0,537606542498837 )
Artif Intell Med - Weighted spherical 1-mean with phase shift and its application in electrocardiogram discord detection. ( 0,537569612989791 )
Int J Health Geogr - Detection of clusters of a rare disease over a large territory: performance of cluster detection methods. ( 0,535782350686184 )
J Biomed Inform - Predictive combinations of monitor alarms preceding in-hospital code blue events. ( 0,535466721599137 )
BMC Med Inform Decis Mak - Development and evaluation of a de-identification procedure for a case register sourced from mental health electronic records. ( 0,535147082214786 )
J Am Med Inform Assoc - Structural models used in real-time biosurveillance outbreak detection and outbreak curve isolation from noisy background morbidity levels. ( 0,535016104548306 )
Comput. Biol. Med. - A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients. ( 0,534038607177028 )
J Chem Inf Model - Metabolism site prediction based on xenobiotic structural formulas and PASS prediction algorithm. ( 0,532547503585754 )
J Am Med Inform Assoc - A collaborative approach to developing an electronic health record phenotyping algorithm for drug-induced liver injury. ( 0,531651373705127 )
Med Decis Making - Cost-saving tree-structured survival analysis for hip fracture of study of osteoporotic fractures data. ( 0,52950977164717 )
Brief. Bioinformatics - Batch effect removal methods for microarray gene expression data integration: a survey. ( 0,52721255412188 )
Med Decis Making - Developing appropriate methods for cost-effectiveness analysis of cluster randomized trials. ( 0,527179293493841 )
Comput Methods Programs Biomed - Aneurysm identification by analysis of the blood-vessel skeleton. ( 0,526330525499243 )
J Med Syst - Employing post-DEA cross-evaluation and cluster analysis in a sample of Greek NHS hospitals. ( 0,525882523020977 )
J Chem Inf Model - Investigation of the use of spectral clustering for the analysis of molecular data. ( 0,525690634539021 )
J Am Med Inform Assoc - Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. ( 0,52465604157443 )
J. Med. Internet Res. - Missing data approaches in eHealth research: simulation study and a tutorial for nonmathematically inclined researchers. ( 0,524038458018328 )
J. Comput. Biol. - Inconsistent Denoising and Clustering Algorithms for Amplicon Sequence Data. ( 0,52311907295939 )
Methods Inf Med - A database de-identification framework to enable direct queries on medical data for secondary use. ( 0,522512580419939 )
J Biomed Inform - Design patterns for the development of electronic health record-driven phenotype extraction algorithms. ( 0,521559468236902 )
Comput Methods Programs Biomed - Improvements on a privacy-protection algorithm for DNA sequences with generalization lattices. ( 0,519377120958307 )
Comput. Biol. Med. - CAM: a web tool for combining array CGH and microarray gene expression data from multiple samples. ( 0,518943147012413 )
IEEE Trans Pattern Anal Mach Intell - Iterative Discovery of Multiple Alternative Clustering Views. ( 0,517238874223183 )
J Biomed Inform - Mining association language patterns using a distributional semantic model for negative life event classification. ( 0,516186571993325 )
J Chem Inf Model - Benchmark data sets for structure-based computational target prediction. ( 0,515324695033509 )
J Biomed Inform - Statistical file matching of flow cytometry data. ( 0,515148886964746 )
Comput Math Methods Med - A wavelet relational fuzzy C-means algorithm for 2D gel image segmentation. ( 0,51357348917354 )
IEEE Trans Image Process - Entropy-functional-based online adaptive decision fusion framework with application to wildfire detection in video. ( 0,511244022977796 )
J Chem Inf Model - Digital data repositories in chemistry and their integration with journals and electronic notebooks. ( 0,511190911823186 )
IEEE Trans Image Process - Video keyframe analysis using a segment-based statistical metric in a visually sensitive parametric space. ( 0,51074731338049 )
IEEE Trans Image Process - Self-adaptively Weighted Co-saliency Detection via Rank Constraint. ( 0,510448867438152 )
Comput Math Methods Med - A study of rough set approach in gastroenterology. ( 0,508697056734877 )
J Chem Inf Model - Comparison of combinatorial clustering methods on pharmacological data sets represented by machine learning-selected real molecular descriptors. ( 0,508580402010525 )
Comput Math Methods Med - Group factor analysis for Alzheimer's disease. ( 0,508463441541507 )
Comput Math Methods Med - A robust rerank approach for feature selection and its application to pooling-based GWA studies. ( 0,508096296156892 )
IEEE J Biomed Health Inform - The effects of lossy compression on diagnostically relevant seizure information in EEG signals. ( 0,505567668018563 )
Comput Math Methods Med - Recent progress on the factorization method for electrical impedance tomography. ( 0,505454645264442 )
AMIA Annu Symp Proc - Mining the human phenome using semantic web technologies: a case study for Type 2 Diabetes. ( 0,504022964982261 )
J Chem Inf Model - Consensus methods for combining multiple clusterings of chemical structures. ( 0,503895440071787 )
J Med Syst - Evaluating cluster preservation in frequent itemset integration for distributed databases. ( 0,503861803854644 )
Comput. Biol. Med. - Identifying patients in target customer segments using a two-stage clustering-classification approach: a hospital-based assessment. ( 0,503248294023553 )
J Biomed Inform - Complementary ensemble clustering of biomedical data. ( 0,502677865396582 )
Spat Spatiotemporal Epidemiol - Performance of cancer cluster Q-statistics for case-control residential histories. ( 0,502481808894701 )
Artif Intell Med - Vicinal support vector classifier using supervised kernel-based clustering. ( 0,501147039359677 )
BMC Med Inform Decis Mak - Evaluating quality of care for patients with type 2 diabetes using electronic health record information in Mexico. ( 0,500002795151716 )
Int J Health Geogr - Using statistical methods and genotyping to detect tuberculosis outbreaks. ( 0,49925849816019 )
Comput. Aided Surg. - The Equidistant Method - a novel hip joint simulation algorithm for detection of femoroacetabular impingement. ( 0,499206330854715 )
IEEE Trans Image Process - Linear discriminant analysis based on L1-norm maximization. ( 0,498584918345752 )
IEEE Trans Pattern Anal Mach Intell - A Minimum Volume Covering Approach With a Set of Ellipsoids. ( 0,496904072455304 )
Comput Math Methods Med - Decimative spectral estimation with unconstrained model order. ( 0,496602465491246 )
Brief. Bioinformatics - A large-scale benchmark study of existing algorithms for taxonomy-independent microbial community analysis. ( 0,495865553069212 )
J. Comput. Biol. - EDAR: an efficient error detection and removal algorithm for next generation sequencing data. ( 0,495733129514255 )
J Med Syst - Data mining in healthcare and biomedicine: a survey of the literature. ( 0,494407067066081 )
J Biomed Inform - Exploring the ncRNA-ncRNA patterns based on bridging rules. ( 0,494091743938315 )
AMIA Annu Symp Proc - Patient clustering with uncoded text in electronic medical records. ( 0,492299582805417 )
Methods Inf Med - Semantic interoperability adheres to proper models and code systems. A detailed examination of different approaches for score systems. ( 0,491920804938955 )
Methods Inf Med - Application of microarray analysis on computer cluster and cloud platforms. ( 0,491404191808839 )
J. Comput. Biol. - Fast geometric consensus approach for protein model quality assessment. ( 0,491224374373112 )
Appl Clin Inform - Interest in health information exchange in ambulatory care: a statewide survey. ( 0,490809538348708 )
Int J Health Geogr - Interactive web-based mapping: bridging technology and data for health. ( 0,490117598311269 )
IEEE Trans Image Process - Orientation imaging microscopy with optimized convergence angle using CBED patterns in TEMs. ( 0,48906943226905 )
Comput. Biol. Med. - Analysis of adductors angle measurement in Hammersmith infant neurological examinations using mean shift segmentation and feature point based object tracking. ( 0,488679530446552 )
Comput. Biol. Med. - Using partial decision trees to predict Parkinson's symptoms: a new approach for diagnosis and therapy in patients suffering from Parkinson's disease. ( 0,48866880883667 )
J. Comput. Biol. - Biological cluster evaluation for gene function prediction. ( 0,488107772322209 )
Comput Methods Programs Biomed - Generating correlated discrete ordinal data using R and SAS IML. ( 0,48680790977138 )
AMIA Annu Symp Proc - A healthcare utilization analysis framework for hot spotting and contextual anomaly detection. ( 0,486425516989226 )
Int J Neural Syst - A genetic graph-based approach for partitional clustering. ( 0,486352861763197 )