J Am Med Inform Assoc - Choosing blindly but wisely: differentially private solicitation of DNA datasets for disease marker discovery.

Tópicos

{ model(2656) set(1616) predict(1553) }
{ system(1050) medic(1026) inform(1018) }
{ method(1557) propos(1049) approach(1037) }
{ drug(1928) target(777) effect(648) }
{ analysi(2126) use(1163) compon(1037) }
{ gene(2352) biolog(1181) express(1162) }
{ case(1353) use(1143) diagnosi(1136) }
{ algorithm(1844) comput(1787) effici(935) }
{ estim(2440) model(1874) function(577) }
{ state(1844) use(1261) util(961) }
{ howev(809) still(633) remain(590) }
{ studi(2440) review(1878) systemat(933) }
{ cost(1906) reduc(1198) effect(832) }
{ monitor(1329) mobil(1314) devic(1160) }
{ imag(2830) propos(1344) filter(1198) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ decis(3086) make(1611) patient(1517) }
{ cancer(2502) breast(956) screen(824) }
{ data(2317) use(1299) case(1017) }
{ research(1218) medic(880) student(794) }
{ compound(1573) activ(1297) structur(1058) }
{ research(1085) discuss(1038) issu(1018) }
{ method(984) reconstruct(947) comput(926) }
{ general(901) number(790) one(736) }
{ design(1359) user(1324) use(1319) }
{ extract(1171) text(1153) clinic(932) }
{ concept(1167) ontolog(924) domain(897) }
{ assess(1506) score(1403) qualiti(1306) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ detect(2391) sensit(1101) algorithm(908) }
{ method(2212) result(1239) propos(1039) }
{ method(1969) cluster(1462) data(1082) }
{ activ(1452) weight(1219) physic(1104) }
{ process(1125) use(805) approach(778) }
{ survey(1388) particip(1329) question(1065) }
{ implement(1333) system(1263) develop(1122) }
{ result(1111) use(1088) new(759) }
{ use(1733) differ(960) four(931) }
{ use(976) code(926) identifi(902) }
{ high(1669) rate(1365) level(1280) }
{ structur(1116) can(940) graph(676) }
{ health(1844) social(1437) communiti(874) }
{ can(981) present(881) function(850) }
{ use(2086) technolog(871) perceiv(783) }
{ patient(1821) servic(1111) care(1106) }
{ time(1939) patient(1703) rate(768) }
{ activ(1138) subject(705) human(624) }
{ intervent(3218) particip(2042) group(1664) }
{ first(2504) two(1366) second(1323) }
{ data(3008) multipl(1320) sourc(1022) }
{ sampl(1606) size(1419) use(1276) }
{ group(2977) signific(1463) compar(1072) }
{ signal(2180) analysi(812) frequenc(800) }
{ medic(1828) order(1363) alert(1069) }
{ age(1611) year(1155) adult(843) }
{ patient(2837) hospit(1953) medic(668) }
{ ehr(2073) health(1662) electron(1139) }
{ model(3480) simul(1196) paramet(876) }
{ health(3367) inform(1360) care(1135) }
{ record(1888) medic(1808) patient(1693) }
{ spatial(1525) area(1432) region(1030) }
{ blood(1257) pressur(1144) flow(957) }
{ studi(1119) effect(1106) posit(819) }
{ perform(1367) use(1326) method(1137) }
{ visual(1396) interact(850) tool(830) }
{ model(2341) predict(2261) use(1141) }
{ import(1318) role(1303) understand(862) }
{ perform(999) metric(946) measur(919) }
{ risk(3053) factor(974) diseas(938) }
{ studi(1410) differ(1259) use(1210) }
{ data(3963) clinic(1234) research(1004) }
{ featur(1941) imag(1645) propos(1176) }
{ search(2224) databas(1162) retriev(909) }
{ care(1570) inform(1187) nurs(1089) }
{ model(2220) cell(1177) simul(1124) }
{ control(1307) perform(991) simul(935) }
{ data(1714) softwar(1251) tool(1186) }
{ clinic(1479) use(1117) guidelin(835) }
{ learn(2355) train(1041) set(1003) }
{ chang(1828) time(1643) increas(1301) }
{ error(1145) method(1030) estim(1020) }
{ problem(2511) optim(1539) algorithm(950) }
{ framework(1458) process(801) describ(734) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ treatment(1704) effect(941) patient(846) }
{ motion(1329) object(1292) video(1091) }
{ take(945) account(800) differ(722) }
{ patient(2315) diseas(1263) diabet(1191) }
{ imag(2675) segment(2577) method(1081) }
{ network(2748) neural(1063) input(814) }
{ featur(3375) classif(2383) classifi(1994) }

Resumo

JECTIVE: To propose a new approach to privacy preserving data selection, which helps the data users access human genomic datasets efficiently without undermining patients' privacy.METHODS: Our idea is to let each data owner publish a set of differentially-private pilot data, on which a data user can test-run arbitrary association-test algorithms, including those not known to the data owner a priori. We developed a suite of new techniques, including a pilot-data generation approach that leverages the linkage disequilibrium in the human genome to preserve both the utility of the data and the privacy of the patients, and a utility evaluation method that helps the user assess the value of the real data from its pilot version with high confidence.RESULTS: We evaluated our approach on real human genomic data using four popular association tests. Our study shows that the proposed approach can help data users make the right choices in most cases.CONCLUSIONS: Even though the pilot data cannot be directly used for scientific discovery, it provides a useful indication of which datasets are more likely to be useful to data users, who can therefore approach the appropriate data owners to gain access to the data.

Resumo Limpo

jectiv propos new approach privaci preserv data select help data user access human genom dataset effici without undermin patient privacymethod idea let data owner publish set differentiallypriv pilot data data user can testrun arbitrari associationtest algorithm includ known data owner priori develop suit new techniqu includ pilotdata generat approach leverag linkag disequilibrium human genom preserv util data privaci patient util evalu method help user assess valu real data pilot version high confidenceresult evalu approach real human genom data use four popular associ test studi show propos approach can help data user make right choic casesconclus even though pilot data direct use scientif discoveri provid use indic dataset like use data user can therefor approach appropri data owner gain access data

Resumos Similares

J Chem Inf Model - Beyond the scope of Free-Wilson analysis: building interpretable QSAR models with machine learning algorithms. ( 0,734035035405618 )
J Biomed Inform - Quantifying the costs and benefits of privacy-preserving health data publishing. ( 0,732285080190593 )
J Chem Inf Model - Study of chromatographic retention of natural terpenoids by chemoinformatic tools. ( 0,724269769811408 )
AMIA Annu Symp Proc - Effect of data combination on predictive modeling: a study using gene expression data. ( 0,710038827009629 )
J. Med. Internet Res. - Outsourcing medical data analyses: can technology overcome legal, privacy, and confidentiality issues? ( 0,698299385813245 )
Artif Intell Med - Training artificial neural networks directly on the concordance index for censored data using genetic algorithms. ( 0,696866814552267 )
J Chem Inf Model - Time-split cross-validation as a method for estimating the goodness of prospective prediction. ( 0,694001208738203 )
J Chem Inf Model - RS-Predictor models augmented with SMARTCyp reactivities: robust metabolic regioselectivity predictions for nine CYP isozymes. ( 0,686821406971343 )
J Chem Inf Model - iLOGP: a simple, robust, and efficient description of n-octanol/water partition coefficient for drug design using the GB/SA approach. ( 0,682902768175732 )
J Chem Inf Model - Predicting pK(a) values of substituted phenols from atomic charges: comparison of different quantum mechanical methods and charge distribution schemes. ( 0,681768279904378 )
J Chem Inf Model - Does rational selection of training and test sets improve the outcome of QSAR modeling? ( 0,67069642241569 )
J Chem Inf Model - Pharmacophore assessment through 3-D QSAR: evaluation of the predictive ability on new derivatives by the application on a series of antitubercular agents. ( 0,670469467932117 )
BMC Med Inform Decis Mak - Concordance and predictive value of two adverse drug event data sets. ( 0,66856767368761 )
Int J Comput Assist Radiol Surg - Assessing performance in brain tumor resection using a novel virtual reality simulator. ( 0,660167095498854 )
Comput. Biol. Med. - A prediction model of substrates and non-substrates of breast cancer resistance protein (BCRP) developed by GA-CG-SVM method. ( 0,658176786243164 )
J Chem Inf Model - Best of both worlds: combining pharma data and state of the art modeling technology to improve in Silico pKa prediction. ( 0,653643986598878 )
J Chem Inf Model - GRID-based three-dimensional pharmacophores II: PharmBench, a benchmark data set for evaluating pharmacophore elucidation methods. ( 0,63956229552972 )
Comput Math Methods Med - Multiscale autoregressive identification of neuroelectrophysiological systems. ( 0,633229185831594 )
J Am Med Inform Assoc - Harvest: an open platform for developing web-based biomedical data discovery and reporting applications. ( 0,631658927050671 )
AMIA Annu Symp Proc - Advanced proficiency EHR training: effect on physicians' EHR efficiency, EHR satisfaction and job satisfaction. ( 0,630418453008515 )
J Biomed Inform - Selection of interdependent genes via dynamic relevance analysis for cancer diagnosis. ( 0,629365161896215 )
Artif Intell Med - Fuzzy model identification of dengue epidemic in Colombia based on multiresolution analysis. ( 0,627196861985259 )
AMIA Annu Symp Proc - Motivating the additional use of external validity: examining transportability in a model of glioblastoma multiforme. ( 0,623948278328915 )
Int J Health Geogr - Incorporating geographical factors with artificial neural networks to predict reference values of erythrocyte sedimentation rate. ( 0,614167380078838 )
BMC Med Inform Decis Mak - Regression tree construction by bootstrap: model search for DRG-systems applied to Austrian health-data. ( 0,60761091075176 )
J Chem Inf Model - Rank order entropy: why one metric is not enough. ( 0,605997235196886 )
J Chem Inf Model - Applicability domain based on ensemble learning in classification and regression analyses. ( 0,59490364258109 )
Spat Spatiotemporal Epidemiol - Spatial modelling of disease using data- and knowledge-driven approaches. ( 0,593423736312332 )
J Chem Inf Model - Comparative studies on some metrics for external validation of QSPR models. ( 0,593308708742859 )
Comput. Aided Surg. - Evaluation of a computational model to predict elbow range of motion. ( 0,591986609152379 )
Comput Methods Programs Biomed - Kinetic modelling of haemodialysis removal of myoglobin in rhabdomyolysis patients. ( 0,591920793468244 )
J Chem Inf Model - Criterion for evaluating the predictive ability of nonlinear regression models without cross-validation. ( 0,591751780422001 )
J Biomed Inform - Scalable privacy-preserving data sharing methodology for genome-wide association studies. ( 0,591380973825902 )
Med Biol Eng Comput - Share and enjoy: anatomical models database--generating and sharing cardiovascular model data using web services. ( 0,587563099222636 )
IEEE Trans Image Process - Incremental N-mode SVD for large-scale multilinear generative models. ( 0,585692755443349 )
J Am Med Inform Assoc - Reconciliation of the cloud computing model with US federal electronic health record regulations. ( 0,580273008809746 )
Med Biol Eng Comput - Application of the RIMARC algorithm to a large data set of action potentials and clinical parameters for risk prediction of atrial fibrillation. ( 0,580068266936471 )
J Chem Inf Model - Applicability domains for classification problems: Benchmarking of distance to models for Ames mutagenicity set. ( 0,577967865366326 )
J Chem Inf Model - Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient. ( 0,577686813134172 )
J Chem Inf Model - Three useful dimensions for domain applicability in QSAR models using random forest. ( 0,577041990077336 )
Int J Health Geogr - Comparative analysis of remotely-sensed data products via ecological niche modeling of avian influenza case occurrences in Middle Eastern poultry. ( 0,575249498680981 )
J Biomed Inform - MysiRNA: improving siRNA efficacy prediction using a machine-learning model combining multi-tools and whole stacking energy (G). ( 0,574269039122362 )
Comput. Biol. Med. - Artificial neural network modelling of the results of tympanoplasty in chronic suppurative otitis media patients. ( 0,571122810334604 )
J Chem Inf Model - A critical assessment of combined ligand- and structure-based approaches to HERG channel blocker modeling. ( 0,569448040077292 )
IEEE Trans Pattern Anal Mach Intell - Specificity: A Graph-Based Estimator of Divergence. ( 0,569425827888025 )
Med Decis Making - Developing a tuberculosis transmission model that accounts for changes in population health. ( 0,569315577293061 )
AMIA Annu Symp Proc - Predicting the dengue incidence in Singapore using univariate time series models. ( 0,567632034604775 )
J. Med. Internet Res. - A case study of the New York City 2012-2013 influenza season with daily geocoded Twitter data from temporal and spatiotemporal perspectives. ( 0,567384745134322 )
Comput Biol Chem - Prediction of white cabbage (Brassica oleracea var. capitata) self-incompatibility based on neural network and discriminant analysis of complex electrophoretic patterns. ( 0,563957140885877 )
BMC Med Inform Decis Mak - Measuring preferences for analgesic treatment for cancer pain: how do African-Americans and Whites perform on choice-based conjoint (CBC) analysis experiments? ( 0,563872307042039 )
J Chem Inf Model - Impact of template choice on homology model efficiency in virtual screening. ( 0,563223657375147 )
J. Comput. Biol. - The complexity of the dirichlet model for multiple alignment data. ( 0,560999985768314 )
Comput Methods Programs Biomed - A predictive model of longitudinal, patient-specific colonoscopy results. ( 0,560254130197441 )
Int J Health Geogr - A linear programming model for preserving privacy when disclosing patient spatial information for secondary purposes. ( 0,557147027009707 )
J Chem Inf Model - Oversampling to overcome overfitting: exploring the relationship between data set composition, molecular descriptors, and predictive modeling methods. ( 0,555160676365376 )
J Chem Inf Model - Predicting myelosuppression of drugs from in silico models. ( 0,552807130382988 )
J Chem Inf Model - Prediction of linear cationic antimicrobial peptides based on characteristics responsible for their interaction with the membranes. ( 0,552089274408628 )
Comput Biol Chem - Predicting essential genes for identifying potential drug targets in Aspergillus fumigatus. ( 0,547226235338647 )
J Chem Inf Model - In silico prediction of aqueous solubility using simple QSPR models: the importance of phenol and phenol-like moieties. ( 0,54652049526656 )
Lifetime Data Anal - Analysis of cure rate survival data under proportional odds model. ( 0,545726864811161 )
IEEE Trans Image Process - Accurate multiple view 3D reconstruction using patch-based stereo for large-scale scenes. ( 0,543719163762441 )
J Chem Inf Model - Leave-cluster-out cross-validation is appropriate for scoring functions derived from diverse protein data sets. ( 0,54278394097479 )
J Chem Inf Model - Introducing conformal prediction in predictive modeling. A transparent and flexible alternative to applicability domain determination. ( 0,542759097090601 )
J Chem Inf Model - Statistical analysis and compound selection of combinatorial libraries for soluble epoxide hydrolase. ( 0,542178232139268 )
J Chem Inf Model - Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection. ( 0,540573279965352 )
J Chem Inf Model - Building a three-dimensional model of CYP2C9 inhibition using the Autocorrelator: an autonomous model generator. ( 0,53908327836007 )
Brief. Bioinformatics - Protein mass spectra data analysis for clinical biomarker discovery: a global review. ( 0,538906851805648 )
Int J Med Inform - Design and implementation of I2Vote--an interactive image-based voting system using windows mobile devices. ( 0,537011992640684 )
J Chem Inf Model - Estimation of carcinogenicity using molecular fragments tree. ( 0,533817855045522 )
J. Comput. Biol. - Boolean models can explain bistability in the lac operon. ( 0,533242443083733 )
J Chem Inf Model - Applicability Domain ANalysis (ADAN): a robust method for assessing the reliability of drug property predictions. ( 0,532968714474665 )
J Integr Bioinform - A new approach for modelling gene regulatory networks using fuzzy petri nets. ( 0,528111228135738 )
Methods Inf Med - Supporting regenerative medicine by integrative dimensionality reduction. ( 0,527753694581549 )
J Chem Inf Model - Four-dimensional structure-activity relationship model to predict HIV-1 integrase strand transfer inhibition using LQTA-QSAR methodology. ( 0,526825697226325 )
J Chem Inf Model - Classification of compounds with distinct or overlapping multi-target activities and diverse molecular mechanisms using emerging chemical patterns. ( 0,526655077031509 )
J Chem Inf Model - Kinase-kernel models: accurate in silico screening of 4 million compounds across the entire human kinome. ( 0,525416094930016 )
Comput Methods Programs Biomed - A 5-component mathematical model for salt-induced hypertension in Dahl-S and Dahl-R rats. ( 0,52514968096706 )
J Chem Inf Model - In silico prediction of total human plasma clearance. ( 0,523212671872747 )
Med Biol Eng Comput - Accelerometry-based prediction of movement dynamics for balance monitoring. ( 0,52305830052752 )
Brief. Bioinformatics - An empirical assessment of validation practices for molecular classifiers. ( 0,523033870104853 )
AMIA Annu Symp Proc - Ontology-based federated data access to human studies information. ( 0,522485409202495 )
Comput Methods Programs Biomed - Bayesian bivariate generalized Lindley model for survival data with a cure fraction. ( 0,522241788639832 )
Brief. Bioinformatics - Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies. ( 0,521435901889229 )
J Chem Inf Model - Using molecular features of xenobiotics to predict hepatic gene expression response. ( 0,520879295118847 )
J Chem Inf Model - Combined 3D-QSAR, molecular docking, and molecular dynamics study on piperazinyl-glutamate-pyridines/pyrimidines as potent P2Y12 antagonists for inhibition of platelet aggregation. ( 0,520556218136513 )
J Chem Inf Model - Development of novel 3D-QSAR combination approach for screening and optimizing B-Raf inhibitors in silico. ( 0,518181036685135 )
Sci Data - Comprehensive RNA-Seq transcriptomic profiling across 11 organs, 4 ages, and 2 sexes of Fischer 344 rats. ( 0,517978379268982 )
J Chem Inf Model - Beyond the scope of free-Wilson analysis. 2: Can distance encoded R-group fingerprints provide interpretable nonlinear models? ( 0,514067925077609 )
J Chem Inf Model - Coping with unbalanced class data sets in oral absorption models. ( 0,510510575196082 )
Med Biol Eng Comput - Optimal design of clinical tests for the identification of physiological models of type 1 diabetes in the presence of model mismatch. ( 0,510084652449468 )
J Am Med Inform Assoc - Use of a support vector machine for categorizing free-text notes: assessment of accuracy across two institutions. ( 0,506639657743835 )
J Chem Inf Model - In silico prediction of chemical Ames mutagenicity. ( 0,506196704925715 )
J Biomed Inform - Using PharmGKB to train text mining approaches for identifying potential gene targets for pharmacogenomic studies. ( 0,505190990419083 )
J Chem Inf Model - Robust scoring functions for protein-ligand interactions with quantum chemical charge models. ( 0,504430179260452 )
AMIA Annu Symp Proc - Selecting cases for whom additional tests can improve prognostication. ( 0,504233451100527 )
Comput. Biol. Med. - Quantification of contributions of molecular fragments for eye irritation of organic chemicals using QSAR study. ( 0,503551156870558 )
Brief. Bioinformatics - A dynamic framework for quantifying the genetic architecture of phenotypic plasticity. ( 0,503512380175771 )
J Chem Inf Model - Hsp90 inhibitors, part 1: definition of 3-D QSAutogrid/R models as a tool for virtual screening. ( 0,503185758863248 )
Comput. Biol. Med. - Comprehension of drug toxicity: software and databases. ( 0,502579227260302 )
J Biomed Inform - A framework to preserve the privacy of electronic health data streams. ( 0,50212845114293 )