J. Med. Internet Res. - P2P watch: personal health information detection in peer-to-peer file-sharing networks.

Tópicos

{ data(1714) softwar(1251) tool(1186) }
{ use(976) code(926) identifi(902) }
{ extract(1171) text(1153) clinic(932) }
{ ehr(2073) health(1662) electron(1139) }
{ signal(2180) analysi(812) frequenc(800) }
{ health(1844) social(1437) communiti(874) }
{ risk(3053) factor(974) diseas(938) }
{ gene(2352) biolog(1181) express(1162) }
{ imag(2675) segment(2577) method(1081) }
{ record(1888) medic(1808) patient(1693) }
{ model(2656) set(1616) predict(1553) }
{ process(1125) use(805) approach(778) }
{ model(3404) distribut(989) bayesian(671) }
{ system(1050) medic(1026) inform(1018) }
{ medic(1828) order(1363) alert(1069) }
{ error(1145) method(1030) estim(1020) }
{ design(1359) user(1324) use(1319) }
{ model(2220) cell(1177) simul(1124) }
{ general(901) number(790) one(736) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ group(2977) signific(1463) compar(1072) }
{ system(1976) rule(880) can(841) }
{ sequenc(1873) structur(1644) protein(1328) }
{ featur(3375) classif(2383) classifi(1994) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ chang(1828) time(1643) increas(1301) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ care(1570) inform(1187) nurs(1089) }
{ case(1353) use(1143) diagnosi(1136) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ visual(1396) interact(850) tool(830) }
{ compound(1573) activ(1297) structur(1058) }
{ blood(1257) pressur(1144) flow(957) }
{ sampl(1606) size(1419) use(1276) }
{ detect(2391) sensit(1101) algorithm(908) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ method(1557) propos(1049) approach(1037) }
{ control(1307) perform(991) simul(935) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ featur(1941) imag(1645) propos(1176) }
{ studi(1410) differ(1259) use(1210) }
{ perform(999) metric(946) measur(919) }
{ research(1085) discuss(1038) issu(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ spatial(1525) area(1432) region(1030) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ cost(1906) reduc(1198) effect(832) }
{ data(3008) multipl(1320) sourc(1022) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ structur(1116) can(940) graph(676) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }

Resumo

CKGROUND: Users of peer-to-peer (P2P) file-sharing networks risk the inadvertent disclosure of personal health information (PHI). In addition to potentially causing harm to the affected individuals, this can heighten the risk of data breaches for health information custodians. Automated PHI detection tools that crawl the P2P networks can identify PHI and alert custodians. While there has been previous work on the detection of personal information in electronic health records, there has been a dearth of research on the automated detection of PHI in heterogeneous user files.OBJECTIVE: To build a system that accurately detects PHI in files sent through P2P file-sharing networks. The system, which we call P2P Watch, uses a pipeline of text processing techniques to automatically detect PHI in files exchanged through P2P networks. P2P Watch processes unstructured texts regardless of the file format, document type, and content.METHODS: We developed P2P Watch to extract and analyze PHI in text files exchanged on P2P networks. We labeled texts as PHI if they contained identifiable information about a person (eg, name and date of birth) and specifics of the person's health (eg, diagnosis, prescriptions, and medical procedures). We evaluated the system's performance through its efficiency and effectiveness on 3924 files gathered from three P2P networks.RESULTS: P2P Watch successfully processed 3924 P2P files of unknown content. A manual examination of 1578 randomly selected files marked by the system as non-PHI confirmed that these files indeed did not contain PHI, making the false-negative detection rate equal to zero. Of 57 files marked by the system as PHI, all contained both personally identifiable information and health information: 11 files were PHI disclosures, and 46 files contained organizational materials such as unfilled insurance forms, job applications by medical professionals, and essays.CONCLUSIONS: PHI can be successfully detected in free-form textual files exchanged through P2P networks. Once the files with PHI are detected, affected individuals or data custodians can be alerted to take remedial action.

Resumo Limpo

ckground user peertop pp fileshar network risk inadvert disclosur person health inform phi addit potenti caus harm affect individu can heighten risk data breach health inform custodian autom phi detect tool crawl pp network can identifi phi alert custodian previous work detect person inform electron health record dearth research autom detect phi heterogen user filesobject build system accur detect phi file sent pp fileshar network system call pp watch use pipelin text process techniqu automat detect phi file exchang pp network pp watch process unstructur text regardless file format document type contentmethod develop pp watch extract analyz phi text file exchang pp network label text phi contain identifi inform person eg name date birth specif person health eg diagnosi prescript medic procedur evalu system perform effici effect file gather three pp networksresult pp watch success process pp file unknown content manual examin random select file mark system nonphi confirm file inde contain phi make falseneg detect rate equal zero file mark system phi contain person identifi inform health inform file phi disclosur file contain organiz materi unfil insur form job applic medic profession essaysconclus phi can success detect freeform textual file exchang pp network file phi detect affect individu data custodian can alert take remedi action

Resumos Similares

J Integr Bioinform - PathJam: a new service for integrating biological pathway information. ( 0,718550518184493 )
J Am Med Inform Assoc - Implementation and management of a biomedical observation dictionary in a large healthcare information system. ( 0,685081533292016 )
J Biomed Inform - Common data model for natural language processing based on two existing standard information models: CDA+GrAF. ( 0,652621431372319 )
IEEE J Biomed Health Inform - Service for the pseudonymization of electronic healthcare records based on ISO/EN 13606 for the secondary use of information. ( 0,648913794891271 )
J Biomed Inform - Using the ResearchEHR platform to facilitate the practical application of the EHR standards. ( 0,646781562047162 )
BMC Med Inform Decis Mak - Measuring diversity in medical reports based on categorized attributes and international classification systems. ( 0,638203104023511 )
Int J Med Inform - The MITRE Identification Scrubber Toolkit: design, training, and assessment. ( 0,634980469030533 )
J Am Med Inform Assoc - Diagnosis code assignment: models and evaluation metrics. ( 0,621262406245458 )
J Med Syst - A generative tool for building health applications driven by ISO 13606 archetypes. ( 0,609271711549259 )
J Chem Inf Model - CYANOS: a data management system for natural product drug discovery efforts using cultured microorganisms. ( 0,606496526109924 )
J. Med. Internet Res. - How strong are passwords used to protect personal health information in clinical trials? ( 0,602985970129083 )
J Am Med Inform Assoc - Towards comprehensive syntactic and semantic annotations of the clinical narrative. ( 0,600427537682819 )
AMIA Annu Symp Proc - Measuring the Information Gain of Diagnosis vs. Diagnosis Category Coding. ( 0,586714599943816 )
J Am Med Inform Assoc - Taking advantage of continuity of care documents to populate a research repository. ( 0,585230215873797 )
J Am Med Inform Assoc - Developing and evaluating an automated appendicitis risk stratification algorithm for pediatric patients in the emergency department. ( 0,583343396335736 )
Comput Math Methods Med - A mixture modeling framework for differential analysis of high-throughput data. ( 0,578177060305177 )
AMIA Annu Symp Proc - Tools for improving the characterization and visualization of changes in neuro-oncology patients. ( 0,575525112587923 )
J Chem Inf Model - 3-D QSAutogrid/R: an alternative procedure to build 3-D QSAR models. Methodologies and applications. ( 0,574977875889772 )
J Am Med Inform Assoc - Exploiting domain information for Word Sense Disambiguation of medical documents. ( 0,574768910416956 )
AMIA Annu Symp Proc - Informing standard development and understanding user needs with omaha system signs and symptoms text entries in community-based care settings. ( 0,571861989905435 )
Comput Methods Programs Biomed - PKSolver: An add-in program for pharmacokinetic and pharmacodynamic data analysis in Microsoft Excel. ( 0,56367320188716 )
Artif Intell Med - Conceptual-driven classification for coding advise in health insurance reimbursement. ( 0,561880625808619 )
J Am Med Inform Assoc - Bionimbus: a cloud for managing, analyzing and sharing large genomics datasets. ( 0,558351525612397 )
Int J Health Geogr - Open-Source web-based Geographical Information System for health exposure assessment. ( 0,554308451933502 )
AMIA Annu Symp Proc - Impact of selective mapping strategies on automated laboratory result notification to public health authorities. ( 0,5526260654711 )
Comput Methods Programs Biomed - BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments. ( 0,551861720038338 )
Methods Inf Med - Frequency analysis of medical concepts in clinical trials and their coverage in MeSH and SNOMED-CT. ( 0,551516004993432 )
Int J Med Inform - A review of ECG storage formats. ( 0,547457491135845 )
Methods Inf Med - Secure Secondary Use of Clinical Data with Cloud-based NLP Services. Towards a Highly Scalable Research Infrastructure. ( 0,544246573860318 )
Appl Clin Inform - Structuring clinical workflows for diabetes care: an overview of the OntoHealth approach. ( 0,543642966203921 )
AMIA Annu Symp Proc - The SHARPn project on secondary use of Electronic Medical Record data: progress, plans, and possibilities. ( 0,543575756072189 )
AMIA Annu Symp Proc - Improving Search for Evidence-based Practice using Information Extraction. ( 0,540130238882804 )
J Biomed Inform - Using an ensemble system to improve concept extraction from clinical records. ( 0,536373428979033 )
Brief. Bioinformatics - Bioinformatics tools and database resources for systems genetics analysis in mice--a short review and an evaluation of future needs. ( 0,535710050377914 )
J Am Med Inform Assoc - A corpus-based approach for automated LOINC mapping. ( 0,534105909978644 )
Comput Methods Programs Biomed - Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance. ( 0,533995617570453 )
Appl Clin Inform - Design and multicentric implementation of a generic software architecture for patient recruitment systems re-using existing HIS tools and routine patient data. ( 0,53324785095958 )
AMIA Annu Symp Proc - Discovering peripheral arterial disease cases from radiology notes using natural language processing. ( 0,53263780428484 )
Sci Data - Building the graph of medicine from millions of clinical narratives. ( 0,531947813698304 )
J Integr Bioinform - Automatic extraction of microorganisms and their habitats from free text using text mining workflows. ( 0,531835554551239 )
Methods Inf Med - Piloting the EHR4CR feasibility platform across Europe. ( 0,530952389862756 )
Int J Med Inform - Detecting temporal expressions in medical narratives. ( 0,528461341301423 )
J Med Syst - A data types profile suitable for use with ISO EN 13606. ( 0,525441074350069 )
J Am Med Inform Assoc - MITRE system for clinical assertion status classification. ( 0,523394963764119 )
Brief. Bioinformatics - Architecture for interoperable software in biology. ( 0,523334843765156 )
J Chem Inf Model - hERG me out. ( 0,521894578844343 )
Methods Inf Med - LOINC in prehospital emergency medicine in Germany - experience of the `DIRK?-project. ( 0,521271203837445 )
BMC Med Inform Decis Mak - Applying representational state transfer (REST) architecture to archetype-based electronic health record systems. ( 0,521102664389889 )
J Am Med Inform Assoc - Mapping local laboratory interface terms to LOINC at a German university hospital using RELMA V.5: a semi-automated approach. ( 0,520837799129447 )
Methods Inf Med - Development of ICF code selection tools for mental health care. ( 0,520714508010198 )
Appl Clin Inform - Implementing SNOMED CT for Quality Reporting: Avoiding Pitfalls. ( 0,520476378012998 )
J Med Syst - Impact of the Patient-Reported Outcomes Management Information System (PROMIS) upon the design and operation of multi-center clinical trials: a qualitative research study. ( 0,519760451838074 )
J Biomed Inform - A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora. ( 0,519252368377362 )
J Am Med Inform Assoc - A nationwide medication incidents reporting system in The Netherlands. ( 0,516940155779259 )
J Am Med Inform Assoc - Automatic discourse connective detection in biomedical text. ( 0,516470815424969 )
J Biomed Inform - The Analytic Information Warehouse (AIW): a platform for analytics using electronic health record data. ( 0,515853666199048 )
Sci Data - Metabolic differences in ripening of Solanum lycopersicum 'Ailsa Craig' and three monogenic mutants. ( 0,513795221544612 )
AMIA Annu Symp Proc - Generalizability and comparison of automatic clinical text de-identification methods and resources. ( 0,51280865565253 )
J Am Med Inform Assoc - Extracting drug indication information from structured product labels using natural language processing. ( 0,512406976803494 )
Appl Clin Inform - Unlocking Data for Clinical Research - The German i2b2 Experience. ( 0,512057287020665 )
J Am Med Inform Assoc - Identifying clinical/translational research cohorts: ascertainment via querying an integrated multi-source database. ( 0,511240044087109 )
J Biomed Inform - Using patient lists to add value to integrated data repositories. ( 0,510890337487356 )
J Biomed Inform - An approach to improve LOINC mapping through augmentation of local test names. ( 0,510157522416952 )
J Med Syst - Redactable signatures for signed CDA Documents. ( 0,509658358513122 )
AMIA Annu Symp Proc - Developing a section labeler for clinical documents. ( 0,509235363182889 )
AMIA Annu Symp Proc - Problem management module: an innovative system to improve problem list workflow. ( 0,509192145343497 )
J Integr Bioinform - BacillOndex: an integrated data resource for systems and synthetic biology. ( 0,508903341576879 )
Brief. Bioinformatics - Translational research platforms integrating clinical and omics data: a review of publicly available solutions. ( 0,508794815359717 )
J Am Med Inform Assoc - ccML, a new mark-up language to improve ISO/EN 13606-based electronic health record extracts practical edition. ( 0,50873786124692 )
J Biomed Inform - Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses. ( 0,508728764550126 )
J Am Med Inform Assoc - Development and evaluation of an ensemble resource linking medications to their indications. ( 0,508679218953948 )
AMIA Annu Symp Proc - It's about this and that: a description of anaphoric expressions in clinical text. ( 0,50836861903684 )
Comput Biol Chem - Circular code motifs in transfer RNAs. ( 0,508249341452094 )
AMIA Annu Symp Proc - Automatically detecting problem list omissions of type 2 diabetes cases using electronic medical records. ( 0,50769651877882 )
J Biomed Inform - Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: the SHARPn project. ( 0,506956180813609 )
J Chem Inf Model - ZINC: a free tool to discover chemistry for biology. ( 0,50639881862721 )
Med Biol Eng Comput - The Ornstein-Uhlenbeck third-order Gaussian process (OUGP) applied directly to the un-resampled heart rate variability (HRV) tachogram for detrending and low-pass filtering. ( 0,505368322506059 )
Brief. Bioinformatics - The NGS WikiBook: a dynamic collaborative online training effort with long-term sustainability. ( 0,505169309051159 )
Comput Methods Programs Biomed - Raw data extraction from electrocardiograms with Portable Document Format. ( 0,505083520018923 )
AMIA Annu Symp Proc - Enabling cross-platform clinical decision support through Web-based decision support in commercial electronic health record systems: proposal and evaluation of initial prototype implementations. ( 0,504598857313871 )
AMIA Annu Symp Proc - Extracting patient demographics and personal medical information from online health forums. ( 0,504100408515055 )
J Chem Inf Model - Chemical name to structure: OPSIN, an open source solution. ( 0,503980602406612 )
BMC Med Inform Decis Mak - Text data extraction for a prospective, research-focused data mart: implementation and validation. ( 0,503797009597924 )
Brief. Bioinformatics - Probe mapping across multiple microarray platforms. ( 0,502936885355826 )
AMIA Annu Symp Proc - Hedging their mets: the use of uncertainty terms in clinical documents and its potential implications when sharing the documents with patients. ( 0,502472803657632 )
BMC Med Inform Decis Mak - GenDrux: a biomedical literature search system to identify gene expression-based drug sensitivity in breast cancer. ( 0,502359676426383 )
AMIA Annu Symp Proc - ASLForm: an adaptive self learning medical form generating system. ( 0,501941717090402 )
J Biomed Inform - Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. ( 0,501829852205013 )
J Biomed Inform - Deriving consumer-facing disease concepts for family health histories using multi-source sampling. ( 0,501511989195752 )
J Biomed Inform - The clinician in the Driver's Seat: part 1 - a drag/drop user-composable electronic health record platform. ( 0,501438458278549 )
J Am Med Inform Assoc - Eventual situations for timeline extraction from clinical reports. ( 0,501428785491301 )
AMIA Annu Symp Proc - A cloud-based approach to medical NLP. ( 0,500769280596593 )
J Biomed Inform - Clustering-based methodology for analyzing near-miss reports and identifying risks in healthcare delivery. ( 0,500433462552601 )
J Am Med Inform Assoc - MedXN: an open source medication extraction and normalization tool for clinical text. ( 0,500282026035076 )
BMC Med Inform Decis Mak - XML-BSPM: an XML format for storing Body Surface Potential Map recordings. ( 0,500003132616173 )
Methods Inf Med - A characterization of local LOINC mapping for laboratory tests in three large institutions. ( 0,499932148203339 )
Appl Clin Inform - Representation of information about family relatives as structured data in electronic health records. ( 0,499611416424405 )
BMC Med Inform Decis Mak - The freetext matching algorithm: a computer program to extract diagnoses and causes of death from unstructured text in electronic health records. ( 0,499160691522594 )
J Biomed Inform - Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: an annotation and machine learning study. ( 0,49872213769208 )
J Chem Inf Model - Managing the computational chemistry big data problem: the ioChem-BD platform. ( 0,49858023511833 )