J Biomed Inform - PREDOSE: a semantic web platform for drug abuse epidemiology using social media.


{ extract(1171) text(1153) clinic(932) }
{ health(1844) social(1437) communiti(874) }
{ drug(1928) target(777) effect(648) }
{ research(1218) medic(880) student(794) }
{ concept(1167) ontolog(924) domain(897) }
{ design(1359) user(1324) use(1319) }
{ spatial(1525) area(1432) region(1030) }
{ medic(1828) order(1363) alert(1069) }
{ chang(1828) time(1643) increas(1301) }
{ data(2317) use(1299) case(1017) }
{ process(1125) use(805) approach(778) }
{ data(1737) use(1416) pattern(1282) }
{ analysi(2126) use(1163) compon(1037) }
{ general(901) number(790) one(736) }
{ system(1050) medic(1026) inform(1018) }
{ health(3367) inform(1360) care(1135) }
{ studi(2440) review(1878) systemat(933) }
{ research(1085) discuss(1038) issu(1018) }
{ model(2656) set(1616) predict(1553) }
{ import(1318) role(1303) understand(862) }
{ structur(1116) can(940) graph(676) }
{ featur(3375) classif(2383) classifi(1994) }
{ treatment(1704) effect(941) patient(846) }
{ method(1557) propos(1049) approach(1037) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ state(1844) use(1261) util(961) }
{ patient(1821) servic(1111) care(1106) }
{ estim(2440) model(1874) function(577) }
{ model(3404) distribut(989) bayesian(671) }
{ system(1976) rule(880) can(841) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ imag(2675) segment(2577) method(1081) }
{ motion(1329) object(1292) video(1091) }
{ framework(1458) process(801) describ(734) }
{ data(1714) softwar(1251) tool(1186) }
{ featur(1941) imag(1645) propos(1176) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ visual(1396) interact(850) tool(830) }
{ patient(2837) hospit(1953) medic(668) }
{ gene(2352) biolog(1181) express(1162) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ use(976) code(926) identifi(902) }
{ result(1111) use(1088) new(759) }
{ survey(1388) particip(1329) question(1065) }
{ decis(3086) make(1611) patient(1517) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ network(2748) neural(1063) input(814) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ assess(1506) score(1403) qualiti(1306) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ learn(2355) train(1041) set(1003) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ case(1353) use(1143) diagnosi(1136) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ model(2341) predict(2261) use(1141) }
{ compound(1573) activ(1297) structur(1058) }
{ blood(1257) pressur(1144) flow(957) }
{ record(1888) medic(1808) patient(1693) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ age(1611) year(1155) adult(843) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ sampl(1606) size(1419) use(1276) }
{ data(3008) multipl(1320) sourc(1022) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ implement(1333) system(1263) develop(1122) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }
{ detect(2391) sensit(1101) algorithm(908) }


JECTIVES: The role of social media in biomedical knowledge mining, including clinical, medical and healthcare informatics, prescription drug abuse epidemiology and drug pharmacology, has become increasingly significant in recent years. Social media offers opportunities for people to share opinions and experiences freely in online communities, which may contribute information beyond the knowledge of domain professionals. This paper describes the development of a novel semantic web platform called PREDOSE (PREscription Drug abuse Online Surveillance and Epidemiology), which is designed to facilitate the epidemiologic study of prescription (and related) drug abuse practices using social media. PREDOSE uses web forum posts and domain knowledge, modeled in a manually created Drug Abuse Ontology (DAO--pronounced dow), to facilitate the extraction of semantic information from User Generated Content (UGC), through combination of lexical, pattern-based and semantics-based techniques. In a previous study, PREDOSE was used to obtain the datasets from which new knowledge in drug abuse research was derived. Here, we report on various platform enhancements, including an updated DAO, new components for relationship and triple extraction, and tools for content analysis, trend detection and emerging patterns exploration, which enhance the capabilities of the PREDOSE platform. Given these enhancements, PREDOSE is now more equipped to impact drug abuse research by alleviating traditional labor-intensive content analysis tasks.METHODS: Using custom web crawlers that scrape UGC from publicly available web forums, PREDOSE first automates the collection of web-based social media content for subsequent semantic annotation. The annotation scheme is modeled in the DAO, and includes domain specific knowledge such as prescription (and related) drugs, methods of preparation, side effects, and routes of administration. The DAO is also used to help recognize three types of data, namely: (1) entities, (2) relationships and (3) triples. PREDOSE then uses a combination of lexical and semantic-based techniques to extract entities and relationships from the scraped content, and a top-down approach for triple extraction that uses patterns expressed in the DAO. In addition, PREDOSE uses publicly available lexicons to identify initial sentiment expressions in text, and then a probabilistic optimization algorithm (from related research) to extract the final sentiment expressions. Together, these techniques enable the capture of fine-grained semantic information, which facilitate search, trend analysis and overall content analysis using social media on prescription drug abuse. Moreover, extracted data are also made available to domain experts for the creation of training and test sets for use in evaluation and refinements in information extraction techniques.RESULTS: A recent evaluation of the information extraction techniques applied in the PREDOSE platform indicates 85% precision and 72% recall in entity identification, on a manually created gold standard dataset. In another study, PREDOSE achieved 36% precision in relationship identification and 33% precision in triple extraction, through manual evaluation by domain experts. Given the complexity of the relationship and triple extraction tasks and the abstruse nature of social media texts, we interpret these as favorable initial results. Extracted semantic information is currently in use in an online discovery support system, by prescription drug abuse researchers at the Center for Interventions, Treatment and Addictions Research (CITAR) at Wright State University.CONCLUSION: A comprehensive platform for entity, relationship, triple and sentiment extraction from such abstruse texts has never been developed for drug abuse research. PREDOSE has already demonstrated the importance of mining social media by providing data from which new findings in drug abuse research were uncovered. Given the recent platform enhancements, including the refined DAO, components for relationship and triple extraction, and tools for content, trend and emerging pattern analysis, it is expected that PREDOSE will play a significant role in advancing drug abuse epidemiology in future.

Resumo Limpo

jectiv role social media biomed knowledg mine includ clinic medic healthcar informat prescript drug abus epidemiolog drug pharmacolog becom increas signific recent year social media offer opportun peopl share opinion experi freeli onlin communiti may contribut inform beyond knowledg domain profession paper describ develop novel semant web platform call predos prescript drug abus onlin surveil epidemiolog design facilit epidemiolog studi prescript relat drug abus practic use social media predos use web forum post domain knowledg model manual creat drug abus ontolog daopronounc dow facilit extract semant inform user generat content ugc combin lexic patternbas semanticsbas techniqu previous studi predos use obtain dataset new knowledg drug abus research deriv report various platform enhanc includ updat dao new compon relationship tripl extract tool content analysi trend detect emerg pattern explor enhanc capabl predos platform given enhanc predos now equip impact drug abus research allevi tradit laborintens content analysi tasksmethod use custom web crawler scrape ugc public avail web forum predos first autom collect webbas social media content subsequ semant annot annot scheme model dao includ domain specif knowledg prescript relat drug method prepar side effect rout administr dao also use help recogn three type data name entiti relationship tripl predos use combin lexic semanticbas techniqu extract entiti relationship scrape content topdown approach tripl extract use pattern express dao addit predos use public avail lexicon identifi initi sentiment express text probabilist optim algorithm relat research extract final sentiment express togeth techniqu enabl captur finegrain semant inform facilit search trend analysi overal content analysi use social media prescript drug abus moreov extract data also made avail domain expert creation train test set use evalu refin inform extract techniquesresult recent evalu inform extract techniqu appli predos platform indic precis recal entiti identif manual creat gold standard dataset anoth studi predos achiev precis relationship identif precis tripl extract manual evalu domain expert given complex relationship tripl extract task abstrus natur social media text interpret favor initi result extract semant inform current use onlin discoveri support system prescript drug abus research center intervent treatment addict research citar wright state universityconclus comprehens platform entiti relationship tripl sentiment extract abstrus text never develop drug abus research predos alreadi demonstr import mine social media provid data new find drug abus research uncov given recent platform enhanc includ refin dao compon relationship tripl extract tool content trend emerg pattern analysi expect predos will play signific role advanc drug abus epidemiolog futur

Resumos Similares

AMIA Annu Symp Proc - Extracting patient demographics and personal medical information from online health forums. ( 0,832716636156885 )
AMIA Annu Symp Proc - Pharmacovigilance on twitter? Mining tweets for adverse drug reactions. ( 0,769380502834624 )
AMIA Annu Symp Proc - Pattern mining for extraction of mentions of Adverse Drug Reactions from user comments. ( 0,753989709597112 )
J. Med. Internet Res. - Biomedical informatics techniques for processing and analyzing web blogs of military service members. ( 0,751754251210652 )
AMIA Annu Symp Proc - Evaluating health interest profiles extracted from patient-generated data. ( 0,714325481474722 )
J Am Med Inform Assoc - A classification approach to coreference in discharge summaries: 2011 i2b2 challenge. ( 0,706001347767993 )
J Am Med Inform Assoc - MITRE system for clinical assertion status classification. ( 0,692160480240734 )
J Am Med Inform Assoc - Extracting drug indication information from structured product labels using natural language processing. ( 0,690620668746717 )
AMIA Annu Symp Proc - Inter-annotator reliability of medical events, coreferences and temporal relations in clinical narratives by annotators with varying levels of clinical expertise. ( 0,6888058748638 )
AMIA Annu Symp Proc - A cloud-based approach to medical NLP. ( 0,687737182613306 )
J Am Med Inform Assoc - Anaphoric relations in the clinical narrative: corpus creation. ( 0,684751785064092 )
AMIA Annu Symp Proc - The cellular generation and a new risk environment: implications for texting-based sexual health promotion interventions among minority young men who have sex with men. ( 0,68011843407112 )
Int J Med Inform - A methodology to enhance spatial understanding of disease outbreak events reported in news articles. ( 0,678218589782311 )
Inform Health Soc Care - Language use in an internet support group for smoking cessation: development of sense of community. ( 0,677856145989661 )
J Biomed Inform - Approaches to verb subcategorization for biomedicine. ( 0,676632014348794 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,676391106563888 )
J Am Med Inform Assoc - Towards comprehensive syntactic and semantic annotations of the clinical narrative. ( 0,674785215853063 )
J Biomed Inform - Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. ( 0,671601790463346 )
J Am Med Inform Assoc - Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification. ( 0,67046205519959 )
Appl Clin Inform - Representation of information about family relatives as structured data in electronic health records. ( 0,669364306303407 )
J Biomed Inform - Towards generating a patient's timeline: extracting temporal relationships from clinical notes. ( 0,668160721927478 )
J Biomed Inform - UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text. ( 0,666682681182226 )
J Biomed Inform - NCBI disease corpus: a resource for disease name recognition and concept normalization. ( 0,663683802278704 )
J. Med. Internet Res. - Web 2.0-based crowdsourcing for high-quality gold standard development in clinical natural language processing. ( 0,660549851114594 )
J Biomed Inform - Desiderata for ontologies to be used in semantic annotation of biomedical documents. ( 0,659135021896447 )
Comput Methods Programs Biomed - Marky: a tool supporting annotation consistency in multi-user and iterative document annotation projects. ( 0,658748787811147 )
Comput Math Methods Med - Ranking biomedical annotations with annotator's semantic relevancy. ( 0,658431737042266 )
J Med Syst - Redactable signatures for signed CDA Documents. ( 0,657180755226175 )
J Biomed Inform - Comparison of automated and human assignment of MeSH terms on publicly-available molecular datasets. ( 0,655711568879218 )
Int J Med Inform - Bootstrapping a de-identification system for narrative patient records: cost-performance tradeoffs. ( 0,652337698492181 )
J Am Med Inform Assoc - Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives. ( 0,651765190601235 )
J Am Med Inform Assoc - A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources. ( 0,649835824656866 )
J Biomed Inform - Anaphoric reference in clinical reports: characteristics of an annotated corpus. ( 0,649106655271762 )
J Biomed Inform - MedTime: a temporal information extraction system for clinical narratives. ( 0,645872773389973 )
BMC Med Inform Decis Mak - Text summarization as a decision support aid. ( 0,645698039075725 )
J Am Med Inform Assoc - An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. ( 0,644757314820627 )
AMIA Annu Symp Proc - Mapping annotations with textual evidence using an scLDA model. ( 0,64464021141082 )
J Am Med Inform Assoc - Eventual situations for timeline extraction from clinical reports. ( 0,643468388907891 )
BMC Med Inform Decis Mak - Multi-topic assignment for exploratory navigation of consumer health information in NetWellness using formal concept analysis. ( 0,642778005599691 )
AMIA Annu Symp Proc - Throw the bath water out, keep the baby: keeping medically-relevant terms for text mining. ( 0,642460152496314 )
J Biomed Inform - Coreference resolution: a review of general methodologies and applications in the clinical domain. ( 0,641824013940861 )
J Am Med Inform Assoc - Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text. ( 0,641497404895489 )
J Am Med Inform Assoc - Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. ( 0,641082771354279 )
J Am Med Inform Assoc - A hybrid system for temporal information extraction from clinical text. ( 0,640808564736155 )
Health Info Libr J - Assessment of vaccination-related information for consumers available on Facebook. ( 0,639983331163929 )
J Biomed Inform - Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction. ( 0,639433502679957 )
J. Med. Internet Res. - Protected health information on social networking sites: ethical and legal considerations. ( 0,638767282646103 )
J Biomed Inform - A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora. ( 0,638582875039145 )
J. Med. Internet Res. - Evaluating a web-based clinical decision support system for language disorders screening in a nursery school. ( 0,636195541756174 )
AMIA Annu Symp Proc - Building gold standard corpora for medical natural language processing tasks. ( 0,635618777819158 )
AMIA Annu Symp Proc - Semantic processing to identify adverse drug event information from black box warnings. ( 0,635504253459068 )
J Am Med Inform Assoc - A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. ( 0,634780685540341 )
J Am Med Inform Assoc - The effect of word familiarity on actual and perceived text difficulty. ( 0,633592711252053 )
J Am Med Inform Assoc - A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. ( 0,631570010840326 )
Artif Intell Med - Biomedical events extraction using the hidden vector state model. ( 0,631234798106159 )
J Biomed Inform - Extraction of events and temporal expressions from clinical narratives. ( 0,631030665339668 )
J Am Med Inform Assoc - Assisted annotation of medical free text using RapTAT. ( 0,630473308143131 )
Brief. Bioinformatics - A survey on annotation tools for the biomedical literature. ( 0,629954013994412 )
J Biomed Inform - Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text. ( 0,629823367441584 )
AMIA Annu Symp Proc - Extracting Concepts Related to Homelessness from the Free Text of VA Electronic Medical Records. ( 0,629769237203387 )
J Integr Bioinform - PathJam: a new service for integrating biological pathway information. ( 0,628159562331245 )
IEEE J Biomed Health Inform - Network-based modeling and intelligent data mining of social media for improving care. ( 0,627965054418511 )
Perspect Health Inf Manag - A comparison of two approaches to text processing: facilitating chart reviews of radiology reports in electronic medical records. ( 0,626657147715259 )
AMIA Annu Symp Proc - Critical finding capture in the impression section of radiology reports. ( 0,626282331748319 )
J Biomed Inform - Unsupervised biomedical named entity recognition: experiments with clinical and biological texts. ( 0,623224100407631 )
J Biomed Inform - Relation mining experiments in the pharmacogenomics domain. ( 0,623040450095881 )
J Biomed Inform - Ontology modularization to improve semantic medical image annotation. ( 0,622493714592442 )
J. Med. Internet Res. - Implementing a virtual community of practice for family physician training: a mixed-methods case study. ( 0,622311187204208 )
AMIA Annu Symp Proc - Extracting semantic lexicons from discharge summaries using machine learning and the C-Value method. ( 0,62221108983876 )
IEEE Trans Vis Comput Graph - Social-Event-Driven Camera Control for Multicharacter Animations. ( 0,621703396725004 )
J. Med. Internet Res. - Use of twitter among local health departments: an analysis of information sharing, engagement, and action. ( 0,620968554794715 )
J Biomed Inform - Semantator: semantic annotator for converting biomedical text to linked data. ( 0,618857278987492 )
J Am Med Inform Assoc - An informatics agenda for public health: summarized recommendations from the 2011 AMIA PHI Conference. ( 0,618614734164468 )
J Biomed Inform - Common data model for natural language processing based on two existing standard information models: CDA+GrAF. ( 0,618304630101981 )
Appl Clin Inform - The use of smartphones on General Internal Medicine wards: a mixed methods study. ( 0,61786170152087 )
J Biomed Inform - Lessons learnt from the DDIExtraction-2013 Shared Task. ( 0,61758412287207 )
J Am Med Inform Assoc - MedXN: an open source medication extraction and normalization tool for clinical text. ( 0,616766917691253 )
J Integr Bioinform - Automatic extraction of microorganisms and their habitats from free text using text mining workflows. ( 0,616562963168778 )
AMIA Annu Symp Proc - A machine learning approach for identifying anatomical locations of actionable findings in radiology reports. ( 0,616076617015223 )
J. Med. Internet Res. - What are health-related users tweeting? A qualitative content analysis of health-related users and their messages on twitter. ( 0,614882368316123 )
J Am Med Inform Assoc - Automatic discourse connective detection in biomedical text. ( 0,614578230454409 )
AMIA Annu Symp Proc - ADESSA: A Real-Time Decision Support Service for Delivery of Semantically Coded Adverse Drug Event Data. ( 0,614504981252297 )
AMIA Annu Symp Proc - Automatically pairing measured findings across narrative abdomen CT reports. ( 0,613891169453476 )
J Am Med Inform Assoc - Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements. ( 0,613790007650365 )
J Biomed Inform - The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions. ( 0,613358945745891 )
AMIA Annu Symp Proc - Natural language processing for lines and devices in portable chest x-rays. ( 0,613193997752526 )
J Am Med Inform Assoc - A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. ( 0,612042468212177 )
Int J Med Inform - Detecting temporal expressions in medical narratives. ( 0,611904351291191 )
AMIA Annu Symp Proc - Towards a semantic lexicon for clinical natural language processing. ( 0,611595325063401 )
AMIA Annu Symp Proc - TagLine: Information Extraction for Semi-Structured Text in Medical Progress Notes. ( 0,611049800806586 )
AMIA Annu Symp Proc - Sophia: A Expedient UMLS Concept Extraction Annotator. ( 0,61074796138033 )
Comput. Biol. Med. - A P300-based brain computer interface system for words typing. ( 0,610093671589725 )
J. Med. Internet Res. - Would you tell everyone this? Facebook conversations as health promotion interventions. ( 0,609900716720718 )
AMIA Annu Symp Proc - Mining Biomedical Literature for Terms related to Epidemiologic Exposures. ( 0,608575082153512 )
AMIA Annu Symp Proc - Application of a temporal reasoning framework tool in analysis of medical device adverse events. ( 0,608521876708297 )
Inform Health Soc Care - Online conversations among Ontario university students: environmental concerns. ( 0,608343855258326 )
AMIA Annu Symp Proc - Natural language processing to extract follow-up provider information from hospital discharge summaries. ( 0,608249373021584 )
J Biomed Inform - Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. ( 0,60777558887996 )
J. Med. Internet Res. - Patient-centered design of an information management module for a personally controlled health record. ( 0,606882263318168 )
BMC Med Inform Decis Mak - Detecting causality from online psychiatric texts using inter-sentential language patterns. ( 0,606534218865739 )