J. Med. Internet Res. - A case study of the New York City 2012-2013 influenza season with daily geocoded Twitter data from temporal and spatiotemporal perspectives.


{ model(2656) set(1616) predict(1553) }
{ studi(1119) effect(1106) posit(819) }
{ use(976) code(926) identifi(902) }
{ spatial(1525) area(1432) region(1030) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ extract(1171) text(1153) clinic(932) }
{ patient(1821) servic(1111) care(1106) }
{ time(1939) patient(1703) rate(768) }
{ assess(1506) score(1403) qualiti(1306) }
{ search(2224) databas(1162) retriev(909) }
{ data(2317) use(1299) case(1017) }
{ result(1111) use(1088) new(759) }
{ imag(2675) segment(2577) method(1081) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ model(2341) predict(2261) use(1141) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ first(2504) two(1366) second(1323) }
{ use(2086) technolog(871) perceiv(783) }
{ high(1669) rate(1365) level(1280) }
{ estim(2440) model(1874) function(577) }
{ visual(1396) interact(850) tool(830) }
{ age(1611) year(1155) adult(843) }
{ signal(2180) analysi(812) frequenc(800) }
{ group(2977) signific(1463) compar(1072) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1452) weight(1219) physic(1104) }
{ data(1737) use(1416) pattern(1282) }
{ bind(1733) structur(1185) ligand(1036) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ take(945) account(800) differ(722) }
{ studi(2440) review(1878) systemat(933) }
{ treatment(1704) effect(941) patient(846) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ data(1714) softwar(1251) tool(1186) }
{ model(2220) cell(1177) simul(1124) }
{ case(1353) use(1143) diagnosi(1136) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ system(1050) medic(1026) inform(1018) }
{ blood(1257) pressur(1144) flow(957) }
{ state(1844) use(1261) util(961) }
{ cost(1906) reduc(1198) effect(832) }
{ detect(2391) sensit(1101) algorithm(908) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ sequenc(1873) structur(1644) protein(1328) }
{ featur(3375) classif(2383) classifi(1994) }
{ network(2748) neural(1063) input(814) }
{ patient(2315) diseas(1263) diabet(1191) }
{ motion(1329) object(1292) video(1091) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(1557) propos(1049) approach(1037) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ method(984) reconstruct(947) comput(926) }
{ featur(1941) imag(1645) propos(1176) }
{ studi(1410) differ(1259) use(1210) }
{ research(1085) discuss(1038) issu(1018) }
{ import(1318) role(1303) understand(862) }
{ compound(1573) activ(1297) structur(1058) }
{ perform(1367) use(1326) method(1137) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ medic(1828) order(1363) alert(1069) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ data(3008) multipl(1320) sourc(1022) }
{ activ(1138) subject(705) human(624) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }


CKGROUND: Twitter has shown some usefulness in predicting influenza cases on a weekly basis in multiple countries and on different geographic scales. Recently, Broniatowski and colleagues suggested Twitter's relevance at the city-level for New York City. Here, we look to dive deeper into the case of New York City by analyzing daily Twitter data from temporal and spatiotemporal perspectives. Also, through manual coding of all tweets, we look to gain qualitative insights that can help direct future automated searches.OBJECTIVE: The intent of the study was first to validate the temporal predictive strength of daily Twitter data for influenza-like illness emergency department (ILI-ED) visits during the New York City 2012-2013 influenza season against other available and established datasets (Google search query, or GSQ), and second, to examine the spatial distribution and the spread of geocoded tweets as proxies for potential cases.METHODS: From the Twitter Streaming API, 2972 tweets were collected in the New York City region matching the keywords "flu", "influenza", "gripe", and "high fever". The tweets were categorized according to the scheme developed by Lamb et al. A new fourth category was added as an evaluator guess for the probability of the subject(s) being sick to account for strength of confidence in the validity of the statement. Temporal correlations were made for tweets against daily ILI-ED visits and daily GSQ volume. The best models were used for linear regression for forecasting ILI visits. A weighted, retrospective Poisson model with SaTScan software (n=1484), and vector map were used for spatiotemporal analysis.RESULTS: Infection-related tweets (R=.763) correlated better than GSQ time series (R=.683) for the same keywords and had a lower mean average percent error (8.4 vs 11.8) for ILI-ED visit prediction in January, the most volatile month of flu. SaTScan identified primary outbreak cluster of high-probability infection tweets with a 2.74 relative risk ratio compared to medium-probability infection tweets at P=.001 in Northern Brooklyn, in a radius that includes Barclay's Center and the Atlantic Avenue Terminal.CONCLUSIONS: While others have looked at weekly regional tweets, this study is the first to stress test Twitter for daily city-level data for New York City. Extraction of personal testimonies of infection-related tweets suggests Twitter's strength both qualitatively and quantitatively for ILI-ED prediction compared to alternative daily datasets mixed with awareness-based data such as GSQ. Additionally, granular Twitter data provide important spatiotemporal insights. A tweet vector-map may be useful for visualization of city-level spread when local gold standard data are otherwise unavailable.

Resumo Limpo

ckground twitter shown use predict influenza case week basi multipl countri differ geograph scale recent broniatowski colleagu suggest twitter relev citylevel new york citi look dive deeper case new york citi analyz daili twitter data tempor spatiotempor perspect also manual code tweet look gain qualit insight can help direct futur autom searchesobject intent studi first valid tempor predict strength daili twitter data influenzalik ill emerg depart ili visit new york citi influenza season avail establish dataset googl search queri gsq second examin spatial distribut spread geocod tweet proxi potenti casesmethod twitter stream api tweet collect new york citi region match keyword flu influenza gripe high fever tweet categor accord scheme develop lamb et al new fourth categori ad evalu guess probabl subject sick account strength confid valid statement tempor correl made tweet daili ili visit daili gsq volum best model use linear regress forecast ili visit weight retrospect poisson model satscan softwar n vector map use spatiotempor analysisresult infectionrel tweet r correl better gsq time seri r keyword lower mean averag percent error vs ili visit predict januari volatil month flu satscan identifi primari outbreak cluster highprob infect tweet relat risk ratio compar mediumprob infect tweet p northern brooklyn radius includ barclay center atlant avenu terminalconclus other look week region tweet studi first stress test twitter daili citylevel data new york citi extract person testimoni infectionrel tweet suggest twitter strength qualit quantit ili predict compar altern daili dataset mix awarenessbas data gsq addit granular twitter data provid import spatiotempor insight tweet vectormap may use visual citylevel spread local gold standard data otherwis unavail

Resumos Similares

Artif Intell Med - Training artificial neural networks directly on the concordance index for censored data using genetic algorithms. ( 0,73159718222746 )
AMIA Annu Symp Proc - Motivating the additional use of external validity: examining transportability in a model of glioblastoma multiforme. ( 0,728356853524579 )
J Chem Inf Model - Beyond the scope of Free-Wilson analysis: building interpretable QSAR models with machine learning algorithms. ( 0,711698366664964 )
J Chem Inf Model - Predicting pK(a) values of substituted phenols from atomic charges: comparison of different quantum mechanical methods and charge distribution schemes. ( 0,711555827614005 )
BMC Med Inform Decis Mak - Concordance and predictive value of two adverse drug event data sets. ( 0,702792429983698 )
J Am Med Inform Assoc - Harvest: an open platform for developing web-based biomedical data discovery and reporting applications. ( 0,699248711236312 )
J Chem Inf Model - Study of chromatographic retention of natural terpenoids by chemoinformatic tools. ( 0,690506939886877 )
J Chem Inf Model - iLOGP: a simple, robust, and efficient description of n-octanol/water partition coefficient for drug design using the GB/SA approach. ( 0,675039779751835 )
Med Biol Eng Comput - Cardiogoniometric parameters for detection of coronary artery disease at rest as a function of stenosis localization and distribution. ( 0,671409265712645 )
Spat Spatiotemporal Epidemiol - Spatial modelling of disease using data- and knowledge-driven approaches. ( 0,669895050838266 )
J Chem Inf Model - Time-split cross-validation as a method for estimating the goodness of prospective prediction. ( 0,664586179096881 )
J Chem Inf Model - RS-Predictor models augmented with SMARTCyp reactivities: robust metabolic regioselectivity predictions for nine CYP isozymes. ( 0,660850576966992 )
AMIA Annu Symp Proc - Effect of data combination on predictive modeling: a study using gene expression data. ( 0,659985855605048 )
Artif Intell Med - Improving predictive models of glaucoma severity by incorporating quality indicators. ( 0,65719708262475 )
J Chem Inf Model - Does rational selection of training and test sets improve the outcome of QSAR modeling? ( 0,653155468973813 )
Int J Health Geogr - Comparative analysis of remotely-sensed data products via ecological niche modeling of avian influenza case occurrences in Middle Eastern poultry. ( 0,645688607727547 )
Int J Health Geogr - Incorporating geographical factors with artificial neural networks to predict reference values of erythrocyte sedimentation rate. ( 0,644652704499259 )
BMC Med Inform Decis Mak - Measuring preferences for analgesic treatment for cancer pain: how do African-Americans and Whites perform on choice-based conjoint (CBC) analysis experiments? ( 0,642401266673675 )
J Chem Inf Model - GRID-based three-dimensional pharmacophores II: PharmBench, a benchmark data set for evaluating pharmacophore elucidation methods. ( 0,632851706005999 )
Comput Methods Programs Biomed - Kinetic modelling of haemodialysis removal of myoglobin in rhabdomyolysis patients. ( 0,632376412737173 )
J Chem Inf Model - Introducing conformal prediction in predictive modeling. A transparent and flexible alternative to applicability domain determination. ( 0,624218577256393 )
J Chem Inf Model - Comparative studies on some metrics for external validation of QSPR models. ( 0,621345595518111 )
Int J Health Geogr - A linear programming model for preserving privacy when disclosing patient spatial information for secondary purposes. ( 0,61987089131831 )
AMIA Annu Symp Proc - Predicting the dengue incidence in Singapore using univariate time series models. ( 0,61958984166619 )
J Chem Inf Model - Leave-cluster-out cross-validation is appropriate for scoring functions derived from diverse protein data sets. ( 0,617843351864811 )
J Chem Inf Model - Rank order entropy: why one metric is not enough. ( 0,616647205521273 )
J Chem Inf Model - Impact of template choice on homology model efficiency in virtual screening. ( 0,61597665122797 )
J Chem Inf Model - Pharmacophore assessment through 3-D QSAR: evaluation of the predictive ability on new derivatives by the application on a series of antitubercular agents. ( 0,614844530964326 )
J Biomed Inform - MysiRNA: improving siRNA efficacy prediction using a machine-learning model combining multi-tools and whole stacking energy (G). ( 0,608175085318657 )
J Chem Inf Model - Applicability domain based on ensemble learning in classification and regression analyses. ( 0,603483867252829 )
Comput Methods Programs Biomed - A predictive model of longitudinal, patient-specific colonoscopy results. ( 0,600438621559585 )
AMIA Annu Symp Proc - Advanced proficiency EHR training: effect on physicians' EHR efficiency, EHR satisfaction and job satisfaction. ( 0,597879312832315 )
J Chem Inf Model - Three useful dimensions for domain applicability in QSAR models using random forest. ( 0,597868076940874 )
Comput. Biol. Med. - A prediction model of substrates and non-substrates of breast cancer resistance protein (BCRP) developed by GA-CG-SVM method. ( 0,597591107905522 )
Artif Intell Med - A machine learning-based approach to prognostic analysis of thoracic transplantations. ( 0,597208661108041 )
J Chem Inf Model - Building a three-dimensional model of CYP2C9 inhibition using the Autocorrelator: an autonomous model generator. ( 0,596474421860995 )
J Chem Inf Model - Prediction of linear cationic antimicrobial peptides based on characteristics responsible for their interaction with the membranes. ( 0,594122290777169 )
J Chem Inf Model - In silico prediction of aqueous solubility using simple QSPR models: the importance of phenol and phenol-like moieties. ( 0,592219761558116 )
IEEE Trans Image Process - Incremental N-mode SVD for large-scale multilinear generative models. ( 0,59106168629965 )
Int J Comput Assist Radiol Surg - Assessing performance in brain tumor resection using a novel virtual reality simulator. ( 0,586189355938125 )
Comput Methods Programs Biomed - Modeling the glucose regulatory system in extreme preterm infants. ( 0,585261703912628 )
BMC Med Inform Decis Mak - Regression tree construction by bootstrap: model search for DRG-systems applied to Austrian health-data. ( 0,585258933127151 )
BMC Med Inform Decis Mak - Developing an algorithm to identify people with Chronic Obstructive Pulmonary Disease (COPD) using administrative data. ( 0,584349474652618 )
Med Decis Making - Developing a tuberculosis transmission model that accounts for changes in population health. ( 0,581451831730016 )
J Chem Inf Model - Estimation of carcinogenicity using molecular fragments tree. ( 0,578825891757219 )
Brief. Bioinformatics - Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies. ( 0,577127042891101 )
Int J Comput Assist Radiol Surg - Optimized order estimation for autoregressive models to predict respiratory motion. ( 0,576210409048603 )
J Chem Inf Model - Development of novel 3D-QSAR combination approach for screening and optimizing B-Raf inhibitors in silico. ( 0,575301046402022 )
Med Biol Eng Comput - Optimal design of clinical tests for the identification of physiological models of type 1 diabetes in the presence of model mismatch. ( 0,570885006113895 )
J Chem Inf Model - Four-dimensional structure-activity relationship model to predict HIV-1 integrase strand transfer inhibition using LQTA-QSAR methodology. ( 0,570677179963381 )
AMIA Annu Symp Proc - A natural language processing algorithm to define a venous thromboembolism phenotype. ( 0,570278827583276 )
J Am Med Inform Assoc - Choosing blindly but wisely: differentially private solicitation of DNA datasets for disease marker discovery. ( 0,567384745134322 )
J Chem Inf Model - Criterion for evaluating the predictive ability of nonlinear regression models without cross-validation. ( 0,566444054838387 )
Spat Spatiotemporal Epidemiol - Modeling the epidemic waves of AH1N1/09 influenza around the world. ( 0,565318080905939 )
J Am Med Inform Assoc - Use of a support vector machine for categorizing free-text notes: assessment of accuracy across two institutions. ( 0,563769326804134 )
Comput. Biol. Med. - Artificial neural network modelling of the results of tympanoplasty in chronic suppurative otitis media patients. ( 0,562159702450823 )
Med Biol Eng Comput - Application of the RIMARC algorithm to a large data set of action potentials and clinical parameters for risk prediction of atrial fibrillation. ( 0,55998390995624 )
J Chem Inf Model - Robust scoring functions for protein-ligand interactions with quantum chemical charge models. ( 0,559633294394583 )
J Chem Inf Model - Classification of compounds with distinct or overlapping multi-target activities and diverse molecular mechanisms using emerging chemical patterns. ( 0,557910720754741 )
J Chem Inf Model - In silico prediction of total human plasma clearance. ( 0,556914482056666 )
Int J Comput Assist Radiol Surg - Hybrid image visualization tool for 3D integration of CT coronary anatomy and quantitative myocardial perfusion PET. ( 0,55487659953585 )
AMIA Annu Symp Proc - Identifying Deviations from Usual Medical Care using a Statistical Approach. ( 0,553475396001669 )
J Chem Inf Model - Best of both worlds: combining pharma data and state of the art modeling technology to improve in Silico pKa prediction. ( 0,552963051028356 )
Artif Intell Med - Fuzzy model identification of dengue epidemic in Colombia based on multiresolution analysis. ( 0,552063323307473 )
Comput. Aided Surg. - Evaluation of a computational model to predict elbow range of motion. ( 0,548206896284106 )
Telemed J E Health - Measurement adherence in the blood pressure self-measurement room. ( 0,547424356078193 )
Appl Clin Inform - Development of an automated, real time surveillance tool for predicting readmissions at a community hospital. ( 0,547412706114428 )
BMC Med Inform Decis Mak - Sequential detection of influenza epidemics by the Kolmogorov-Smirnov test. ( 0,546235228250624 )
Artif Intell Med - Image partitioning and illumination in image-based pose detection for teleoperated flexible endoscopes. ( 0,544596937740622 )
J Chem Inf Model - Applicability Domain ANalysis (ADAN): a robust method for assessing the reliability of drug property predictions. ( 0,54434667477612 )
J Biomed Inform - Development of reusable logic for determination of statin exposure-time from electronic health records. ( 0,542694071508592 )
J Chem Inf Model - A multiscale simulation system for the prediction of drug-induced cardiotoxicity. ( 0,541517323083972 )
Comput. Biol. Med. - Quantification of contributions of molecular fragments for eye irritation of organic chemicals using QSAR study. ( 0,540828561772571 )
AMIA Annu Symp Proc - Ontology-based federated data access to human studies information. ( 0,539936646961146 )
J Chem Inf Model - QSAR models for P-glycoprotein transport based on a highly consistent data set. ( 0,538602350179805 )
J Biomed Inform - Transfer learning based clinical concept extraction on data from multiple sources. ( 0,536613981737335 )
Med Biol Eng Comput - Accelerometry-based prediction of movement dynamics for balance monitoring. ( 0,535704973643548 )
Comput Math Methods Med - Multiscale autoregressive identification of neuroelectrophysiological systems. ( 0,535010723654371 )
Methods Inf Med - Automating individualized coaching and authentic role-play practice for brief intervention training. ( 0,534717630859389 )
Neural Comput - Input statistics and Hebbian cross-talk effects. ( 0,534663754924812 )
J. Comput. Biol. - The complexity of the dirichlet model for multiple alignment data. ( 0,532948076244378 )
Int J Health Geogr - A validation of ground ambulance pre-hospital times modeled using geographic information systems. ( 0,531400351511008 )
J Chem Inf Model - A new approach to radial basis function approximation and its application to QSAR. ( 0,530336358152258 )
J Chem Inf Model - Statistical analysis and compound selection of combinatorial libraries for soluble epoxide hydrolase. ( 0,530040800268782 )
Spat Spatiotemporal Epidemiol - Spatial approximations of network-based individual level infectious disease models. ( 0,529308731112906 )
J Am Med Inform Assoc - Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure. ( 0,528182366070524 )
J. Comput. Biol. - Boolean models can explain bistability in the lac operon. ( 0,527450915020807 )
Lifetime Data Anal - Analysis of cure rate survival data under proportional odds model. ( 0,527275686011475 )
J Chem Inf Model - Analysis and study of molecule data sets using snowflake diagrams of weighted maximum common subgraph trees. ( 0,526721448546121 )
J Chem Inf Model - Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection. ( 0,526344078563442 )
AMIA Annu Symp Proc - Building and evaluating an ontology-based tool for reasoning about consent permission. ( 0,526275267744688 )
BMC Med Inform Decis Mak - Diagnosis-specific readmission risk prediction using electronic health data: a retrospective cohort study. ( 0,52447812612626 )
BMC Med Inform Decis Mak - Prediction models for short children born small for gestational age (SGA) covering the total growth phase. Analyses based on data from KIGS (Pfizer International Growth Database). ( 0,523571344156678 )
Lifetime Data Anal - Bayesian inference of the fully specified subdistribution model for survival data with competing risks. ( 0,522560731673841 )
J Chem Inf Model - Applicability domains for classification problems: Benchmarking of distance to models for Ames mutagenicity set. ( 0,520990188536085 )
Med Biol Eng Comput - Development of a comprehensive musculoskeletal model of the shoulder and elbow. ( 0,51964959569868 )
J Med Syst - Utilization of electronic medical records to build a detection model for surveillance of healthcare-associated urinary tract infections. ( 0,518966074890219 )
BMC Med Inform Decis Mak - Predicting length of stay from an electronic patient record system: a primary total knee replacement example. ( 0,518666289263764 )
Comput Methods Programs Biomed - Predicting body fat percentage based on gender, age and BMI by using artificial neural networks. ( 0,516092289285764 )
Med Biol Eng Comput - Validating motor unit firing patterns extracted by EMG signal decomposition. ( 0,515664090366963 )