During infectious disease outbreaks, data collected through health institutions and official reporting structures may not be available for weeks, hindering early epidemiologic assessment. By contrast, data from informal media are typically available in near real-time and could provide earlier estimates of epidemic dynamics. We assessed correlation of volume of cholera-related HealthMap news media reports, Twitter postings, and government cholera cases reported in the first 100 days of the 2010 Haitian cholera outbreak. Trends in volume of informal sources significantly correlated in time with official case data and was available up to 2 weeks earlier. Estimates of the reproductive number ranged from 1.54 to 6.89 (informal sources) and 1.27 to 3.72 (official sources) during the initial outbreak growth period, and 1.04 to 1.51 (informal) and 1.06 to 1.73 (official) when Hurricane Tomas afflicted Haiti. Informal data can be used complementarily with official data in an outbreak setting to get timely estimates of disease dynamics.
BACKGROUND: Recent legislation in the US requires that all medical records become electronic over the next decade. In addition, ongoing developments in patient-oriented care, most notably with the advent of health social networking and personal health records, provide a plethora of new information sources for research. CONTENT: Electronic health records (EHRs) show great potential for use in observational studies to examine drug safety via pharmacovigiliance methods that can find adverse drug events as well as expand drug safety profiles. EHRs also show promise for head-to-head comparative effectiveness trials and could play a critical role in secondary and tertiary diabetes prevention efforts. A growing subset of EHRs, personal health records (PHRs), opens up the possibility of engaging patients in their care, as well as new opportunities for participatory research and personalized medicine. Organizations nationwide, from providers to employers, are already investing heavily in PHR systems. Additionally, the explosive use of online social networking sites and mobile technologies will undoubtedly play a role in future research efforts by making available a veritable flood of information, such as real-time exercise monitoring, to health researchers. SUMMARY: The future confluence of health information technologies will enable researchers and clinicians to reveal novel therapies and insights into treatments and disease management, as well as environmental and genomic interactions, at an unprecedented population scale.
In: Bulletin of the World Health Organization: the international journal of public health = Bulletin de l'Organisation Mondiale de la Santé, Band 96, Heft 1, S. 66-68
Background: Alternative data sources are used increasingly to augment traditional public health surveillance systems. Examples include over-the-counter medication sales and school absenteeism. Objective: We sought to determine if an increase in restaurant table availabilities was associated with an increase in disease incidence, specifically influenza-like illness (ILI). Methods: Restaurant table availability was monitored using OpenTable, an online restaurant table reservation site. A daily search was performed for restaurants with available tables for 2 at the hour and at half past the hour for 22 distinct times: between 11:00 am-3:30 pm for lunch and between 6:00-11:30 PM for dinner. In the United States, we examined table availability for restaurants in Boston, Atlanta, Baltimore, and Miami. For Mexico, we studied table availabilities in Cancun, Mexico City, Puebla, Monterrey, and Guadalajara. Time series of restaurant use was compared with Google Flu Trends and ILI at the state and national levels for the United States and Mexico using the cross-correlation function. Results: Differences in restaurant use were observed across sampling times and regions. We also noted similarities in time series trends between data on influenza activity and restaurant use. In some settings, significant correlations greater than 70% were noted between data on restaurant use and ILI trends. Conclusions: This study introduces and demonstrates the potential value of restaurant use data for event surveillance. ; Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center (DoI/NBC) [D12PC000337] ; We thank Sumiko Mekaru for suggestions regarding data analysis. This work is partially supported by a research grant the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center (DoI/NBC) contract number D12PC000337. The US Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, DoI/NBC, or the US Government.
Background: Global outbreak detection and reporting have generally improved for a variety of infectious diseases and geographic regions in recent decades. Nevertheless, lags in outbreak reporting remain a threat to the global human health and economy. In the time between first occurrence of a novel disease incident and public notification of an outbreak, infected individuals have a greater possibility of traveling and spreading the pathogen to other nations. Shortening outbreak reporting lags has the potential to improve global health by preventing local outbreaks from escalating into global epidemics. Methods: Reporting lags between the first record and the first public report of an event were calculated for 318 outbreaks occurring 1996-2009. The influence of freedom of the press, Internet usage, per capita health expenditure, and cell phone subscriptions, on the timeliness of outbreak reporting was evaluated. Results: Freer presses and increasing Internet usage correlate with reduced time between the first record of an outbreak and the public report. Increasing Internet usage reduced the expected reporting lag from more than one month in nations without Internet users to one day in those where 75 of 100 people use the Internet. Conclusion: Advances in technology and the emergence of more open and free governments are associated with to improved global infectious disease surveillance.
Dengue is a common and growing problem worldwide, with an estimated 70–140 million cases per year. Traditional, healthcare-based, government-implemented dengue surveillance is resource intensive and slow. As global Internet use has increased, novel, Internet-based disease monitoring tools have emerged. Google Dengue Trends (GDT) uses near real-time search query data to create an index of dengue incidence that is a linear proxy for traditional surveillance. Studies have shown that GDT correlates highly with dengue incidence in multiple countries on a large spatial scale. This study addresses the heterogeneity of GDT at smaller spatial scales, assessing its accuracy at the state-level in Mexico and identifying factors that are associated with its accuracy. We used Pearson correlation to estimate the association between GDT and traditional dengue surveillance data for Mexico at the national level and for 17 Mexican states. Nationally, GDT captured approximately 83% of the variability in reported cases over the 9 study years. The correlation between GDT and reported cases varied from state to state, capturing anywhere from 1% of the variability in Baja California to 88% in Chiapas, with higher accuracy in states with higher dengue average annual incidence. A model including annual average maximum temperature, precipitation, and their interaction accounted for 81% of the variability in GDT accuracy between states. This climate model was the best indicator of GDT accuracy, suggesting that GDT works best in areas with intense transmission, particularly where local climate is well suited for transmission. Internet accessibility (average ∼36%) did not appear to affect GDT accuracy. While GDT seems to be a less robust indicator of local transmission in areas of low incidence and unfavorable climate, it may indicate cases among travelers in those areas. Identifying the strengths and limitations of novel surveillance is critical for these types of data to be used to make public health decisions and forecasting models.
As policies are adjusted throughout the COVID-19 pandemic according to public health best practices, there is a need to balance the importance of social distancing in preventing viral spread with the strain that these governmental public safety mandates put on public mental health. Thus, there is need for continuous observation of public sentiment and deliberation to inform further adaptation of mandated interventions. In this study, we explore how public response may be reflected in Massachusetts (MA) via social media by specifically exploring temporal patterns in Twitter posts (tweets) regarding sentiment and discussion of topics. We employ interrupted time series centered on (1) Massachusetts State of Emergency declaration (March 10), (2) US State of Emergency declaration (March 13) and (3) Massachusetts public school closure (March 17) to explore changes in tweet sentiment polarity (net negative/positive), expressed anxiety and discussion on risk and health topics on a random subset of all tweets coded within Massachusetts and published from January 1 to May 15, 2020 (n=2.8 million). We find significant differences between tweets published before and after mandate enforcement for Massachusetts State of Emergency (increased discussion of risk and health, decreased polarity and increased anxiety expression), US State of Emergency (increased discussion of risk and health, and increased anxiety expression) and Massachusetts public school closure (increased discussion of risk and decreased polarity). Our work further validates that Twitter data is a reasonable way to monitor public sentiment and discourse within a crisis, especially in conjunction with other observation data.
As policies are adjusted throughout the COVID-19 pandemic according to public health best practices, there is a need to balance the importance of social distancing in preventing viral spread with the strain that these governmental public safety mandates put on public mental health. Thus, there is need for continuous observation of public sentiment and deliberation to inform further adaptation of mandated interventions. In this study, we explore how public response may be reflected in Massachusetts (MA) via social media by specifically exploring temporal patterns in Twitter posts (tweets) regarding sentiment and discussion of topics. We employ interrupted time series centered on (1) Massachusetts State of Emergency declaration (March 10), (2) US State of Emergency declaration (March 13) and (3) Massachusetts public school closure (March 17) to explore changes in tweet sentiment polarity (net negative/positive), expressed anxiety and discussion on risk and health topics on a random subset of all tweets coded within Massachusetts and published from January 1 to May 15, 2020 (n = 2.8 million). We find significant differences between tweets published before and after mandate enforcement for Massachusetts State of Emergency (increased discussion of risk and health, decreased polarity and increased anxiety expression), US State of Emergency (increased discussion of risk and health, and increased anxiety expression) and Massachusetts public school closure (increased discussion of risk and decreased polarity). Our work further validates that Twitter data is a reasonable way to monitor public sentiment and discourse within a crisis, especially in conjunction with other observation data.
Widespread availability geographic information systems (GIS) software has facilitated the use health mapping in both academia and government. Maps that display patients as points are often exchanged in public forums (journals, meetings, web). However, even these low resolution maps may reveal confidential patient location information. In this report, we describe a method to test whether privacy is being breached. We reverse geocode from maps with cases and describe the accuracy with which patient addresses can be extracted.
PurposeThe purpose was to use Twitter to conduct online surveillance of negative sentiment towards Mexicans and Hispanics during the 2016 United States presidential election, and to examine its relationship with mental well-being in this targeted group at the population level.MethodsTweets containing the terms Mexican(s) and Hispanic(s) were collected within a 20-week period of the 2016 United States presidential election (November 9th 2016). Sentiment analysis was used to capture percent negative tweets. A time series lag regression model was used to examine the association between percent count of negative tweets mentioning Mexicans and Hispanics and percent count of worry among Hispanic Gallup poll respondents.ResultsOf 2,809,641 tweets containing terms Mexican(s) and Hispanic(s), 687,291 tweets were negative. Among 8,314 Hispanic Gallup respondents, a mean of 33.5% responded to be worried on a daily basis. A significant lead time of 1 week was observed, showing that negative tweets mentioning Mexicans and Hispanics appeared to forecast daily worry among Hispanics by 1 week.ConclusionSurveillance of online negative sentiment towards racially vulnerable population groups can be captured using social media. This has potential to identify early warning signals for symptoms of mental well-being among targeted groups at the population level.
PURPOSE: The purpose was to use Twitter to conduct online surveillance of negative sentiment towards Mexicans and Hispanics during the 2016 United States presidential election, and to examine its relationship with mental well-being in this targeted group at the population level. METHODS: Tweets containing the terms Mexican(s) and Hispanic(s) were collected within a 20-week period of the 2016 United States presidential election (November 9th 2016). Sentiment analysis was used to capture percent negative tweets. A time series lag regression model was used to examine the association between percent count of negative tweets mentioning Mexicans and Hispanics and percent count of worry among Hispanic Gallup poll respondents. RESULTS: Of 2,809,641 tweets containing terms Mexican(s) and Hispanic(s), 687,291 tweets were negative. Among 8,314 Hispanic Gallup respondents, a mean of 33.5% responded to be worried on a daily basis. A significant lead time of 1 week was observed, showing that negative tweets mentioning Mexicans and Hispanics appeared to forecast daily worry among Hispanics by 1 week. CONCLUSION: Surveillance of online negative sentiment towards racially vulnerable population groups can be captured using social media. This has potential to identify early warning signals for symptoms of mental well-being among targeted groups at the population level.
Background: Hurricane Isaac made landfall in southeastern Louisiana in late August 2012, resulting in extensive storm surge and inland flooding. As the lead federal agency responsible for medical and public health response and recovery coordination, the Department of Health and Human Services (HHS) must have situational awareness to prepare for and address state and local requests for assistance following hurricanes. Both traditional and non-traditional data have been used to improve situational awareness in fields like disease surveillance and seismology. This study investigated whether non-traditional data (i.e., tweets and news reports) fill a void in traditional data reporting during hurricane response, as well as whether non-traditional data improve the timeliness for reporting identified HHS Essential Elements of Information (EEI). Methods: HHS EEIs provided the information collection guidance, and when the information indicated there was a potential public health threat, an event was identified and categorized within the larger scope of overall Hurricane Issac situational awareness. Tweets, news reports, press releases, and federal situation reports during Hurricane Isaac response were analyzed for information about EEIs. Data that pertained to the same EEI were linked together and given a unique event identification number to enable more detailed analysis of source content. Reports of sixteen unique events were examined for types of data sources reporting on the event and timeliness of the reports. Results: Of these sixteen unique events identified, six were reported by only a single data source, four were reported by two data sources, four were reported by three data sources, and two were reported by four or more data sources. For five of the events where news tweets were one of multiple sources of information about an event, the tweet occurred prior to the news report, press release, local government\emergency management tweet, and federal situation report. In all circumstances where citizens were ...
To compare the timeliness of nongovernmental and governmental communications of infectious disease outbreaks and evaluate trends for each over time, we investigated the time elapsed from the beginning of an outbreak to public reporting of the event. We found that governmental sources improved the timeliness of public reporting of infectious disease outbreaks during the study period.