Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On the Ground Validation of Online Diagnosis with Twitter and Medical Records (1404.3026v1)

Published 11 Apr 2014 in cs.SI, cs.CL, and cs.LG

Abstract: Social media has been considered as a data source for tracking disease. However, most analyses are based on models that prioritize strong correlation with population-level disease rates over determining whether or not specific individual users are actually sick. Taking a different approach, we develop a novel system for social-media based disease detection at the individual level using a sample of professionally diagnosed individuals. Specifically, we develop a system for making an accurate influenza diagnosis based on an individual's publicly available Twitter data. We find that about half (17/35 = 48.57%) of the users in our sample that were sick explicitly discuss their disease on Twitter. By developing a meta classifier that combines text analysis, anomaly detection, and social network analysis, we are able to diagnose an individual with greater than 99% accuracy even if she does not discuss her health.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Todd Bodnar (2 papers)
  2. Victoria C Barclay (1 paper)
  3. Nilam Ram (6 papers)
  4. Marcel Salathé (27 papers)
  5. Conrad S Tucker (1 paper)
Citations (24)

Summary

We haven't generated a summary for this paper yet.