Sexy Call Girl Dharmapuri Arshi ๐9058824046๐ Dharmapuri Escort Service
ย
Surveillance of social media: Big data analytics
1. Presented by:
Thomas Otto (Manager Business Intelligence)
Dr. Mehnaz Adnan (Senior Scientist Health Intelligence)
Institute of Environmental Science & Research Ltd.
Credits:
ESR โ Dr. Mehnaz Adnan: Health Intelligence Analytics on Tweets
ESR - Franco Andrews: SAP Data Integration, Modelling, Analytics and
Visualisation
ESR IT: Infrastructure / Firewall
Soltius NZ - Erik Roelofs: Connection Module and SAP Data Services
Syndromic Surveillance of Social
Media - Big Data Analytics
ยฉ ESR 2015
2. Problem statement and hypothesis
โข Individuals disclose a lot of personal information on Social Media
channels (i.e. Facebook, Twitter etc.)
โข Thereโs lots of Social Media Data (SMD) out there and:-
โข It is very noisy
โข It is not verified
โข It needs to be curated (checked by a clinician)
โข Personal information contains location, names and self diagnosed
syndromes
โข SMD could be used to feed an early warning surveillance system
3. 1. How to exploit twitter for public health monitoring
(http://goo.gl/sOx9xo)
2. Digital disease detectionโharnessing the Web for public health
surveillance. (http://goo.gl/fxwoJT)
3. Influenza forecasting with Google flu trends. (http://goo.gl/z7GZco)
Related work
1) Denecke, K., Krieck, M., Otrusina, L., Smrz, P., Dolog, P., Nejdl, W., & Velasco, E. (2013). How to exploit twitter
for public health monitoring. Methods Inf Med, 52(4), 326-339.
2) Brownstein, J. S., Freifeld, C. C., & Madoff, L. C. (2009). Digital disease detectionโharnessing the Web for
public health surveillance. New England Journal of Medicine, 360(21), 2153-2157.
3) Dugas, A. F., Jalalpour, M., Gel, Y., Levin, S., Torcaso, F., Igusa, T., & Rothman, R. E. (2013). Influenza
forecasting with Google flu trends. PloS one, 8(2), e56176.
4. What is Social Media?
Social media refers to the means of interactions among people in which they
create, share, and/or exchange information and ideas in virtual communities
and networksยน.
1 Tufts university, Boston, U.S.A.
2 Social Media Examiner: 2014 Social Media Marketing Industry Report
5. What is Twitter?
Some Twitter statistics
โข 1 billion users registered
โข 255 million users/month
โข 100 million users per day ยณ
1 Wikipedia
2 PEW Research Centre: January 2014
3 DMR: March 2014
4 Twitter Terms of Service as of 24/7/14
6. Overview
โข This is a proof of concept (POC)
โข The POC is not yet used for surveillance or to monitor actual diseases
โข This POC is an experimental application at ESR to understand the
validity of the approach
7. Method of data collection
Commercial / Government clients
Future work
Machine Learning (ML),
Artificial Intelligence
(AI) etc.
Commercial / Government clients
8. โข There was a measles outbreak in 2014
โข We extracted a subset of tweets for the period of Jan 2014 to Dec.
2015 containing the key word โmeaslesโ from our twitter data mart
โข We extracted the number of confirmed measles cases for the period
of Jan 2014 to Dec. 2015 from a national New Zealand surveillance
system (EpiSurv)
โข We performed quantitative data analysis on both data sets
Study Design
10. โข Number of tweets collected for measles: 1408
โข Single keyword-based data curation
โข Usage of free Twitter API 1.1 (volume, timeliness)
Limitations
11. Social Media (Twitter) โ Visualisation / Front-End
Select keyword
Measles
Zoom into WLG
Basic stats by location
Measles Tweets
Current, active
keywords
12. โข We believe that Social Media Data (SMD) is a relevant source
of information
โข Storage is potentially challenging (it has aspects of Big Data)
โข Cleansing (it needs to be curated)
โข A mixed approach between machine automation and
human verification (i.e. clinician)
โข Curated SMD will be the source for down-stream Analytics
and early warning systems (syndromic surveillance)
Conclusion
13. โข Potentially use a Twitter data aggregator, or a paid Twitter API
connection (higher volumes, better timeliness)
โข Adding to the Linguistic Analytical Module applying:-
โข Machine Learning (sentiment analysis, linear regression analysis
etc.)
โข Prediction
โข Evaluate the Deep Dive engine from Stanford University
(http://deepdive.stanford.edu/)
โข Develop ontology for syndromic keywords related to specific diseases
(i.e. spots, rash, itching for measles)
Future work
14. Confidence
Tweet is a real
event.
100 %
0 %
Time
? %
Unverified
Twitter
data
Verify with
Health Line
data
Enrich Twitter data set with
other, verified data (counts,
location, time).
Verify with
Health Stats
data
Verify with
Lab
Information
Data
Verify with
Sentinel
data
Verify with
National
Surveillance
Database
PRO
Nature scientific
journal: There is a close
correlation between
the rates of doctor
visits for flu symptoms,
and the use of flu-like
search terms. NZ Herald 23/7/14
CON
Researchers from Harvard University state:
Google Flu Tracker has overestimated for 100 of
the 108 weeks starting from
August, 2011
source: motherboard.vice.com
VISION - Big Data complements traditional methods
Calibrate social media data with verified and trusted data to identify valid tweets
Risk Opportunity
โข Social Media Data is
validated and a trusted
source of information
โข Maybe used for
indicative, early
warnings of potential
outbreaks?
Google ??
Flu Tracker
NZ
50 %
15. Some Social Media exploration tools
โข Try below tools first and see what benefit they offer
(This is random selection and does not rate or recommend any of the tools in particular)
โข PlusOne Social (http://plusonesocial.com )
โข Microsoft Excel Twitter add-in (http://goo.gl/WBaXt5)
โข Others
โข http://www.razorsocial.com/free-twitter-analytics/
โข http://www.socialmediaexaminer.com/6-twitter-analytics-
tools/
16. Presented by:
Thomas Otto (Manager Business Intelligence)
Dr. Mehnaz Adnan (Senior Scientist Health Intelligence)
Institute of Environmental Science & Research Ltd.
Credits:
ESR โ Dr. Mehnaz Adnan: Health Intelligence Analytics on Tweets
ESR - Franco Andrews: SAP Data Integration, Modelling, Analytics and
Visualisation
ESR IT: Infrastructure / Firewall
Soltius NZ - Erik Roelofs: Connection Module and SAP Data Services
Syndromic Surveillance of Social
Media - Big Data Analytics
ยฉ ESR 2015