SlideShare a Scribd company logo
1 of 24
Processing Social Media Messages in
Mass Emergency: Survey Summary
Muhammad Imran Carlos Castillo
Fernando Diaz Sarah Vieweg
Authors
mimran@hbku.edu.qa chato@acm.org
diazf@acm.org sarahvieweg@gmail.com
Date: 25th April 2018
Overarching Goal
“To extract time-critical information from social media
that is useful for emergency responders, affected
communities, and other concerned population in
disaster situations.”
Urgent help need Urgent aid need
Survey Study Selection
Domain filters Topic filters Data filters
- Humanitarian
- Disaster response
- Mass emergencies
- Computing
- Artificial intelligence
- Machine learning
- Twitter
- Facebook
- Micro-blogging
Keywords
Final selection = 180 published research papers
Domain Topics Data
>700 articles
Duplicate filters
Topics Covered
Humanitarian + Social Media + AI
Volume & Velocity
(~18)
Data acquisition,
storage, and retrieval
Event Detection
(~36)
Topic detection and
tracking
Classification &
Clustering
(~40)
Classification and
clustering
Information
Summarization
~(15)
Abstractive and
Extractive summarization
Semantics and
Crisis Ontologies
(~10)
Semantic enrichment &
Crisis ontologies
Information
Veracity
(~18)
Credibility and
misinformation
Information
Visualization
(~12)
Crisis maps, dashboards
Total ~180
papers surveyed
Volume & Velocity
Twitter Storms during Emergencies
Source: https://www.wsj.com/articles/twitter-storms-can-help-gauge-damage-of-real-storms-and-disasters-study-says-1457722801
(Castillo C, Big Crisis
Data, 2016, Cambridge
University Press)
Volume
Velocity 72k tweets/min
27 million in 3 days
(Yury Kryvasheyeu et al. Sci Adv 2016;2:e1500779)
Blue: represents a location farther from the disaster
Red: represents a location closer to the disaster
Twitter Activity Across Locations
during Disasters
Activity Retweeting
Strong relationship between proximity to Sandy’s path and social media activity
Event Detection
Event Description
• Why to detect events from social media?
– Human sensors report incidents very quickly
– Tweet waves travel faster than earthquake waves
• What is an event?
– Events can be defined as situations, actions or
occurrences that happen in a certain location at a
specific time (Dou et al. 2012)
• An event is generally characterized by: 5W1H
– Who? When? Where? What? Why? How?
Event Detection using
Bursty Behavior
(Liang et al. Quantifying Information Flow During Emergencies, 2014, Nature.)
Event Detection Systems
System Approach Event
types
Real-
time
Query
type
Spatio-
temporal
Sub-
events
Reference
Twitter
Monitor
Burst
detection
Open domain Yes Open No No [Mathioudakis et al. 2010]
TwitInfo Burst
detection
Earthquakes Yes Keyword Spatial Yes [Marcus et al. 2011]
Twevent Burst
detection
Open domain Yes Open No No [Li et al. 2012b]
TEDAS Supervised
classification
Crime/disast
ers
No Keyword Yes No [Li et al. 2012a]
LeadLine Burst
detection
Open domain No Keyword Yes No [Dou et al. 2012]
TwiCal Supervised
classification
Conflicts/poli
tics
Yes Open Temporal No [Ritter et al. 2012]
Tweet4Act Dictionaries Disasters Yes Keyword No No [Chowdhury et al. 2013]
ESA Burst
detection
Open domain Yes Keyword Spatial No [Robinson et al. 2013a]
Challenges and Future Directions
• Inadequate spatial information
– Spatial and temporal information are two integral
components of an event
– Automatic text-based geo-tagging may help
• Mundane events
– #MusicMonday #FollowFriday are misleading
• Describing the events
– Named-entities, tracking, semantic enhancements
Information
Classification and Clustering
By Information Provided
• Caution and advice [Imran et al. 2013b]; warnings [Acar and Muraki 2011];
hazard preparation [Olteanu et al. 2014]; tips [Leavitt and Clark 2014]; advice
[Bruns 2014]; status, protocol [Hughes et al. 2014b]
• Affected or trapped people [Caragea et al. 2011]; casualties, people missing,
found, or seen [Imran et al. 2013b]; self-reports [Acar and Muraki 2011]; injured,
missing, killed [Vieweg et al. 2010]; looking for missing people [Qu et al. 2011]
• Infrastructure/utilities damage [Imran et al. 2013b]; collapsed structure
[Caragea et al. 2011]; built environment [Vieweg et al. 2010]; closure and services
[Hughes et al. 2014b]
• Needs and donations of money, goods, services [Imran et al. 2013b];
food/water shortage [Caragea et al. 2011]; donations or volunteering [Olteanu et
al. 2014]; help requests, relief coordination [Qu et al. 2011]; relief, donations,
resources [Hughes et al. 2014b]; help and fundraising [Bruns 2014]
• Other useful information: hospital/clinic service, water sanitation [Caragea et
al. 2011]; consequences [Olteanu et al. 2014]
By Information Provided
• Caution and advice [Imran et al. 2013b]; warnings [Acar and Muraki 2011];
hazard preparation [Olteanu et al. 2014]; tips [Leavitt and Clark 2014]; advice
[Bruns 2014]; status, protocol [Hughes et al. 2014b]
• Affected or trapped people [Caragea et al. 2011]; casualties, people missing,
found, or seen [Imran et al. 2013b]; self-reports [Acar and Muraki 2011]; injured,
missing, killed [Vieweg et al. 2010]; looking for missing people [Qu et al. 2011]
• Infrastructure/utilities damage [Imran et al. 2013b]; collapsed structure
[Caragea et al. 2011]; built environment [Vieweg et al. 2010]; closure and services
[Hughes et al. 2014b]
• Needs and donations of money, goods, services [Imran et al. 2013b];
food/water shortage [Caragea et al. 2011]; donations or volunteering [Olteanu et
al. 2014]; help requests, relief coordination [Qu et al. 2011]; relief, donations,
resources [Hughes et al. 2014b]; help and fundraising [Bruns 2014]
• Other useful information: hospital/clinic service, water sanitation [Caragea et
al. 2011]; consequences [Olteanu et al. 2014]
- Supervised classification techniques
- Learning algorithms include SVMs, Random
Forest, Ensemble methods, and lately deep
learning e.g., RNN
- Unsupervised: clustering, and LDA for topic modeling
Formal response organizations prefer supervised
classification as most of the times categories are
defined.
Systems for Crisis Data Processing
Twitris [Purohit and Sheth 2013]
Twitter; semantic enrichment, classify automatically, geotag
SensePlace2 [MacEachren et al. 2011]
Twitter; geotag, visualize heat-maps based on geotags
EAIMS Emergency Analysis Identification and Management System [McCreadie et
al. 2016] Twitter; sentiment, alerts, credibility,
ESA Emergency Situation Awareness
[Yin et al. 2012; Power et al. 2014]
Twitter; detect bursts, classify, cluster, geotag
Systems for Crisis Data Processing
Twitcident [Abel et al. 2012]
Twitter and TwitPic; semantic enrichment, classify
CrisisTracker [Rogstadius et al. 2013]
Twitter; cluster, annotate manually
Tweedr [Ashktorab et al. 2014]
Twitter; classify automatically, extract information, geotag
AIDR: Artificial Intelligence for Disaster Response [Imran et al. 2014a]
Twitter & Facebook; annotate manually,
classify automatically (text + image)
Challenges and Future Directions
• Missing actionable insights
– Who and where help is needed
– Automatic extraction of actionable/serviceable msgs
• Labeled data scarcity
– Most of the systems are labeled data hungry
– More robust domain adaption and transfer learning
techniques are required
• Focus on other content type (Images)
– Images contain critical information (e.g., damage)
– More focus on multimodal research is required
Information Summarization
Information Summarization
Tribhuvan international airport closed after the quake
Airport closed after 7.9 Earthquake in Kathmandu
Tribhuvan international airport closed after 7.9 earthquake in
Kathmandu.
Summaries reduce information overload issue
Key Objectives and Challenges
• Information coverage
– Capture most situational updates from data. The summary should be
rich in terms of information coverage
• Less redundant information
– Messages on Twitter contain duplicate information. Produce
summaries with less redundant but important updates
• Readability
– Twitter messages are often noisy, informal, and full of grammatical
mistakes. The aim here is to produce more readable summaries
• Real-time (online/updated summaries)
– The system should not be heavily overloaded with computations
such that by the time the summary is produced, the utility of that
information is marginal
(McCreadie et al. 2013; Aslam et al. 2013;
Nenkova and McKeown 2011; Guo et al. 2013, Rudra et al., 2016)
Crisis Datasets (Labeled + Unlabeled)
CrisisMMD: Multimodal Twitter Datasets from Natural
Disasters
http://CrisisNLP.qcri.org/
http://CrisisLex.org/
Conclusion and Future Directions
• Applied Research at its Best
– Real-world problems and challenges
– Social Media for Social Good
– Decent work on information filtering and classification (last 6-8 years)
• Social media imagery content is another potential source of information
• Labeled data scarcity problem
– No or few labeled data instances (in early hours)
– High diversity among organizations needs
– Information needs change overtime
– Domain adaptation and transfer learning techniques required
• From situational to actionable insights
– Identify requests and needs in real-time
– Triangulate missing information
– Rank them based on their urgency to help responders
Thank you!
Contact me at: mimran@hbku.edu.qa OR @mimran15
For queries, questions, and datasets:
Recommended books:
Processing Social Media Messages in Mass Emergency: A Survey.
ACM Computing Surveys, 2015.
Full survey paper:

More Related Content

What's hot

Automatically Rank Social Media Requests for Emergency Services using Service...
Automatically Rank Social Media Requests for Emergency Services using Service...Automatically Rank Social Media Requests for Emergency Services using Service...
Automatically Rank Social Media Requests for Emergency Services using Service...Hemant Purohit
 
ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...
ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...
ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...Global Risk Forum GRFDavos
 
KNO.E.SIS Approach to Impactful Research, Creating Exceptional Careers & Eco...
KNO.E.SIS Approach to Impactful Research,  Creating Exceptional Careers & Eco...KNO.E.SIS Approach to Impactful Research,  Creating Exceptional Careers & Eco...
KNO.E.SIS Approach to Impactful Research, Creating Exceptional Careers & Eco...Amit Sheth
 
Social Media News Mining and Automatic Content Analysis of News
Social Media News Mining and Automatic Content Analysis of NewsSocial Media News Mining and Automatic Content Analysis of News
Social Media News Mining and Automatic Content Analysis of NewsCarlos Castillo (ChaTo)
 
Twitris in Action - a review of its many applications
Twitris in Action - a review of its many applications Twitris in Action - a review of its many applications
Twitris in Action - a review of its many applications Amit Sheth
 
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...Artificial Intelligence Institute at UofSC
 
Helping Crisis Responders Find the Informative Needle in the Tweet Haystack
Helping Crisis Responders Find the Informative Needle in the Tweet HaystackHelping Crisis Responders Find the Informative Needle in the Tweet Haystack
Helping Crisis Responders Find the Informative Needle in the Tweet HaystackCOMRADES project
 
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...Keith Powell
 
CrisisCommons Statement for the Record
CrisisCommons Statement for the RecordCrisisCommons Statement for the Record
CrisisCommons Statement for the RecordHeather Blanchard
 
Leveraging A Wiki To Enhance Virtual Collaboration In The Emergency Domain
Leveraging A Wiki To Enhance Virtual Collaboration In The Emergency DomainLeveraging A Wiki To Enhance Virtual Collaboration In The Emergency Domain
Leveraging A Wiki To Enhance Virtual Collaboration In The Emergency DomainConnie White
 
Emergency Risk Communication
Emergency Risk CommunicationEmergency Risk Communication
Emergency Risk CommunicationHeather Blanchard
 
The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...
The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...
The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...Ryan Burns
 
Disaster data informatics for situation awareness
Disaster data informatics for situation awareness Disaster data informatics for situation awareness
Disaster data informatics for situation awareness Ashutosh Jadhav
 
OCHA Think Brief - Hashtag Standards for emergencies
OCHA Think Brief - Hashtag Standards for emergenciesOCHA Think Brief - Hashtag Standards for emergencies
OCHA Think Brief - Hashtag Standards for emergenciesJan Husar
 
Situational Awareness Workgroup Input
Situational Awareness Workgroup InputSituational Awareness Workgroup Input
Situational Awareness Workgroup InputHeather Blanchard
 
National Preparedness System (NPS) component: TractorFax's Incident Managemen...
National Preparedness System (NPS) component: TractorFax's Incident Managemen...National Preparedness System (NPS) component: TractorFax's Incident Managemen...
National Preparedness System (NPS) component: TractorFax's Incident Managemen...JD Hamilton
 

What's hot (20)

Crisis Computing
Crisis ComputingCrisis Computing
Crisis Computing
 
Automatically Rank Social Media Requests for Emergency Services using Service...
Automatically Rank Social Media Requests for Emergency Services using Service...Automatically Rank Social Media Requests for Emergency Services using Service...
Automatically Rank Social Media Requests for Emergency Services using Service...
 
ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...
ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...
ICT for Disaster Risk Management-Managing Disaster Information-Global Risk Id...
 
KNO.E.SIS Approach to Impactful Research, Creating Exceptional Careers & Eco...
KNO.E.SIS Approach to Impactful Research,  Creating Exceptional Careers & Eco...KNO.E.SIS Approach to Impactful Research,  Creating Exceptional Careers & Eco...
KNO.E.SIS Approach to Impactful Research, Creating Exceptional Careers & Eco...
 
Social Media News Mining and Automatic Content Analysis of News
Social Media News Mining and Automatic Content Analysis of NewsSocial Media News Mining and Automatic Content Analysis of News
Social Media News Mining and Automatic Content Analysis of News
 
Twitris in Action - a review of its many applications
Twitris in Action - a review of its many applications Twitris in Action - a review of its many applications
Twitris in Action - a review of its many applications
 
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...
 
Helping Crisis Responders Find the Informative Needle in the Tweet Haystack
Helping Crisis Responders Find the Informative Needle in the Tweet HaystackHelping Crisis Responders Find the Informative Needle in the Tweet Haystack
Helping Crisis Responders Find the Informative Needle in the Tweet Haystack
 
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...
 
CrisisCommons Statement for the Record
CrisisCommons Statement for the RecordCrisisCommons Statement for the Record
CrisisCommons Statement for the Record
 
Kenya red cross society
Kenya red cross societyKenya red cross society
Kenya red cross society
 
Leveraging A Wiki To Enhance Virtual Collaboration In The Emergency Domain
Leveraging A Wiki To Enhance Virtual Collaboration In The Emergency DomainLeveraging A Wiki To Enhance Virtual Collaboration In The Emergency Domain
Leveraging A Wiki To Enhance Virtual Collaboration In The Emergency Domain
 
2014 may iaem bulletin jifx
2014 may iaem bulletin jifx2014 may iaem bulletin jifx
2014 may iaem bulletin jifx
 
Emergency Risk Communication
Emergency Risk CommunicationEmergency Risk Communication
Emergency Risk Communication
 
The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...
The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...
The Digital Humanitarian Moment: New Practices, Knowledge Politics, and Phila...
 
Sais.34.1
Sais.34.1Sais.34.1
Sais.34.1
 
Disaster data informatics for situation awareness
Disaster data informatics for situation awareness Disaster data informatics for situation awareness
Disaster data informatics for situation awareness
 
OCHA Think Brief - Hashtag Standards for emergencies
OCHA Think Brief - Hashtag Standards for emergenciesOCHA Think Brief - Hashtag Standards for emergencies
OCHA Think Brief - Hashtag Standards for emergencies
 
Situational Awareness Workgroup Input
Situational Awareness Workgroup InputSituational Awareness Workgroup Input
Situational Awareness Workgroup Input
 
National Preparedness System (NPS) component: TractorFax's Incident Managemen...
National Preparedness System (NPS) component: TractorFax's Incident Managemen...National Preparedness System (NPS) component: TractorFax's Incident Managemen...
National Preparedness System (NPS) component: TractorFax's Incident Managemen...
 

Similar to Processing Social Media Messages in Mass Emergency: A Survey

Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis ScenarioSummarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis ScenarioMuhammad Imran
 
Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?CesToronto
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Farida Vis
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Nicola Osborne
 
Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...ijtsrd
 
Building Social Life Networks 130818
Building Social Life Networks 130818Building Social Life Networks 130818
Building Social Life Networks 130818Ramesh Jain
 
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Carlos Castillo (ChaTo)
 
Multimodal Combination.pdf
Multimodal Combination.pdfMultimodal Combination.pdf
Multimodal Combination.pdfclientmentailai
 
Event detection in twitter using text and image fusion
Event detection in twitter using text and image fusionEvent detection in twitter using text and image fusion
Event detection in twitter using text and image fusioncsandit
 
No Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media MonitoringNo Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media MonitoringTamer Hadi
 
Fusing text and image for event
Fusing text and image for eventFusing text and image for event
Fusing text and image for eventijma
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis? Amit Sheth
 
Twitter turns ten: its use to date in disaster management
Twitter turns ten: its use to date in disaster managementTwitter turns ten: its use to date in disaster management
Twitter turns ten: its use to date in disaster managementNeil Dufty
 
Demo 2014 aidr_artificial_intelligence_disaster_response
Demo 2014 aidr_artificial_intelligence_disaster_responseDemo 2014 aidr_artificial_intelligence_disaster_response
Demo 2014 aidr_artificial_intelligence_disaster_responseConor O'Connor
 
CIT in information literacy, ECIL 2016, Sabina Cisek
CIT in information literacy, ECIL 2016, Sabina CisekCIT in information literacy, ECIL 2016, Sabina Cisek
CIT in information literacy, ECIL 2016, Sabina CisekSabina Cisek
 
Social Media as a Record for Public Services and Utilities in a Disaster
Social Media as a Record for Public Services and Utilities in a DisasterSocial Media as a Record for Public Services and Utilities in a Disaster
Social Media as a Record for Public Services and Utilities in a DisasterAndrew Steinitz
 
Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...
Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...
Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...tinokreutzer
 
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...Saurabh Mishra
 

Similar to Processing Social Media Messages in Mass Emergency: A Survey (20)

Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis ScenarioSummarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
 
Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
 
Akram.pptx
Akram.pptxAkram.pptx
Akram.pptx
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...
 
Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...Expelling Information of Events from Critical Public Space using Social Senso...
Expelling Information of Events from Critical Public Space using Social Senso...
 
Building Social Life Networks 130818
Building Social Life Networks 130818Building Social Life Networks 130818
Building Social Life Networks 130818
 
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
 
Multimodal Combination.pdf
Multimodal Combination.pdfMultimodal Combination.pdf
Multimodal Combination.pdf
 
Event detection in twitter using text and image fusion
Event detection in twitter using text and image fusionEvent detection in twitter using text and image fusion
Event detection in twitter using text and image fusion
 
No Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media MonitoringNo Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media Monitoring
 
Introduction to Digital Humanitarians
Introduction to Digital Humanitarians   Introduction to Digital Humanitarians
Introduction to Digital Humanitarians
 
Fusing text and image for event
Fusing text and image for eventFusing text and image for event
Fusing text and image for event
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis?
 
Twitter turns ten: its use to date in disaster management
Twitter turns ten: its use to date in disaster managementTwitter turns ten: its use to date in disaster management
Twitter turns ten: its use to date in disaster management
 
Demo 2014 aidr_artificial_intelligence_disaster_response
Demo 2014 aidr_artificial_intelligence_disaster_responseDemo 2014 aidr_artificial_intelligence_disaster_response
Demo 2014 aidr_artificial_intelligence_disaster_response
 
CIT in information literacy, ECIL 2016, Sabina Cisek
CIT in information literacy, ECIL 2016, Sabina CisekCIT in information literacy, ECIL 2016, Sabina Cisek
CIT in information literacy, ECIL 2016, Sabina Cisek
 
Social Media as a Record for Public Services and Utilities in a Disaster
Social Media as a Record for Public Services and Utilities in a DisasterSocial Media as a Record for Public Services and Utilities in a Disaster
Social Media as a Record for Public Services and Utilities in a Disaster
 
Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...
Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...
Establishing Global Rules for the Ethical Use of Artificial Intelligence in H...
 
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
 

More from Muhammad Imran

Damage Assessment from Social Media Imagery Data During Disasters
Damage Assessment from Social Media Imagery Data During DisastersDamage Assessment from Social Media Imagery Data During Disasters
Damage Assessment from Social Media Imagery Data During DisastersMuhammad Imran
 
Image4Act: Online Social Media Image Processing for Disaster Response
Image4Act: Online Social Media Image Processing for Disaster ResponseImage4Act: Online Social Media Image Processing for Disaster Response
Image4Act: Online Social Media Image Processing for Disaster ResponseMuhammad Imran
 
AIDR Tutorial (Artificial Intelligence for Disaster Response)
AIDR Tutorial (Artificial Intelligence for Disaster Response)AIDR Tutorial (Artificial Intelligence for Disaster Response)
AIDR Tutorial (Artificial Intelligence for Disaster Response)Muhammad Imran
 
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...Muhammad Imran
 
Artificial Intelligence for Disaster Response
Artificial Intelligence for Disaster ResponseArtificial Intelligence for Disaster Response
Artificial Intelligence for Disaster ResponseMuhammad Imran
 
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...Muhammad Imran
 
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...Muhammad Imran
 
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...Muhammad Imran
 
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaExtracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaMuhammad Imran
 
Domain Specific Mashups
Domain Specific MashupsDomain Specific Mashups
Domain Specific MashupsMuhammad Imran
 
Reseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECOReseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECOMuhammad Imran
 
ResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platformResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platformMuhammad Imran
 

More from Muhammad Imran (12)

Damage Assessment from Social Media Imagery Data During Disasters
Damage Assessment from Social Media Imagery Data During DisastersDamage Assessment from Social Media Imagery Data During Disasters
Damage Assessment from Social Media Imagery Data During Disasters
 
Image4Act: Online Social Media Image Processing for Disaster Response
Image4Act: Online Social Media Image Processing for Disaster ResponseImage4Act: Online Social Media Image Processing for Disaster Response
Image4Act: Online Social Media Image Processing for Disaster Response
 
AIDR Tutorial (Artificial Intelligence for Disaster Response)
AIDR Tutorial (Artificial Intelligence for Disaster Response)AIDR Tutorial (Artificial Intelligence for Disaster Response)
AIDR Tutorial (Artificial Intelligence for Disaster Response)
 
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
 
Artificial Intelligence for Disaster Response
Artificial Intelligence for Disaster ResponseArtificial Intelligence for Disaster Response
Artificial Intelligence for Disaster Response
 
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
 
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...
 
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...
 
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaExtracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social Media
 
Domain Specific Mashups
Domain Specific MashupsDomain Specific Mashups
Domain Specific Mashups
 
Reseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECOReseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECO
 
ResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platformResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platform
 

Recently uploaded

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 

Processing Social Media Messages in Mass Emergency: A Survey

  • 1. Processing Social Media Messages in Mass Emergency: Survey Summary Muhammad Imran Carlos Castillo Fernando Diaz Sarah Vieweg Authors mimran@hbku.edu.qa chato@acm.org diazf@acm.org sarahvieweg@gmail.com Date: 25th April 2018
  • 2. Overarching Goal “To extract time-critical information from social media that is useful for emergency responders, affected communities, and other concerned population in disaster situations.” Urgent help need Urgent aid need
  • 3. Survey Study Selection Domain filters Topic filters Data filters - Humanitarian - Disaster response - Mass emergencies - Computing - Artificial intelligence - Machine learning - Twitter - Facebook - Micro-blogging Keywords Final selection = 180 published research papers Domain Topics Data >700 articles Duplicate filters
  • 4. Topics Covered Humanitarian + Social Media + AI Volume & Velocity (~18) Data acquisition, storage, and retrieval Event Detection (~36) Topic detection and tracking Classification & Clustering (~40) Classification and clustering Information Summarization ~(15) Abstractive and Extractive summarization Semantics and Crisis Ontologies (~10) Semantic enrichment & Crisis ontologies Information Veracity (~18) Credibility and misinformation Information Visualization (~12) Crisis maps, dashboards Total ~180 papers surveyed
  • 6. Twitter Storms during Emergencies Source: https://www.wsj.com/articles/twitter-storms-can-help-gauge-damage-of-real-storms-and-disasters-study-says-1457722801 (Castillo C, Big Crisis Data, 2016, Cambridge University Press) Volume Velocity 72k tweets/min 27 million in 3 days
  • 7. (Yury Kryvasheyeu et al. Sci Adv 2016;2:e1500779) Blue: represents a location farther from the disaster Red: represents a location closer to the disaster Twitter Activity Across Locations during Disasters Activity Retweeting Strong relationship between proximity to Sandy’s path and social media activity
  • 9. Event Description • Why to detect events from social media? – Human sensors report incidents very quickly – Tweet waves travel faster than earthquake waves • What is an event? – Events can be defined as situations, actions or occurrences that happen in a certain location at a specific time (Dou et al. 2012) • An event is generally characterized by: 5W1H – Who? When? Where? What? Why? How?
  • 10. Event Detection using Bursty Behavior (Liang et al. Quantifying Information Flow During Emergencies, 2014, Nature.)
  • 11. Event Detection Systems System Approach Event types Real- time Query type Spatio- temporal Sub- events Reference Twitter Monitor Burst detection Open domain Yes Open No No [Mathioudakis et al. 2010] TwitInfo Burst detection Earthquakes Yes Keyword Spatial Yes [Marcus et al. 2011] Twevent Burst detection Open domain Yes Open No No [Li et al. 2012b] TEDAS Supervised classification Crime/disast ers No Keyword Yes No [Li et al. 2012a] LeadLine Burst detection Open domain No Keyword Yes No [Dou et al. 2012] TwiCal Supervised classification Conflicts/poli tics Yes Open Temporal No [Ritter et al. 2012] Tweet4Act Dictionaries Disasters Yes Keyword No No [Chowdhury et al. 2013] ESA Burst detection Open domain Yes Keyword Spatial No [Robinson et al. 2013a]
  • 12. Challenges and Future Directions • Inadequate spatial information – Spatial and temporal information are two integral components of an event – Automatic text-based geo-tagging may help • Mundane events – #MusicMonday #FollowFriday are misleading • Describing the events – Named-entities, tracking, semantic enhancements
  • 14. By Information Provided • Caution and advice [Imran et al. 2013b]; warnings [Acar and Muraki 2011]; hazard preparation [Olteanu et al. 2014]; tips [Leavitt and Clark 2014]; advice [Bruns 2014]; status, protocol [Hughes et al. 2014b] • Affected or trapped people [Caragea et al. 2011]; casualties, people missing, found, or seen [Imran et al. 2013b]; self-reports [Acar and Muraki 2011]; injured, missing, killed [Vieweg et al. 2010]; looking for missing people [Qu et al. 2011] • Infrastructure/utilities damage [Imran et al. 2013b]; collapsed structure [Caragea et al. 2011]; built environment [Vieweg et al. 2010]; closure and services [Hughes et al. 2014b] • Needs and donations of money, goods, services [Imran et al. 2013b]; food/water shortage [Caragea et al. 2011]; donations or volunteering [Olteanu et al. 2014]; help requests, relief coordination [Qu et al. 2011]; relief, donations, resources [Hughes et al. 2014b]; help and fundraising [Bruns 2014] • Other useful information: hospital/clinic service, water sanitation [Caragea et al. 2011]; consequences [Olteanu et al. 2014]
  • 15. By Information Provided • Caution and advice [Imran et al. 2013b]; warnings [Acar and Muraki 2011]; hazard preparation [Olteanu et al. 2014]; tips [Leavitt and Clark 2014]; advice [Bruns 2014]; status, protocol [Hughes et al. 2014b] • Affected or trapped people [Caragea et al. 2011]; casualties, people missing, found, or seen [Imran et al. 2013b]; self-reports [Acar and Muraki 2011]; injured, missing, killed [Vieweg et al. 2010]; looking for missing people [Qu et al. 2011] • Infrastructure/utilities damage [Imran et al. 2013b]; collapsed structure [Caragea et al. 2011]; built environment [Vieweg et al. 2010]; closure and services [Hughes et al. 2014b] • Needs and donations of money, goods, services [Imran et al. 2013b]; food/water shortage [Caragea et al. 2011]; donations or volunteering [Olteanu et al. 2014]; help requests, relief coordination [Qu et al. 2011]; relief, donations, resources [Hughes et al. 2014b]; help and fundraising [Bruns 2014] • Other useful information: hospital/clinic service, water sanitation [Caragea et al. 2011]; consequences [Olteanu et al. 2014] - Supervised classification techniques - Learning algorithms include SVMs, Random Forest, Ensemble methods, and lately deep learning e.g., RNN - Unsupervised: clustering, and LDA for topic modeling Formal response organizations prefer supervised classification as most of the times categories are defined.
  • 16. Systems for Crisis Data Processing Twitris [Purohit and Sheth 2013] Twitter; semantic enrichment, classify automatically, geotag SensePlace2 [MacEachren et al. 2011] Twitter; geotag, visualize heat-maps based on geotags EAIMS Emergency Analysis Identification and Management System [McCreadie et al. 2016] Twitter; sentiment, alerts, credibility, ESA Emergency Situation Awareness [Yin et al. 2012; Power et al. 2014] Twitter; detect bursts, classify, cluster, geotag
  • 17. Systems for Crisis Data Processing Twitcident [Abel et al. 2012] Twitter and TwitPic; semantic enrichment, classify CrisisTracker [Rogstadius et al. 2013] Twitter; cluster, annotate manually Tweedr [Ashktorab et al. 2014] Twitter; classify automatically, extract information, geotag AIDR: Artificial Intelligence for Disaster Response [Imran et al. 2014a] Twitter & Facebook; annotate manually, classify automatically (text + image)
  • 18. Challenges and Future Directions • Missing actionable insights – Who and where help is needed – Automatic extraction of actionable/serviceable msgs • Labeled data scarcity – Most of the systems are labeled data hungry – More robust domain adaption and transfer learning techniques are required • Focus on other content type (Images) – Images contain critical information (e.g., damage) – More focus on multimodal research is required
  • 20. Information Summarization Tribhuvan international airport closed after the quake Airport closed after 7.9 Earthquake in Kathmandu Tribhuvan international airport closed after 7.9 earthquake in Kathmandu. Summaries reduce information overload issue
  • 21. Key Objectives and Challenges • Information coverage – Capture most situational updates from data. The summary should be rich in terms of information coverage • Less redundant information – Messages on Twitter contain duplicate information. Produce summaries with less redundant but important updates • Readability – Twitter messages are often noisy, informal, and full of grammatical mistakes. The aim here is to produce more readable summaries • Real-time (online/updated summaries) – The system should not be heavily overloaded with computations such that by the time the summary is produced, the utility of that information is marginal (McCreadie et al. 2013; Aslam et al. 2013; Nenkova and McKeown 2011; Guo et al. 2013, Rudra et al., 2016)
  • 22. Crisis Datasets (Labeled + Unlabeled) CrisisMMD: Multimodal Twitter Datasets from Natural Disasters http://CrisisNLP.qcri.org/ http://CrisisLex.org/
  • 23. Conclusion and Future Directions • Applied Research at its Best – Real-world problems and challenges – Social Media for Social Good – Decent work on information filtering and classification (last 6-8 years) • Social media imagery content is another potential source of information • Labeled data scarcity problem – No or few labeled data instances (in early hours) – High diversity among organizations needs – Information needs change overtime – Domain adaptation and transfer learning techniques required • From situational to actionable insights – Identify requests and needs in real-time – Triangulate missing information – Rank them based on their urgency to help responders
  • 24. Thank you! Contact me at: mimran@hbku.edu.qa OR @mimran15 For queries, questions, and datasets: Recommended books: Processing Social Media Messages in Mass Emergency: A Survey. ACM Computing Surveys, 2015. Full survey paper:

Editor's Notes

  1. Our goal in this paper was to survey systems, techniques, and computational models that help extract time-critical information from social media useful for emergency responders and affected communities. For example, look at these two messages. The message on the left side, which was collected during the recent hurricane Harvey, asks about urgent help for an old person who got trapped. The message on the right side, requests about urgent need of baby food and medicines during a flood situation in Kashmir.
  2. Before start reading the papers, we decided three aspects that influence what papers to select and what not. We formed several keyword searches using domain + topics + data sources. We used several scholarly search engines After getting the results, two of the authors looked at the papers and filter out the ones which were not relevant. Our final set has around 184 papers. ----- Meeting Notes (4/16/18 13:04) ----- - No listing, but the message - Opinions ( -
  3. These are some numbers from a few major past disasters from 2010 to 2013 originally reported in the WSJ. There were 27 million tweets posted in 3 days after the Boston marathon bombing in 2013. How fast these messages arrive? Well, during 2011 Japan earthquake the highest velocity record according the Big Crisis Data book, was 72k. It is not only the velocity is high, actually social media breaks stories faster than traditional channels. When a magnitude-5.8 earthquake hit Virginia in 2011, the first Twitter report from a bystander at the epicenter reached New York about 40 seconds ahead of the quake’s first shock waves. Sourced WSJ
  4. Now with all the big volume and high velocity, the question is whether this Twitter activity indicate anything or is it random? According to this paper published in the Science Journal, there is a strong relationship between disaster proximity and social media activity. “Rapid assessment of disaster damage using social media activity In all charts, the primary plot shows results for messages with keyword “sandy” and the small chart for keyword “weather” to contrast behaviors between event-related and neutral words. Blue represents a location farther from the disaster. Red represents a location closer to the disaster. A: Chart A shows a sharp decline in the activity as the distance between a location and the path of the hurricane increases. B: The chart B shows the activity and retweet fraction. It seems that the retweet rate is inversely related to activity, with affected areas producing more original content. None of the features discussed above are present for neutral words (see the insets in all panels). --Backup— A: After the distance exceeds 1200 to 1500 km, its effect on the strength of response disappears. This trend may be caused by a combination of factors, with direct observation of disaster effects and perception of risk both increasing the tweet activity of the East Coast cities. Anxiety, anticipation, and risk perception evidently contribute to the magnitude of response because many of the communities falling into the decreasing trend were not directly hit or were affected only marginally, whereas New Orleans, for example, shows a significant tweeting level that reflects its historical experience with damaging hurricanes like Katrina. C: The chart C shows content popularity. The popularity of the content created in the disaster area is also higher and therefore increases with activity as well.
  5. Now, with all these huge activity on social media during disasters, can we use it to automatically detection disaster events?
  6. We want to detect events from social media because 1) human sensors are generally fast, 2) we saw that tweet waves travel faster than earthquake waves
  7. According to a study published in Nature on “Quantifying Information flow during emergencies”. The authors used mobile SMS and calls to predict suspicious events. According to this study, the actions and reactions of affected people due to a disaster or due to a non-disaster event are differentiable. Go are users who directly affected by the disaster G1 are users who are contacted by G0 users If you compare, bombing, jet scare, and plane crash with concert event, you notice a consistent pattern in all disaster event which is not visible in the non-disaster event. G0 activity goes up as they hit disaster G1 also go up in the case of emergency, but not really in the case of non-emergency event
  8. Several systems and techniques have been developed in the last couple of years. Here I listed a few important ones with their capabilities e.g, event type, real-time, query type, spatio-temporal, and whether they able to identify sub-events or not. You notice that most of these systems are based on burst detection, which is could be misleading, especially in social media due to mundane events messages. Temporal = able to predict the time of a detected event Spatial = able to predict the location of an event
  9. After an event is detected, the next step is to analyze what the data. Two famous techniques classification and clustering have been used for this purpose.
  10. Here I listed a number of works, with their detailed task.
  11. Here I listed a number of works, with their detailed task.
  12. Unfortunately most of these systems are not developed based on stakeholders needs. Future system should be requirements-driven
  13. Information summarization is another very important step after classification. There are mainly two types of summarization approaches: extractive in which same content as source is used to generate summaries. Abstractive in which new content is used to summarize a set of documents.