SlideShare a Scribd company logo
1 of 23
Download to read offline
TREND
TREND 
DETECTION, TRACKING & TRANSITION 
in Social Networks 
1. Definition & General Idea 
2. Web Samples in Trend Hunting 
3. Detection Approches 
4. Architecture: TwitterMonitor 
5. Detection: MemeTracker 
6. Classification: ExoEndo 
SemioNet: Semantic Social Network Analysis
REFERENCES 
Mathioudakis, Michael, and Nick Koudas. "Twittermonitor: trend 
detection over the twitter stream." Proceedings of the 2010 ACM 
SIGMOD International Conference on Management of data. ACM, 
2010. 
Leskovec, Jure, Lars Backstrom, and Jon Kleinberg. "Meme-tracking 
and the dynamics of the news cycle." Proceedings of the 15th ACM 
SIGKDD international conference on Knowledge discovery and 
data mining. ACM, 2009. 
Naaman, Mor, Hila Becker, and Luis Gravano. "Hip and trendy: 
Characterizing emerging trends on Twitter." Journal of the American 
Society for Information Science and Technology 62.5 (2011): 902- 
918. 
Becker, Hila, Mor Naaman, and Luis Gravano. "Beyond Trending 
Topics: Real-World Event Identification on Twitter." ICWSM 11 (2011): 
438-441.
Trend Analysis 
The Science of Studying 
Changes in Social Patterns, 
Including Fashion, Technology 
& Consumer Behavior 
Horizontal Analysis 
The General Movement 
over TIME of a 
Statistically Detectable 
Change 
Fundamentally, a Method 
for Understanding HOW & 
WHY Things have Changed 
– or will Change – over TIME
APPLICATION
APPROCH 
Text Mining 
Topic Ident. & Clust. 
"Kilroy was here" was a 
piece of graffiti that 
became popular in the 
1940s, and existed under 
various names in 
different countries, 
illustrating how a meme 
can be modified through 
replication 
Memes 
(/ˈmiːm/) is "an idea, behavior, or 
style that spreads from person to person 
within a culture.“ … through writing, 
speech, gestures, rituals, or other 
imitable phenomena with a mimicked 
theme. … cultural analogues to genes in that 
they self-replicate, mutate, and 
respond to selective pressures.
GroupBurst: Assesses Co-occurrences 
One-pass 
of Bursty 
Real-time 
Keyword in Recent Tweets 
Adjustable against spam 
Theoretically sound! 
Adjustable against SPURIOUS Bursts. Coincidental Burst of Keyword over a short period of time 
Context Extraction Algorithms (PCA, 
SVD) & Grapevine’s Entity Extractor 
to Add more 
271 Million Monthly Active Users 
500 Million Tweets (140 ch) Per Day 
78% Active Users on Mobile 
77% Accounts Outside U.S. 
Supports 35+ languages
MemeTracking 
News Cycle 
Tracking News Evolution 
Quotes & Memes 
Integral Part of Journalistic Practice 
Travel Relatively Intact with Mutational Variants 
Clustering by Graph
Item: Each News Article/Blog Post 
Phrase: A Quoted String Occurs in Items 
MemeTracking …
Phrase Graph 
DAG 
|P| < |Q| 
“senseless killing” 
“enough of senseless 
killing” 
“Hear our voice. We have had enough of this 
senseless killing” 
Directed Edit Distance(P, Q) < δ 
Word Consecutive Overlap(P, Q) > k 
P  Q 
푊푃,푄 ∝ 
1 
퐷푖푟푒푐푡푒푑 퐸푑푖푡 퐷푖푠푡푎푛푐푒(푃,푄) 
∝ 푇표푡푎푙 푁푢푚푏푒푟 표푓 푄 푖푛 퐶표푟푝푢푠 
MemeTracking …
Phrase Clusters 
Directed Acyclic Graph (DAG) Partitioning 
Given a Weighted DAG, Delete a Set of Edges of 
Min Total Weight So That Each of the Resulting 
Components is Single-Rooted. 
NP-hard 
Heuristic 
1.Start from the Roots 
2.Down the DAG & greedily Assigns each Node to the Cluster to 
which it has the most Edges 
MemeTracking …
MemeTracking …
Result 
Volume Distribution 
Dataset 
3 Months Aug 1 to Oct 31 2008 
~ 1M Docs per Day from 1.65 Million 
Sites! 
47M Phrases, 22M Distinct 
9H Clustering Process Time 
35, 800 Non-trivial Clusters (at least two phrases) 
MemeTracking …
ThemeRiver 
MemeTracking …
Other Findings 
Time lag between the news media and blogs 
푓 푛푗 훿 푡 − 푡푗 
푛푗 = Number of Item Previously Written for Cluster j 
푡 = 푡ℎ푒 푐푢푟푟푒푛푡 푡푖푚푒 
푡푗 = 푡ℎ푒 푡푖푚푒 푤ℎ푒푛 푗 푤푎푠 푓푖푟푠푡 푝푟표푑푢푐푒푑 
푅푒푐푒푛푐푦 → 훿 푖푠 푚표푛표푡표푛푖푐푎푙푙푦 푑푒푐푟푒푎푠푖푛푔 푖푛 푡 − 푡푗 
퐼푚푖푡푎푡푖표푛 → 푓 푖푠 푚표푛표푡표푛푖푐푎푙푙푦 푖푛푐푟푒푎푠푖푛푔 푖푛 푛푗, 푓(0) > 0 
푡 → 0−: 푎 = 0.076 푡 → 0+: 푎 = 0.092 
푡 → 0−: 푏 = 1.77 푡 → 0+: 푏 = 2.15 
Quotes migrating from blogs to news media: 3.5% 
Each Cluster 
Modeling the news trend 
Imitation≠Recency 
MemeTracking …
Characterizing Trends 
“trends in trend data.”  Meta Trend 
Taxonomy of the trends 
Key Distinguishing Features of Trends 
Not only the Textual Content 
Social Network Structure 
Ties 
Geographic 
Action  Retweet, Reply, Mention, Hashtag
Trends 
Exogenous 
Broadcast-media 
Broadcast of local media 
“fight” (boxing event) 
“Ravens” (football game) 
Broadcast of global/national media 
“Kanye”(KanyeWest acts up at the MTVVideo MusicAwards) 
“Lost Finale” (series finale of Lost). 
Global News 
Breaking 
“earthquake” (Chile earthquake) 
“Tsunami” (HawaiiTsunamiwarning) 
“Beyoncé”(Beyoncé cancels Malaysia concert). 
Nonbreaking 
“HCR” (health care reform) 
“Tiger” (Tiger Woods apologizes) 
“iPad” (toward thelaunch of Apple’s popular device). 
National Holidays & Memorial Days 
“Halloween,” “Valentine’s.” 
Local Participatory & Physical 
Planned 
“marathon,” 
“superbowl” (Super Bowl viewing parties) 
“patrick’s” (St. Patrick’s Day Parade). 
Unplanned 
“rainy,” “snow.” 
Endogenous 
Memes 
#in2010 (in December 2009, users imagine their near future) 
“November” (users marking the beginning of the month on November 1) 
Retweets 
Fan Community Activities 
“2pac” (the anniversary of the death of hip-hop artist Tupac Shakur). 
Characterizing Trends …
Trends from twitter.com 
Trends from Simple Trend Detector 
Trends for Quality Analysis  Supervised Categories 
Trends for Computing Features 
Tquantity 
Ttwitter 
Tterm freq. 
Tquality 
Characterizing Trends …
Content Features 
•Average number of words/characters 
•Proportion of messages with URLs, unique URLs, with hashtags ex/including trend terms 
•Top unique hashtag? 
•Similarity to centroid 
Interaction Features 
• Proportion of retweets, replies, mentions 
Time-based Features 
• Exponential fit head, tail 
• Logarithmic fit head, tail 
Participation Features 
• Messages per author 
• Proportion of messages from top author 
• Proportion of messages from top 10% of authors 
Social Network Features 
•Level of reciprocity 
•Maximal eigenvector centrality 
•Maximal degree centrality 
•Transitivity 
•Density 
•Average component size 
Characterizing Trends …
Content features: Exo higher URLs, smaller hashtags 
Exogenous 
vs. 
Endogenous 
Trends 
Interaction features: Exo fewer 
retweets, similar number of replies 
Time features: Exo different for the 
head period before the trend peak 
but will exhibit similar time features in 
the tail period after the trend peak, 
compared to endogenous trends. 
Social network features: Exo fewer connections, less reciprocity 
1.1 
1.2 
1.3 
1.4 
Characterizing Trends …
TRANSITION 
Alluvial Diagrams
IDEA 
Automatic Categorization of Trends 
Photography Trend  Selfie Image 
Trust Trend  Trustful Users, Trustful Twits 
Untrendy People! Users Counteract the trends

More Related Content

Viewers also liked

Pre colonial literature-1_ - copy
Pre colonial literature-1_ - copyPre colonial literature-1_ - copy
Pre colonial literature-1_ - copyMichelle Celestino
 
Philippine contemporary literature
Philippine contemporary literaturePhilippine contemporary literature
Philippine contemporary literatureschool
 
Pre spanish period in the philippines
Pre spanish period in the philippinesPre spanish period in the philippines
Pre spanish period in the philippinesKate Sevilla
 
PHILIPPINE LITERATURE DURING PRE-COLONIAL PERIOD
PHILIPPINE LITERATURE DURING PRE-COLONIAL PERIODPHILIPPINE LITERATURE DURING PRE-COLONIAL PERIOD
PHILIPPINE LITERATURE DURING PRE-COLONIAL PERIODAnthon Nick Manlangit
 
Humss trends, networks, and critical thinking in the 21st century culture cg 1
Humss trends, networks, and critical thinking in the 21st century culture cg 1Humss trends, networks, and critical thinking in the 21st century culture cg 1
Humss trends, networks, and critical thinking in the 21st century culture cg 1Carie Justine Estrellado
 
Pre colonial philippine literature
Pre colonial philippine literaturePre colonial philippine literature
Pre colonial philippine literatureitsebo
 
What Is The Difference Between A Fad And A Trend
What Is The Difference Between A Fad And A TrendWhat Is The Difference Between A Fad And A Trend
What Is The Difference Between A Fad And A TrendNeil Perkin
 

Viewers also liked (8)

Pre colonial literature-1_ - copy
Pre colonial literature-1_ - copyPre colonial literature-1_ - copy
Pre colonial literature-1_ - copy
 
Philippine contemporary literature
Philippine contemporary literaturePhilippine contemporary literature
Philippine contemporary literature
 
Pre spanish period in the philippines
Pre spanish period in the philippinesPre spanish period in the philippines
Pre spanish period in the philippines
 
Pre colonial-period
Pre colonial-periodPre colonial-period
Pre colonial-period
 
PHILIPPINE LITERATURE DURING PRE-COLONIAL PERIOD
PHILIPPINE LITERATURE DURING PRE-COLONIAL PERIODPHILIPPINE LITERATURE DURING PRE-COLONIAL PERIOD
PHILIPPINE LITERATURE DURING PRE-COLONIAL PERIOD
 
Humss trends, networks, and critical thinking in the 21st century culture cg 1
Humss trends, networks, and critical thinking in the 21st century culture cg 1Humss trends, networks, and critical thinking in the 21st century culture cg 1
Humss trends, networks, and critical thinking in the 21st century culture cg 1
 
Pre colonial philippine literature
Pre colonial philippine literaturePre colonial philippine literature
Pre colonial philippine literature
 
What Is The Difference Between A Fad And A Trend
What Is The Difference Between A Fad And A TrendWhat Is The Difference Between A Fad And A Trend
What Is The Difference Between A Fad And A Trend
 

Similar to Trend Analysis

Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...Artificial Intelligence Institute at UofSC
 
Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course Ashutosh Jadhav
 
Social Media and Scientific Research How Semantic Technologies Enhance Colla...
Social Media and Scientific ResearchHow Semantic Technologies Enhance Colla...Social Media and Scientific ResearchHow Semantic Technologies Enhance Colla...
Social Media and Scientific Research How Semantic Technologies Enhance Colla...Darrell W. Gunter
 
"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant ReadingShalin Hai-Jew
 
Enhancing Soft Power: using cyberspace to enhance Soft Power
Enhancing Soft Power: using cyberspace to enhance Soft PowerEnhancing Soft Power: using cyberspace to enhance Soft Power
Enhancing Soft Power: using cyberspace to enhance Soft PowerAmit Sheth
 
EDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko GrobelnikEDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko GrobelnikEuropean Data Forum
 
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...Amit Sheth
 
Search, Signals & Sense: An Analytics Fueled Vision
Search, Signals & Sense: An Analytics Fueled VisionSearch, Signals & Sense: An Analytics Fueled Vision
Search, Signals & Sense: An Analytics Fueled VisionSeth Grimes
 
Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012Marko Grobelnik
 
Hashtag Conversations, Eventgraphs, and User Ego Neighborhoods: Extracting...
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting...Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting...
Hashtag Conversations, Eventgraphs, and User Ego Neighborhoods: Extracting...learjk
 
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods: Extracting So...
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting So...Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting So...
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods: Extracting So...Shalin Hai-Jew
 
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...Artificial Intelligence Institute at UofSC
 
final_nlp
final_nlpfinal_nlp
final_nlpaphex34
 
Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Benjamin Good
 
KASW'08 - Invited Talk
KASW'08 - Invited TalkKASW'08 - Invited Talk
KASW'08 - Invited TalkRalf Klamma
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...Savvas Zannettou
 
Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Stanford University
 
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data TutorialESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorialeswcsummerschool
 

Similar to Trend Analysis (20)

Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
 
Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course Twitris - Web Information System 2011 Course
Twitris - Web Information System 2011 Course
 
Hendrickson data2 2012-gnip
Hendrickson data2 2012-gnipHendrickson data2 2012-gnip
Hendrickson data2 2012-gnip
 
Social Media and Scientific Research How Semantic Technologies Enhance Colla...
Social Media and Scientific ResearchHow Semantic Technologies Enhance Colla...Social Media and Scientific ResearchHow Semantic Technologies Enhance Colla...
Social Media and Scientific Research How Semantic Technologies Enhance Colla...
 
"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading"Mass Surveillance" through Distant Reading
"Mass Surveillance" through Distant Reading
 
Enhancing Soft Power: using cyberspace to enhance Soft Power
Enhancing Soft Power: using cyberspace to enhance Soft PowerEnhancing Soft Power: using cyberspace to enhance Soft Power
Enhancing Soft Power: using cyberspace to enhance Soft Power
 
EDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko GrobelnikEDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko Grobelnik
 
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
 
Search, Signals & Sense: An Analytics Fueled Vision
Search, Signals & Sense: An Analytics Fueled VisionSearch, Signals & Sense: An Analytics Fueled Vision
Search, Signals & Sense: An Analytics Fueled Vision
 
Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012
 
Hashtag Conversations, Eventgraphs, and User Ego Neighborhoods: Extracting...
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting...Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting...
Hashtag Conversations, Eventgraphs, and User Ego Neighborhoods: Extracting...
 
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods: Extracting So...
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting So...Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods:  Extracting So...
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods: Extracting So...
 
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
 
final_nlp
final_nlpfinal_nlp
final_nlp
 
Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden
 
KASW'08 - Invited Talk
KASW'08 - Invited TalkKASW'08 - Invited Talk
KASW'08 - Invited Talk
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
 
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
 
Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016
 
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data TutorialESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
 

More from Hossein Fani

ECIR23: A Streaming Approach to Neural Team Formation Training
ECIR23: A Streaming Approach to Neural Team Formation TrainingECIR23: A Streaming Approach to Neural Team Formation Training
ECIR23: A Streaming Approach to Neural Team Formation TrainingHossein Fani
 
SEKE15: An ontology for describing security events
SEKE15: An ontology for describing security eventsSEKE15: An ontology for describing security events
SEKE15: An ontology for describing security eventsHossein Fani
 
ECIR20: Temporal Latent Space Modeling for Community Prediction
ECIR20: Temporal Latent Space Modeling for Community PredictionECIR20: Temporal Latent Space Modeling for Community Prediction
ECIR20: Temporal Latent Space Modeling for Community PredictionHossein Fani
 
CIKM17: temporally like-minded user community identification through neural ...
CIKM17: temporally like-minded user community identification through  neural ...CIKM17: temporally like-minded user community identification through  neural ...
CIKM17: temporally like-minded user community identification through neural ...Hossein Fani
 
CIKM AnalytiCup 2017: Bagging Model for Product Title Quality with Noise
CIKM AnalytiCup 2017: Bagging Model for Product Title Quality with NoiseCIKM AnalytiCup 2017: Bagging Model for Product Title Quality with Noise
CIKM AnalytiCup 2017: Bagging Model for Product Title Quality with NoiseHossein Fani
 
WSDM16: Temporal Formation and Evolution of Online Communities
WSDM16: Temporal Formation and Evolution of Online CommunitiesWSDM16: Temporal Formation and Evolution of Online Communities
WSDM16: Temporal Formation and Evolution of Online CommunitiesHossein Fani
 
Latent Community Analysis: PhD Proposal
Latent Community Analysis: PhD ProposalLatent Community Analysis: PhD Proposal
Latent Community Analysis: PhD ProposalHossein Fani
 
Moviesion: Content-based Movie Recommender Fueled by Linked Open Data
Moviesion: Content-based Movie Recommender Fueled by Linked Open DataMoviesion: Content-based Movie Recommender Fueled by Linked Open Data
Moviesion: Content-based Movie Recommender Fueled by Linked Open DataHossein Fani
 
Exploratory Social Network Analysis with Pajek: Blockmodels
Exploratory Social Network Analysis with Pajek: BlockmodelsExploratory Social Network Analysis with Pajek: Blockmodels
Exploratory Social Network Analysis with Pajek: BlockmodelsHossein Fani
 
Exploratory Social Network Analysis: Ranking
Exploratory Social Network Analysis: RankingExploratory Social Network Analysis: Ranking
Exploratory Social Network Analysis: RankingHossein Fani
 
Exploratory Social Network Analysis with Pajek: Diffusion
Exploratory Social Network Analysis with Pajek: DiffusionExploratory Social Network Analysis with Pajek: Diffusion
Exploratory Social Network Analysis with Pajek: DiffusionHossein Fani
 
Exploratory Social Network Analysis with Pajek: Center & Periphery
Exploratory Social Network Analysis with Pajek: Center & PeripheryExploratory Social Network Analysis with Pajek: Center & Periphery
Exploratory Social Network Analysis with Pajek: Center & PeripheryHossein Fani
 
Exploratory Social Network Analysis with Pajek: Sentiments & Friendship
Exploratory Social Network Analysis with Pajek: Sentiments & FriendshipExploratory Social Network Analysis with Pajek: Sentiments & Friendship
Exploratory Social Network Analysis with Pajek: Sentiments & FriendshipHossein Fani
 
Exploratory Social Network Analysis with Pajek: Attributes & Relations
Exploratory Social Network Analysis with Pajek: Attributes & RelationsExploratory Social Network Analysis with Pajek: Attributes & Relations
Exploratory Social Network Analysis with Pajek: Attributes & RelationsHossein Fani
 
Ontology Engineering
Ontology EngineeringOntology Engineering
Ontology EngineeringHossein Fani
 
Philosophical Software Developing
Philosophical Software DevelopingPhilosophical Software Developing
Philosophical Software DevelopingHossein Fani
 

More from Hossein Fani (18)

ECIR23: A Streaming Approach to Neural Team Formation Training
ECIR23: A Streaming Approach to Neural Team Formation TrainingECIR23: A Streaming Approach to Neural Team Formation Training
ECIR23: A Streaming Approach to Neural Team Formation Training
 
SEKE15: An ontology for describing security events
SEKE15: An ontology for describing security eventsSEKE15: An ontology for describing security events
SEKE15: An ontology for describing security events
 
ECIR20: Temporal Latent Space Modeling for Community Prediction
ECIR20: Temporal Latent Space Modeling for Community PredictionECIR20: Temporal Latent Space Modeling for Community Prediction
ECIR20: Temporal Latent Space Modeling for Community Prediction
 
CIKM17: temporally like-minded user community identification through neural ...
CIKM17: temporally like-minded user community identification through  neural ...CIKM17: temporally like-minded user community identification through  neural ...
CIKM17: temporally like-minded user community identification through neural ...
 
CIKM AnalytiCup 2017: Bagging Model for Product Title Quality with Noise
CIKM AnalytiCup 2017: Bagging Model for Product Title Quality with NoiseCIKM AnalytiCup 2017: Bagging Model for Product Title Quality with Noise
CIKM AnalytiCup 2017: Bagging Model for Product Title Quality with Noise
 
WSDM16: Temporal Formation and Evolution of Online Communities
WSDM16: Temporal Formation and Evolution of Online CommunitiesWSDM16: Temporal Formation and Evolution of Online Communities
WSDM16: Temporal Formation and Evolution of Online Communities
 
Latent Community Analysis: PhD Proposal
Latent Community Analysis: PhD ProposalLatent Community Analysis: PhD Proposal
Latent Community Analysis: PhD Proposal
 
Moviesion: Content-based Movie Recommender Fueled by Linked Open Data
Moviesion: Content-based Movie Recommender Fueled by Linked Open DataMoviesion: Content-based Movie Recommender Fueled by Linked Open Data
Moviesion: Content-based Movie Recommender Fueled by Linked Open Data
 
Exploratory Social Network Analysis with Pajek: Blockmodels
Exploratory Social Network Analysis with Pajek: BlockmodelsExploratory Social Network Analysis with Pajek: Blockmodels
Exploratory Social Network Analysis with Pajek: Blockmodels
 
Exploratory Social Network Analysis: Ranking
Exploratory Social Network Analysis: RankingExploratory Social Network Analysis: Ranking
Exploratory Social Network Analysis: Ranking
 
Exploratory Social Network Analysis with Pajek: Diffusion
Exploratory Social Network Analysis with Pajek: DiffusionExploratory Social Network Analysis with Pajek: Diffusion
Exploratory Social Network Analysis with Pajek: Diffusion
 
Exploratory Social Network Analysis with Pajek: Center & Periphery
Exploratory Social Network Analysis with Pajek: Center & PeripheryExploratory Social Network Analysis with Pajek: Center & Periphery
Exploratory Social Network Analysis with Pajek: Center & Periphery
 
Exploratory Social Network Analysis with Pajek: Sentiments & Friendship
Exploratory Social Network Analysis with Pajek: Sentiments & FriendshipExploratory Social Network Analysis with Pajek: Sentiments & Friendship
Exploratory Social Network Analysis with Pajek: Sentiments & Friendship
 
Exploratory Social Network Analysis with Pajek: Attributes & Relations
Exploratory Social Network Analysis with Pajek: Attributes & RelationsExploratory Social Network Analysis with Pajek: Attributes & Relations
Exploratory Social Network Analysis with Pajek: Attributes & Relations
 
Temporal Network
Temporal NetworkTemporal Network
Temporal Network
 
Ontology Engineering
Ontology EngineeringOntology Engineering
Ontology Engineering
 
Software Test
Software TestSoftware Test
Software Test
 
Philosophical Software Developing
Philosophical Software DevelopingPhilosophical Software Developing
Philosophical Software Developing
 

Recently uploaded

Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...Lokesh Kothari
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxabhishekdhamu51
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 

Recently uploaded (20)

Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 

Trend Analysis

  • 2. TREND DETECTION, TRACKING & TRANSITION in Social Networks 1. Definition & General Idea 2. Web Samples in Trend Hunting 3. Detection Approches 4. Architecture: TwitterMonitor 5. Detection: MemeTracker 6. Classification: ExoEndo SemioNet: Semantic Social Network Analysis
  • 3. REFERENCES Mathioudakis, Michael, and Nick Koudas. "Twittermonitor: trend detection over the twitter stream." Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. ACM, 2010. Leskovec, Jure, Lars Backstrom, and Jon Kleinberg. "Meme-tracking and the dynamics of the news cycle." Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2009. Naaman, Mor, Hila Becker, and Luis Gravano. "Hip and trendy: Characterizing emerging trends on Twitter." Journal of the American Society for Information Science and Technology 62.5 (2011): 902- 918. Becker, Hila, Mor Naaman, and Luis Gravano. "Beyond Trending Topics: Real-World Event Identification on Twitter." ICWSM 11 (2011): 438-441.
  • 4. Trend Analysis The Science of Studying Changes in Social Patterns, Including Fashion, Technology & Consumer Behavior Horizontal Analysis The General Movement over TIME of a Statistically Detectable Change Fundamentally, a Method for Understanding HOW & WHY Things have Changed – or will Change – over TIME
  • 6. APPROCH Text Mining Topic Ident. & Clust. "Kilroy was here" was a piece of graffiti that became popular in the 1940s, and existed under various names in different countries, illustrating how a meme can be modified through replication Memes (/ˈmiːm/) is "an idea, behavior, or style that spreads from person to person within a culture.“ … through writing, speech, gestures, rituals, or other imitable phenomena with a mimicked theme. … cultural analogues to genes in that they self-replicate, mutate, and respond to selective pressures.
  • 7. GroupBurst: Assesses Co-occurrences One-pass of Bursty Real-time Keyword in Recent Tweets Adjustable against spam Theoretically sound! Adjustable against SPURIOUS Bursts. Coincidental Burst of Keyword over a short period of time Context Extraction Algorithms (PCA, SVD) & Grapevine’s Entity Extractor to Add more 271 Million Monthly Active Users 500 Million Tweets (140 ch) Per Day 78% Active Users on Mobile 77% Accounts Outside U.S. Supports 35+ languages
  • 8. MemeTracking News Cycle Tracking News Evolution Quotes & Memes Integral Part of Journalistic Practice Travel Relatively Intact with Mutational Variants Clustering by Graph
  • 9. Item: Each News Article/Blog Post Phrase: A Quoted String Occurs in Items MemeTracking …
  • 10. Phrase Graph DAG |P| < |Q| “senseless killing” “enough of senseless killing” “Hear our voice. We have had enough of this senseless killing” Directed Edit Distance(P, Q) < δ Word Consecutive Overlap(P, Q) > k P  Q 푊푃,푄 ∝ 1 퐷푖푟푒푐푡푒푑 퐸푑푖푡 퐷푖푠푡푎푛푐푒(푃,푄) ∝ 푇표푡푎푙 푁푢푚푏푒푟 표푓 푄 푖푛 퐶표푟푝푢푠 MemeTracking …
  • 11. Phrase Clusters Directed Acyclic Graph (DAG) Partitioning Given a Weighted DAG, Delete a Set of Edges of Min Total Weight So That Each of the Resulting Components is Single-Rooted. NP-hard Heuristic 1.Start from the Roots 2.Down the DAG & greedily Assigns each Node to the Cluster to which it has the most Edges MemeTracking …
  • 13. Result Volume Distribution Dataset 3 Months Aug 1 to Oct 31 2008 ~ 1M Docs per Day from 1.65 Million Sites! 47M Phrases, 22M Distinct 9H Clustering Process Time 35, 800 Non-trivial Clusters (at least two phrases) MemeTracking …
  • 15. Other Findings Time lag between the news media and blogs 푓 푛푗 훿 푡 − 푡푗 푛푗 = Number of Item Previously Written for Cluster j 푡 = 푡ℎ푒 푐푢푟푟푒푛푡 푡푖푚푒 푡푗 = 푡ℎ푒 푡푖푚푒 푤ℎ푒푛 푗 푤푎푠 푓푖푟푠푡 푝푟표푑푢푐푒푑 푅푒푐푒푛푐푦 → 훿 푖푠 푚표푛표푡표푛푖푐푎푙푙푦 푑푒푐푟푒푎푠푖푛푔 푖푛 푡 − 푡푗 퐼푚푖푡푎푡푖표푛 → 푓 푖푠 푚표푛표푡표푛푖푐푎푙푙푦 푖푛푐푟푒푎푠푖푛푔 푖푛 푛푗, 푓(0) > 0 푡 → 0−: 푎 = 0.076 푡 → 0+: 푎 = 0.092 푡 → 0−: 푏 = 1.77 푡 → 0+: 푏 = 2.15 Quotes migrating from blogs to news media: 3.5% Each Cluster Modeling the news trend Imitation≠Recency MemeTracking …
  • 16.
  • 17. Characterizing Trends “trends in trend data.”  Meta Trend Taxonomy of the trends Key Distinguishing Features of Trends Not only the Textual Content Social Network Structure Ties Geographic Action  Retweet, Reply, Mention, Hashtag
  • 18. Trends Exogenous Broadcast-media Broadcast of local media “fight” (boxing event) “Ravens” (football game) Broadcast of global/national media “Kanye”(KanyeWest acts up at the MTVVideo MusicAwards) “Lost Finale” (series finale of Lost). Global News Breaking “earthquake” (Chile earthquake) “Tsunami” (HawaiiTsunamiwarning) “Beyoncé”(Beyoncé cancels Malaysia concert). Nonbreaking “HCR” (health care reform) “Tiger” (Tiger Woods apologizes) “iPad” (toward thelaunch of Apple’s popular device). National Holidays & Memorial Days “Halloween,” “Valentine’s.” Local Participatory & Physical Planned “marathon,” “superbowl” (Super Bowl viewing parties) “patrick’s” (St. Patrick’s Day Parade). Unplanned “rainy,” “snow.” Endogenous Memes #in2010 (in December 2009, users imagine their near future) “November” (users marking the beginning of the month on November 1) Retweets Fan Community Activities “2pac” (the anniversary of the death of hip-hop artist Tupac Shakur). Characterizing Trends …
  • 19. Trends from twitter.com Trends from Simple Trend Detector Trends for Quality Analysis  Supervised Categories Trends for Computing Features Tquantity Ttwitter Tterm freq. Tquality Characterizing Trends …
  • 20. Content Features •Average number of words/characters •Proportion of messages with URLs, unique URLs, with hashtags ex/including trend terms •Top unique hashtag? •Similarity to centroid Interaction Features • Proportion of retweets, replies, mentions Time-based Features • Exponential fit head, tail • Logarithmic fit head, tail Participation Features • Messages per author • Proportion of messages from top author • Proportion of messages from top 10% of authors Social Network Features •Level of reciprocity •Maximal eigenvector centrality •Maximal degree centrality •Transitivity •Density •Average component size Characterizing Trends …
  • 21. Content features: Exo higher URLs, smaller hashtags Exogenous vs. Endogenous Trends Interaction features: Exo fewer retweets, similar number of replies Time features: Exo different for the head period before the trend peak but will exhibit similar time features in the tail period after the trend peak, compared to endogenous trends. Social network features: Exo fewer connections, less reciprocity 1.1 1.2 1.3 1.4 Characterizing Trends …
  • 23. IDEA Automatic Categorization of Trends Photography Trend  Selfie Image Trust Trend  Trustful Users, Trustful Twits Untrendy People! Users Counteract the trends

Editor's Notes

  1. Vertical Analysis: Financial Managers Set One Accounting Item as the Benchmark & Compare other Items with the Numerical Standard In contrast with Horizontal Analysis: Study of Performance Trends over Time Short Intermediate Long Past Now Future
  2. Automatic trend detection over the twitter stream
  3. distinctive phrases that travel relatively intact through on-line text; developing scalable algorithms for clustering textual variants of such phrases, we identify a broad class of memes that exhibit wide spread mutation. As a result, a central computational challenge in this approach is to find robust ways of extracting and identifying all the mutational variants of each of these distinctive phrases, and to group them together.
  4. Words as Tokens This latter dependence is important, since we particularly wish to preserve edges (p, q) when the inclusion of p in q is supported by many occurrences of q.
  5. Collections of Phrases Deemed to be Close Textual Variants of One Another
  6. CCDF: Complementary Cumulative Distribution Function If the quantity of interest is power-law distributed with exponent γ, p(x) ∝ x−γ, then when plotted on log-log axes the CCDF will be a straight line with slope −(γ + 1). the tail is much heavier This means that variants of popular phrases, like “lipstick on a pig,” are much more “stickier” than what would be expected from overall phrase volume distribution. Popular phrases have many variants and each of them appears more frequently than an “average” phrase.
  7. To put a “lipstick on a pig”(does not make it a lady) is a rhetorical expression used to convey the message that making superficial or cosmetic changes is a futile attempt to disguise the true nature of a product اگر زري بپوشي، اگر اطلس بپوشي، همون کنگر فروشي بزک
  8. focus on the 1,000 threads with the largest total volumes (i.e. the largest number of mentions). Thread volume in blogs reaches its peak typically 2.5 hours after the peak thread volume in the news sources. Thread volume in news sources increases slowly but decrease quickly, while in blogs the increase is rapid and decrease much slower.
  9. reflect an ever-updating real-time live image of our society.
  10. Exogenous Trends • Broadcast-media events: ◦ Broadcast of local media events: “fight” (boxing event), “Ravens” (football game). ◦ Broadcast of global/national media events: “Kanye”(KanyeWest acts up at the MTVVideo MusicAwards),“Lost Finale” (series finale of Lost). • Global news events: ◦ Breaking news events: “earthquake” (Chile earthquake),“Tsunami” (HawaiiTsunamiwarning), “Beyoncé”(Beyoncé cancels Malaysia concert). ◦ Nonbreaking news events: “HCR” (health care reform),“Tiger” (Tiger Woods apologizes), “iPad” (toward thelaunch of Apple’s popular device). • National holidays and memorial days: “Halloween,” “Valentine’s.” • Local participatory and physical events: ◦ Planned events: “marathon,” “superbowl” (Super Bowl viewing parties), “patrick’s” (St. Patrick’s Day Parade). ◦ Unplanned events: “rainy,” “snow.” Endogenous Trends • Memes: #in2010 (in December 2009, users imagine their near future), “November” (users marking the beginning of the month on November 1) • Retweets (users “forwarding” en masse a single tweet from a popular user): “determination” (users retweeting LL Cool J’s post about said concept). • Fan community activities: “2pac” (the anniversary of the death of hip-hop artist Tupac Shakur).
  11. Breaking News vs. Other Exogenous Trends H2.1: Interaction features of breaking events will be different than those of other exogenous trends, with more retweets (forwarding), but fewer replies (conversation). H2.2: Time features of breaking events will be different for the head period, showing more rapid growth, and a better fit to the functions’ curve (i.e., less noise) compared to other exogenous trends. H2.3: Social network features of breaking events will be different than those of other exogenous trends. Local Events vs. Other Exogenous Trends H3.1: Content features of local events will be different than those of other exogenous trends. H3.2: Interaction features of local events will be different than those of other exogenous trends; in particular, local events will have more replies (conversation). H3.3: Time features of local events will be different than those of other exogenous trends. H3.4: Social network features of local events will be different than those of other exogenous trends; in particular, local events will have denser networks, more connectivity, and higher reciprocity. Memes vs. Retweet Endogenous Trends H4.1: Content features of memes will be different than those of retweet trends. H4.2: Interaction features of memes will be different than those of retweet trends; in particular, retweet trends will have significantly more retweet (forwarding) messages (this hypothesis is included as a “sanity check” since the retweet trends are defined by having a large proportion of retweets). H4.3: Time features of memes will be different than those of retweet trends. H4.4: Participation features of memes will be different than those of retweet trends. H4.5: Social network features of memes will be different than those of retweet trends; in particular, meme trends will have more connectivity and higher reciprocity than retweet trends.