SlideShare a Scribd company logo
1 of 30
Community Analysis
Using Graph Representation Learning
On Social Networks
Marco Brambilla and Mattia Gasparini
Politecnico di Milano
Introduction
• Development of platforms such as Instagram and
Facebook increased levels of interaction among
people
• Variety of social networks data exploited to map
users behavior
• Graphs perfectly fit for modeling all the
interactions of these users
2
Problem Statement
• Analysis of communities on on-line social
networks, applying machine learning on graphs
• Representation learning is used to extract valuable
information about users inside the community
• Classification of consumer and business users
• Grouping of similar users
3
Representation Learning
• Define a continuos representation for each node of the
graph (embedding) to easily apply machine learning
techniques on graphs
• Embeddings are based on neighbourhood nodes:
4
u
u :
Node2vec
• Emeddings computations performed using
node2vec algorithm[1], included in the Stanford
Network Analysis Platform (SNAP) library
• The algorithm calculates the embeddings solving an
optimization problem:
max
𝑓
𝑢 ∈𝑉
log Pr(𝑁𝑠(𝑢)|𝑓 𝑢 )
5
[1] Grover and Leskovec. 2016. node2vec: Scalable Feature Learning for Networks.
Node2vec
6
OutputInput
Node2vec
algorithms
calculates
embeddings such
that similarities
between graph
nodes and vectors
are preserved.
Case Study
• Emerging Italian fashion brand: Emporio Le Sirenuse
• Products: luxury swimsuits and dresses
• Case study is focused on the brand, its competitors
and their communities, defined as the set of
followers users on social network
7
http://www.fashiondatasensing.polimi.it/
Related Work
• Users’ communities defined using graph’s structural
properties [himelboim2017, deeb2017, guerrero2017]
• Brand-related communities have a specific role,
with business strategies as final target [ramadan2018,
kim2014, campbell2014]
• Fashion brands gain major advantages from social
media [brambilla2017, schmidt2017]
8
Analysis Pipeline
9
The proposed solution defines a method to handle all the steps of the analysis.
1 – Data Collection
• Web scraping of 10 brands and their followers data
from Instagram
• Time window: from 1 𝑠𝑡
January 2017 to 1 𝑠𝑡
November 2017
• Final database : 400K users, 10M posts
10
2 – Graph Construction
• Graphs are built using several entities: users that we
want to analyze (𝑈𝑡), their posts (𝑃), hashtags
referenced in the posts (𝐻) and mentioned users (𝑈 𝑚)
• Symmetrically, three different types of edges are
defined:
o 𝐸 𝑜𝑤𝑛𝑒𝑟 = 𝑒1, 𝑒2 𝑒1 ∈ 𝑈𝑡, 𝑒2 ∈ 𝑃}
o 𝐸𝑡𝑎𝑔 = 𝑒1, 𝑒2 𝑒1 ∈ 𝑃, 𝑒2 ∈ 𝑇}
o 𝐸 𝑚𝑒𝑛𝑡𝑖𝑜𝑛 = 𝑒1, 𝑒2 𝑒1 ∈ 𝑃, 𝑒2 ∈ 𝑈 𝑚}
11
2 – Graph Construction
• Three graph models are used for the analysis:
1. Mixed network: 𝐺 𝑀 = 𝑈, 𝑃, 𝑇 , 𝐸 𝑜𝑤𝑛𝑒𝑟, 𝐸𝑡𝑎𝑔, 𝐸 𝑚𝑒𝑛𝑡𝑖𝑜𝑛
2. Hashtags network: 𝐺ℎ = 𝑈𝑡, 𝑃, 𝑇 , 𝐸 𝑜𝑤𝑛𝑒𝑟, 𝐸𝑡𝑎𝑔
3. Mentions network: 𝐺 𝑚 = 𝑈𝑡, 𝑈 𝑚, 𝑃 , 𝐸 𝑜𝑤𝑛𝑒𝑟, 𝐸 𝑚𝑒𝑛𝑡𝑖𝑜𝑛
• 𝐺ℎ and 𝐺 𝑚 are subgraphs of 𝐺 𝑀: they map the
influence of specific social media aspects
12
Example Hashtags
Network
13
The central part of the graph features
the most connected nodes, which
correspond to the users that
have many hashtags in common.
3 – Graph Reduction
• A reduction process is applied to 𝐺ℎ and 𝐺 𝑚 to obtain «classical» social
networks, where the nodes are the users and the edges are weighted
based on the number of shared entities:
𝑤𝑖𝑗 =
𝑡𝑖 ∩ 𝑡𝑗 , 𝑖𝑓 𝑖, 𝑗 ∈ 𝐺ℎ
𝑚𝑖 ∩ 𝑚𝑗 , 𝑖𝑓 𝑖, 𝑗 ∈ 𝐺 𝑚
where 𝑖, 𝑗 ⊂ 𝑈𝑡, 𝑡𝑖,𝑗 ⊆ 𝑇, 𝑚𝑖,𝑗 ⊆ 𝑈 𝑚
• 𝐺ℎ and 𝐺 𝑚, the reduced hashtags and reduced mentions networks, are
generated
14
Reduced Graph
Example
15
Reduced mentions
network 𝐺 𝑚: edges
are weighted based
on number of
common mentioned
users.
4 – Features Extraction
• Both heterogeneous networks 𝐺ℎ,𝑚 and reduced
networks 𝐺ℎ,𝑚 are used to extract the embeddings
• Feature vectors dimension is fixed for the two types
of networks: 𝑑 𝐺 = 8 and 𝑑 𝐺 = 4, respectively.
• Hyper-parameter tuning for 𝑝 and 𝑞 in supervised
setting
16
5 – Classification
• Domain specific task:
«Discriminate between consumer and non-consumer
users»
• Ground-truth of 351 labelled users defined with
domain experts
• Three features set are tested:
• Social media account data(#followers, #following,
#posts, bio)
• Complete network embeddings
• Reduced network embeddings
17
5 – Classification Experiment
18
Description of the user is valuable if a good fraction of the neighborhood
is exploited, which is not always feasible for complete networks.
5 – Classification Experiment on Reduced Networks
19
Performance and number of classified users increase with the number of user nodes
included in the model, even if they are not classified: they enrich the neighborhood and, by
consequence, the features vector.
6 – Clustering
• Hashtags reduced networks 𝐺ℎ used as proxy to
content-based similarity
• K-means is applied on extracted features vectors
• Focus on 𝐺ℎ of Emporio Le Sirenuse community
20
6 – Clustering
Network Input
21
Hashtags Reduced
Network 𝐺ℎ of
Emporio Le Sirenuse
community.
6 – Clustering Features
22
Embeddings extracted from the
network.
First two features components
are used for visualization.
6 – Clustering Output
23
K selection: plot of inertia
against number of clusters
6 – Output
Network
24
Application of
clustering output to
the reduced network
6 – Cluster Validation: Domain Experts
• Domain experts are provided with a subset of users for each
cluster
• Manual inspection of user profiles, providing feedback
about the patterns present in each cluster
25
6 – Cluster Validation: Experts Feedback
• Cluster 0, 1 and 2 very well defined: professionals
users, such as showrooms and other brands
• Cluster 3 contains regular users that share contents
about holidays in Italy
• Clusters 3, 4, 5 and 6 composed mostly by regular
users, too
26
6 – Cluster Labels
27
Cluster labels extracted using the set of hashtags shared at least by two users inside the
cluster.
28
FOOD
LUXURY
HIPSTER
INTERNATIONAL
INTERIOR
DESIGN
VINTAGE
ITALIAN HOLIDAYS
6 – Final Result
Conclusion
• Results:
• Definition of an effective method to analyze
communities inside social network domain
• Modeling of user similarities through network features
• Detection of content-driven sub-communities
• Future work:
• Inclusion of time variable
29
Questions?
Contacts:
Marco Brambilla: marco.brambilla@polimi.it
Mattia Gasparini: mattia.gasparini@polimi.it
@marcobrambi @datascience_mi
http://www.fashiondatasensing.polimi.it/
http://datascience.deib.polimi.it

More Related Content

What's hot

Mining social data
Mining social dataMining social data
Mining social dataMalk Zameth
 
The Basics of Social Network Analysis
The Basics of Social Network AnalysisThe Basics of Social Network Analysis
The Basics of Social Network AnalysisRory Sie
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisSujoy Bag
 
LAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning AnalyticsLAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning Analyticsgoehnert
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Xiaohan Zeng
 
Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011guillaume ereteo
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social NetworksKent State University
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Denis Parra Santander
 
Social Network Visualization 101
Social Network Visualization 101Social Network Visualization 101
Social Network Visualization 101librarianrafia
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...Jeromy Anglim
 
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHIBig Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHIRuchika Sharma
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisFred Stutzman
 
Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Michael Steurer
 
Data mining for social media
Data mining for social mediaData mining for social media
Data mining for social mediarangesharp
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collectiondnac
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network AnalysisMarc Smith
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis Jari Jussila
 

What's hot (20)

Mining social data
Mining social dataMining social data
Mining social data
 
The Basics of Social Network Analysis
The Basics of Social Network AnalysisThe Basics of Social Network Analysis
The Basics of Social Network Analysis
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
LAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning AnalyticsLAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning Analytics
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
 
Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social Networks
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
 
Social Network Visualization 101
Social Network Visualization 101Social Network Visualization 101
Social Network Visualization 101
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...
 
Social Network Analysis (SNA)
Social Network Analysis (SNA)Social Network Analysis (SNA)
Social Network Analysis (SNA)
 
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHIBig Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...
 
Data mining for social media
Data mining for social mediaData mining for social media
Data mining for social media
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collection
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis
 

Similar to Community analysis using graph representation learning on social networks

Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Lippo Group Digital
 
Scalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduceScalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReducesscdotopen
 
Business intelligence and data warehousing
Business intelligence and data warehousingBusiness intelligence and data warehousing
Business intelligence and data warehousingVaishnavi
 
Community Structure-based Audience Expansion for Digital Advertising
Community Structure-based Audience Expansion for Digital AdvertisingCommunity Structure-based Audience Expansion for Digital Advertising
Community Structure-based Audience Expansion for Digital AdvertisingEunjae Kim
 
User Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience ExpansionUser Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience ExpansionDatabricks
 
20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworkstm1966
 
Social Network Analysis Using Gephi
Social Network Analysis Using Gephi Social Network Analysis Using Gephi
Social Network Analysis Using Gephi Goa App
 
Social Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network ContextSocial Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network ContextIRJET Journal
 
Mohan C R CV
Mohan C R CVMohan C R CV
Mohan C R CVMOHAN C R
 
Provenance Analytics at AAAI Human Computation Conference 2013
Provenance Analytics at AAAI Human Computation Conference 2013Provenance Analytics at AAAI Human Computation Conference 2013
Provenance Analytics at AAAI Human Computation Conference 2013T Dong Huynh
 
Algorithm in Social network of graph and social network analysis
Algorithm in Social network of graph and social network analysisAlgorithm in Social network of graph and social network analysis
Algorithm in Social network of graph and social network analysisoliviaclark2905
 
Online news popularity analysis
Online news popularity analysisOnline news popularity analysis
Online news popularity analysisAnkur Vora
 
Analyzing rich club behavior in open source projects
Analyzing rich club behavior in open source projectsAnalyzing rich club behavior in open source projects
Analyzing rich club behavior in open source projectsMarco Brambilla
 
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Charalampos Chelmis
 
Social media community using optimized algorithm by M. Gomathi / Lecturer
Social media community using optimized algorithm by M. Gomathi / LecturerSocial media community using optimized algorithm by M. Gomathi / Lecturer
Social media community using optimized algorithm by M. Gomathi / Lecturergomathi chlm
 
Machine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskMachine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskQuantUniversity
 
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...GUANGYUAN PIAO
 
CAD theory presentation.pptx .
CAD theory presentation.pptx                .CAD theory presentation.pptx                .
CAD theory presentation.pptx .Athar739197
 

Similar to Community analysis using graph representation learning on social networks (20)

Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomera...
 
Scalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduceScalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduce
 
Business intelligence and data warehousing
Business intelligence and data warehousingBusiness intelligence and data warehousing
Business intelligence and data warehousing
 
Community Structure-based Audience Expansion for Digital Advertising
Community Structure-based Audience Expansion for Digital AdvertisingCommunity Structure-based Audience Expansion for Digital Advertising
Community Structure-based Audience Expansion for Digital Advertising
 
User Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience ExpansionUser Behavior Hashing for Audience Expansion
User Behavior Hashing for Audience Expansion
 
20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks
 
Multimedia Mining
Multimedia Mining Multimedia Mining
Multimedia Mining
 
Social Network Analysis Using Gephi
Social Network Analysis Using Gephi Social Network Analysis Using Gephi
Social Network Analysis Using Gephi
 
Social Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network ContextSocial Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network Context
 
Mohan C R CV
Mohan C R CVMohan C R CV
Mohan C R CV
 
Provenance Analytics at AAAI Human Computation Conference 2013
Provenance Analytics at AAAI Human Computation Conference 2013Provenance Analytics at AAAI Human Computation Conference 2013
Provenance Analytics at AAAI Human Computation Conference 2013
 
Algorithm in Social network of graph and social network analysis
Algorithm in Social network of graph and social network analysisAlgorithm in Social network of graph and social network analysis
Algorithm in Social network of graph and social network analysis
 
Online news popularity analysis
Online news popularity analysisOnline news popularity analysis
Online news popularity analysis
 
Analyzing rich club behavior in open source projects
Analyzing rich club behavior in open source projectsAnalyzing rich club behavior in open source projects
Analyzing rich club behavior in open source projects
 
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
 
Social media community using optimized algorithm by M. Gomathi / Lecturer
Social media community using optimized algorithm by M. Gomathi / LecturerSocial media community using optimized algorithm by M. Gomathi / Lecturer
Social media community using optimized algorithm by M. Gomathi / Lecturer
 
Machine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskMachine Learning Applications in Credit Risk
Machine Learning Applications in Credit Risk
 
ruSMART 2013 presentation
ruSMART 2013 presentationruSMART 2013 presentation
ruSMART 2013 presentation
 
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
 
CAD theory presentation.pptx .
CAD theory presentation.pptx                .CAD theory presentation.pptx                .
CAD theory presentation.pptx .
 

More from Marco Brambilla

M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...Marco Brambilla
 
Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...
Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...
Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...Marco Brambilla
 
Hierarchical Transformers for User Semantic Similarity - ICWE 2023
Hierarchical Transformers for User Semantic Similarity - ICWE 2023Hierarchical Transformers for User Semantic Similarity - ICWE 2023
Hierarchical Transformers for User Semantic Similarity - ICWE 2023Marco Brambilla
 
Exploring the Bi-verse. A trip across the digital and physical ecospheres
Exploring the Bi-verse.A trip across the digital and physical ecospheresExploring the Bi-verse.A trip across the digital and physical ecospheres
Exploring the Bi-verse. A trip across the digital and physical ecospheresMarco Brambilla
 
Trigger.eu: Cocteau game for policy making - introduction and demo
Trigger.eu: Cocteau game for policy making - introduction and demoTrigger.eu: Cocteau game for policy making - introduction and demo
Trigger.eu: Cocteau game for policy making - introduction and demoMarco Brambilla
 
Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...
Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...
Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...Marco Brambilla
 
Analysis of On-line Debate on Long-Running Political Phenomena. The Brexit C...
Analysis of On-line Debate on Long-Running Political Phenomena.The Brexit C...Analysis of On-line Debate on Long-Running Political Phenomena.The Brexit C...
Analysis of On-line Debate on Long-Running Political Phenomena. The Brexit C...Marco Brambilla
 
Available Data Science M.Sc. Thesis Proposals
Available Data Science M.Sc. Thesis Proposals Available Data Science M.Sc. Thesis Proposals
Available Data Science M.Sc. Thesis Proposals Marco Brambilla
 
Driving Style and Behavior Analysis based on Trip Segmentation over GPS Info...
Driving Style and Behavior Analysis based on Trip Segmentation over GPS  Info...Driving Style and Behavior Analysis based on Trip Segmentation over GPS  Info...
Driving Style and Behavior Analysis based on Trip Segmentation over GPS Info...Marco Brambilla
 
Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Marco Brambilla
 
Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...
Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...
Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...Marco Brambilla
 
Model-driven Development of User Interfaces for IoT via Domain-specific Comp...
Model-driven Development of  User Interfaces for IoT via Domain-specific Comp...Model-driven Development of  User Interfaces for IoT via Domain-specific Comp...
Model-driven Development of User Interfaces for IoT via Domain-specific Comp...Marco Brambilla
 
A Model-Based Method for Seamless Web and Mobile Experience. Splash 2016 conf.
A Model-Based Method for  Seamless Web and Mobile Experience. Splash 2016 conf.A Model-Based Method for  Seamless Web and Mobile Experience. Splash 2016 conf.
A Model-Based Method for Seamless Web and Mobile Experience. Splash 2016 conf.Marco Brambilla
 
Big Data and Stream Data Analysis at Politecnico di Milano
Big Data and Stream Data Analysis at Politecnico di MilanoBig Data and Stream Data Analysis at Politecnico di Milano
Big Data and Stream Data Analysis at Politecnico di MilanoMarco Brambilla
 
Web Science. An introduction
Web Science. An introductionWeb Science. An introduction
Web Science. An introductionMarco Brambilla
 
On the Quest for Changing Knowledge. Capturing emerging entities from social ...
On the Quest for Changing Knowledge. Capturing emerging entities from social ...On the Quest for Changing Knowledge. Capturing emerging entities from social ...
On the Quest for Changing Knowledge. Capturing emerging entities from social ...Marco Brambilla
 
Studying Multicultural Diversity of Cities and Neighborhoods through Social M...
Studying Multicultural Diversity of Cities and Neighborhoods through Social M...Studying Multicultural Diversity of Cities and Neighborhoods through Social M...
Studying Multicultural Diversity of Cities and Neighborhoods through Social M...Marco Brambilla
 
Model driven software engineering in practice book - Chapter 9 - Model to tex...
Model driven software engineering in practice book - Chapter 9 - Model to tex...Model driven software engineering in practice book - Chapter 9 - Model to tex...
Model driven software engineering in practice book - Chapter 9 - Model to tex...Marco Brambilla
 
Model driven software engineering in practice book - chapter 7 - Developing y...
Model driven software engineering in practice book - chapter 7 - Developing y...Model driven software engineering in practice book - chapter 7 - Developing y...
Model driven software engineering in practice book - chapter 7 - Developing y...Marco Brambilla
 
Automatic code generation for cross platform, multi-device mobile apps. An in...
Automatic code generation for cross platform, multi-device mobile apps. An in...Automatic code generation for cross platform, multi-device mobile apps. An in...
Automatic code generation for cross platform, multi-device mobile apps. An in...Marco Brambilla
 

More from Marco Brambilla (20)

M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
 
Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...
Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...
Thesis Topics and Proposals @ Polimi Data Science Lab - 2023 - prof. Brambill...
 
Hierarchical Transformers for User Semantic Similarity - ICWE 2023
Hierarchical Transformers for User Semantic Similarity - ICWE 2023Hierarchical Transformers for User Semantic Similarity - ICWE 2023
Hierarchical Transformers for User Semantic Similarity - ICWE 2023
 
Exploring the Bi-verse. A trip across the digital and physical ecospheres
Exploring the Bi-verse.A trip across the digital and physical ecospheresExploring the Bi-verse.A trip across the digital and physical ecospheres
Exploring the Bi-verse. A trip across the digital and physical ecospheres
 
Trigger.eu: Cocteau game for policy making - introduction and demo
Trigger.eu: Cocteau game for policy making - introduction and demoTrigger.eu: Cocteau game for policy making - introduction and demo
Trigger.eu: Cocteau game for policy making - introduction and demo
 
Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...
Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...
Generation of Realistic Navigation Paths for Web Site Testing using RNNs and ...
 
Analysis of On-line Debate on Long-Running Political Phenomena. The Brexit C...
Analysis of On-line Debate on Long-Running Political Phenomena.The Brexit C...Analysis of On-line Debate on Long-Running Political Phenomena.The Brexit C...
Analysis of On-line Debate on Long-Running Political Phenomena. The Brexit C...
 
Available Data Science M.Sc. Thesis Proposals
Available Data Science M.Sc. Thesis Proposals Available Data Science M.Sc. Thesis Proposals
Available Data Science M.Sc. Thesis Proposals
 
Driving Style and Behavior Analysis based on Trip Segmentation over GPS Info...
Driving Style and Behavior Analysis based on Trip Segmentation over GPS  Info...Driving Style and Behavior Analysis based on Trip Segmentation over GPS  Info...
Driving Style and Behavior Analysis based on Trip Segmentation over GPS Info...
 
Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...
 
Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...
Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...
Harvesting Knowledge from Social Networks: Extracting Typed Relationships amo...
 
Model-driven Development of User Interfaces for IoT via Domain-specific Comp...
Model-driven Development of  User Interfaces for IoT via Domain-specific Comp...Model-driven Development of  User Interfaces for IoT via Domain-specific Comp...
Model-driven Development of User Interfaces for IoT via Domain-specific Comp...
 
A Model-Based Method for Seamless Web and Mobile Experience. Splash 2016 conf.
A Model-Based Method for  Seamless Web and Mobile Experience. Splash 2016 conf.A Model-Based Method for  Seamless Web and Mobile Experience. Splash 2016 conf.
A Model-Based Method for Seamless Web and Mobile Experience. Splash 2016 conf.
 
Big Data and Stream Data Analysis at Politecnico di Milano
Big Data and Stream Data Analysis at Politecnico di MilanoBig Data and Stream Data Analysis at Politecnico di Milano
Big Data and Stream Data Analysis at Politecnico di Milano
 
Web Science. An introduction
Web Science. An introductionWeb Science. An introduction
Web Science. An introduction
 
On the Quest for Changing Knowledge. Capturing emerging entities from social ...
On the Quest for Changing Knowledge. Capturing emerging entities from social ...On the Quest for Changing Knowledge. Capturing emerging entities from social ...
On the Quest for Changing Knowledge. Capturing emerging entities from social ...
 
Studying Multicultural Diversity of Cities and Neighborhoods through Social M...
Studying Multicultural Diversity of Cities and Neighborhoods through Social M...Studying Multicultural Diversity of Cities and Neighborhoods through Social M...
Studying Multicultural Diversity of Cities and Neighborhoods through Social M...
 
Model driven software engineering in practice book - Chapter 9 - Model to tex...
Model driven software engineering in practice book - Chapter 9 - Model to tex...Model driven software engineering in practice book - Chapter 9 - Model to tex...
Model driven software engineering in practice book - Chapter 9 - Model to tex...
 
Model driven software engineering in practice book - chapter 7 - Developing y...
Model driven software engineering in practice book - chapter 7 - Developing y...Model driven software engineering in practice book - chapter 7 - Developing y...
Model driven software engineering in practice book - chapter 7 - Developing y...
 
Automatic code generation for cross platform, multi-device mobile apps. An in...
Automatic code generation for cross platform, multi-device mobile apps. An in...Automatic code generation for cross platform, multi-device mobile apps. An in...
Automatic code generation for cross platform, multi-device mobile apps. An in...
 

Recently uploaded

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 

Recently uploaded (20)

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 

Community analysis using graph representation learning on social networks

  • 1. Community Analysis Using Graph Representation Learning On Social Networks Marco Brambilla and Mattia Gasparini Politecnico di Milano
  • 2. Introduction • Development of platforms such as Instagram and Facebook increased levels of interaction among people • Variety of social networks data exploited to map users behavior • Graphs perfectly fit for modeling all the interactions of these users 2
  • 3. Problem Statement • Analysis of communities on on-line social networks, applying machine learning on graphs • Representation learning is used to extract valuable information about users inside the community • Classification of consumer and business users • Grouping of similar users 3
  • 4. Representation Learning • Define a continuos representation for each node of the graph (embedding) to easily apply machine learning techniques on graphs • Embeddings are based on neighbourhood nodes: 4 u u :
  • 5. Node2vec • Emeddings computations performed using node2vec algorithm[1], included in the Stanford Network Analysis Platform (SNAP) library • The algorithm calculates the embeddings solving an optimization problem: max 𝑓 𝑢 ∈𝑉 log Pr(𝑁𝑠(𝑢)|𝑓 𝑢 ) 5 [1] Grover and Leskovec. 2016. node2vec: Scalable Feature Learning for Networks.
  • 7. Case Study • Emerging Italian fashion brand: Emporio Le Sirenuse • Products: luxury swimsuits and dresses • Case study is focused on the brand, its competitors and their communities, defined as the set of followers users on social network 7 http://www.fashiondatasensing.polimi.it/
  • 8. Related Work • Users’ communities defined using graph’s structural properties [himelboim2017, deeb2017, guerrero2017] • Brand-related communities have a specific role, with business strategies as final target [ramadan2018, kim2014, campbell2014] • Fashion brands gain major advantages from social media [brambilla2017, schmidt2017] 8
  • 9. Analysis Pipeline 9 The proposed solution defines a method to handle all the steps of the analysis.
  • 10. 1 – Data Collection • Web scraping of 10 brands and their followers data from Instagram • Time window: from 1 𝑠𝑡 January 2017 to 1 𝑠𝑡 November 2017 • Final database : 400K users, 10M posts 10
  • 11. 2 – Graph Construction • Graphs are built using several entities: users that we want to analyze (𝑈𝑡), their posts (𝑃), hashtags referenced in the posts (𝐻) and mentioned users (𝑈 𝑚) • Symmetrically, three different types of edges are defined: o 𝐸 𝑜𝑤𝑛𝑒𝑟 = 𝑒1, 𝑒2 𝑒1 ∈ 𝑈𝑡, 𝑒2 ∈ 𝑃} o 𝐸𝑡𝑎𝑔 = 𝑒1, 𝑒2 𝑒1 ∈ 𝑃, 𝑒2 ∈ 𝑇} o 𝐸 𝑚𝑒𝑛𝑡𝑖𝑜𝑛 = 𝑒1, 𝑒2 𝑒1 ∈ 𝑃, 𝑒2 ∈ 𝑈 𝑚} 11
  • 12. 2 – Graph Construction • Three graph models are used for the analysis: 1. Mixed network: 𝐺 𝑀 = 𝑈, 𝑃, 𝑇 , 𝐸 𝑜𝑤𝑛𝑒𝑟, 𝐸𝑡𝑎𝑔, 𝐸 𝑚𝑒𝑛𝑡𝑖𝑜𝑛 2. Hashtags network: 𝐺ℎ = 𝑈𝑡, 𝑃, 𝑇 , 𝐸 𝑜𝑤𝑛𝑒𝑟, 𝐸𝑡𝑎𝑔 3. Mentions network: 𝐺 𝑚 = 𝑈𝑡, 𝑈 𝑚, 𝑃 , 𝐸 𝑜𝑤𝑛𝑒𝑟, 𝐸 𝑚𝑒𝑛𝑡𝑖𝑜𝑛 • 𝐺ℎ and 𝐺 𝑚 are subgraphs of 𝐺 𝑀: they map the influence of specific social media aspects 12
  • 13. Example Hashtags Network 13 The central part of the graph features the most connected nodes, which correspond to the users that have many hashtags in common.
  • 14. 3 – Graph Reduction • A reduction process is applied to 𝐺ℎ and 𝐺 𝑚 to obtain «classical» social networks, where the nodes are the users and the edges are weighted based on the number of shared entities: 𝑤𝑖𝑗 = 𝑡𝑖 ∩ 𝑡𝑗 , 𝑖𝑓 𝑖, 𝑗 ∈ 𝐺ℎ 𝑚𝑖 ∩ 𝑚𝑗 , 𝑖𝑓 𝑖, 𝑗 ∈ 𝐺 𝑚 where 𝑖, 𝑗 ⊂ 𝑈𝑡, 𝑡𝑖,𝑗 ⊆ 𝑇, 𝑚𝑖,𝑗 ⊆ 𝑈 𝑚 • 𝐺ℎ and 𝐺 𝑚, the reduced hashtags and reduced mentions networks, are generated 14
  • 15. Reduced Graph Example 15 Reduced mentions network 𝐺 𝑚: edges are weighted based on number of common mentioned users.
  • 16. 4 – Features Extraction • Both heterogeneous networks 𝐺ℎ,𝑚 and reduced networks 𝐺ℎ,𝑚 are used to extract the embeddings • Feature vectors dimension is fixed for the two types of networks: 𝑑 𝐺 = 8 and 𝑑 𝐺 = 4, respectively. • Hyper-parameter tuning for 𝑝 and 𝑞 in supervised setting 16
  • 17. 5 – Classification • Domain specific task: «Discriminate between consumer and non-consumer users» • Ground-truth of 351 labelled users defined with domain experts • Three features set are tested: • Social media account data(#followers, #following, #posts, bio) • Complete network embeddings • Reduced network embeddings 17
  • 18. 5 – Classification Experiment 18 Description of the user is valuable if a good fraction of the neighborhood is exploited, which is not always feasible for complete networks.
  • 19. 5 – Classification Experiment on Reduced Networks 19 Performance and number of classified users increase with the number of user nodes included in the model, even if they are not classified: they enrich the neighborhood and, by consequence, the features vector.
  • 20. 6 – Clustering • Hashtags reduced networks 𝐺ℎ used as proxy to content-based similarity • K-means is applied on extracted features vectors • Focus on 𝐺ℎ of Emporio Le Sirenuse community 20
  • 21. 6 – Clustering Network Input 21 Hashtags Reduced Network 𝐺ℎ of Emporio Le Sirenuse community.
  • 22. 6 – Clustering Features 22 Embeddings extracted from the network. First two features components are used for visualization.
  • 23. 6 – Clustering Output 23 K selection: plot of inertia against number of clusters
  • 24. 6 – Output Network 24 Application of clustering output to the reduced network
  • 25. 6 – Cluster Validation: Domain Experts • Domain experts are provided with a subset of users for each cluster • Manual inspection of user profiles, providing feedback about the patterns present in each cluster 25
  • 26. 6 – Cluster Validation: Experts Feedback • Cluster 0, 1 and 2 very well defined: professionals users, such as showrooms and other brands • Cluster 3 contains regular users that share contents about holidays in Italy • Clusters 3, 4, 5 and 6 composed mostly by regular users, too 26
  • 27. 6 – Cluster Labels 27 Cluster labels extracted using the set of hashtags shared at least by two users inside the cluster.
  • 29. Conclusion • Results: • Definition of an effective method to analyze communities inside social network domain • Modeling of user similarities through network features • Detection of content-driven sub-communities • Future work: • Inclusion of time variable 29
  • 30. Questions? Contacts: Marco Brambilla: marco.brambilla@polimi.it Mattia Gasparini: mattia.gasparini@polimi.it @marcobrambi @datascience_mi http://www.fashiondatasensing.polimi.it/ http://datascience.deib.polimi.it

Editor's Notes

  1. Good morning, today I am going to present our research work about community analysis using graph representation learning on social networks.
  2. Starting point is that modern social networks such as Instagram and Facebook increased exponentially the number of interactions among people. That variety of data can be exploited to map user behavior. Data itself can perfectly fit to a graph model, capturing users interactions.
  3. Our purpose is to analyze communities on on-line social networks, applying innovative machine learning techniques on graphs. In the specific, we want to apply representation learning on graphs to describe users inside communities: two main tasks have been developed, one that classifies users in consumers and non-consumers, the other that extracts subgroups of similar users. .
  4. Just a brief mention to the technique: representation learning is a technique that defines a continuos features vector for each node of the graph, referred to as embedding. The embeddings are learnt with different strategies: as one possible example, focusing on a specific node u, we can exploit local neighbors, the blue nodes in the picture, to learn the feature vectors of u.
  5. Many algorithms are able to perform this operation and we chose node2vec that provides a very flexible technique. It computes the embedding f(u) of a node u using the following equation. It maximizes log-probability of observing a network neighborhood 𝑁 𝑠 (𝑢) for a node 𝑢 conditioned on its feature representation, given by 𝑓.
  6. We can see how it works here: the intuition is that nodes near in the graph are also near in vector space.
  7. The scenario that we take as case study is about an emerging Italian fashion brand, Emporio Le Sirenuse: this brand is located in Positano, near Neaples, andit mainly produces women luxury swimsuits and dresses. The work focuses on the community of the brand, defined as the set of its Instagram followers: the idea is that brands can get valuable insights about the specific interests of its followers, and in its way better targeting their products and marketing campaigns.
  8. 1st group: community detection on social networks is quite well-known domain, but network structure is not really exploited. 2° group: Analysis of users reaction to brand network marketing, as well as content sharing indicator of brand turst and community commitment 3° group: Instagram is a visual social network that has high potential for fashion brands, that have in visual aspects their main feature
  9. We defined an analysis pipeline and now I will go into the details of each step.
  10. As step 1, data is gathered using web scraping to collect posts of the brands and their followers from Instagram, in a time window that spans from January 2017 to November 2017. We collected around 400K users and 10M posts.
  11. Second step is the definition of the graph model. We consider as entities users, posts and hashtags and then we define three sets of edges: one connects users to the posts produced, while the other two connect posts to references entities, hashtags and mentions.
  12. The heterogeneous graph 𝐺 𝑀 contains all the entities and relationships. From this graph, two subgraphs are extracted: the hashtags network and the mentions network, that map two important aspects of social media interactions.
  13. As an example, this is a hashtags graph built from gathered data: green nodes are the users (the ones with more connections), blue nodes are posts and hashtags are in orange. The important fact is that users that have many hashtags in common are concetrated in the centre of the network, so their are «near».
  14. In a further step, a graph reduction is applied to previously presented graphs to obtain homogeneous networks, where only users nodes are present. The reduced graphs are weighted as well, where the weight is based on the number of common entities, either hashtags or mentioned users. In this way, 𝐺 ℎ and 𝐺 𝑚 are generated.
  15. In this example, you can see a reduced mentions network: edges connect each user to the ones that he or she mentioned and number of mentions, the weight, is mapped to a color, from low (blue), to high (red).
  16. Embeddings are extracted both for heterogeneous networks and reduced networks. Number of dimensions for the output vectors are fixed a-priori: it is set to 8 for heterogeneous networks (that are bigger) and to 4 for reduced networks. Instead, main parameters of the algorithm p and q are selected via hyper-parameter tuning.
  17. Classification step is defined as to prove the effectiveness of features in our domain: we want to disciminate between consumers and non-consumers users on Instagram. To do so, we manually labelled a set of users with the help of domain experts of Politecnico di Milano fashion department. Then, a classifier is implemented to test three set of features: social media quantitative features are used as a baseline, compared against features extracted from complete and reduced networks.
  18. Results of first experiment are shown in this table: it is possible to see that reduced network features perform better than complete network ones and than the social media baseline, too. This is given by the fact that, given a fixed computational power, reduced networks are smaller and so the neighborhood is easier to be exploited. On the other way, they are able to encode the main dynamics useful for our purpose.
  19. Given first experiment results, we performed a second experiment on reduced networks only. In this experiment, ground truth network is enlarged using a set of additional non-labelled users taken from followers of different brands: results show that the more users are included the richer is the neighborhood of labeled users and so the performance increases.
  20. As the second task, we want to exploit the features to extract new subgroups of users from the community of the brand, defined as the set of its followers. So, focus is on the community of Emporio Sirenuse, using the reduced hashtags graph as a proxy of content description.
  21. This is the real reduced network over which we run our analysis: each node is a user and the edges connect pair of users that shared same hashtags.
  22. We extract the embeddings of this graph: a 2-d visualization, using the first two components, is presented. We use a standard parametrization of the algorithm (p=1, q=0.5), that allows to exploit the local neighborhood.
  23. We run K-means over this set of features: K is selected using inertia as structural validation metric. These are the 7 clusters obtained, as well as the plot of inertia with respect to K.
  24. The output network is presented, with colors associated with clusters.
  25. Clustering needs external measures to validate the results: for this reason, we provided domain experts with a subset of users for each cluster. They manually inspected the social media profile of each user, providing feedback about presence of patterns inside clusters. The lists are ordered by distance from centroid, which is used as similarity quality measure
  26. The insights are simple but quite interesting: Cluster 0, 1 and 2 are users that share very specific contents, such as interior desing or food: they are mainly professional profiles, such as showrooms or brands. Cluster 3 is very well defined, too, but it contains regular users: they share contents about holidays in Italy, which matches with brand identity. Clusters 4, 5 and 6 contain regular users with broader contents.
  27. As additional validation, we provided a way to label each cluster: we compute the list of cluster hashtags T(c) as the set of hashtags shared by at least two users inside the cluster (e.g.: hashtags that increase the weights and/or connections inside the cluster). Then, label is defined as the top 10 hashtags by frequency belonging to this list: these lists are presented in the table, showing a consistent labeling with previous validation (e.g.: cluster 3 use hashtags related to italian vacation, cluster 0 about luxury accessories, cluster 1 about food, …) 𝑇 𝑐 = 𝑢, 𝑣 ∈𝑐 𝑡 𝑢 ∩ 𝑡 𝑣
  28. What we obtain as final result of clustering is a segmentation of users that can be used by brand to better target their marketing campaigns [or to make other collaborations, (e.g.: luxury(0), food (1) and interior design (2) clusters are professionals)].
  29. As final conclusion, in this work we defined an effective method to characterize users inside online communities: users are described using features extracted from their network representation and we are able to use these features to solve domain-specific classification tasks, as well as defining subgroups of users based on shared interests. In this analysis, time variable is missing and graphs are built using a single snapshot of all the data: having time-varying graphs could potentially capture more fine-grained patterns.