SlideShare a Scribd company logo
1 of 19
Inferring Peer Centrality
in Socially-Informed P2P Systems
Nicolas Kourtellis, Adriana Iamnitchi
Department of Computer Science & Engineering
University of South Florida
Tampa, USA
11th IEEE International Conference on Peer-to-Peer Computing
Kyoto, Japan, 2011
Socially-aware Applications
 Applications collect and use social information:
 Location, collocation, history of interactions, etc.
 Build (implicit/explicit) social network of users
 Use: reduce spam, provide recommendations, etc.
 Wide range of system architectures
 How does the social network of users affect the load
in a P2P architecture?
2
Decentralization of user social data
• MobiClique
• Yarta
• ...
• PeerSoN
• LifeSocial.KOM
• Safebook
• Prometheus
• …
Social Graphs & P2P Networks
 Users connected with application-specific edges
 User-contributed peers form a P2P network
 User social graph is partitioned into subgraphs &
stored on peers
Questions:
 How do applications traverse a distributed social graph?
 What does it mean for the P2P routing? 3
 Invite user G’s 2-hop hiking contacts to a trip
 Social graph traversals => many P2P lookups
 Application performance affected by projection
of social graph on peers
Application Example
4
=> 1-hop={B, C, E} 2-hops={A, D, F, I}
 How do the properties of the projection graph compare with
the properties of the social graph projected?
Projection Graph
5
Projection
Graph (PG)
P2P Overlay
Social
Graph (SG)
Projection Graph Model
 Uses:
 Study properties of peers such as centrality
 Study how the social graph topology affects P2P
routing & system performance 6
Social Graph SG = (V,E)
V=set of users, E=set of social edges
Projection Graph PG = (VP
,EP
)
VP
=set of peers, EP
=set of P2P edges
PV
(i) = set of users mapped on peer Pi
, Pi
Î VP
(Pi
,Pj
) Î EP
iff $ a Î PV
(i), $ b Î PV
(j) s.t. (a,b) Î E
w(Pi
,Pj
) = (a,b) Î E |a Î PV
(i), b Î PV
(j){ }
7
Outline
 Motivation
 Projection Graph Model
 Social Network Centrality Metrics
 Degree Centrality
 Node Betweenness Centrality
 Edge Betweenness Centrality
 Centrality Calculation: Limitations
 Experimental Questions
 Experimental Methodology
 Experimental Results
 Impacts on Applications & Systems
A
B
C
D
EF
G
H
IJ
K
L
M
N
O
 Number of edges of a node
 High degree centrality peers: Network Hubs
 Can be targeted to directly influence many other
peers with a message broadcast or distribute a
search query
Degree Centrality
8
A
B
C
D
EF
G
H
IJ
K
M
N
O
Node Betweenness Centrality
 Measures the extent to which a node lies on the
shortest path between two other nodes
 High betweenness centrality peers: Control
communication between distant peers
 Can host data caches for reduced latency to locate
data
9
A
B
CD
EF
G
H
I
J
K
L
M
N
O
Edge Betweenness Centrality
 Measures the extent to which an edge lies on the
shortest path between two nodes
 High betweenness centrality edges: Connect
distant parts of P2P network
 Can be monitored to block malware traffic
10
Calculating Peer Centrality
 Challenging because of:
 Limited access to user data (e.g., privacy settings)
 P2P network scale
 Peer churn
 Through experimental analysis on the social and
projection graph, we investigate how to
circumvent these limitations
11
Experimental Questions
 Can we approximate the centrality of peers using
the centrality scores of their users?
 How does the number of users storing data per
peer affect the centrality scores of their peers?
 Social graph is less dynamic than the P2P network
 Calculate infrequently centrality score of users & use it
to estimate their peer’s centrality
Spoiler Alert!
 [1, ~150] users/peer: Can estimate degree &
betweenness centrality of peers with good
accuracy
 Above 150 users/peer: The projection graph
becomes highly connected => peers do not
differentiate in centrality 12
 Naturally-formed communities offer incentives for resource
sharing  1 community subgraph mapped per peer
 Projection graphs generated from 5 real social graphs
 Communities detected via recursive Louvain algorithm*
 Varied average community size: 5,10,20,…,1000 users/peer
 Calculate correlation of centralities of users and their peers
 Compare average centralities of users and their peers
 Identify top centrality peers from their users’ scores
Experimental Methodology
13
Social Network Users Edges
gnutella04 10,876 39,994
gnutella31 62,561 147,878
enron 33,696 180,811
epinions 75,877 405,739
slashdot 82,168 504,230
*V. D. Blondel et al, “Fast unfolding of communities in large networks”,
Journal of Statistical Mechanics: Theory and Experiment, vol. 10, 2008.
Correlation of Centrality Scores
 [1-150] users/peer:
 Projection graph resembles
closely social graph
 Highest correlation of social &
projection graph metrics
 Degree & node betweenness
estimated from local
information (cumulative scores)
14
0
0.2
0.4
0.6
0.8
1
1 10 100 1000
DegreeCentralityCorrelation
Users/Peer (a)
gnutella04
enron
gnutella31
epinions
slashdot
0
0.2
0.4
0.6
0.8
1
1 10 100 1000NodeBetweennessCentralityCorrelation
Users/Peer (b)
gnutella04
enron
gnutella31
epinions
slashdot
0
0.2
0.4
0.6
0.8
1
1 10 100 1000
EdgeBetweennessCentralityCorrelation
Users/Peer (c)
gnutella04
enron
gnutella31
epinions
slashdot
 After 150 users/peer:
 Projection graph topology
loses social properties
 Highly connected network
 Peers participate equally
in graph traversal
Users/Peer
vs.
Degree
Users/Peer
vs.
Node Betweenness
Users/Peer
vs.
Edge Betweenness
Comparison of Centrality Scores
 Increase number of users/peer  turning point in
projection graph
 More connections with other peers
 increase peer degree & betweenness to maximum
 More social edges within peers
 decrease edge betweenness to minimum
15
1e-05
0.0001
0.001
0.01
0.1
1
1 10 100 1000
DegreeCentrality
Users/Peer (a)
gnutella04_CDCU
gnutella04_DCP
enron_CDCU
enron_DCP
gnutella31_CDCU
gnutella31_DCP
epinions_CDCU
epinions_DCP
slashdot_CDCU
slashdot_DCP
1e-05
0.0001
0.001
0.01
0.1
1
1 10 100 1000
NodeBetweennessCentrality
Users/Peer (b)
gnutella04_CNBCU
gnutella04_NBCP
enron_CNBCU
enron_NBCP
gnutella31_CNBCU
gnutella31_NBCP
epinions_CNBCU
epinions_NBCP
slashdot_CNBCU
slashdot_NBCP
1e-11
1e-10
1e-09
1e-08
1e-07
1e-06
1e-05
0.0001
0.001
0.01
1 10 100 1000
EdgeBetweennessCentrality
Users/Peer (c)
gnutella04_CEBCU
gnutella04_EBCP
enron_CEBCU
enron_EBCP
gnutella31_CEBCU
gnutella31_EBCP
epinions_CEBCU
epinions_EBCP
slashdot_CEBCU
slashdot_EBCP
Users/Peer
vs.
Degree
Users/Peer
vs.
Node Betweenness
Users/Peer
Vs.
Edge Betweenness
Finding High Betweenness Peers
 Placing data caches on high betweenness peers
can reduce latency to locate data
 Can we identify such peers, knowing the top
betweenness users or communities?
 Top 5% betweenness centrality users => top betweenness
centrality peers with 80–90% accuracy 16
0
0.2
0.4
0.6
0.8
1
1 10 100 1000
PeerOverlap
Users/Peer (Method 1)
1%
5%
10%
1 10 100 1000
Users/Peer (Method 2)
1%
5%
10%
Users/Peer Users/Peer
With Top-N% users With Top-N% communities
Summary of Findings
 [1, ~150] users/peer:
 Projection graph resembles closely social graph
 Highest correlation of social & projection graph metrics
 Degree & node betweenness can be estimated from
local information (cumulative scores of users)
 Cannot estimate well edge betweenness
 Above 150 users/peer:
 Projection graph topology loses social properties
 A highly connected projection graph
 No differentiation in peer centrality
 Top betweenness centrality users can pinpoint the top
betweenness centrality peers with good accuracy
 Overall: Applications can calculate infrequently
centrality score of users to estimate peer centrality
 Social graph changes slowly compared to P2P network 17
Impact on Applications & Systems
 Target high degree peers to:
 Decrease search time
 Increase breadth of search and diversity of results
 Target high betweenness peers to:
 Monitor information flow and collect traces
 Place data caches and indexes of data location
 Quarantine malware outbursts
 Disseminate software patches
 Tackle P2P churn
 Predict centrality of peers to allocate resources
 Reduce overlay overhead
 Enhance routing tables with P2P edges for faster &
more secure peer discovery
18
19
Thank you!
This work was supported by NSF Grants:
CNS 0952420 and CNS 0831785
http://www.cse.usf.edu/dsg/
nkourtel@mail.usf.edu

More Related Content

What's hot

Node XL - features and demo
Node XL - features and demoNode XL - features and demo
Node XL - features and demoMayank Mohan
 
Presentation rutgers
Presentation rutgersPresentation rutgers
Presentation rutgersGuido Borà
 
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
Mining and analyzing social media   part 2 - hicss47 tutorial - dave kingMining and analyzing social media   part 2 - hicss47 tutorial - dave king
Mining and analyzing social media part 2 - hicss47 tutorial - dave kingDave King
 
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELINGDYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELINGAndry Alamsyah
 
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPINGIMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPINGijp2p
 
sos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networkssos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networksswathi78
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisFred Stutzman
 
Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Michael Steurer
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networkseSAT Publishing House
 
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSAPPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSIJwest
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social NetworksKent State University
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collectiondnac
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...syeda yasmeen
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisSujoy Bag
 
Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011guillaume ereteo
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...1crore projects
 

What's hot (20)

Node XL - features and demo
Node XL - features and demoNode XL - features and demo
Node XL - features and demo
 
Presentation rutgers
Presentation rutgersPresentation rutgers
Presentation rutgers
 
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
Mining and analyzing social media   part 2 - hicss47 tutorial - dave kingMining and analyzing social media   part 2 - hicss47 tutorial - dave king
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
 
05 Whole Network Descriptive Stats
05 Whole Network Descriptive Stats05 Whole Network Descriptive Stats
05 Whole Network Descriptive Stats
 
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELINGDYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
DYNAMIC LARGE SCALE DATA ON TWITTER USING SENTIMENT ANALYSIS AND TOPIC MODELING
 
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPINGIMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
IMPROVING HYBRID REPUTATION MODEL THROUGH DYNAMIC REGROUPING
 
presentation29
presentation29presentation29
presentation29
 
sos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networkssos a distributed mobile q&a system based on social networks
sos a distributed mobile q&a system based on social networks
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...Predicting Social Interactions from Different Sources of Location-based Knowl...
Predicting Social Interactions from Different Sources of Location-based Knowl...
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networks
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKSAPPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
APPLICATION OF CLUSTERING TO ANALYZE ACADEMIC SOCIAL NETWORKS
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social Networks
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collection
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011Social network analysis course 2010 - 2011
Social network analysis course 2010 - 2011
 
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
Asymmetric Social Proximity Based Private Matching Protocols for Online Socia...
 

Similar to Inferring Peer Centrality in Socially-Informed Peer-to-Peer Systems

Studying user footprints in different online social networks
Studying user footprints in different online social networksStudying user footprints in different online social networks
Studying user footprints in different online social networksIIIT Hyderabad
 
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...reshma reshu
 
Friend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life StyleFriend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life Stylepaperpublications3
 
Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...CloudTechnologies
 
Community detection in complex social networks
Community detection in complex social networksCommunity detection in complex social networks
Community detection in complex social networksAboul Ella Hassanien
 
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDAFRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDAJournal For Research
 
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONSA MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONSIJCNC
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Editor IJAIEM
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfBalasundaramSr
 
Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Luis Fernandez Luque
 
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...Nicolas Kourtellis
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis Jari Jussila
 
Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...eSAT Publishing House
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018Arsalan Khan
 
ISDA 2011 Cordoba
ISDA 2011 CordobaISDA 2011 Cordoba
ISDA 2011 CordobaAndrea Zaza
 

Similar to Inferring Peer Centrality in Socially-Informed Peer-to-Peer Systems (20)

Ppt
PptPpt
Ppt
 
Studying user footprints in different online social networks
Studying user footprints in different online social networksStudying user footprints in different online social networks
Studying user footprints in different online social networks
 
Final Report
Final ReportFinal Report
Final Report
 
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
Testing Vitality Ranking and Prediction in Social Networking Services With Dy...
 
Friend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life StyleFriend Recommendation on Social Network Site Based on Their Life Style
Friend Recommendation on Social Network Site Based on Their Life Style
 
Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...Service rating prediction by exploring social mobile users’ geographical loca...
Service rating prediction by exploring social mobile users’ geographical loca...
 
Community detection in complex social networks
Community detection in complex social networksCommunity detection in complex social networks
Community detection in complex social networks
 
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDAFRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
FRIEND RECOMMENDATION IN ONLINE SOCIAL NETWORKS USING LDA
 
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONSA MALICIOUS USERS DETECTING MODEL BASED  ON FEEDBACK CORRELATIONS
A MALICIOUS USERS DETECTING MODEL BASED ON FEEDBACK CORRELATIONS
 
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums.
 
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
Prometheus: User-Controlled P2P Social Data Management for Socially-aware App...
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis
 
18 Diffusion Models and Peer Influence
18 Diffusion Models and Peer Influence18 Diffusion Models and Peer Influence
18 Diffusion Models and Peer Influence
 
09 Diffusion Models & Peer Influence
09 Diffusion Models & Peer Influence09 Diffusion Models & Peer Influence
09 Diffusion Models & Peer Influence
 
Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...Study on security and quality of service implementations in p2 p overlay netw...
Study on security and quality of service implementations in p2 p overlay netw...
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018
 
ISDA 2011 Cordoba
ISDA 2011 CordobaISDA 2011 Cordoba
ISDA 2011 Cordoba
 

More from Nicolas Kourtellis

On managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and servicesOn managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and servicesNicolas Kourtellis
 
Scalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving GraphsScalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving GraphsNicolas Kourtellis
 
Prometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social DataPrometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social DataNicolas Kourtellis
 
Cultures in Community Question Answering
Cultures in Community Question AnsweringCultures in Community Question Answering
Cultures in Community Question AnsweringNicolas Kourtellis
 
Privacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question AnsweringPrivacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question AnsweringNicolas Kourtellis
 
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)Nicolas Kourtellis
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...Nicolas Kourtellis
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)Nicolas Kourtellis
 

More from Nicolas Kourtellis (8)

On managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and servicesOn managing social data for enabling socially-aware applications and services
On managing social data for enabling socially-aware applications and services
 
Scalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving GraphsScalable Online Betweenness Centrality in Evolving Graphs
Scalable Online Betweenness Centrality in Evolving Graphs
 
Prometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social DataPrometheus: Distributed Management of Geo-Social Data
Prometheus: Distributed Management of Geo-Social Data
 
Cultures in Community Question Answering
Cultures in Community Question AnsweringCultures in Community Question Answering
Cultures in Community Question Answering
 
Privacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question AnsweringPrivacy Concerns vs. User Behavior in Community Question Answering
Privacy Concerns vs. User Behavior in Community Question Answering
 
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
VHT: Vertical Hoeffding Tree (IEEE BigData 2016)
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
SAMOA: A Platform for Mining Big Data Streams (Apache BigData North America 2...
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
 

Recently uploaded

TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju
 
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxSulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxnoordubaliya2003
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 

Recently uploaded (20)

TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdf
 
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxSulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 

Inferring Peer Centrality in Socially-Informed Peer-to-Peer Systems

  • 1. Inferring Peer Centrality in Socially-Informed P2P Systems Nicolas Kourtellis, Adriana Iamnitchi Department of Computer Science & Engineering University of South Florida Tampa, USA 11th IEEE International Conference on Peer-to-Peer Computing Kyoto, Japan, 2011
  • 2. Socially-aware Applications  Applications collect and use social information:  Location, collocation, history of interactions, etc.  Build (implicit/explicit) social network of users  Use: reduce spam, provide recommendations, etc.  Wide range of system architectures  How does the social network of users affect the load in a P2P architecture? 2 Decentralization of user social data • MobiClique • Yarta • ... • PeerSoN • LifeSocial.KOM • Safebook • Prometheus • …
  • 3. Social Graphs & P2P Networks  Users connected with application-specific edges  User-contributed peers form a P2P network  User social graph is partitioned into subgraphs & stored on peers Questions:  How do applications traverse a distributed social graph?  What does it mean for the P2P routing? 3
  • 4.  Invite user G’s 2-hop hiking contacts to a trip  Social graph traversals => many P2P lookups  Application performance affected by projection of social graph on peers Application Example 4 => 1-hop={B, C, E} 2-hops={A, D, F, I}
  • 5.  How do the properties of the projection graph compare with the properties of the social graph projected? Projection Graph 5 Projection Graph (PG) P2P Overlay Social Graph (SG)
  • 6. Projection Graph Model  Uses:  Study properties of peers such as centrality  Study how the social graph topology affects P2P routing & system performance 6 Social Graph SG = (V,E) V=set of users, E=set of social edges Projection Graph PG = (VP ,EP ) VP =set of peers, EP =set of P2P edges PV (i) = set of users mapped on peer Pi , Pi Î VP (Pi ,Pj ) Î EP iff $ a Î PV (i), $ b Î PV (j) s.t. (a,b) Î E w(Pi ,Pj ) = (a,b) Î E |a Î PV (i), b Î PV (j){ }
  • 7. 7 Outline  Motivation  Projection Graph Model  Social Network Centrality Metrics  Degree Centrality  Node Betweenness Centrality  Edge Betweenness Centrality  Centrality Calculation: Limitations  Experimental Questions  Experimental Methodology  Experimental Results  Impacts on Applications & Systems
  • 8. A B C D EF G H IJ K L M N O  Number of edges of a node  High degree centrality peers: Network Hubs  Can be targeted to directly influence many other peers with a message broadcast or distribute a search query Degree Centrality 8
  • 9. A B C D EF G H IJ K M N O Node Betweenness Centrality  Measures the extent to which a node lies on the shortest path between two other nodes  High betweenness centrality peers: Control communication between distant peers  Can host data caches for reduced latency to locate data 9
  • 10. A B CD EF G H I J K L M N O Edge Betweenness Centrality  Measures the extent to which an edge lies on the shortest path between two nodes  High betweenness centrality edges: Connect distant parts of P2P network  Can be monitored to block malware traffic 10
  • 11. Calculating Peer Centrality  Challenging because of:  Limited access to user data (e.g., privacy settings)  P2P network scale  Peer churn  Through experimental analysis on the social and projection graph, we investigate how to circumvent these limitations 11
  • 12. Experimental Questions  Can we approximate the centrality of peers using the centrality scores of their users?  How does the number of users storing data per peer affect the centrality scores of their peers?  Social graph is less dynamic than the P2P network  Calculate infrequently centrality score of users & use it to estimate their peer’s centrality Spoiler Alert!  [1, ~150] users/peer: Can estimate degree & betweenness centrality of peers with good accuracy  Above 150 users/peer: The projection graph becomes highly connected => peers do not differentiate in centrality 12
  • 13.  Naturally-formed communities offer incentives for resource sharing  1 community subgraph mapped per peer  Projection graphs generated from 5 real social graphs  Communities detected via recursive Louvain algorithm*  Varied average community size: 5,10,20,…,1000 users/peer  Calculate correlation of centralities of users and their peers  Compare average centralities of users and their peers  Identify top centrality peers from their users’ scores Experimental Methodology 13 Social Network Users Edges gnutella04 10,876 39,994 gnutella31 62,561 147,878 enron 33,696 180,811 epinions 75,877 405,739 slashdot 82,168 504,230 *V. D. Blondel et al, “Fast unfolding of communities in large networks”, Journal of Statistical Mechanics: Theory and Experiment, vol. 10, 2008.
  • 14. Correlation of Centrality Scores  [1-150] users/peer:  Projection graph resembles closely social graph  Highest correlation of social & projection graph metrics  Degree & node betweenness estimated from local information (cumulative scores) 14 0 0.2 0.4 0.6 0.8 1 1 10 100 1000 DegreeCentralityCorrelation Users/Peer (a) gnutella04 enron gnutella31 epinions slashdot 0 0.2 0.4 0.6 0.8 1 1 10 100 1000NodeBetweennessCentralityCorrelation Users/Peer (b) gnutella04 enron gnutella31 epinions slashdot 0 0.2 0.4 0.6 0.8 1 1 10 100 1000 EdgeBetweennessCentralityCorrelation Users/Peer (c) gnutella04 enron gnutella31 epinions slashdot  After 150 users/peer:  Projection graph topology loses social properties  Highly connected network  Peers participate equally in graph traversal Users/Peer vs. Degree Users/Peer vs. Node Betweenness Users/Peer vs. Edge Betweenness
  • 15. Comparison of Centrality Scores  Increase number of users/peer  turning point in projection graph  More connections with other peers  increase peer degree & betweenness to maximum  More social edges within peers  decrease edge betweenness to minimum 15 1e-05 0.0001 0.001 0.01 0.1 1 1 10 100 1000 DegreeCentrality Users/Peer (a) gnutella04_CDCU gnutella04_DCP enron_CDCU enron_DCP gnutella31_CDCU gnutella31_DCP epinions_CDCU epinions_DCP slashdot_CDCU slashdot_DCP 1e-05 0.0001 0.001 0.01 0.1 1 1 10 100 1000 NodeBetweennessCentrality Users/Peer (b) gnutella04_CNBCU gnutella04_NBCP enron_CNBCU enron_NBCP gnutella31_CNBCU gnutella31_NBCP epinions_CNBCU epinions_NBCP slashdot_CNBCU slashdot_NBCP 1e-11 1e-10 1e-09 1e-08 1e-07 1e-06 1e-05 0.0001 0.001 0.01 1 10 100 1000 EdgeBetweennessCentrality Users/Peer (c) gnutella04_CEBCU gnutella04_EBCP enron_CEBCU enron_EBCP gnutella31_CEBCU gnutella31_EBCP epinions_CEBCU epinions_EBCP slashdot_CEBCU slashdot_EBCP Users/Peer vs. Degree Users/Peer vs. Node Betweenness Users/Peer Vs. Edge Betweenness
  • 16. Finding High Betweenness Peers  Placing data caches on high betweenness peers can reduce latency to locate data  Can we identify such peers, knowing the top betweenness users or communities?  Top 5% betweenness centrality users => top betweenness centrality peers with 80–90% accuracy 16 0 0.2 0.4 0.6 0.8 1 1 10 100 1000 PeerOverlap Users/Peer (Method 1) 1% 5% 10% 1 10 100 1000 Users/Peer (Method 2) 1% 5% 10% Users/Peer Users/Peer With Top-N% users With Top-N% communities
  • 17. Summary of Findings  [1, ~150] users/peer:  Projection graph resembles closely social graph  Highest correlation of social & projection graph metrics  Degree & node betweenness can be estimated from local information (cumulative scores of users)  Cannot estimate well edge betweenness  Above 150 users/peer:  Projection graph topology loses social properties  A highly connected projection graph  No differentiation in peer centrality  Top betweenness centrality users can pinpoint the top betweenness centrality peers with good accuracy  Overall: Applications can calculate infrequently centrality score of users to estimate peer centrality  Social graph changes slowly compared to P2P network 17
  • 18. Impact on Applications & Systems  Target high degree peers to:  Decrease search time  Increase breadth of search and diversity of results  Target high betweenness peers to:  Monitor information flow and collect traces  Place data caches and indexes of data location  Quarantine malware outbursts  Disseminate software patches  Tackle P2P churn  Predict centrality of peers to allocate resources  Reduce overlay overhead  Enhance routing tables with P2P edges for faster & more secure peer discovery 18
  • 19. 19 Thank you! This work was supported by NSF Grants: CNS 0952420 and CNS 0831785 http://www.cse.usf.edu/dsg/ nkourtel@mail.usf.edu