SlideShare a Scribd company logo
1 of 19
Download to read offline
KONECT Cloud
Large Scale Network Mining in the Cloud


                  Jérôme Kunegis
  Future SOC Lab Day, 18.04.2012



                                          1
Networks are Everywhere

                                                                       ip
                                                                   r sh
                                                               tho
                                                             Au

                                     ip
                                  dsh
                           Fr ien

                                                                  t
                                                             Trus


                       n
                 tio
         n   i ca
       mu                                                                      e
Co
     m
                                                                         re nc
                                          c              n           r
                                                    c tio        c cu
                                            ter
                                                a
                                                             Co-o
                                          In
Social Networks



                  friend
Trust Networks




                 tru
                    st
Friend/Enemy Network




                       en
        d




                          em
      en




                          y
     fri
Interaction Network




                      listen
KONECT – Koblenz Network Collection

 148      network datasets
       26 are undirected
       38 are directed
       84 are bipartite
       59 have unweighted edges
       77 allow multiple edges
       04 have signed edges
       08 have ratings as edges
       78 have edge arrival times



   konect.uni-koblenz.de
Largest Network


   Directed “who follows who” network


        0 041 652 230 users
        1 468 365 182 edges


  konect.uni-koblenz.de/networks/twitter
148 Network Datasets

     authorship
communication
 co-occurrence
        features
    folksonomy
     interaction
        physical
          ratings
      reference
       semantic
           social
             trust
What We Computed

 Connected components
 Network diameter   ←    at Future SOC Lab
 Clustering coefficients
 Degree distributions
 Spectral distribution
 Eigenvector centrality
 Graph drawing
 Temporal Analysis
 Link prediction
Network Diameter




 6
90 Percentile Effective Diameter




5
90 Percentile Effective Diameter

                  3
90 Percentile Effective Diameter




3.75
Computing the Effective
Diameter

for each node i {                   |V|
   count hops needed to reach 90%   |E|
}


Total runtime:                      |E| × |V|
Graph Sampling



                   Keep
                 X% of edges
Computation

  × 1 000 vertices (sampled)
  × 120 840 391 edges
  × 20 sample sizes (5%, 10%, …, 100%)
  × 50 random samplings

  Evaluation on single machine:
   1 TiB memory
   64 cores
   Matlab 64 bit
Results
Thank You!


                        Dr. Jérôme Kunegis
konect.uni-koblenz.de   kunegis@uni-koblenz.de
                        west.uni-koblenz.de

More Related Content

Similar to KONECT Cloud – Large Scale Network Mining in the Cloud

Similar to KONECT Cloud – Large Scale Network Mining in the Cloud (6)

Recruitment and Social Media | Singapore
Recruitment and Social Media | SingaporeRecruitment and Social Media | Singapore
Recruitment and Social Media | Singapore
 
01 fundamental of multimedia
01 fundamental of multimedia01 fundamental of multimedia
01 fundamental of multimedia
 
Critical path and pert
Critical path and pertCritical path and pert
Critical path and pert
 
Deterministic Ethernet - TTEthernet
Deterministic Ethernet - TTEthernetDeterministic Ethernet - TTEthernet
Deterministic Ethernet - TTEthernet
 
Trends & Momentum in Events - 2013
Trends & Momentum in Events - 2013Trends & Momentum in Events - 2013
Trends & Momentum in Events - 2013
 
Mirage: extreme specialisation of virtual appliances
Mirage: extreme specialisation of virtual appliancesMirage: extreme specialisation of virtual appliances
Mirage: extreme specialisation of virtual appliances
 

More from Jérôme KUNEGIS

Succinct Summarisation of Large Networks via Small Synthetic Representative G...
Succinct Summarisation of Large Networks via Small Synthetic Representative G...Succinct Summarisation of Large Networks via Small Synthetic Representative G...
Succinct Summarisation of Large Networks via Small Synthetic Representative G...Jérôme KUNEGIS
 
Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...
Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...
Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...Jérôme KUNEGIS
 
Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...
Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...
Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...Jérôme KUNEGIS
 
Generating Networks with Arbitrary Properties
Generating Networks with Arbitrary PropertiesGenerating Networks with Arbitrary Properties
Generating Networks with Arbitrary PropertiesJérôme KUNEGIS
 
Karriere Lounge – INFORMATIK 2013
Karriere Lounge – INFORMATIK 2013Karriere Lounge – INFORMATIK 2013
Karriere Lounge – INFORMATIK 2013Jérôme KUNEGIS
 
KONECT – The Koblenz Network Collection
KONECT – The Koblenz Network CollectionKONECT – The Koblenz Network Collection
KONECT – The Koblenz Network CollectionJérôme KUNEGIS
 
Preferential Attachment in Online Networks: Measurement and Explanations
Preferential Attachment in Online Networks:  Measurement and ExplanationsPreferential Attachment in Online Networks:  Measurement and Explanations
Preferential Attachment in Online Networks: Measurement and ExplanationsJérôme KUNEGIS
 
Predicting Directed Links using Nondiagonal Matrix Decompositions
Predicting Directed Links using Nondiagonal Matrix DecompositionsPredicting Directed Links using Nondiagonal Matrix Decompositions
Predicting Directed Links using Nondiagonal Matrix DecompositionsJérôme KUNEGIS
 
Online Dating Recommender Systems: The Split-complex Number Approach
Online Dating Recommender Systems: The Split-complex Number ApproachOnline Dating Recommender Systems: The Split-complex Number Approach
Online Dating Recommender Systems: The Split-complex Number ApproachJérôme KUNEGIS
 
Fairness on the Web: Alternatives to the Power Law (WebSci 2012)
Fairness on the Web:  Alternatives to the Power Law (WebSci 2012)Fairness on the Web:  Alternatives to the Power Law (WebSci 2012)
Fairness on the Web: Alternatives to the Power Law (WebSci 2012)Jérôme KUNEGIS
 
Fairness on the Web: Alternatives to the Power Law
Fairness on the Web:  Alternatives to the Power LawFairness on the Web:  Alternatives to the Power Law
Fairness on the Web: Alternatives to the Power LawJérôme KUNEGIS
 
On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)
On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)
On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)Jérôme KUNEGIS
 
Searching Microblogs: Coping with Sparsity and Document Quality
Searching Microblogs: Coping with Sparsity and Document QualitySearching Microblogs: Coping with Sparsity and Document Quality
Searching Microblogs: Coping with Sparsity and Document QualityJérôme KUNEGIS
 
Bad News Travel Fast: A Content-based Analysis of Interestingness on Twitter
Bad News Travel Fast: A Content-based Analysis of Interestingness on TwitterBad News Travel Fast: A Content-based Analysis of Interestingness on Twitter
Bad News Travel Fast: A Content-based Analysis of Interestingness on TwitterJérôme KUNEGIS
 
On the Scalability of Graph Kernels Applied to Collaborative Recommenders
On the Scalability of Graph Kernels Applied to Collaborative RecommendersOn the Scalability of Graph Kernels Applied to Collaborative Recommenders
On the Scalability of Graph Kernels Applied to Collaborative RecommendersJérôme KUNEGIS
 
The Slashdot Zoo: Mining a Social Network with Negative Edges
The Slashdot Zoo:  Mining a Social Network with Negative EdgesThe Slashdot Zoo:  Mining a Social Network with Negative Edges
The Slashdot Zoo: Mining a Social Network with Negative EdgesJérôme KUNEGIS
 
Spectral Analysis of Signed Graphs for Clustering, Prediction and Visualization
Spectral Analysis of Signed Graphs for Clustering, Prediction and VisualizationSpectral Analysis of Signed Graphs for Clustering, Prediction and Visualization
Spectral Analysis of Signed Graphs for Clustering, Prediction and VisualizationJérôme KUNEGIS
 
Network Growth and the Spectral Evolution Model
Network Growth and the Spectral Evolution ModelNetwork Growth and the Spectral Evolution Model
Network Growth and the Spectral Evolution ModelJérôme KUNEGIS
 

More from Jérôme KUNEGIS (19)

Succinct Summarisation of Large Networks via Small Synthetic Representative G...
Succinct Summarisation of Large Networks via Small Synthetic Representative G...Succinct Summarisation of Large Networks via Small Synthetic Representative G...
Succinct Summarisation of Large Networks via Small Synthetic Representative G...
 
Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...
Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...
Title: What Is the Difference between a Social and a Hyperlink Network? -- Ho...
 
Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...
Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...
Measuring the Conflict in a Social Network with Friends and Foes: A Recent Al...
 
Schach und Computer
Schach und ComputerSchach und Computer
Schach und Computer
 
Generating Networks with Arbitrary Properties
Generating Networks with Arbitrary PropertiesGenerating Networks with Arbitrary Properties
Generating Networks with Arbitrary Properties
 
Karriere Lounge – INFORMATIK 2013
Karriere Lounge – INFORMATIK 2013Karriere Lounge – INFORMATIK 2013
Karriere Lounge – INFORMATIK 2013
 
KONECT – The Koblenz Network Collection
KONECT – The Koblenz Network CollectionKONECT – The Koblenz Network Collection
KONECT – The Koblenz Network Collection
 
Preferential Attachment in Online Networks: Measurement and Explanations
Preferential Attachment in Online Networks:  Measurement and ExplanationsPreferential Attachment in Online Networks:  Measurement and Explanations
Preferential Attachment in Online Networks: Measurement and Explanations
 
Predicting Directed Links using Nondiagonal Matrix Decompositions
Predicting Directed Links using Nondiagonal Matrix DecompositionsPredicting Directed Links using Nondiagonal Matrix Decompositions
Predicting Directed Links using Nondiagonal Matrix Decompositions
 
Online Dating Recommender Systems: The Split-complex Number Approach
Online Dating Recommender Systems: The Split-complex Number ApproachOnline Dating Recommender Systems: The Split-complex Number Approach
Online Dating Recommender Systems: The Split-complex Number Approach
 
Fairness on the Web: Alternatives to the Power Law (WebSci 2012)
Fairness on the Web:  Alternatives to the Power Law (WebSci 2012)Fairness on the Web:  Alternatives to the Power Law (WebSci 2012)
Fairness on the Web: Alternatives to the Power Law (WebSci 2012)
 
Fairness on the Web: Alternatives to the Power Law
Fairness on the Web:  Alternatives to the Power LawFairness on the Web:  Alternatives to the Power Law
Fairness on the Web: Alternatives to the Power Law
 
On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)
On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)
On the Spectral Evolution of Large Networks (PhD Thesis by Jérôme Kunegis)
 
Searching Microblogs: Coping with Sparsity and Document Quality
Searching Microblogs: Coping with Sparsity and Document QualitySearching Microblogs: Coping with Sparsity and Document Quality
Searching Microblogs: Coping with Sparsity and Document Quality
 
Bad News Travel Fast: A Content-based Analysis of Interestingness on Twitter
Bad News Travel Fast: A Content-based Analysis of Interestingness on TwitterBad News Travel Fast: A Content-based Analysis of Interestingness on Twitter
Bad News Travel Fast: A Content-based Analysis of Interestingness on Twitter
 
On the Scalability of Graph Kernels Applied to Collaborative Recommenders
On the Scalability of Graph Kernels Applied to Collaborative RecommendersOn the Scalability of Graph Kernels Applied to Collaborative Recommenders
On the Scalability of Graph Kernels Applied to Collaborative Recommenders
 
The Slashdot Zoo: Mining a Social Network with Negative Edges
The Slashdot Zoo:  Mining a Social Network with Negative EdgesThe Slashdot Zoo:  Mining a Social Network with Negative Edges
The Slashdot Zoo: Mining a Social Network with Negative Edges
 
Spectral Analysis of Signed Graphs for Clustering, Prediction and Visualization
Spectral Analysis of Signed Graphs for Clustering, Prediction and VisualizationSpectral Analysis of Signed Graphs for Clustering, Prediction and Visualization
Spectral Analysis of Signed Graphs for Clustering, Prediction and Visualization
 
Network Growth and the Spectral Evolution Model
Network Growth and the Spectral Evolution ModelNetwork Growth and the Spectral Evolution Model
Network Growth and the Spectral Evolution Model
 

Recently uploaded

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Visualising and forecasting stocks using Dash
Visualising and forecasting stocks using DashVisualising and forecasting stocks using Dash
Visualising and forecasting stocks using Dashnarutouzumaki53779
 

Recently uploaded (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Visualising and forecasting stocks using Dash
Visualising and forecasting stocks using DashVisualising and forecasting stocks using Dash
Visualising and forecasting stocks using Dash
 

KONECT Cloud – Large Scale Network Mining in the Cloud