SlideShare a Scribd company logo
1 of 20
Download to read offline
place graphs
        are the new
        social graphs
       Matt Biddulph
       @mattb | matt@hackdiary.com

Every data scientist has their own favourite way of representing their data. For some people
it’s Excel, and they think in rows and columns. For others it’s matrices, and they use linear
algreba to interrogate their data. For me, it’s graphs.
We’re all pretty used to the idea that you can model human relationships in a social graph.
“Social network analysis
        views social relationships in
        terms of network theory
        consisting of nodes and ties.
        Nodes are the individual actors
        within the networks, and ties
        are the relationships between
        the actors.”

There’s a pretty deep area of mathematical study called Social Network Analysis that goes
back at least 20 years. It tries to create insight by analysing the structure of social networks,
and usually doesn’t incorporate any elements of culture or sociology in doing so.
Centrality
                                                               measures




It led to the creation of techniques like centrality measures, that try to find the nodes that are
most central to the network. These might be the kind of people on Twitter who have the
highest chance of being retweeted.
Community
                                                              detection




There are also community detection algorithms that try to find the most tightly-knit
subgraphs and cluster those nodes together. If you ran this over the network of people I
follow on Twitter, it might be able to pick out my work colleagues or the people I socialise
with face-to-face.
People you
                                                            may know




Sites like LinkedIn build almost-telepathic “people you may know” features by walking around
the graph starting at your node and looking for people that show up a lot in your
neighbourhood that you haven’t connected with yet.
But enough mathematics. Let’s talk about Belgium.
Belgium is a country in the northwest of Europe with some unusual cultural qualities. It’s
sandwiched between the Netherlands and France. About half of the country speaks French,
and the other half speaks Dutch. It’d be very interesting to study the patterns of interactions
in this country.
Researchers at Louvain in Belgium were lucky enough to do a joint project with a Belgian
mobile phone company. They had access to anonymised records of 2.6 million phone calls -
the record of which phone called which number when.

http://arxiv.org/pdf/0802.2178v2
Belgian
                                                                        phonecall
                                                                        network




Fast unfolding of communities in large networks, Blondel et al [2008]

They used these calls to construct a “call graph”. They were able to develop a community-
detection algorithm that could detect the two separate clusters of Dutch and French speakers
that were mostly only calling each other. The algorithm achieved this simply by analysing the
shape of the graph. It knew nothing about French, Dutch or phone calls.

http://arxiv.org/pdf/0803.0476
So let’s take a step back and think about what other kinds of graph we could form, from what
kinds of data.
I work in location apps at Nokia, and so I naturally think of places. Wouldn’t it be interesting
to study the connections between cities instead of people? For example, people probably fly
more often between NYC and LA than they do between NYC and New Jersey. We could re-
draw the map based on closeness in the travel network.
I turned to the Hadoop cluster at Nokia and took a sample of several weeks of logs from our
routing servers. These are used every time someone uses our maps application to request a
driving route from one place to another. Every time someone drove from A to B, I made an
edge in a “place graph” from A to B.
I ran the data through Gephi and asked it to cluster it based on the strength of connections
between towns. The result is a not-quite-geographic new map of the world, where two cities
are close to each other if people often drive between them.
UK

                                                            China
                                                               Korea,
                                                             Japan, etc



                Spain                           Most of Europe




                                                                             India
                                                                             Pakistan
             Finland                     Russia

As you’d expect, the UK is an island and so people don’t drive in and out of it very often.
Spain and Portugal are not islands, but they appear separate because they’re attached to the
rest of Europe by a very narrow neck of land. So people are much more likely to fly than drive
out of Spain.
How could we use this data in a practical application? Say I’m coming to New York to attend a
conference on big data. I could choose a hotel near the conference venue, but I’d rather see
more interesting parts of New York.
Where should
                                                         I stay?




If I’ve never been to New York before, I could ask a friend. I could tell them that I like
London’s West End and San Francisco’s downtown.
Times Square = Piccadilly Circus
          New York                London
If they know both towns, they’d probably tell me that Times Square is the Piccadilly Circus of
New York.
What is the Greenwich Village
       of Tokyo?
       ... the Noe Valley of New York?
       ... the Shibuya of Los Angeles?



But if we delve into the place graph, we could answer much more interesting questions, and
create a “neighbourhood isomorphism” from city to city. People who like the Mission in SF
and Shoreditch in London could find out that Williamsberg is probably the best place for
them to stay in New York.
Thank you.
Matt Biddulph
@mattb | matt@hackdiary.com

More Related Content

What's hot

Avoiding the Heron's Way
Avoiding the Heron's WayAvoiding the Heron's Way
Avoiding the Heron's WayDorothea Salo
 
Webstock 2013: An Animating Spark
Webstock 2013: An Animating SparkWebstock 2013: An Animating Spark
Webstock 2013: An Animating SparkTom Coates
 
Property-based Access of RDF Data
Property-based Access of RDF DataProperty-based Access of RDF Data
Property-based Access of RDF DataGerd Groener
 
Data Journalism Workshop - Prague
Data Journalism Workshop - PragueData Journalism Workshop - Prague
Data Journalism Workshop - PragueChristian Kreutz
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overviewChris Taggart
 
Social Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 yearsSocial Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 yearsPeter Mika
 
Visualizing Networks
Visualizing NetworksVisualizing Networks
Visualizing Networksfreshdatabos
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lora Aroyo
 
BISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and HopeBISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and Hopebisg
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic WebJohn Breslin
 
Human-machine Inter-agencies
Human-machine Inter-agenciesHuman-machine Inter-agencies
Human-machine Inter-agenciesmo-seph
 

What's hot (14)

Avoiding the Heron's Way
Avoiding the Heron's WayAvoiding the Heron's Way
Avoiding the Heron's Way
 
Webstock 2013: An Animating Spark
Webstock 2013: An Animating SparkWebstock 2013: An Animating Spark
Webstock 2013: An Animating Spark
 
Property-based Access of RDF Data
Property-based Access of RDF DataProperty-based Access of RDF Data
Property-based Access of RDF Data
 
Data Journalism Workshop - Prague
Data Journalism Workshop - PragueData Journalism Workshop - Prague
Data Journalism Workshop - Prague
 
Urban lenses
Urban lensesUrban lenses
Urban lenses
 
Cyborgs
CyborgsCyborgs
Cyborgs
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overview
 
Social Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 yearsSocial Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 years
 
Visualizing Networks
Visualizing NetworksVisualizing Networks
Visualizing Networks
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
BISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and HopeBISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and Hope
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
 
Human-machine Inter-agencies
Human-machine Inter-agenciesHuman-machine Inter-agencies
Human-machine Inter-agencies
 

Viewers also liked

How To Write Your First Firefox Extension
How To Write Your First Firefox ExtensionHow To Write Your First Firefox Extension
How To Write Your First Firefox ExtensionRobert Nyman
 
iPhone Coding For Web Developers
iPhone Coding For Web DevelopersiPhone Coding For Web Developers
iPhone Coding For Web DevelopersMatt Biddulph
 
Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013WB_Research
 
South Asia Regional Outlook June 2013
South Asia Regional Outlook  June 2013South Asia Regional Outlook  June 2013
South Asia Regional Outlook June 2013WB_Research
 
How to do presentations that don't induce suicide
How to do presentations that don't induce suicideHow to do presentations that don't induce suicide
How to do presentations that don't induce suicideAndy Whitlock
 
Science of Delivery in Education
Science of Delivery in EducationScience of Delivery in Education
Science of Delivery in EducationWB_Research
 
Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013WB_Research
 
Gep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacificGep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacificWB_Research
 
Rural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGsRural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGsWB_Research
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talkDan Brickley
 
Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013WB_Research
 

Viewers also liked (20)

How To Write Your First Firefox Extension
How To Write Your First Firefox ExtensionHow To Write Your First Firefox Extension
How To Write Your First Firefox Extension
 
iPhone Coding For Web Developers
iPhone Coding For Web DevelopersiPhone Coding For Web Developers
iPhone Coding For Web Developers
 
Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013
 
South Asia Regional Outlook June 2013
South Asia Regional Outlook  June 2013South Asia Regional Outlook  June 2013
South Asia Regional Outlook June 2013
 
How to do presentations that don't induce suicide
How to do presentations that don't induce suicideHow to do presentations that don't induce suicide
How to do presentations that don't induce suicide
 
Quality ed (1)
Quality ed (1)Quality ed (1)
Quality ed (1)
 
Secondary ed
Secondary edSecondary ed
Secondary ed
 
Tertiary ed
Tertiary edTertiary ed
Tertiary ed
 
Literacy ed
Literacy edLiteracy ed
Literacy ed
 
Science of Delivery in Education
Science of Delivery in EducationScience of Delivery in Education
Science of Delivery in Education
 
Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013
 
Gep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacificGep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacific
 
Equity ed
Equity edEquity ed
Equity ed
 
Primary ed
Primary edPrimary ed
Primary ed
 
Gender ed
Gender edGender ed
Gender ed
 
Rural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGsRural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGs
 
Pre primaryed
Pre primaryedPre primaryed
Pre primaryed
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talk
 
Has Off-Grid Electrification Come of Age?
Has Off-Grid Electrification Come of Age? Has Off-Grid Electrification Come of Age?
Has Off-Grid Electrification Come of Age?
 
Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013
 

Similar to Place graphs are the new social graphs

Cognitive Cities: City analytics
Cognitive Cities: City analyticsCognitive Cities: City analytics
Cognitive Cities: City analyticsMatt Biddulph
 
Network Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for BeginnersNetwork Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for BeginnersRenaud Clément
 
Harvesting collective intelligence.
Harvesting collective intelligence. Harvesting collective intelligence.
Harvesting collective intelligence. Alberto Cottica
 
Critical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, MinneapolisCritical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, MinneapolisBurak Arikan
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3SMCFrance
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSilvia Puglisi
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Doug Needham
 
Placebooks
PlacebooksPlacebooks
PlacebooksTom Pert
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTAMartin Wynne
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchFrancesca Di Donato
 
10 Jahre Web Science
10 Jahre Web Science10 Jahre Web Science
10 Jahre Web ScienceSteffen Staab
 
ViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3DViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3DRobin Hawkes
 
杭州讲座 石田英敬
杭州讲座 石田英敬杭州讲座 石田英敬
杭州讲座 石田英敬luruiyang
 
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Jonathan Stray
 
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!Tish Shute
 
Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Lauri Eloranta
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc SmithMarc Smith
 

Similar to Place graphs are the new social graphs (20)

Cognitive Cities: City analytics
Cognitive Cities: City analyticsCognitive Cities: City analytics
Cognitive Cities: City analytics
 
Network Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for BeginnersNetwork Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for Beginners
 
Harvesting collective intelligence.
Harvesting collective intelligence. Harvesting collective intelligence.
Harvesting collective intelligence.
 
Critical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, MinneapolisCritical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
 
AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101  AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
 
Oulu2
Oulu2Oulu2
Oulu2
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview.
 
Placebooks
PlacebooksPlacebooks
Placebooks
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTA
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for research
 
10 Jahre Web Science
10 Jahre Web Science10 Jahre Web Science
10 Jahre Web Science
 
ViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3DViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3D
 
杭州讲座 石田英敬
杭州讲座 石田英敬杭州讲座 石田英敬
杭州讲座 石田英敬
 
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
 
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
 
Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
 

More from Matt Biddulph

The IoT Conversation
The IoT ConversationThe IoT Conversation
The IoT ConversationMatt Biddulph
 
EAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real dataEAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real dataMatt Biddulph
 
Monkigras 2012: Networks Of Data
Monkigras 2012: Networks Of DataMonkigras 2012: Networks Of Data
Monkigras 2012: Networks Of DataMatt Biddulph
 
Science Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your dataScience Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your dataMatt Biddulph
 
Prototyping with data at Nokia
Prototyping with data at NokiaPrototyping with data at Nokia
Prototyping with data at NokiaMatt Biddulph
 
Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)Matt Biddulph
 
Tinkering with game controllers
Tinkering with game controllersTinkering with game controllers
Tinkering with game controllersMatt Biddulph
 
SXSW 2008: Creative Collaboration
SXSW 2008: Creative CollaborationSXSW 2008: Creative Collaboration
SXSW 2008: Creative CollaborationMatt Biddulph
 
Coding on the Shoulders of Giants
Coding on the Shoulders of GiantsCoding on the Shoulders of Giants
Coding on the Shoulders of GiantsMatt Biddulph
 
Connecting First And Second Life
Connecting First And Second LifeConnecting First And Second Life
Connecting First And Second LifeMatt Biddulph
 
Coders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOWCoders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOWMatt Biddulph
 

More from Matt Biddulph (12)

The IoT Conversation
The IoT ConversationThe IoT Conversation
The IoT Conversation
 
EAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real dataEAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real data
 
Monkigras 2012: Networks Of Data
Monkigras 2012: Networks Of DataMonkigras 2012: Networks Of Data
Monkigras 2012: Networks Of Data
 
Science Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your dataScience Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your data
 
Prototyping with data at Nokia
Prototyping with data at NokiaPrototyping with data at Nokia
Prototyping with data at Nokia
 
Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)
 
Tinkering with game controllers
Tinkering with game controllersTinkering with game controllers
Tinkering with game controllers
 
The Realtime Web
The Realtime WebThe Realtime Web
The Realtime Web
 
SXSW 2008: Creative Collaboration
SXSW 2008: Creative CollaborationSXSW 2008: Creative Collaboration
SXSW 2008: Creative Collaboration
 
Coding on the Shoulders of Giants
Coding on the Shoulders of GiantsCoding on the Shoulders of Giants
Coding on the Shoulders of Giants
 
Connecting First And Second Life
Connecting First And Second LifeConnecting First And Second Life
Connecting First And Second Life
 
Coders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOWCoders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOW
 

Recently uploaded

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 

Place graphs are the new social graphs

  • 1. place graphs are the new social graphs Matt Biddulph @mattb | matt@hackdiary.com Every data scientist has their own favourite way of representing their data. For some people it’s Excel, and they think in rows and columns. For others it’s matrices, and they use linear algreba to interrogate their data. For me, it’s graphs.
  • 2. We’re all pretty used to the idea that you can model human relationships in a social graph.
  • 3. “Social network analysis views social relationships in terms of network theory consisting of nodes and ties. Nodes are the individual actors within the networks, and ties are the relationships between the actors.” There’s a pretty deep area of mathematical study called Social Network Analysis that goes back at least 20 years. It tries to create insight by analysing the structure of social networks, and usually doesn’t incorporate any elements of culture or sociology in doing so.
  • 4. Centrality measures It led to the creation of techniques like centrality measures, that try to find the nodes that are most central to the network. These might be the kind of people on Twitter who have the highest chance of being retweeted.
  • 5. Community detection There are also community detection algorithms that try to find the most tightly-knit subgraphs and cluster those nodes together. If you ran this over the network of people I follow on Twitter, it might be able to pick out my work colleagues or the people I socialise with face-to-face.
  • 6. People you may know Sites like LinkedIn build almost-telepathic “people you may know” features by walking around the graph starting at your node and looking for people that show up a lot in your neighbourhood that you haven’t connected with yet.
  • 7. But enough mathematics. Let’s talk about Belgium.
  • 8. Belgium is a country in the northwest of Europe with some unusual cultural qualities. It’s sandwiched between the Netherlands and France. About half of the country speaks French, and the other half speaks Dutch. It’d be very interesting to study the patterns of interactions in this country.
  • 9. Researchers at Louvain in Belgium were lucky enough to do a joint project with a Belgian mobile phone company. They had access to anonymised records of 2.6 million phone calls - the record of which phone called which number when. http://arxiv.org/pdf/0802.2178v2
  • 10. Belgian phonecall network Fast unfolding of communities in large networks, Blondel et al [2008] They used these calls to construct a “call graph”. They were able to develop a community- detection algorithm that could detect the two separate clusters of Dutch and French speakers that were mostly only calling each other. The algorithm achieved this simply by analysing the shape of the graph. It knew nothing about French, Dutch or phone calls. http://arxiv.org/pdf/0803.0476
  • 11. So let’s take a step back and think about what other kinds of graph we could form, from what kinds of data.
  • 12. I work in location apps at Nokia, and so I naturally think of places. Wouldn’t it be interesting to study the connections between cities instead of people? For example, people probably fly more often between NYC and LA than they do between NYC and New Jersey. We could re- draw the map based on closeness in the travel network.
  • 13. I turned to the Hadoop cluster at Nokia and took a sample of several weeks of logs from our routing servers. These are used every time someone uses our maps application to request a driving route from one place to another. Every time someone drove from A to B, I made an edge in a “place graph” from A to B.
  • 14. I ran the data through Gephi and asked it to cluster it based on the strength of connections between towns. The result is a not-quite-geographic new map of the world, where two cities are close to each other if people often drive between them.
  • 15. UK China Korea, Japan, etc Spain Most of Europe India Pakistan Finland Russia As you’d expect, the UK is an island and so people don’t drive in and out of it very often. Spain and Portugal are not islands, but they appear separate because they’re attached to the rest of Europe by a very narrow neck of land. So people are much more likely to fly than drive out of Spain.
  • 16. How could we use this data in a practical application? Say I’m coming to New York to attend a conference on big data. I could choose a hotel near the conference venue, but I’d rather see more interesting parts of New York.
  • 17. Where should I stay? If I’ve never been to New York before, I could ask a friend. I could tell them that I like London’s West End and San Francisco’s downtown.
  • 18. Times Square = Piccadilly Circus New York London If they know both towns, they’d probably tell me that Times Square is the Piccadilly Circus of New York.
  • 19. What is the Greenwich Village of Tokyo? ... the Noe Valley of New York? ... the Shibuya of Los Angeles? But if we delve into the place graph, we could answer much more interesting questions, and create a “neighbourhood isomorphism” from city to city. People who like the Mission in SF and Shoreditch in London could find out that Williamsberg is probably the best place for them to stay in New York.
  • 20. Thank you. Matt Biddulph @mattb | matt@hackdiary.com