SlideShare a Scribd company logo
1 of 22
Download to read offline
Y.H. Eom1;4, P. Aragon2, D. Laniado2, A. Kaltenbrunner2, S. Vigna3, D.L. Shepelyansky4
1 IMT Institute for Advanced Studies Lucca, Italy
2 Eurecat | Barcelona Media, Spain
3 Dipartimento di Informatica, Universita degli Studi di Milano, Italy
4 Laboratoire de Physique Theorique du CNRS, IRSAMC, Universite de Toulouse, France
NetSci 2015, Jun 5, Zaragoza
Assessing inter-cultural patterns
through ranking biographies
2www.eurecat.org
Wikipedia
NetSci 2015, Jun 5, Zaragoza
 The largest global knowledge repository
 The biggest collaborative platform on the Web
 287 language editions (different descriptions of human knowledge)
3www.eurecat.org
Reflected cultural differences across language editions
NetSci 2015, Jun 5, Zaragoza
Since language is one of the primary elements of culture [1], collective cultural biases
may be reflected on the contents and organization of each Wikipedia language edition.
How can we identify reflected, collective, cultural difference across Wikipedia editions?
Hola
Hello
您好
नमस्ते
‫مرحبا‬
Olá
4www.eurecat.org
Methodology
NetSci 2015, Jun 5, Zaragoza
1. For each edition  Network of articles
2. Ranking based on network structure
3. Identification of biographical articles and extraction of features
4. Analysis of top people
Edition Language Articles Edition Language Articles
EN English 4 212 493 RU Russian 966 284
NL Dutch 1 144 615 HE Hebrew 144 959
DE German 1 532 978 TR Turkish 206 311
FR French 1 352 825 AR Arabic 203 328
ES Spanish 974 025 FA Persian 295 696
IT Italian 1 017 953 HI Hindi 96 869
PT Portuguese 758 227 MS Malaysian 180 886
EL Greek 82 563 TH Thai 78 953
DA Danish 175 228 VI Vietnamese 594 089
SV Swedish 780 872 ZH Chinese 663 485
PL Polish 949 153 KO Korean 231 959
HU Hungarian 235 212 JA Japanese 852 087
The selection covers:
 ~ 60% of world population
 ~ 70% of Wikipedia articles
These 24 editions also cover languages
which played an important role in
human history including Western,
Asian and Arabic cultures.
5www.eurecat.org
Methodology
NetSci 2015, Jun 5, Zaragoza
1. For each edition  Network of articles
2. Ranking based on network structure
3. Identification of biographical articles and extraction of features
4. Analysis of top people
6www.eurecat.org
Methodology
NetSci 2015, Jun 5, Zaragoza
1. For each edition  Network of articles
2. Ranking based on network structure
3. Identification of biographical articles and extraction of features
4. Analysis of top people
PAGERANK CHEIRANK
2DRANK
 Highly linked articles are important
 Articles cited by important articles are also important  Out-going links can be important
7www.eurecat.org
Methodology
NetSci 2015, Jun 5, Zaragoza
1. For each edition  Network of articles
2. Ranking based on network structure
3. Identification of biographical articles and extraction of features
4. Analysis of top people
 1.1M people in EN-Wikipedia are identified
 Inter-language links: From EN to other language
Extraction of features through DBpedia
(manual inspection if missing):
 Birth place country level
 Birth date year level
 Gender male / female
 Culture current most spoken language
at that birth place
8www.eurecat.org
Methodology
NetSci 2015, Jun 5, Zaragoza
1. For each edition  Network of articles
2. Ranking based on network structure
3. Identification of biographical articles and extraction of features
4. Analysis of top people
EN-WIKIPEDIA NETWORK GLOBAL NETWORK
ΘA is the ranking score of algorithm A;
NA is the number of appearances of a given person in the top100 rank for all editions.
9www.eurecat.org
Top people ranking
NetSci 2015, Jun 5, Zaragoza
Overlap with alternative rankings:
 Hart’s Influential personas in History [2]: 43%
 Top 100 people Pantheon MIT list [3]: 44%
Not bigger than Jesus, but relevant from an encyclopedic point of view
10www.eurecat.org
Assessing cultural differences in top 100 people
NetSci 2015, Jun 5, Zaragoza
SPATIAL
DISTRIBUTION
TEMPORAL
DISTRIBUTION
GENDER
DISTRIBUTION
11www.eurecat.org
Results: Spatial distribution (I)
NetSci 2015, Jun 5, Zaragoza
Sum of appearances of historical figures from a given country in the 24 lists of top 100 persons for PageRank.
Color changes from zero (white) to 233 for Germany (black).
Overall Western
11 editions are not European (AR, FA, HE, HI, JA, KO, MS, TH, TR,VI, ZH)
12www.eurecat.org
Results: Spatial distribution (II)
NetSci 2015, Jun 5, Zaragoza
Birth place distribution of top historical figures averaged over 24 Wikipedia edition for (A) PageRank historical figures
(71 countries) and (B) 2DRank historical figures (91 countries).
Overall Western
13www.eurecat.org
Results: Spatial distribution (III)
NetSci 2015, Jun 5, Zaragoza
For each Wikipedia edition, distributions of (A) PageRank historical figures over 71 countries;
(B) 2DRank historical figures over 91 countries
Local skewness in the spatial distribution of the top historical figures
14www.eurecat.org
Results: Temporal distribution (I)
NetSci 2015, Jun 5, Zaragoza
Birth date distribution of (A) PageRank historical figures (B) 2DRank historical figures.
Most are born after 17th century.
Peaks in BC 5th and BC 1st century in top PageRank people
15www.eurecat.org
Results: Temporal distribution (II)
NetSci 2015, Jun 5, Zaragoza
For each Wikipedia edition, birth date distributions of (C) PageRank historical figures (D) 2DRank historical figures.
Most are born after 17th century
Arabic & Persian from 6th to 10th century
16www.eurecat.org
Results: Spatial & Temporal distribution
NetSci 2015, Jun 5, Zaragoza
Temporal locality property of cultures: rL, C = ML, C / NL, C ; of (A) PageRank historical figures (B) 2DRank historical figures.
ML, C is the number of historical figures born in countries attributed to a given language edition L
NL, C is the total number of historical figures in a given edition L
 Locality in Ancient History: Greek, Italian, Hebrew or Chinese editions
 Locality in Middle Ages: French, Spanish or Portuguese editions
 Locality in Modern History: English edition
17www.eurecat.org
Results: Gender distribution (I)
NetSci 2015, Jun 5, Zaragoza
For each Wikipedia edition, number of females of
(A) Top PageRank historical figures;
(B) Top 2DRank historical figures.
Overall male skewed
Rank TH PageRank
1 Sirindhorn
2 Bhumibol Adulyadej
3 Sirikit
4 Thaksin Shinawatra
5 Gautama Buddha
6 Adolf Hitler
7 Taksin
8 Queen Victoria
9 Abhisit Vejjajiva
10 Pridi Banomyong
11 Yingluck Shinawatra
12 Galyani Vadhana
13 Srinagarindra
14 J. R. R. Tolkien
15 Samak Sundaravej
18www.eurecat.org
Results: Gender distribution (II)
NetSci 2015, Jun 5, Zaragoza
The average female ratio of historical figures in given centuries
across 24 Wikipedia editions.
The ratio of women has grown in the
last centuries
19www.eurecat.org
Results: Networks of cultures
NetSci 2015, Jun 5, Zaragoza
Network of cultures and corresponding Google Matrix obtained from
24 Wikipedia languages and the remaining world (WR) considering
(A) top PageRank historical figures and (B) 2DRank historical figures.
EN edition
Napoleon
Napoleon III
Descartes
FR edition
Shakespeare
Elizabeth II
3 2
Cross cultural top
persons as influence
between cultures
Ex)
20www.eurecat.org
Findings
NetSci 2015, Jun 5, Zaragoza
Global: Most important historical figures across Wikipedia language editions were:
 from Western countries
 born after the 17th century
 male
Local: Each edition highlights historical figures from that culture
Future work
Refine the extraction of features through WikiData project (instead of DBpedia)
Compare results to alternative categories of Wikipedia (e.g. food)
21www.eurecat.org
References
NetSci 2015, Jun 5, Zaragoza
These slides are based on:
Y.-H. Eom, P. Aragón, D. Laniado, A. Kaltenbrunner, S. Vigna, and D. L.
Shepelyansky, “Interactions of cultures and top people of wikipedia from
ranking of 24 language editions,” PLoS ONE, vol. 10, p. e0114825, 03 2015.
and cite:
[1] UNESCO World Report (2009) Investing in cultural diversity and intercultural
dialogue
[2] Hart, M. H. (1978). The 100: A Ranking of the Most Influential Persons in History.
Citadel Press.
[3] Pantheon http://pantheon.media.mit.edu/
22www.eurecat.org
NetSci 2015, Jun 5, Zaragoza
Thanks for your attention !

More Related Content

What's hot

internet_society_2010
internet_society_2010internet_society_2010
internet_society_2010Craig Bellamy
 
Border Trouble: On the Frontiers of Digital Scholarship
Border Trouble: On the Frontiers of Digital ScholarshipBorder Trouble: On the Frontiers of Digital Scholarship
Border Trouble: On the Frontiers of Digital ScholarshipSpencer Keralis
 
Open Data: Analysis and Visualisation
Open Data: Analysis and VisualisationOpen Data: Analysis and Visualisation
Open Data: Analysis and VisualisationDr Muhammad Adnan
 
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
RDA implementation: the new cataloguing standard in Europe - Dilyana DuchevaLISDISConference
 
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...Pablo Aragón
 

What's hot (7)

internet_society_2010
internet_society_2010internet_society_2010
internet_society_2010
 
An Open Data Story
An Open Data StoryAn Open Data Story
An Open Data Story
 
IS_2010
IS_2010IS_2010
IS_2010
 
Border Trouble: On the Frontiers of Digital Scholarship
Border Trouble: On the Frontiers of Digital ScholarshipBorder Trouble: On the Frontiers of Digital Scholarship
Border Trouble: On the Frontiers of Digital Scholarship
 
Open Data: Analysis and Visualisation
Open Data: Analysis and VisualisationOpen Data: Analysis and Visualisation
Open Data: Analysis and Visualisation
 
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
 

Viewers also liked

Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...
Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...
Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...Pablo Aragón
 
UAB: Análisis de redes sociales
UAB: Análisis de redes socialesUAB: Análisis de redes sociales
UAB: Análisis de redes socialesPablo Aragón
 
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide MadridFrom Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide MadridPablo Aragón
 
Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...
Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...
Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...Pablo Aragón
 
Desarrollo de una herramienta de planificación social media
Desarrollo de una herramienta de planificación social mediaDesarrollo de una herramienta de planificación social media
Desarrollo de una herramienta de planificación social mediaPablo Aragón
 
Elecciones 20n en Twitter
Elecciones 20n en TwitterElecciones 20n en Twitter
Elecciones 20n en TwitterPablo Aragón
 
Bases de datos temporales
Bases de datos temporalesBases de datos temporales
Bases de datos temporalesPablo Aragón
 
Biographical social networks on Wikipedia
Biographical social networks on WikipediaBiographical social networks on Wikipedia
Biographical social networks on WikipediaPablo Aragón
 
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...Pablo Aragón
 
DatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redes
DatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redesDatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redes
DatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redesPablo Aragón
 
Not all paths lead to Rome: Analysing the network of sister cities
Not all paths lead to Rome: Analysing the network of sister citiesNot all paths lead to Rome: Analysing the network of sister cities
Not all paths lead to Rome: Analysing the network of sister citiesAndreas Kaltenbrunner
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsRuth Garcia Gavilanes
 
Datactic, Data with Tactics
Datactic, Data with TacticsDatactic, Data with Tactics
Datactic, Data with TacticsPablo Aragón
 
Gestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consolaGestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consolaPablo Aragón
 
The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...Pablo Aragón
 
Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...Pablo Aragón
 
Análisis de datos para la democracia
Análisis de datos para la democraciaAnálisis de datos para la democracia
Análisis de datos para la democraciaPablo Aragón
 

Viewers also liked (19)

Smmart for partners
Smmart for partnersSmmart for partners
Smmart for partners
 
Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...
Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...
Graph Visualization Tool for Twittersphere users based on a high-scalable Ext...
 
UAB: Análisis de redes sociales
UAB: Análisis de redes socialesUAB: Análisis de redes sociales
UAB: Análisis de redes sociales
 
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide MadridFrom Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
 
Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...
Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...
Tweeting the campaign: Evaluation of the Strategies performed by Spanish Poli...
 
Desarrollo de una herramienta de planificación social media
Desarrollo de una herramienta de planificación social mediaDesarrollo de una herramienta de planificación social media
Desarrollo de una herramienta de planificación social media
 
Elecciones 20n en Twitter
Elecciones 20n en TwitterElecciones 20n en Twitter
Elecciones 20n en Twitter
 
Bases de datos temporales
Bases de datos temporalesBases de datos temporales
Bases de datos temporales
 
3t presentation
3t presentation3t presentation
3t presentation
 
Biographical social networks on Wikipedia
Biographical social networks on WikipediaBiographical social networks on Wikipedia
Biographical social networks on Wikipedia
 
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
 
DatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redes
DatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redesDatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redes
DatAnalysis15M: Evolución del sistema-red 15m a través de la topología de redes
 
Not all paths lead to Rome: Analysing the network of sister cities
Not all paths lead to Rome: Analysing the network of sister citiesNot all paths lead to Rome: Analysing the network of sister cities
Not all paths lead to Rome: Analysing the network of sister cities
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in Microblogs
 
Datactic, Data with Tactics
Datactic, Data with TacticsDatactic, Data with Tactics
Datactic, Data with Tactics
 
Gestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consolaGestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consola
 
The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...
 
Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...
 
Análisis de datos para la democracia
Análisis de datos para la democraciaAnálisis de datos para la democracia
Análisis de datos para la democracia
 

Similar to Assessing inter-cultural patterns through ranking biographiesBiographies

#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...DataBeersBCN
 
LIS 653 Posters
LIS 653 PostersLIS 653 Posters
LIS 653 PostersPrattSILS
 
Handbook of bibliography on diaspora and transnationalism 2013
Handbook of bibliography on diaspora and transnationalism 2013Handbook of bibliography on diaspora and transnationalism 2013
Handbook of bibliography on diaspora and transnationalism 2013Diaspora Transnationalism
 
Knowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use casesKnowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use casesAndrea Scharnhorst
 
Semantic Web for Cultural Heritage valorisation
Semantic Web for Cultural Heritage valorisationSemantic Web for Cultural Heritage valorisation
Semantic Web for Cultural Heritage valorisationValentina Carriero
 
Collapse of CivilizationsThe ancient civilizations discussed t.docx
Collapse of CivilizationsThe ancient civilizations discussed t.docxCollapse of CivilizationsThe ancient civilizations discussed t.docx
Collapse of CivilizationsThe ancient civilizations discussed t.docxmccormicknadine86
 
Nigel-West-Historical-Dictionary-of-International-Intelligence.pdf
Nigel-West-Historical-Dictionary-of-International-Intelligence.pdfNigel-West-Historical-Dictionary-of-International-Intelligence.pdf
Nigel-West-Historical-Dictionary-of-International-Intelligence.pdfTeabeAmet
 
Cultural Identities in Wikipedia (Wikimania 2016)
Cultural Identities in Wikipedia (Wikimania 2016)Cultural Identities in Wikipedia (Wikimania 2016)
Cultural Identities in Wikipedia (Wikimania 2016)Marc Miquel
 
From Gypsy Kings to Beggars: Integration and Exclusion of the Romanian Roma
From Gypsy Kings to Beggars: Integration and Exclusion of the Romanian RomaFrom Gypsy Kings to Beggars: Integration and Exclusion of the Romanian Roma
From Gypsy Kings to Beggars: Integration and Exclusion of the Romanian RomaMichael S
 
European librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightEuropean librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightJulien Houssiere
 
Historical Artifact of the Library Card Catalog
Historical Artifact of the Library Card Catalog Historical Artifact of the Library Card Catalog
Historical Artifact of the Library Card Catalog Antoinette Durden
 
It's a Man's Wikipedia?
It's a Man's Wikipedia? It's a Man's Wikipedia?
It's a Man's Wikipedia? Claudia Wagner
 
Essay On Vocational Education.pdf
Essay On Vocational Education.pdfEssay On Vocational Education.pdf
Essay On Vocational Education.pdfHeather Lopez
 
Data Journalism - Storytelling with Data
Data Journalism - Storytelling with DataData Journalism - Storytelling with Data
Data Journalism - Storytelling with DataBahareh Heravi
 
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...Andrea Scharnhorst
 
Race, ethnicity and nation international perspectives on social conflict
Race, ethnicity and nation international perspectives on social conflictRace, ethnicity and nation international perspectives on social conflict
Race, ethnicity and nation international perspectives on social conflictyoonshweyee
 

Similar to Assessing inter-cultural patterns through ranking biographiesBiographies (20)

#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
 
LIS 653 Posters
LIS 653 PostersLIS 653 Posters
LIS 653 Posters
 
Handbook of bibliography on diaspora and transnationalism 2013
Handbook of bibliography on diaspora and transnationalism 2013Handbook of bibliography on diaspora and transnationalism 2013
Handbook of bibliography on diaspora and transnationalism 2013
 
Knowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use casesKnowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use cases
 
Semantic Web for Cultural Heritage valorisation
Semantic Web for Cultural Heritage valorisationSemantic Web for Cultural Heritage valorisation
Semantic Web for Cultural Heritage valorisation
 
Historical Dictionary of International Intelligence
Historical Dictionary of International IntelligenceHistorical Dictionary of International Intelligence
Historical Dictionary of International Intelligence
 
Collapse of CivilizationsThe ancient civilizations discussed t.docx
Collapse of CivilizationsThe ancient civilizations discussed t.docxCollapse of CivilizationsThe ancient civilizations discussed t.docx
Collapse of CivilizationsThe ancient civilizations discussed t.docx
 
Nigel-West-Historical-Dictionary-of-International-Intelligence.pdf
Nigel-West-Historical-Dictionary-of-International-Intelligence.pdfNigel-West-Historical-Dictionary-of-International-Intelligence.pdf
Nigel-West-Historical-Dictionary-of-International-Intelligence.pdf
 
Cultural Identities in Wikipedia (Wikimania 2016)
Cultural Identities in Wikipedia (Wikimania 2016)Cultural Identities in Wikipedia (Wikimania 2016)
Cultural Identities in Wikipedia (Wikimania 2016)
 
From Gypsy Kings to Beggars: Integration and Exclusion of the Romanian Roma
From Gypsy Kings to Beggars: Integration and Exclusion of the Romanian RomaFrom Gypsy Kings to Beggars: Integration and Exclusion of the Romanian Roma
From Gypsy Kings to Beggars: Integration and Exclusion of the Romanian Roma
 
European librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightEuropean librarians theatre - Social Media Spotlight
European librarians theatre - Social Media Spotlight
 
Metadata for digital humanities
Metadata for digital humanitiesMetadata for digital humanities
Metadata for digital humanities
 
Historical Artifact of the Library Card Catalog
Historical Artifact of the Library Card Catalog Historical Artifact of the Library Card Catalog
Historical Artifact of the Library Card Catalog
 
It's a Man's Wikipedia?
It's a Man's Wikipedia? It's a Man's Wikipedia?
It's a Man's Wikipedia?
 
Essay On Vocational Education.pdf
Essay On Vocational Education.pdfEssay On Vocational Education.pdf
Essay On Vocational Education.pdf
 
Data Journalism - Storytelling with Data
Data Journalism - Storytelling with DataData Journalism - Storytelling with Data
Data Journalism - Storytelling with Data
 
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...
 
Fin De Siecle Europe
Fin De Siecle EuropeFin De Siecle Europe
Fin De Siecle Europe
 
Race, ethnicity and nation international perspectives on social conflict
Race, ethnicity and nation international perspectives on social conflictRace, ethnicity and nation international perspectives on social conflict
Race, ethnicity and nation international perspectives on social conflict
 
Data and science
Data and scienceData and science
Data and science
 

More from Pablo Aragón

A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...Pablo Aragón
 
Civic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open ChallengesCivic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open ChallengesPablo Aragón
 
Characterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhDCharacterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhDPablo Aragón
 
The DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in BarcelonaThe DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in BarcelonaPablo Aragón
 
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...Pablo Aragón
 
DECODE project: Barcelona pilots
DECODE project: Barcelona pilotsDECODE project: Barcelona pilots
DECODE project: Barcelona pilotsPablo Aragón
 
Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)Pablo Aragón
 
Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...Pablo Aragón
 
Decidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democráticaDecidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democráticaPablo Aragón
 
Datos para la participación
Datos para la participaciónDatos para la participación
Datos para la participaciónPablo Aragón
 
Decidim en redes sociales
Decidim en redes socialesDecidim en redes sociales
Decidim en redes socialesPablo Aragón
 
The dynamics of a social convention
The dynamics of a social conventionThe dynamics of a social convention
The dynamics of a social conventionPablo Aragón
 
Discussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim BarcelonaDiscussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim BarcelonaPablo Aragón
 
Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...Pablo Aragón
 
4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a party4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a partyPablo Aragón
 

More from Pablo Aragón (18)

A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
 
Civic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open ChallengesCivic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open Challenges
 
Characterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhDCharacterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhD
 
The DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in BarcelonaThe DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
 
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
 
DECODE project: Barcelona pilots
DECODE project: Barcelona pilotsDECODE project: Barcelona pilots
DECODE project: Barcelona pilots
 
Labmeeting18
Labmeeting18Labmeeting18
Labmeeting18
 
Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)
 
Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...
 
Decidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democráticaDecidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democrática
 
Uk seminar
Uk seminarUk seminar
Uk seminar
 
Datos para la participación
Datos para la participaciónDatos para la participación
Datos para la participación
 
Lab metadecidim
Lab metadecidimLab metadecidim
Lab metadecidim
 
Decidim en redes sociales
Decidim en redes socialesDecidim en redes sociales
Decidim en redes sociales
 
The dynamics of a social convention
The dynamics of a social conventionThe dynamics of a social convention
The dynamics of a social convention
 
Discussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim BarcelonaDiscussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim Barcelona
 
Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...
 
4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a party4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a party
 

Recently uploaded

IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 

Recently uploaded (20)

IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 

Assessing inter-cultural patterns through ranking biographiesBiographies

  • 1. Y.H. Eom1;4, P. Aragon2, D. Laniado2, A. Kaltenbrunner2, S. Vigna3, D.L. Shepelyansky4 1 IMT Institute for Advanced Studies Lucca, Italy 2 Eurecat | Barcelona Media, Spain 3 Dipartimento di Informatica, Universita degli Studi di Milano, Italy 4 Laboratoire de Physique Theorique du CNRS, IRSAMC, Universite de Toulouse, France NetSci 2015, Jun 5, Zaragoza Assessing inter-cultural patterns through ranking biographies
  • 2. 2www.eurecat.org Wikipedia NetSci 2015, Jun 5, Zaragoza  The largest global knowledge repository  The biggest collaborative platform on the Web  287 language editions (different descriptions of human knowledge)
  • 3. 3www.eurecat.org Reflected cultural differences across language editions NetSci 2015, Jun 5, Zaragoza Since language is one of the primary elements of culture [1], collective cultural biases may be reflected on the contents and organization of each Wikipedia language edition. How can we identify reflected, collective, cultural difference across Wikipedia editions? Hola Hello 您好 नमस्ते ‫مرحبا‬ Olá
  • 4. 4www.eurecat.org Methodology NetSci 2015, Jun 5, Zaragoza 1. For each edition  Network of articles 2. Ranking based on network structure 3. Identification of biographical articles and extraction of features 4. Analysis of top people Edition Language Articles Edition Language Articles EN English 4 212 493 RU Russian 966 284 NL Dutch 1 144 615 HE Hebrew 144 959 DE German 1 532 978 TR Turkish 206 311 FR French 1 352 825 AR Arabic 203 328 ES Spanish 974 025 FA Persian 295 696 IT Italian 1 017 953 HI Hindi 96 869 PT Portuguese 758 227 MS Malaysian 180 886 EL Greek 82 563 TH Thai 78 953 DA Danish 175 228 VI Vietnamese 594 089 SV Swedish 780 872 ZH Chinese 663 485 PL Polish 949 153 KO Korean 231 959 HU Hungarian 235 212 JA Japanese 852 087 The selection covers:  ~ 60% of world population  ~ 70% of Wikipedia articles These 24 editions also cover languages which played an important role in human history including Western, Asian and Arabic cultures.
  • 5. 5www.eurecat.org Methodology NetSci 2015, Jun 5, Zaragoza 1. For each edition  Network of articles 2. Ranking based on network structure 3. Identification of biographical articles and extraction of features 4. Analysis of top people
  • 6. 6www.eurecat.org Methodology NetSci 2015, Jun 5, Zaragoza 1. For each edition  Network of articles 2. Ranking based on network structure 3. Identification of biographical articles and extraction of features 4. Analysis of top people PAGERANK CHEIRANK 2DRANK  Highly linked articles are important  Articles cited by important articles are also important  Out-going links can be important
  • 7. 7www.eurecat.org Methodology NetSci 2015, Jun 5, Zaragoza 1. For each edition  Network of articles 2. Ranking based on network structure 3. Identification of biographical articles and extraction of features 4. Analysis of top people  1.1M people in EN-Wikipedia are identified  Inter-language links: From EN to other language Extraction of features through DBpedia (manual inspection if missing):  Birth place country level  Birth date year level  Gender male / female  Culture current most spoken language at that birth place
  • 8. 8www.eurecat.org Methodology NetSci 2015, Jun 5, Zaragoza 1. For each edition  Network of articles 2. Ranking based on network structure 3. Identification of biographical articles and extraction of features 4. Analysis of top people EN-WIKIPEDIA NETWORK GLOBAL NETWORK ΘA is the ranking score of algorithm A; NA is the number of appearances of a given person in the top100 rank for all editions.
  • 9. 9www.eurecat.org Top people ranking NetSci 2015, Jun 5, Zaragoza Overlap with alternative rankings:  Hart’s Influential personas in History [2]: 43%  Top 100 people Pantheon MIT list [3]: 44% Not bigger than Jesus, but relevant from an encyclopedic point of view
  • 10. 10www.eurecat.org Assessing cultural differences in top 100 people NetSci 2015, Jun 5, Zaragoza SPATIAL DISTRIBUTION TEMPORAL DISTRIBUTION GENDER DISTRIBUTION
  • 11. 11www.eurecat.org Results: Spatial distribution (I) NetSci 2015, Jun 5, Zaragoza Sum of appearances of historical figures from a given country in the 24 lists of top 100 persons for PageRank. Color changes from zero (white) to 233 for Germany (black). Overall Western 11 editions are not European (AR, FA, HE, HI, JA, KO, MS, TH, TR,VI, ZH)
  • 12. 12www.eurecat.org Results: Spatial distribution (II) NetSci 2015, Jun 5, Zaragoza Birth place distribution of top historical figures averaged over 24 Wikipedia edition for (A) PageRank historical figures (71 countries) and (B) 2DRank historical figures (91 countries). Overall Western
  • 13. 13www.eurecat.org Results: Spatial distribution (III) NetSci 2015, Jun 5, Zaragoza For each Wikipedia edition, distributions of (A) PageRank historical figures over 71 countries; (B) 2DRank historical figures over 91 countries Local skewness in the spatial distribution of the top historical figures
  • 14. 14www.eurecat.org Results: Temporal distribution (I) NetSci 2015, Jun 5, Zaragoza Birth date distribution of (A) PageRank historical figures (B) 2DRank historical figures. Most are born after 17th century. Peaks in BC 5th and BC 1st century in top PageRank people
  • 15. 15www.eurecat.org Results: Temporal distribution (II) NetSci 2015, Jun 5, Zaragoza For each Wikipedia edition, birth date distributions of (C) PageRank historical figures (D) 2DRank historical figures. Most are born after 17th century Arabic & Persian from 6th to 10th century
  • 16. 16www.eurecat.org Results: Spatial & Temporal distribution NetSci 2015, Jun 5, Zaragoza Temporal locality property of cultures: rL, C = ML, C / NL, C ; of (A) PageRank historical figures (B) 2DRank historical figures. ML, C is the number of historical figures born in countries attributed to a given language edition L NL, C is the total number of historical figures in a given edition L  Locality in Ancient History: Greek, Italian, Hebrew or Chinese editions  Locality in Middle Ages: French, Spanish or Portuguese editions  Locality in Modern History: English edition
  • 17. 17www.eurecat.org Results: Gender distribution (I) NetSci 2015, Jun 5, Zaragoza For each Wikipedia edition, number of females of (A) Top PageRank historical figures; (B) Top 2DRank historical figures. Overall male skewed Rank TH PageRank 1 Sirindhorn 2 Bhumibol Adulyadej 3 Sirikit 4 Thaksin Shinawatra 5 Gautama Buddha 6 Adolf Hitler 7 Taksin 8 Queen Victoria 9 Abhisit Vejjajiva 10 Pridi Banomyong 11 Yingluck Shinawatra 12 Galyani Vadhana 13 Srinagarindra 14 J. R. R. Tolkien 15 Samak Sundaravej
  • 18. 18www.eurecat.org Results: Gender distribution (II) NetSci 2015, Jun 5, Zaragoza The average female ratio of historical figures in given centuries across 24 Wikipedia editions. The ratio of women has grown in the last centuries
  • 19. 19www.eurecat.org Results: Networks of cultures NetSci 2015, Jun 5, Zaragoza Network of cultures and corresponding Google Matrix obtained from 24 Wikipedia languages and the remaining world (WR) considering (A) top PageRank historical figures and (B) 2DRank historical figures. EN edition Napoleon Napoleon III Descartes FR edition Shakespeare Elizabeth II 3 2 Cross cultural top persons as influence between cultures Ex)
  • 20. 20www.eurecat.org Findings NetSci 2015, Jun 5, Zaragoza Global: Most important historical figures across Wikipedia language editions were:  from Western countries  born after the 17th century  male Local: Each edition highlights historical figures from that culture Future work Refine the extraction of features through WikiData project (instead of DBpedia) Compare results to alternative categories of Wikipedia (e.g. food)
  • 21. 21www.eurecat.org References NetSci 2015, Jun 5, Zaragoza These slides are based on: Y.-H. Eom, P. Aragón, D. Laniado, A. Kaltenbrunner, S. Vigna, and D. L. Shepelyansky, “Interactions of cultures and top people of wikipedia from ranking of 24 language editions,” PLoS ONE, vol. 10, p. e0114825, 03 2015. and cite: [1] UNESCO World Report (2009) Investing in cultural diversity and intercultural dialogue [2] Hart, M. H. (1978). The 100: A Ranking of the Most Influential Persons in History. Citadel Press. [3] Pantheon http://pantheon.media.mit.edu/
  • 22. 22www.eurecat.org NetSci 2015, Jun 5, Zaragoza Thanks for your attention !