SlideShare a Scribd company logo
1 of 81
Download to read offline
The Google Scholar Revolution:
a big data bibliometric tool
Google Scholar Day: Changing current evaluation paradigms
Cybermetrics Lab (IPP–CSIC)
Madrid, 20 February 2017
Enrique Orduña-Malea, Alberto Martín-Martín, Juan M. Ayllón,
Emilio Delgado López-Cózar
EC3 Research Group-Scholar Division
EC3 Research Group – Google Scholar Division
searchengine
Bibliographicsearchtool
Bibliometrictool
The two faces of Google Scholar
The google search familly. Its goal: finding information
Scholar Search
Scholar Citations
Scholar Metrics
Simple
Easy
Fast
Easy to understand and use
Universal, international, global
Multilingual
Free
Why is it successful?
20122011
2004
Google’s incursion in Bibliometrics
7
Studying it from the bibliometric perspective:
EC3-Scholar Division
Opening the academic Pandora’s Box
2008-
9
Journals Authors Publishers
Multifaceted model
Library & Information Sciences (Spain)
http://www.biblioteconomia-documentacion-española.infoec3.es
Bibliometrics & Scientometrics (International)
http://www.scholar-mirrors.infoec3.es
What have we analyzed?
Intensive, extensive, and obsessive work
Conferences
11
12
• Publication data about 49,930 A&H and SS professors working in public
Spanish universities was extracted from Google Scholar in 2012
• Only authors in the first tercile are displayed
• 68 discipline rankings (49 in Social Sciences and Law, 39 in Arts and
Humanities)
13
Indicators
14
Collecting data
Max. number of hits per page
February 2013: from 100 to 20
15
16
17
Sample of highly cited books (top 3%)
published by ~49k A&H and SS
professors working
in public Spanish universities
Data collected from Google Scholar in
2012 (n ~ 7203)
68 discipline rankings
(49 in Social Sciences and Law,
39 in Arts and Humanities)
18
Indicators: Nº of books, and sum
of citations (relative to highest
element in the ranking)
19
Indicators
H5 Index
H5 Median
22
Spanish Journals
± 2.500
SJR-Scopus
506
JCR
114Google Scholar Metrics
1299
23
24
25
26
27
Indicators
Extracted
directly from
Google Scholar
Metrics
Computed using the article and
citation data available in Google
Scholar Metrics
H Index of
documents
published in the
last 5 years
Median of
citation counts
for articles
published in last
5 years
Sum of
citations for
articles above
h5-index
threshold
28
Classification
Core Related
29
Coverage
IMPORTANT: Google Scholar Metrics
only covers journals that are indexed in
Google Scholar, have published at
least 100 articles in the last 5-year
period, and have received at least 1
citation
30
PROCEEDINGS
31
32
Data sources
Citations
Multifaceted model
33
LIS researchers
in Spain
336 authors in GSC
68 not in GSC
Other sources
ResearcherID (WoS)
ResearchGate
Indicators
Sum of citations
H Index
Nº of documents
RG Score
Impact Points
Aggregating data
Highly cited docs (HCD),
% of HCD by journal, book
publisher, and institution
34
The «Mirrors» approach
There are many platforms that reflect (mirror) scientific activity
on the Web. An inclusive study of the impact of scientific
activity must contemplate as many of them as possible.
35
Bibliometric potential of Google Scholar
We have proved
Yes, we can
What do we know about Google Scholar?
Its strengths and weaknesses
38
It’s the most used academic
search engine
Why do we call it a big data bibliometric tool?
Source
COVERAGE
All documents
Geographic
COVERAGE
All countries
Linguistic
COVERAGE
All languages
Discipline
COVERAGE
All of them
GROWTH
Faster than WoS and
Scopus
FAST TRACK
CITATIONS
Newly published
documents indexed in
a matter of days
Big Data
Big Data
SIZE
Largest bibliographic database in the world
The search engine with the largest coverage
Size matters
Orduña-Malea, E., Ayllón, J. M., Martín-Martín, A., Delgado López-Cózar, E.. (2014). About the size of Google
Scholar: playing the numbers. arXiv preprint arXiv:1407.6239. EC3 Working Papers 18
Orduna-Malea, E., Ayllón, J. M., Martín-Martín, A., Delgado López-Cózar, E. Methods for estimating the size of Google
Scholar. Scientometrics 104 (3), 931-949
2015
The search engine with the largest coverage
Size matters
2017
Nº documents
2017
Larger coverage, larger citation graph
0
0,5
1
1,5
2
2,5
3
Art & Humanities Social Sciences Sciences TOTAL
RatioofGSCitationstoWoS
Citations(avg)
Documents published in 2009 Documents published in 2014
Analysis of most documents with a DOI published in 2009 and 2014 covered by
Web of Science (~2.5 million documents)
45
International
46
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
1800
1805
1810
1815
1820
1825
1830
1835
1840
1845
1850
1855
1860
1865
1870
1875
1880
1885
1890
1895
1900
1905
1910
1915
1920
1925
1930
1935
1940
1945
1950
1955
1960
1965
1970
1975
1980
1985
1990
1995
2000
2005
2010
2015
Proportion of documents covered by Google Scholar by language over the years
Turkish
Dutch
Chinese
Polish
Korean
Italian
Japanese
Portuguese
Spanish
French
Simplified
Chinese
German
English
47
Multilingual
Documents covered by Google Scholar, Web of Science & Scopus by 13 languages
(1800-2016)
48
Multilingual
49
Covers all
document
typologies
Thealternative
51
Google Scholar offers a different vision of scientific production
Web of Science documents
(2009/2014) found in Google Scholar
found in GS
96%
not found in
GS
4%
96% of the searched WoS documents were found in GS. 98% if we only consider
journal articles. The rest might have been found as well if alternative search strategies
had been used.
Martín-Martín, A., Orduña-Malea, E., Ayllón, J. M., Delgado López-Cózar, E. (2014). Does Google Scholar contain all highly cited
documents (1950-2013)?. arXiv preprint arXiv:1410.8464.
Highly cited documents in Google Scholar (1950-2013)
Half of them are not in WoS
The ones who are in WoS: very high citation correlation
54
Confirmation
PRELIMINARY RESULTS:
Analysis of most documents with a DOI published in 2009 covered by Web
of Science (~1 million documents)
Citation Index N spearman.cor p.value prop.cited.gs prop.cited.wos ratio of gs_cit to wos_cit (avg)
Sciences 863801 0,94 0,00 0,97 0,95 1,68
Social Sciences 109232 0,90 0,00 0,97 0,94 2,58
Art & Humanities 13487 0,83 0,00 0,84 0,69 2,52
Sciences Social Sciences Art & Humanities
Logical when you see their sources
elsevier.com 7.200.000
wiley.com 4,590,000
springer 3.290.000
taylor and francis 1.440.000
lww.com 1.030.000
sagepub.com 781.000
nature.com 428.000
bmj.com 370.000
Routledge 293.000
karger.com 188.000
degruyter.com 196.000
biomedcentral.com 121.000
liebertpub.com 90.900
emerald 167.000
books.google.com 14.000.000
ieee.org 2.990.000
jstor.org 2.210.000
acs.org 987.000
cambridge.org 893.000
oxfordjournals.org 872.000
acm.org 472.000
iop.org 462000
aip.org 451.000
arxiv.org 355.000
pnas.org 101.000
ams.org 98.000
sciencemag.org 62.600
nber.org 26.900
Scopus 2009
Web of Science
Google Scholar
56
It has a better coverage of areas like Humanities, Social
Sciences, Engineering…
GS measures scientific impact and more
Scientific Professional
Educational Social
What impact does it measure?
A professional journal in Google Scholar
We question ourselves
Drawbacks Google Scholar
61
Lack of transparency
There isn’t a public master list of the sources
Google Scholar indexes (publishers,
repositories, catalogues, bibliographic
databases and repertoires, aggregators,
journals…)
There is no accurate method to estimate the
size of Google Scholar
62
Similarly, even if a document has received more than
1000 citations, only the first 1000 can be displayed
when clicking the “Cited by” link
We have no control over the results we get
Are the relevant results for my needs among those
1000 results?
Usually yes… thanks to the ranking algorithm they use
Only(!) returns 1000 results for any given query
Is this really a bibliographic problem?
Who is interesed in bibliographic searches of that size?
Weaknesses
63
How does Google Scholar rank results?
64
There is no native method to easily extract bibliographic data
massively. Only one by one.
Weaknesses
There is no API. What can we use for large downloads?
A scraper
65
Weaknesses
Limited filtering and sorting options (year and relevance)
≠
66
Weaknesses
The advanced search form is limited to four search dimensions:
keywords, authors, source of publication (journal, conference…),
and year of publication
67
No quality control of sources indexed. Peer-reviewed documents coexist
with documents that haven’t gone through that process.
Is that really
a problem?
But… Google Scholar also shows which documents are covered by the
Web of Science, and which of them are available from your library. YOUR
CHOICE…
Weaknesses
It shows
richness rather
than a flaw
68
Weaknesses
• Institutional affiliation of the authors of the documents is
available (institution, country)
• The language in which documents are written
• The typology of each document is not clear (book, journal
article, conference communication, thesis, report…). Only
books are marked as such, usually when they have been
found on Google Books
• Not all documents have an abstract
• The author-supplied keywords are not available
• The list of cited references in each article is not available
either
It doesn’t offer information regarding
69
Greatest danger: manipulation
Delgado López‐Cózar, E., Robinson‐García, N., Torres‐Salinas, D. (2014). The Google
Scholar experiment: How to index false papers and manipulate bibliometric indicators.
Journal of the Association for Information Science and Technology, 65(3), 446-454.
70
Errors in the data
Enough quality?
Even with «dirty» data,
it measures more and
better
Large units of analysis: no problem
Individuals: check data first
71
The
Googledependency
Drawbacks:
Google Scholar Metrics
74
75
Drawbacks:
Google Scholar Citations
77
78
79
80
Google Scholar Citations:
Laissez faire laissez passer
Be wary of CUT / COPY – BUTTON / COMAND
bibliometric products
Don’t mix apples and oranges
A final consideration…
To what end are we measuring scientific
activity?
Spreading light where there wasThank you very
much!

More Related Content

What's hot

UPR Science Direct Presentation June 2008
UPR Science Direct Presentation June 2008UPR Science Direct Presentation June 2008
UPR Science Direct Presentation June 2008Liz Pagan
 
social media cafe / organize your author identities
 social media cafe / organize your author identities social media cafe / organize your author identities
social media cafe / organize your author identitiesHugo Besemer
 
How to use Publish or Perish Effectively
How to use Publish or Perish EffectivelyHow to use Publish or Perish Effectively
How to use Publish or Perish EffectivelyAnne-Wil Harzing
 
Building your academic brand through engagement with social media
Building your academic brand through engagement with social mediaBuilding your academic brand through engagement with social media
Building your academic brand through engagement with social mediaAnne-Wil Harzing
 
Social media cafe ResearchGate
Social media cafe ResearchGateSocial media cafe ResearchGate
Social media cafe ResearchGateHugo Besemer
 
Research Sources & Techniques
Research Sources  & TechniquesResearch Sources  & Techniques
Research Sources & TechniquesGina Singh
 
Research Sources
Research SourcesResearch Sources
Research SourcesGina Singh
 
Astronomy libraries - your gateway to information
Astronomy libraries - your gateway to informationAstronomy libraries - your gateway to information
Astronomy libraries - your gateway to informationUta Grothkopf
 
What google scholar can do for you
What google scholar can do for youWhat google scholar can do for you
What google scholar can do for youpvhead123
 
Advanced Searching
Advanced SearchingAdvanced Searching
Advanced SearchingRos Pan
 
EngWri 300 (Gary)
EngWri 300 (Gary)EngWri 300 (Gary)
EngWri 300 (Gary)karlsen
 
Information retrieval for research, Autumn 2014
Information retrieval for research, Autumn 2014Information retrieval for research, Autumn 2014
Information retrieval for research, Autumn 2014Gina Bay
 
Search challenges for collections of book records
Search challenges for collections of book recordsSearch challenges for collections of book records
Search challenges for collections of book recordsArjen de Vries
 
Library Resources and the Literature Review
Library Resources and the Literature ReviewLibrary Resources and the Literature Review
Library Resources and the Literature Reviewjthiessen
 
Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)sbishoptcl
 
Research Sources & Techniques
Research Sources & TechniquesResearch Sources & Techniques
Research Sources & TechniquesGina Singh
 

What's hot (20)

UPR Science Direct Presentation June 2008
UPR Science Direct Presentation June 2008UPR Science Direct Presentation June 2008
UPR Science Direct Presentation June 2008
 
social media cafe / organize your author identities
 social media cafe / organize your author identities social media cafe / organize your author identities
social media cafe / organize your author identities
 
How to use Publish or Perish Effectively
How to use Publish or Perish EffectivelyHow to use Publish or Perish Effectively
How to use Publish or Perish Effectively
 
Building your academic brand through engagement with social media
Building your academic brand through engagement with social mediaBuilding your academic brand through engagement with social media
Building your academic brand through engagement with social media
 
Social media cafe ResearchGate
Social media cafe ResearchGateSocial media cafe ResearchGate
Social media cafe ResearchGate
 
Research Skills 2
Research Skills 2Research Skills 2
Research Skills 2
 
Research Sources & Techniques
Research Sources  & TechniquesResearch Sources  & Techniques
Research Sources & Techniques
 
Research Sources
Research SourcesResearch Sources
Research Sources
 
Astronomy libraries - your gateway to information
Astronomy libraries - your gateway to informationAstronomy libraries - your gateway to information
Astronomy libraries - your gateway to information
 
What google scholar can do for you
What google scholar can do for youWhat google scholar can do for you
What google scholar can do for you
 
Advanced Searching
Advanced SearchingAdvanced Searching
Advanced Searching
 
Chls 335 i_14
Chls 335 i_14Chls 335 i_14
Chls 335 i_14
 
EngWri 300 (Gary)
EngWri 300 (Gary)EngWri 300 (Gary)
EngWri 300 (Gary)
 
Information retrieval for research, Autumn 2014
Information retrieval for research, Autumn 2014Information retrieval for research, Autumn 2014
Information retrieval for research, Autumn 2014
 
Search challenges for collections of book records
Search challenges for collections of book recordsSearch challenges for collections of book records
Search challenges for collections of book records
 
Entomology 314 training
Entomology 314 trainingEntomology 314 training
Entomology 314 training
 
Library Resources and the Literature Review
Library Resources and the Literature ReviewLibrary Resources and the Literature Review
Library Resources and the Literature Review
 
Searching electronic resources effectively BLDS, November 2012
Searching electronic resources effectively BLDS, November 2012Searching electronic resources effectively BLDS, November 2012
Searching electronic resources effectively BLDS, November 2012
 
Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)
 
Research Sources & Techniques
Research Sources & TechniquesResearch Sources & Techniques
Research Sources & Techniques
 

Viewers also liked

Learn How to Build Mobile Apps Using Cloud Services
Learn How to Build Mobile Apps Using Cloud ServicesLearn How to Build Mobile Apps Using Cloud Services
Learn How to Build Mobile Apps Using Cloud ServicesMax Katz
 
Introduction to RichFaces
Introduction to RichFacesIntroduction to RichFaces
Introduction to RichFacesMax Katz
 
Learn Cloud Computing the right way!
Learn Cloud Computing the right way!Learn Cloud Computing the right way!
Learn Cloud Computing the right way!Stefano Bellasio
 
Starky and hutch 70s tv theme
Starky and hutch 70s tv themeStarky and hutch 70s tv theme
Starky and hutch 70s tv themeAlex Grasso
 
MongoDB World 2016: Lunch & Learn: Google Cloud for the Enterprise
MongoDB World 2016: Lunch & Learn: Google Cloud for the EnterpriseMongoDB World 2016: Lunch & Learn: Google Cloud for the Enterprise
MongoDB World 2016: Lunch & Learn: Google Cloud for the EnterpriseMongoDB
 
Google scholar and the academic web
Google scholar and the academic webGoogle scholar and the academic web
Google scholar and the academic webJamie Bisset
 
Journey Through the Cloud - Data Analysis
Journey Through the Cloud - Data AnalysisJourney Through the Cloud - Data Analysis
Journey Through the Cloud - Data AnalysisAmazon Web Services
 
A Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache SparkA Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache Sparkdatamantra
 
Scopus & SciVal Training for Researchers
Scopus & SciVal Training for ResearchersScopus & SciVal Training for Researchers
Scopus & SciVal Training for ResearchersCiarán Quinn
 
Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....
Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....
Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....Jeffrey Breen
 

Viewers also liked (20)

Ii jornadas orientacion investigacion ugr emilio delgado lopez cozar estrateg...
Ii jornadas orientacion investigacion ugr emilio delgado lopez cozar estrateg...Ii jornadas orientacion investigacion ugr emilio delgado lopez cozar estrateg...
Ii jornadas orientacion investigacion ugr emilio delgado lopez cozar estrateg...
 
Learn How to Build Mobile Apps Using Cloud Services
Learn How to Build Mobile Apps Using Cloud ServicesLearn How to Build Mobile Apps Using Cloud Services
Learn How to Build Mobile Apps Using Cloud Services
 
Introduction to RichFaces
Introduction to RichFacesIntroduction to RichFaces
Introduction to RichFaces
 
Learn Cloud Computing the right way!
Learn Cloud Computing the right way!Learn Cloud Computing the right way!
Learn Cloud Computing the right way!
 
Starky and hutch 70s tv theme
Starky and hutch 70s tv themeStarky and hutch 70s tv theme
Starky and hutch 70s tv theme
 
MongoDB World 2016: Lunch & Learn: Google Cloud for the Enterprise
MongoDB World 2016: Lunch & Learn: Google Cloud for the EnterpriseMongoDB World 2016: Lunch & Learn: Google Cloud for the Enterprise
MongoDB World 2016: Lunch & Learn: Google Cloud for the Enterprise
 
Google scholar and the academic web
Google scholar and the academic webGoogle scholar and the academic web
Google scholar and the academic web
 
Hacer una tesis doctoral: Informarse, leer, indagar, escribir…publicar…
Hacer una tesis doctoral: Informarse, leer, indagar, escribir…publicar…Hacer una tesis doctoral: Informarse, leer, indagar, escribir…publicar…
Hacer una tesis doctoral: Informarse, leer, indagar, escribir…publicar…
 
Cómo ser doctor y no perecer en el intento: Los siete pecados capitales de l...
Cómo ser doctor y  no perecer en el intento: Los siete pecados capitales de l...Cómo ser doctor y  no perecer en el intento: Los siete pecados capitales de l...
Cómo ser doctor y no perecer en el intento: Los siete pecados capitales de l...
 
Journey Through the Cloud - Data Analysis
Journey Through the Cloud - Data AnalysisJourney Through the Cloud - Data Analysis
Journey Through the Cloud - Data Analysis
 
A Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache SparkA Tool For Big Data Analysis using Apache Spark
A Tool For Big Data Analysis using Apache Spark
 
¿Cómo buscar, organizar y redactar la bibliografía de mi tesis doctoral?
¿Cómo buscar, organizar y redactar la bibliografía de mi tesis doctoral?   ¿Cómo buscar, organizar y redactar la bibliografía de mi tesis doctoral?
¿Cómo buscar, organizar y redactar la bibliografía de mi tesis doctoral?
 
Scopus & SciVal Training for Researchers
Scopus & SciVal Training for ResearchersScopus & SciVal Training for Researchers
Scopus & SciVal Training for Researchers
 
The Google Scholar Revolution: opening the academic Pandora's box. The transf...
The Google Scholar Revolution: opening the academic Pandora's box. The transf...The Google Scholar Revolution: opening the academic Pandora's box. The transf...
The Google Scholar Revolution: opening the academic Pandora's box. The transf...
 
Cómo usar ImpactStory
Cómo usar ImpactStoryCómo usar ImpactStory
Cómo usar ImpactStory
 
¿Como escribir, publicar y difundir un artículo científico? Reglas y consejos...
¿Como escribir, publicar y difundir un artículo científico? Reglas y consejos...¿Como escribir, publicar y difundir un artículo científico? Reglas y consejos...
¿Como escribir, publicar y difundir un artículo científico? Reglas y consejos...
 
Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....
Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....
Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily.....
 
Google Scholar: Redefiniendo el impacto académico
Google Scholar: Redefiniendo el impacto académicoGoogle Scholar: Redefiniendo el impacto académico
Google Scholar: Redefiniendo el impacto académico
 
Otras realidades, otros impactos, otras métricas: la nueva bibliometría
Otras realidades, otros impactos, otras métricas: la nueva bibliometríaOtras realidades, otros impactos, otras métricas: la nueva bibliometría
Otras realidades, otros impactos, otras métricas: la nueva bibliometría
 
Cómo construir una identidad académica digital
Cómo construir una identidad académica digitalCómo construir una identidad académica digital
Cómo construir una identidad académica digital
 

Similar to The Google Scholar Revolution: a big data bibliometric tool

Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...Anne-Wil Harzing
 
Latest trends in open science and big data analytical study: a decade of scie...
Latest trends in open science and big data analytical study: a decade of scie...Latest trends in open science and big data analytical study: a decade of scie...
Latest trends in open science and big data analytical study: a decade of scie...Oluwaseyi WUSU
 
Publish or Perish - Realising Google Scholar's potential to democratise citat...
Publish or Perish - Realising Google Scholar's potential to democratise citat...Publish or Perish - Realising Google Scholar's potential to democratise citat...
Publish or Perish - Realising Google Scholar's potential to democratise citat...Anne-Wil Harzing
 
Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...
Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...
Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...Jamie Bisset
 
Bibliometrics and University Rankings
Bibliometrics and University RankingsBibliometrics and University Rankings
Bibliometrics and University RankingsR Pagell
 
بنك المعرفة-المصرى
بنك المعرفة-المصرىبنك المعرفة-المصرى
بنك المعرفة-المصرىghadeermagdy
 
بنك المعرفة المصرى Egyptian knowledge bank
بنك المعرفة المصرى  Egyptian knowledge bankبنك المعرفة المصرى  Egyptian knowledge bank
بنك المعرفة المصرى Egyptian knowledge banksameh shalash
 
Publishing and impact Wageningen University IL for PhD 20141202
Publishing and impact  Wageningen University IL for PhD 20141202Publishing and impact  Wageningen University IL for PhD 20141202
Publishing and impact Wageningen University IL for PhD 20141202Hugo Besemer
 
Google Scholar's citation graph: comprehensive, global... and inaccessible
Google Scholar's citation graph: comprehensive, global... and inaccessibleGoogle Scholar's citation graph: comprehensive, global... and inaccessible
Google Scholar's citation graph: comprehensive, global... and inaccessiblealbertomartin101
 
Biology spring 2012_neurobiology
Biology spring 2012_neurobiologyBiology spring 2012_neurobiology
Biology spring 2012_neurobiologyBruce Slutsky
 
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...Michael Levine-Clark
 
2014 09-04-foster-metricsworkshopslides
2014 09-04-foster-metricsworkshopslides2014 09-04-foster-metricsworkshopslides
2014 09-04-foster-metricsworkshopslidesNathalie Cornée
 
A new role for libraries in research assessments
A new role for libraries in research assessmentsA new role for libraries in research assessments
A new role for libraries in research assessmentsWouter Gerritsma
 
‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...
‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...
‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...CONUL Conference
 
Introduction to Altmetrics
Introduction to Altmetrics Introduction to Altmetrics
Introduction to Altmetrics Linda Galloway
 
Scholarly writing and publishing research articles
Scholarly writing and publishing research articlesScholarly writing and publishing research articles
Scholarly writing and publishing research articlesDR.R.SASIPRIYA
 
Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...UoLResearchSupport
 

Similar to The Google Scholar Revolution: a big data bibliometric tool (20)

Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
 
Latest trends in open science and big data analytical study: a decade of scie...
Latest trends in open science and big data analytical study: a decade of scie...Latest trends in open science and big data analytical study: a decade of scie...
Latest trends in open science and big data analytical study: a decade of scie...
 
Publish or Perish - Realising Google Scholar's potential to democratise citat...
Publish or Perish - Realising Google Scholar's potential to democratise citat...Publish or Perish - Realising Google Scholar's potential to democratise citat...
Publish or Perish - Realising Google Scholar's potential to democratise citat...
 
International publication of scientific research
International publication of scientific researchInternational publication of scientific research
International publication of scientific research
 
Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...
Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...
Bibliometrics, Journal Impact Factors and Maximising the Cite-ability of Jour...
 
Bibliometrics and University Rankings
Bibliometrics and University RankingsBibliometrics and University Rankings
Bibliometrics and University Rankings
 
بنك المعرفة-المصرى
بنك المعرفة-المصرىبنك المعرفة-المصرى
بنك المعرفة-المصرى
 
بنك المعرفة المصرى Egyptian knowledge bank
بنك المعرفة المصرى  Egyptian knowledge bankبنك المعرفة المصرى  Egyptian knowledge bank
بنك المعرفة المصرى Egyptian knowledge bank
 
Publishing and impact Wageningen University IL for PhD 20141202
Publishing and impact  Wageningen University IL for PhD 20141202Publishing and impact  Wageningen University IL for PhD 20141202
Publishing and impact Wageningen University IL for PhD 20141202
 
Google Scholar's citation graph: comprehensive, global... and inaccessible
Google Scholar's citation graph: comprehensive, global... and inaccessibleGoogle Scholar's citation graph: comprehensive, global... and inaccessible
Google Scholar's citation graph: comprehensive, global... and inaccessible
 
Biology spring 2012_neurobiology
Biology spring 2012_neurobiologyBiology spring 2012_neurobiology
Biology spring 2012_neurobiology
 
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
 
2014 09-04-foster-metricsworkshopslides
2014 09-04-foster-metricsworkshopslides2014 09-04-foster-metricsworkshopslides
2014 09-04-foster-metricsworkshopslides
 
Georgia ppt 06 08
Georgia ppt 06 08Georgia ppt 06 08
Georgia ppt 06 08
 
A new role for libraries in research assessments
A new role for libraries in research assessmentsA new role for libraries in research assessments
A new role for libraries in research assessments
 
‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...
‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...
‘How does Open Access research sit in a Citation network?’ - Guillaume Rivall...
 
Introduction to Altmetrics
Introduction to Altmetrics Introduction to Altmetrics
Introduction to Altmetrics
 
Scholarly writing and publishing research articles
Scholarly writing and publishing research articlesScholarly writing and publishing research articles
Scholarly writing and publishing research articles
 
Introduction to Metrics and Impact Tracking
Introduction to Metrics and Impact TrackingIntroduction to Metrics and Impact Tracking
Introduction to Metrics and Impact Tracking
 
Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...
 

More from Emilio Delgado Lopez-Cozar, Universidad de Granada

More from Emilio Delgado Lopez-Cozar, Universidad de Granada (20)

Impacto de las revistas españolas de educacion a través de IN-RECS/IN-RECJ: u...
Impacto de las revistas españolas de educacion a través de IN-RECS/IN-RECJ: u...Impacto de las revistas españolas de educacion a través de IN-RECS/IN-RECJ: u...
Impacto de las revistas españolas de educacion a través de IN-RECS/IN-RECJ: u...
 
Hacer una tesis doctoral Informarse, leer, indagar, escribir… publicar… en TI...
Hacer una tesis doctoral Informarse, leer, indagar, escribir… publicar… en TI...Hacer una tesis doctoral Informarse, leer, indagar, escribir… publicar… en TI...
Hacer una tesis doctoral Informarse, leer, indagar, escribir… publicar… en TI...
 
La evaluación de las publicaciones científicas de Derecho en España
La evaluación de las publicaciones científicas de Derecho en EspañaLa evaluación de las publicaciones científicas de Derecho en España
La evaluación de las publicaciones científicas de Derecho en España
 
Buenas prácticas en la construcción de una identidad académica online para un...
Buenas prácticas en la construcción de una identidad académica online para un...Buenas prácticas en la construcción de una identidad académica online para un...
Buenas prácticas en la construcción de una identidad académica online para un...
 
Los nuevos espejos métricos de la ciencia: Google Scholar, ResearchGate y otr...
Los nuevos espejos métricos de la ciencia: Google Scholar, ResearchGate y otr...Los nuevos espejos métricos de la ciencia: Google Scholar, ResearchGate y otr...
Los nuevos espejos métricos de la ciencia: Google Scholar, ResearchGate y otr...
 
Creando una identidad académica digital
Creando una identidad académica digitalCreando una identidad académica digital
Creando una identidad académica digital
 
Como utilizar google scholar para mejorar la visibilidad de nuestra produccio...
Como utilizar google scholar para mejorar la visibilidad de nuestra produccio...Como utilizar google scholar para mejorar la visibilidad de nuestra produccio...
Como utilizar google scholar para mejorar la visibilidad de nuestra produccio...
 
Evaluación de la actividad investigadora española: luces y sombras
Evaluación de la actividad investigadora española: luces y sombrasEvaluación de la actividad investigadora española: luces y sombras
Evaluación de la actividad investigadora española: luces y sombras
 
Repercusion de los rankings de universidades en la prensa española (2004-2013)
Repercusion de los rankings de universidades en la prensa española (2004-2013)Repercusion de los rankings de universidades en la prensa española (2004-2013)
Repercusion de los rankings de universidades en la prensa española (2004-2013)
 
Cocinando rankings de universidades
Cocinando rankings de universidadesCocinando rankings de universidades
Cocinando rankings de universidades
 
Cómo crear y mantener un perfil en Google Scholar Citations
Cómo crear y mantener un perfil en Google Scholar CitationsCómo crear y mantener un perfil en Google Scholar Citations
Cómo crear y mantener un perfil en Google Scholar Citations
 
Cómo usar ResearchGate
Cómo usar ResearchGateCómo usar ResearchGate
Cómo usar ResearchGate
 
¿Evaluar la investigación con Google Scholar? Yes we can
¿Evaluar la investigación con  Google Scholar? Yes we can ¿Evaluar la investigación con  Google Scholar? Yes we can
¿Evaluar la investigación con Google Scholar? Yes we can
 
Difusión y visibilidad de la producción científica en la web def
Difusión y visibilidad de la producción científica en la web defDifusión y visibilidad de la producción científica en la web def
Difusión y visibilidad de la producción científica en la web def
 
¿Cómo buscar información? ¿Cómo estar permanentemente informado? ¿Cómo guarda...
¿Cómo buscar información? ¿Cómo estar permanentemente informado? ¿Cómo guarda...¿Cómo buscar información? ¿Cómo estar permanentemente informado? ¿Cómo guarda...
¿Cómo buscar información? ¿Cómo estar permanentemente informado? ¿Cómo guarda...
 
Como redactar un trabajo fin de grado. Reglas y consejos sobre redacción aca...
Como redactar un trabajo fin de grado.  Reglas y consejos sobre redacción aca...Como redactar un trabajo fin de grado.  Reglas y consejos sobre redacción aca...
Como redactar un trabajo fin de grado. Reglas y consejos sobre redacción aca...
 
Taller para las solicitudes de sexenios en Ciencias Sociales UGR 2011
Taller para las solicitudes de sexenios en Ciencias Sociales UGR 2011Taller para las solicitudes de sexenios en Ciencias Sociales UGR 2011
Taller para las solicitudes de sexenios en Ciencias Sociales UGR 2011
 
Emilio Delgado Lopez Cozar cómo utilizar los indicadores bibliometricos para ...
Emilio Delgado Lopez Cozar cómo utilizar los indicadores bibliometricos para ...Emilio Delgado Lopez Cozar cómo utilizar los indicadores bibliometricos para ...
Emilio Delgado Lopez Cozar cómo utilizar los indicadores bibliometricos para ...
 
Emilio delgado lopez cozar la evaluación de las publicaciones científicas en ...
Emilio delgado lopez cozar la evaluación de las publicaciones científicas en ...Emilio delgado lopez cozar la evaluación de las publicaciones científicas en ...
Emilio delgado lopez cozar la evaluación de las publicaciones científicas en ...
 
Evaluacion sexenios acreditacion uhu
Evaluacion sexenios acreditacion uhuEvaluacion sexenios acreditacion uhu
Evaluacion sexenios acreditacion uhu
 

Recently uploaded

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 

Recently uploaded (20)

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

The Google Scholar Revolution: a big data bibliometric tool

  • 1. The Google Scholar Revolution: a big data bibliometric tool Google Scholar Day: Changing current evaluation paradigms Cybermetrics Lab (IPP–CSIC) Madrid, 20 February 2017 Enrique Orduña-Malea, Alberto Martín-Martín, Juan M. Ayllón, Emilio Delgado López-Cózar EC3 Research Group-Scholar Division
  • 2. EC3 Research Group – Google Scholar Division
  • 4. The google search familly. Its goal: finding information Scholar Search Scholar Citations Scholar Metrics
  • 5. Simple Easy Fast Easy to understand and use Universal, international, global Multilingual Free Why is it successful?
  • 7. 7 Studying it from the bibliometric perspective: EC3-Scholar Division
  • 8. Opening the academic Pandora’s Box 2008-
  • 9. 9 Journals Authors Publishers Multifaceted model Library & Information Sciences (Spain) http://www.biblioteconomia-documentacion-española.infoec3.es Bibliometrics & Scientometrics (International) http://www.scholar-mirrors.infoec3.es What have we analyzed? Intensive, extensive, and obsessive work Conferences
  • 10.
  • 11. 11
  • 12. 12 • Publication data about 49,930 A&H and SS professors working in public Spanish universities was extracted from Google Scholar in 2012 • Only authors in the first tercile are displayed • 68 discipline rankings (49 in Social Sciences and Law, 39 in Arts and Humanities)
  • 14. 14 Collecting data Max. number of hits per page February 2013: from 100 to 20
  • 15. 15
  • 16. 16
  • 17. 17 Sample of highly cited books (top 3%) published by ~49k A&H and SS professors working in public Spanish universities Data collected from Google Scholar in 2012 (n ~ 7203) 68 discipline rankings (49 in Social Sciences and Law, 39 in Arts and Humanities)
  • 18. 18 Indicators: Nº of books, and sum of citations (relative to highest element in the ranking)
  • 19. 19
  • 20.
  • 23. 23
  • 24. 24
  • 25. 25
  • 26. 26
  • 27. 27 Indicators Extracted directly from Google Scholar Metrics Computed using the article and citation data available in Google Scholar Metrics H Index of documents published in the last 5 years Median of citation counts for articles published in last 5 years Sum of citations for articles above h5-index threshold
  • 29. 29 Coverage IMPORTANT: Google Scholar Metrics only covers journals that are indexed in Google Scholar, have published at least 100 articles in the last 5-year period, and have received at least 1 citation
  • 31. 31
  • 33. 33 LIS researchers in Spain 336 authors in GSC 68 not in GSC Other sources ResearcherID (WoS) ResearchGate Indicators Sum of citations H Index Nº of documents RG Score Impact Points Aggregating data Highly cited docs (HCD), % of HCD by journal, book publisher, and institution
  • 34. 34 The «Mirrors» approach There are many platforms that reflect (mirror) scientific activity on the Web. An inclusive study of the impact of scientific activity must contemplate as many of them as possible.
  • 35. 35
  • 36. Bibliometric potential of Google Scholar We have proved Yes, we can What do we know about Google Scholar? Its strengths and weaknesses
  • 37.
  • 38. 38 It’s the most used academic search engine
  • 39. Why do we call it a big data bibliometric tool? Source COVERAGE All documents Geographic COVERAGE All countries Linguistic COVERAGE All languages Discipline COVERAGE All of them GROWTH Faster than WoS and Scopus FAST TRACK CITATIONS Newly published documents indexed in a matter of days Big Data Big Data SIZE Largest bibliographic database in the world
  • 40. The search engine with the largest coverage Size matters Orduña-Malea, E., Ayllón, J. M., Martín-Martín, A., Delgado López-Cózar, E.. (2014). About the size of Google Scholar: playing the numbers. arXiv preprint arXiv:1407.6239. EC3 Working Papers 18 Orduna-Malea, E., Ayllón, J. M., Martín-Martín, A., Delgado López-Cózar, E. Methods for estimating the size of Google Scholar. Scientometrics 104 (3), 931-949 2015
  • 41. The search engine with the largest coverage Size matters 2017 Nº documents
  • 42. 2017
  • 43.
  • 44. Larger coverage, larger citation graph 0 0,5 1 1,5 2 2,5 3 Art & Humanities Social Sciences Sciences TOTAL RatioofGSCitationstoWoS Citations(avg) Documents published in 2009 Documents published in 2014 Analysis of most documents with a DOI published in 2009 and 2014 covered by Web of Science (~2.5 million documents)
  • 47. 47 Multilingual Documents covered by Google Scholar, Web of Science & Scopus by 13 languages (1800-2016)
  • 51. 51 Google Scholar offers a different vision of scientific production
  • 52. Web of Science documents (2009/2014) found in Google Scholar found in GS 96% not found in GS 4% 96% of the searched WoS documents were found in GS. 98% if we only consider journal articles. The rest might have been found as well if alternative search strategies had been used.
  • 53. Martín-Martín, A., Orduña-Malea, E., Ayllón, J. M., Delgado López-Cózar, E. (2014). Does Google Scholar contain all highly cited documents (1950-2013)?. arXiv preprint arXiv:1410.8464. Highly cited documents in Google Scholar (1950-2013) Half of them are not in WoS The ones who are in WoS: very high citation correlation
  • 54. 54 Confirmation PRELIMINARY RESULTS: Analysis of most documents with a DOI published in 2009 covered by Web of Science (~1 million documents) Citation Index N spearman.cor p.value prop.cited.gs prop.cited.wos ratio of gs_cit to wos_cit (avg) Sciences 863801 0,94 0,00 0,97 0,95 1,68 Social Sciences 109232 0,90 0,00 0,97 0,94 2,58 Art & Humanities 13487 0,83 0,00 0,84 0,69 2,52 Sciences Social Sciences Art & Humanities
  • 55. Logical when you see their sources elsevier.com 7.200.000 wiley.com 4,590,000 springer 3.290.000 taylor and francis 1.440.000 lww.com 1.030.000 sagepub.com 781.000 nature.com 428.000 bmj.com 370.000 Routledge 293.000 karger.com 188.000 degruyter.com 196.000 biomedcentral.com 121.000 liebertpub.com 90.900 emerald 167.000 books.google.com 14.000.000 ieee.org 2.990.000 jstor.org 2.210.000 acs.org 987.000 cambridge.org 893.000 oxfordjournals.org 872.000 acm.org 472.000 iop.org 462000 aip.org 451.000 arxiv.org 355.000 pnas.org 101.000 ams.org 98.000 sciencemag.org 62.600 nber.org 26.900 Scopus 2009 Web of Science Google Scholar
  • 56. 56 It has a better coverage of areas like Humanities, Social Sciences, Engineering…
  • 57. GS measures scientific impact and more Scientific Professional Educational Social
  • 58. What impact does it measure?
  • 59. A professional journal in Google Scholar
  • 61. 61 Lack of transparency There isn’t a public master list of the sources Google Scholar indexes (publishers, repositories, catalogues, bibliographic databases and repertoires, aggregators, journals…) There is no accurate method to estimate the size of Google Scholar
  • 62. 62 Similarly, even if a document has received more than 1000 citations, only the first 1000 can be displayed when clicking the “Cited by” link We have no control over the results we get Are the relevant results for my needs among those 1000 results? Usually yes… thanks to the ranking algorithm they use Only(!) returns 1000 results for any given query Is this really a bibliographic problem? Who is interesed in bibliographic searches of that size? Weaknesses
  • 63. 63 How does Google Scholar rank results?
  • 64. 64 There is no native method to easily extract bibliographic data massively. Only one by one. Weaknesses There is no API. What can we use for large downloads? A scraper
  • 65. 65 Weaknesses Limited filtering and sorting options (year and relevance) ≠
  • 66. 66 Weaknesses The advanced search form is limited to four search dimensions: keywords, authors, source of publication (journal, conference…), and year of publication
  • 67. 67 No quality control of sources indexed. Peer-reviewed documents coexist with documents that haven’t gone through that process. Is that really a problem? But… Google Scholar also shows which documents are covered by the Web of Science, and which of them are available from your library. YOUR CHOICE… Weaknesses It shows richness rather than a flaw
  • 68. 68 Weaknesses • Institutional affiliation of the authors of the documents is available (institution, country) • The language in which documents are written • The typology of each document is not clear (book, journal article, conference communication, thesis, report…). Only books are marked as such, usually when they have been found on Google Books • Not all documents have an abstract • The author-supplied keywords are not available • The list of cited references in each article is not available either It doesn’t offer information regarding
  • 69. 69 Greatest danger: manipulation Delgado López‐Cózar, E., Robinson‐García, N., Torres‐Salinas, D. (2014). The Google Scholar experiment: How to index false papers and manipulate bibliometric indicators. Journal of the Association for Information Science and Technology, 65(3), 446-454.
  • 70. 70 Errors in the data Enough quality? Even with «dirty» data, it measures more and better Large units of analysis: no problem Individuals: check data first
  • 71. 71
  • 74. 74
  • 75. 75
  • 77. 77
  • 78. 78
  • 79. 79
  • 80. 80 Google Scholar Citations: Laissez faire laissez passer Be wary of CUT / COPY – BUTTON / COMAND bibliometric products Don’t mix apples and oranges
  • 81. A final consideration… To what end are we measuring scientific activity? Spreading light where there wasThank you very much!