SlideShare a Scribd company logo
1 of 44
Scientometric approaches to classification
Nees Jan van Eck
Centre for Science and Technology Studies (CWTS), Leiden University
Colloquium Research Information Systems and Science Classifications: Revisiting the NARCIS Classification
Museum Meermanno, The Hague, The Netherlands
September 28, 2018
Outline
• Bibliographic databases
• Classification systems of scientific literature
• CWTS publication-level classification system of science
– Methodology
– Structure
– Applications
• Quality of classification systems
1
Bibliographic
databases
2
Bibliographic databases
3
Bibliographic databases
4
Web of Science Scopus
Journals 20,000 24,000
Publications 55 million 45 million
Citations 1.2 billion 1.2 billion
Classification systems
of scientific literature
5
Classification systems of scientific literature
• Mono-disciplinary vs. multidisciplinary
• Journal-level vs. publication-level
• Manual vs. algorithmic
6
Classification systems of scientific literature
• Mono-disciplinary:
– Chemical Abstracts: 80 different sections and 5 broad headings
– EconLit: Journal of Economic Literature (JEL) classification system
– PubMed: Medical Subject Headings (MeSH)
• Multidisciplinary:
– Web of Science: 250 categories
– Scopus (ASJC): bottom level has 304 categories and top level includes 27 categories
– Science-Metrix: 176 categories
– National Science Foundation (NSF): 125 categories
– University of California, San Diego (UCSD): more than 500 categories
– Australian and New Zealand Standard Research Classification (FoR): 3 hierarchical levels
7
CWTS publication-
level classification
system of science
8
Algorithmic classification system of science
• First version created in 2012
• Publications (not journals) are clustered into research areas based on citation
relations
• Research areas are defined at different levels of granularity and are
organized hierarchically
• Clustering is performed using the smart local moving algorithm (improved
Louvain algorithm; Waltman & Van Eck, 2013)
9
Objectives
To create a classification system
• in a fully algorithmic manner
• covering all sciences and social sciences
• at the level of individual publications
• with a hierarchical structure
• using transparent, freely available algorithms
• without excessive computational requirements
10
Main challenges
• Dealing with huge volumes of data
• Avoiding disciplinary biases
• Reaching a high level of accuracy
• Being flexible in terms of number of hierarchical levels and size of research
areas
• Obtaining proper labels for the research areas
• Keeping the methodology reasonably simple and transparent
11
Dealing with huge volumes of data
• Linking publications based on direct citations only; no co-citations,
bibliographic coupling, or word co-occurrences
• Efficient clustering algorithm based on ideas taken from:
– Newman (2004): Modularity-based clustering
– Blondel et al. (2008): ‘Louvain method’
– Waltman et al. (2010): VOS clustering technique
– Rotta & Noack (2011): Multilevel local search algorithms
12
Avoiding disciplinary biases
• cij: Relatedness of publications i and j, i.e., 1 if there is a direct citation
relation between i and j, 0 otherwise
• aij: Normalized relatedness of publications i and j, defined as
• Similar to fractional citation counting (Small & Sweeney, 1985)


k ik
ij
ij
c
c
a
13
Reaching a high level of accuracy
• Clustering technique based on maximization of a quality function:
• xi denotes the cluster (research area) to which publication i is assigned
• (xi, xj) = 1 if xi = xj and 0 otherwise
• r denotes a resolution parameter
• Quality function is maximized with respect to x1, ..., xn
 
i j
ijji raxx ))(,(
14
Being flexible in terms of number of hierarchical levels
and size of research areas
• Three types of parameters:
– Number of hierarchical levels
– Each level’s resolution parameter
– Each level’s minimum number of publications per research area
15
Obtaining proper labels for the research areas
1. Identification of terms in titles and abstracts of articles using part-of-speech
tagging
2. Calculation of term relevance scores based on a combination of a term’s
absolute and relative frequency of occurrence
3. Selection of the most relevant terms based on term relevance scores
combined with a filter for removing similar terms
16
CWTS publication-level classification system of
science
• 21.2 million publications from the period 2000–2017 indexed in Web of
Science
• 374.1 million citation relations
• Classification system of 3 hierarchical levels:
– 22 broad disciplines
– 868 fields
– 4,047 subfields
• Computational performance: less than 2 hours
17
18
Breakdown of scientific literature into 22 broad
disciplines
Social sciences
and humanities
Biomedical and
health sciences
Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
22 broad disciplines
19
20
Breakdown of scientific literature into 868 fields
Social sciences
and humanities
Biomedical and
health sciences
Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
21
Breakdown of scientific literature into 4,047 subfields
Social sciences
and humanities
Biomedical and
health sciences Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
22
Breakdown of scientific literature into 4,047 subfields
Social sciences
and humanities
Biomedical and
health sciences Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
Scientometrics
Summary of scientometrics subfield
23
Cluster: 145
No. publications: 16,312
Top 5 terms No. pubs
bibliometric analysis 852
impact factor 495
h index 264
peer review 515
citation 642
Top 5 publications No. cits
hirsch, je (2005). an index to quantify an individual's scientific research output. p natl acad sci usa, 102(46), 16569-16572. 2,635
wuchty, s; et al. (2007). the increasing dominance of teams in production of knowledge. science, 316(5827), 1036-1039. 699
egghe, l (2006). theory and practise of the g-index. scientometrics, 69(1), 131-152. 609
king, da (2004). the scientific impact of nations. nature, 430(6997), 311-316. 496
newman, mej (2004). coauthorship networks and patterns of scientific collaboration. p natl acad sci usa, 101, 5200-5205. 488
Top 5 authors No. pubs Top 5 journals No. pubs
bornmann, l 221 scientometrics 2,865
thelwall, m 202 journal of informetrics 700
leydesdorff, l 175 journal of the american society for information science and technology 613
rousseau, r 161 plos one 339
egghe, l 133 research evaluation 324
Top 5 institutes No. pubs Top 5 departments No. pubs
univ granada 316 sch lib & informat sci (indiana univ) 106
kathol univ leuven 256 amsterdam sch commun res ascor (univ amsterdam) 97
leiden univ 249 ctr sci & technol studies (leiden univ) 90
indiana univ 246 sch publ policy (georgia inst technol - atlanta) 88
univ wolverhampton 216 trend res ctr (asia univ) 84
0
200
400
600
800
1,000
1,200
1,400
1,600
2000 2002 2004 2006 2008 2010 2012 2014 2016
No.publications
Publications in scientometrics subfield
24
25
Term map of scientometrics subfield
Peer review,
OA, careers,
and gender
CollaborationScientometric
indicators and
networks
Medical research
Country-level
analyses
26
Time-line map of highly cited scientometrics
publications
27
Overlay visualizations
Social sciences
and humanities
Biomedical and
health sciences Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
Time trend
28
Social sciences
and humanities
Biomedical and
health sciences Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
Time trend
29
MicroRNA Graphene
Summary of graphene subfield
30
Cluster: 9
No. publications: 27,771
Top 5 terms No. pubs
bilayer graphene 836
epitaxial graphene 491
silicene 401
graphene nanoribbon 1,035
graphene field effect transistor 207
Top 5 publications No. cits
novoselov, ks; et al. (2004). electric field effect in atomically thin carbon films. science, 306(5696), 666-669. 27,743
geim, ak; et al. (2007). the rise of graphene. nat mater, 6(3), 183-191. 20,073
novoselov, ks; et al. (2005). two-dimensional gas of massless dirac fermions in graphene. nature, 438(7065), 197-200. 11,359
castro neto, ah; et al. (2009). the electronic properties of graphene. rev mod phys, 81(1), 109-162. 11,368
zhang, yb; et al. (2005). experimental observation of the quantum hall effect and berry's phase in graphene. nature, 438(7065), 201-204. 8,110
Top 5 authors No. pubs Top 5 journals No. pubs
watanabe, k 249 physical review b 4,013
taniguchi, t 240 applied physics letters 1,834
peeters, fm 233 carbon 994
lin, mf 178 nano letters 906
katsnelson, mi 177 journal of applied physics 841
Top 5 institutes No. pubs Top 5 departments No. pubs
chinese acad sci 1,394 dept phys (natl univ singapore) 257
russian acad sci 778 inst phys (chinese acad sci) 226
peking univ 557 inst mol & mat (radboud univ nijmegen) 216
natl univ singapore 482 dept phys (mit) 209
tsing hua univ 458 dept phys (univ calif berkeley and berkeley national lab) 206
0
500
1,000
1,500
2,000
2,500
3,000
3,500
4,000
2000 2002 2004 2006 2008 2010 2012 2014 2016
No.publications
Open access
31
Social sciences
and humanities
Biomedical and
health sciences Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
University profiles
32
Delft University of TechnologyLeiden University
Applications
• Field normalization
– CWTS Leiden Ranking/U-Multirank
– Dutch University Medical Centers
• Field delineation
– European research funders
• High-resolution research strengths analysis
– European universities
– European research funders
• Identification of interdisciplinary and emerging research areas
– UK Engineering and Physical Sciences Research Council
33
Adopters and potential adopters
• Adopters:
– CWTS
– SciTech Strategies (e.g. SciVal)
– Royal School of Technology (KTH) Stockholm
• Potential adopters:
– Chinese Academy of Sciences
– European Research Council
– Max Planck
34
Quality of
classification systems
35
Empirical micro study using papers on overall water
splitting
• Haunschild et al. (2018)
• Case study comparing CWTS classification to
journal-based and manually constructed
classifications
• Ability of CWTS classification to distinguish
between fields is questioned
36
Accuracy of the journal classification systems of Web
of Science and Scopus
• Wang and Waltman (2016)
• Two criteria to identify journals with questionable
classifications:
– journals that have weak connections with their assigned
categories
– journals that are not assigned to categories with which they
have strong connections
• Web of Science performs significantly better than
Scopus
37
Field classification of publications in Dimensions
• Bornmann (2018)
• Field classification in Dimensions:
– Based on Fields of Research (FOR) from Australian and New
Zealand Standard Research Classification (ANZSRC)
– Machine learning approach
– Each publication is assigned to at least one field
• Based on Bornmann’s own publications
• Questions reliability and validity of Dimensions
classification
38
Response from Dimensions
• Herzog and Lunn (2018)
• Implementation at launch was first step and
requires improvements:
– Improvement of training sets
– Adding new subcategories to FOR system
39
Large-scale system to organize publications into
hierarchical concept structure
• Shen et al. (2018)
• Core component in Microsoft Academic
• Iterative approach to:
– concept discovery (Wikipedia)
– concept tagging to publications (both textual data and graph
structure are considered)
– concept hierarchy construction
• Based on 2000 initial seed concepts, over 228K
concepts have been identified
• Concepts are organized in six-level hierarchy
• 1 billion publication-concept relations
40
Conclusions
41
Conclusions
• Algorithmic approaches can be used to construct large-scale classifications
• Algorithmic classifications at the level of publications gain popularity
• Algorithmic possibilities depend on data availability
• Algorithmic classifications may have the disadvantage of mixing up different
principles for classifying items (e.g., research topic, research method,
scientific community, theoretical tradition, basic vs. applied)
42
Thank you for your attention!
43

More Related Content

What's hot

VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialNees Jan van Eck
 
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...Nees Jan van Eck
 
Large-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applicationsLarge-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applicationsLudo Waltman
 
Advanced citation matching and large-scale cited reference extraction
Advanced citation matching and large-scale cited reference extractionAdvanced citation matching and large-scale cited reference extraction
Advanced citation matching and large-scale cited reference extractionNees Jan van Eck
 
Science Mapping and Research Positioning
Science Mapping and Research PositioningScience Mapping and Research Positioning
Science Mapping and Research PositioningNees Jan van Eck
 
Intermediacy of publications
Intermediacy of publicationsIntermediacy of publications
Intermediacy of publicationsNees Jan van Eck
 
VOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureVOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureNees Jan van Eck
 
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Ludo Waltman
 
Bibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewerBibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewerLudo Waltman
 
Applications of community detection in bibliometric network analysis
Applications of community detection in bibliometric network analysisApplications of community detection in bibliometric network analysis
Applications of community detection in bibliometric network analysisNees Jan van Eck
 
Large-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sourcesLarge-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sourcesNees Jan van Eck
 
Scientometrics for research assessment
Scientometrics for research assessmentScientometrics for research assessment
Scientometrics for research assessmentLudo Waltman
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesLudo Waltman
 
Crossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataCrossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataNees Jan van Eck
 
Comparing bibliographic data sources
Comparing bibliographic data sourcesComparing bibliographic data sources
Comparing bibliographic data sourcesLudo Waltman
 
The landscape of research on research
The landscape of research on researchThe landscape of research on research
The landscape of research on researchLudo Waltman
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewerNees Jan van Eck
 
A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...Nees Jan van Eck
 
Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Ludo Waltman
 
Multiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataMultiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataNees Jan van Eck
 

What's hot (20)

VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer Tutorial
 
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
 
Large-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applicationsLarge-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applications
 
Advanced citation matching and large-scale cited reference extraction
Advanced citation matching and large-scale cited reference extractionAdvanced citation matching and large-scale cited reference extraction
Advanced citation matching and large-scale cited reference extraction
 
Science Mapping and Research Positioning
Science Mapping and Research PositioningScience Mapping and Research Positioning
Science Mapping and Research Positioning
 
Intermediacy of publications
Intermediacy of publicationsIntermediacy of publications
Intermediacy of publications
 
VOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureVOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literature
 
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
 
Bibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewerBibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewer
 
Applications of community detection in bibliometric network analysis
Applications of community detection in bibliometric network analysisApplications of community detection in bibliometric network analysis
Applications of community detection in bibliometric network analysis
 
Large-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sourcesLarge-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sources
 
Scientometrics for research assessment
Scientometrics for research assessmentScientometrics for research assessment
Scientometrics for research assessment
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
 
Crossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataCrossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadata
 
Comparing bibliographic data sources
Comparing bibliographic data sourcesComparing bibliographic data sources
Comparing bibliographic data sources
 
The landscape of research on research
The landscape of research on researchThe landscape of research on research
The landscape of research on research
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...
 
Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...
 
Multiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataMultiple perspectives on bibliometric data
Multiple perspectives on bibliometric data
 

Similar to Scientometric approaches to classification

MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataHerbert Van de Sompel
 
Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...
Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...
Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...Nadine Rons
 
Using Bibliometrics in the Library
Using Bibliometrics in the LibraryUsing Bibliometrics in the Library
Using Bibliometrics in the LibraryState Of Innovation
 
Paper 6: World University's Evaluation (Qiu & Zhao)
Paper 6: World University's Evaluation (Qiu & Zhao)Paper 6: World University's Evaluation (Qiu & Zhao)
Paper 6: World University's Evaluation (Qiu & Zhao)Kent Business School
 
Bibliometric analysis tools on top of the university’s bibliographic database...
Bibliometric analysis tools on top of the university’s bibliographic database...Bibliometric analysis tools on top of the university’s bibliographic database...
Bibliometric analysis tools on top of the university’s bibliographic database...Wouter Gerritsma
 
A new role for libraries in research assessments
A new role for libraries in research assessmentsA new role for libraries in research assessments
A new role for libraries in research assessmentsWouter Gerritsma
 
Where to publish_130709
Where to publish_130709Where to publish_130709
Where to publish_130709opl10
 
Publication strategy for LEI
Publication strategy for LEIPublication strategy for LEI
Publication strategy for LEIWouter Gerritsma
 
Presentation of a bibliometric Analysis of Quantum machine Learning.ppt
Presentation of a bibliometric Analysis of Quantum machine Learning.pptPresentation of a bibliometric Analysis of Quantum machine Learning.ppt
Presentation of a bibliometric Analysis of Quantum machine Learning.pptaliasgharahmadikia77
 
Broad altmetric analysis of Mendeley readerships through the ‘academic status...
Broad altmetric analysis of Mendeley readerships through the ‘academic status...Broad altmetric analysis of Mendeley readerships through the ‘academic status...
Broad altmetric analysis of Mendeley readerships through the ‘academic status...Zohreh Zahedi
 
What is your h-index and other measures of impact
What is your h-index and other measures of impactWhat is your h-index and other measures of impact
What is your h-index and other measures of impactBerenika Webster
 
Towards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD DatasetsTowards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD DatasetsBlerina Spahiu
 
THOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSTHOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSMaaike Duine
 
Determining cognitive distance between publication portfolios of evaluators a...
Determining cognitive distance between publication portfolios of evaluators a...Determining cognitive distance between publication portfolios of evaluators a...
Determining cognitive distance between publication portfolios of evaluators a...Jakaria Rahman
 
بنك المعرفة-المصرى
بنك المعرفة-المصرىبنك المعرفة-المصرى
بنك المعرفة-المصرىghadeermagdy
 
A new software tool for large-scale analysis of citation networks
A new software tool for large-scale analysis of citation networksA new software tool for large-scale analysis of citation networks
A new software tool for large-scale analysis of citation networksNees Jan van Eck
 
بنك المعرفة المصرى Egyptian knowledge bank
بنك المعرفة المصرى  Egyptian knowledge bankبنك المعرفة المصرى  Egyptian knowledge bank
بنك المعرفة المصرى Egyptian knowledge banksameh shalash
 

Similar to Scientometric approaches to classification (20)

MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage data
 
Value-added services for the Wageningen Institutional Repository (WaY)
Value-added services for the Wageningen Institutional Repository (WaY)Value-added services for the Wageningen Institutional Repository (WaY)
Value-added services for the Wageningen Institutional Repository (WaY)
 
Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...
Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...
Investigation of Partition Cells as a Structural Basis Suitable for Assessmen...
 
Using Bibliometrics in the Library
Using Bibliometrics in the LibraryUsing Bibliometrics in the Library
Using Bibliometrics in the Library
 
Paper 6: World University's Evaluation (Qiu & Zhao)
Paper 6: World University's Evaluation (Qiu & Zhao)Paper 6: World University's Evaluation (Qiu & Zhao)
Paper 6: World University's Evaluation (Qiu & Zhao)
 
Bibliometric analysis tools on top of the university’s bibliographic database...
Bibliometric analysis tools on top of the university’s bibliographic database...Bibliometric analysis tools on top of the university’s bibliographic database...
Bibliometric analysis tools on top of the university’s bibliographic database...
 
A new role for libraries in research assessments
A new role for libraries in research assessmentsA new role for libraries in research assessments
A new role for libraries in research assessments
 
Where to publish_130709
Where to publish_130709Where to publish_130709
Where to publish_130709
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
 
Publication strategy for LEI
Publication strategy for LEIPublication strategy for LEI
Publication strategy for LEI
 
Presentation of a bibliometric Analysis of Quantum machine Learning.ppt
Presentation of a bibliometric Analysis of Quantum machine Learning.pptPresentation of a bibliometric Analysis of Quantum machine Learning.ppt
Presentation of a bibliometric Analysis of Quantum machine Learning.ppt
 
Broad altmetric analysis of Mendeley readerships through the ‘academic status...
Broad altmetric analysis of Mendeley readerships through the ‘academic status...Broad altmetric analysis of Mendeley readerships through the ‘academic status...
Broad altmetric analysis of Mendeley readerships through the ‘academic status...
 
What is your h-index and other measures of impact
What is your h-index and other measures of impactWhat is your h-index and other measures of impact
What is your h-index and other measures of impact
 
Towards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD DatasetsTowards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD Datasets
 
PLOS Visualization Project
PLOS Visualization ProjectPLOS Visualization Project
PLOS Visualization Project
 
THOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSTHOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOS
 
Determining cognitive distance between publication portfolios of evaluators a...
Determining cognitive distance between publication portfolios of evaluators a...Determining cognitive distance between publication portfolios of evaluators a...
Determining cognitive distance between publication portfolios of evaluators a...
 
بنك المعرفة-المصرى
بنك المعرفة-المصرىبنك المعرفة-المصرى
بنك المعرفة-المصرى
 
A new software tool for large-scale analysis of citation networks
A new software tool for large-scale analysis of citation networksA new software tool for large-scale analysis of citation networks
A new software tool for large-scale analysis of citation networks
 
بنك المعرفة المصرى Egyptian knowledge bank
بنك المعرفة المصرى  Egyptian knowledge bankبنك المعرفة المصرى  Egyptian knowledge bank
بنك المعرفة المصرى Egyptian knowledge bank
 

More from Nees Jan van Eck

Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Nees Jan van Eck
 
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...Nees Jan van Eck
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university rankingNees Jan van Eck
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university rankingNees Jan van Eck
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingNees Jan van Eck
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewerNees Jan van Eck
 
How to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparisonHow to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparisonNees Jan van Eck
 
Advanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editorsAdvanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editorsNees Jan van Eck
 
Large-scale analysis of bibliometric networks
Large-scale analysis of bibliometric networksLarge-scale analysis of bibliometric networks
Large-scale analysis of bibliometric networksNees Jan van Eck
 
Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Nees Jan van Eck
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingNees Jan van Eck
 

More from Nees Jan van Eck (13)

Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...
 
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university ranking
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university ranking
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
How to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparisonHow to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparison
 
Advanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editorsAdvanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editors
 
Large-scale analysis of bibliometric networks
Large-scale analysis of bibliometric networksLarge-scale analysis of bibliometric networks
Large-scale analysis of bibliometric networks
 
On cluster stability
On cluster stabilityOn cluster stability
On cluster stability
 
Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...
 
Cluster stability
Cluster stabilityCluster stability
Cluster stability
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
 

Recently uploaded

Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicAditi Jain
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...navyadasi1992
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 

Recently uploaded (20)

Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by Petrovic
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 

Scientometric approaches to classification

  • 1. Scientometric approaches to classification Nees Jan van Eck Centre for Science and Technology Studies (CWTS), Leiden University Colloquium Research Information Systems and Science Classifications: Revisiting the NARCIS Classification Museum Meermanno, The Hague, The Netherlands September 28, 2018
  • 2. Outline • Bibliographic databases • Classification systems of scientific literature • CWTS publication-level classification system of science – Methodology – Structure – Applications • Quality of classification systems 1
  • 5. Bibliographic databases 4 Web of Science Scopus Journals 20,000 24,000 Publications 55 million 45 million Citations 1.2 billion 1.2 billion
  • 7. Classification systems of scientific literature • Mono-disciplinary vs. multidisciplinary • Journal-level vs. publication-level • Manual vs. algorithmic 6
  • 8. Classification systems of scientific literature • Mono-disciplinary: – Chemical Abstracts: 80 different sections and 5 broad headings – EconLit: Journal of Economic Literature (JEL) classification system – PubMed: Medical Subject Headings (MeSH) • Multidisciplinary: – Web of Science: 250 categories – Scopus (ASJC): bottom level has 304 categories and top level includes 27 categories – Science-Metrix: 176 categories – National Science Foundation (NSF): 125 categories – University of California, San Diego (UCSD): more than 500 categories – Australian and New Zealand Standard Research Classification (FoR): 3 hierarchical levels 7
  • 10. Algorithmic classification system of science • First version created in 2012 • Publications (not journals) are clustered into research areas based on citation relations • Research areas are defined at different levels of granularity and are organized hierarchically • Clustering is performed using the smart local moving algorithm (improved Louvain algorithm; Waltman & Van Eck, 2013) 9
  • 11. Objectives To create a classification system • in a fully algorithmic manner • covering all sciences and social sciences • at the level of individual publications • with a hierarchical structure • using transparent, freely available algorithms • without excessive computational requirements 10
  • 12. Main challenges • Dealing with huge volumes of data • Avoiding disciplinary biases • Reaching a high level of accuracy • Being flexible in terms of number of hierarchical levels and size of research areas • Obtaining proper labels for the research areas • Keeping the methodology reasonably simple and transparent 11
  • 13. Dealing with huge volumes of data • Linking publications based on direct citations only; no co-citations, bibliographic coupling, or word co-occurrences • Efficient clustering algorithm based on ideas taken from: – Newman (2004): Modularity-based clustering – Blondel et al. (2008): ‘Louvain method’ – Waltman et al. (2010): VOS clustering technique – Rotta & Noack (2011): Multilevel local search algorithms 12
  • 14. Avoiding disciplinary biases • cij: Relatedness of publications i and j, i.e., 1 if there is a direct citation relation between i and j, 0 otherwise • aij: Normalized relatedness of publications i and j, defined as • Similar to fractional citation counting (Small & Sweeney, 1985)   k ik ij ij c c a 13
  • 15. Reaching a high level of accuracy • Clustering technique based on maximization of a quality function: • xi denotes the cluster (research area) to which publication i is assigned • (xi, xj) = 1 if xi = xj and 0 otherwise • r denotes a resolution parameter • Quality function is maximized with respect to x1, ..., xn   i j ijji raxx ))(,( 14
  • 16. Being flexible in terms of number of hierarchical levels and size of research areas • Three types of parameters: – Number of hierarchical levels – Each level’s resolution parameter – Each level’s minimum number of publications per research area 15
  • 17. Obtaining proper labels for the research areas 1. Identification of terms in titles and abstracts of articles using part-of-speech tagging 2. Calculation of term relevance scores based on a combination of a term’s absolute and relative frequency of occurrence 3. Selection of the most relevant terms based on term relevance scores combined with a filter for removing similar terms 16
  • 18. CWTS publication-level classification system of science • 21.2 million publications from the period 2000–2017 indexed in Web of Science • 374.1 million citation relations • Classification system of 3 hierarchical levels: – 22 broad disciplines – 868 fields – 4,047 subfields • Computational performance: less than 2 hours 17
  • 19. 18 Breakdown of scientific literature into 22 broad disciplines Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 21. 20 Breakdown of scientific literature into 868 fields Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 22. 21 Breakdown of scientific literature into 4,047 subfields Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 23. 22 Breakdown of scientific literature into 4,047 subfields Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering Scientometrics
  • 24. Summary of scientometrics subfield 23 Cluster: 145 No. publications: 16,312 Top 5 terms No. pubs bibliometric analysis 852 impact factor 495 h index 264 peer review 515 citation 642 Top 5 publications No. cits hirsch, je (2005). an index to quantify an individual's scientific research output. p natl acad sci usa, 102(46), 16569-16572. 2,635 wuchty, s; et al. (2007). the increasing dominance of teams in production of knowledge. science, 316(5827), 1036-1039. 699 egghe, l (2006). theory and practise of the g-index. scientometrics, 69(1), 131-152. 609 king, da (2004). the scientific impact of nations. nature, 430(6997), 311-316. 496 newman, mej (2004). coauthorship networks and patterns of scientific collaboration. p natl acad sci usa, 101, 5200-5205. 488 Top 5 authors No. pubs Top 5 journals No. pubs bornmann, l 221 scientometrics 2,865 thelwall, m 202 journal of informetrics 700 leydesdorff, l 175 journal of the american society for information science and technology 613 rousseau, r 161 plos one 339 egghe, l 133 research evaluation 324 Top 5 institutes No. pubs Top 5 departments No. pubs univ granada 316 sch lib & informat sci (indiana univ) 106 kathol univ leuven 256 amsterdam sch commun res ascor (univ amsterdam) 97 leiden univ 249 ctr sci & technol studies (leiden univ) 90 indiana univ 246 sch publ policy (georgia inst technol - atlanta) 88 univ wolverhampton 216 trend res ctr (asia univ) 84 0 200 400 600 800 1,000 1,200 1,400 1,600 2000 2002 2004 2006 2008 2010 2012 2014 2016 No.publications
  • 26. 25 Term map of scientometrics subfield Peer review, OA, careers, and gender CollaborationScientometric indicators and networks Medical research Country-level analyses
  • 27. 26 Time-line map of highly cited scientometrics publications
  • 28. 27 Overlay visualizations Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 29. Time trend 28 Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 31. Summary of graphene subfield 30 Cluster: 9 No. publications: 27,771 Top 5 terms No. pubs bilayer graphene 836 epitaxial graphene 491 silicene 401 graphene nanoribbon 1,035 graphene field effect transistor 207 Top 5 publications No. cits novoselov, ks; et al. (2004). electric field effect in atomically thin carbon films. science, 306(5696), 666-669. 27,743 geim, ak; et al. (2007). the rise of graphene. nat mater, 6(3), 183-191. 20,073 novoselov, ks; et al. (2005). two-dimensional gas of massless dirac fermions in graphene. nature, 438(7065), 197-200. 11,359 castro neto, ah; et al. (2009). the electronic properties of graphene. rev mod phys, 81(1), 109-162. 11,368 zhang, yb; et al. (2005). experimental observation of the quantum hall effect and berry's phase in graphene. nature, 438(7065), 201-204. 8,110 Top 5 authors No. pubs Top 5 journals No. pubs watanabe, k 249 physical review b 4,013 taniguchi, t 240 applied physics letters 1,834 peeters, fm 233 carbon 994 lin, mf 178 nano letters 906 katsnelson, mi 177 journal of applied physics 841 Top 5 institutes No. pubs Top 5 departments No. pubs chinese acad sci 1,394 dept phys (natl univ singapore) 257 russian acad sci 778 inst phys (chinese acad sci) 226 peking univ 557 inst mol & mat (radboud univ nijmegen) 216 natl univ singapore 482 dept phys (mit) 209 tsing hua univ 458 dept phys (univ calif berkeley and berkeley national lab) 206 0 500 1,000 1,500 2,000 2,500 3,000 3,500 4,000 2000 2002 2004 2006 2008 2010 2012 2014 2016 No.publications
  • 32. Open access 31 Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 33. University profiles 32 Delft University of TechnologyLeiden University
  • 34. Applications • Field normalization – CWTS Leiden Ranking/U-Multirank – Dutch University Medical Centers • Field delineation – European research funders • High-resolution research strengths analysis – European universities – European research funders • Identification of interdisciplinary and emerging research areas – UK Engineering and Physical Sciences Research Council 33
  • 35. Adopters and potential adopters • Adopters: – CWTS – SciTech Strategies (e.g. SciVal) – Royal School of Technology (KTH) Stockholm • Potential adopters: – Chinese Academy of Sciences – European Research Council – Max Planck 34
  • 37. Empirical micro study using papers on overall water splitting • Haunschild et al. (2018) • Case study comparing CWTS classification to journal-based and manually constructed classifications • Ability of CWTS classification to distinguish between fields is questioned 36
  • 38. Accuracy of the journal classification systems of Web of Science and Scopus • Wang and Waltman (2016) • Two criteria to identify journals with questionable classifications: – journals that have weak connections with their assigned categories – journals that are not assigned to categories with which they have strong connections • Web of Science performs significantly better than Scopus 37
  • 39. Field classification of publications in Dimensions • Bornmann (2018) • Field classification in Dimensions: – Based on Fields of Research (FOR) from Australian and New Zealand Standard Research Classification (ANZSRC) – Machine learning approach – Each publication is assigned to at least one field • Based on Bornmann’s own publications • Questions reliability and validity of Dimensions classification 38
  • 40. Response from Dimensions • Herzog and Lunn (2018) • Implementation at launch was first step and requires improvements: – Improvement of training sets – Adding new subcategories to FOR system 39
  • 41. Large-scale system to organize publications into hierarchical concept structure • Shen et al. (2018) • Core component in Microsoft Academic • Iterative approach to: – concept discovery (Wikipedia) – concept tagging to publications (both textual data and graph structure are considered) – concept hierarchy construction • Based on 2000 initial seed concepts, over 228K concepts have been identified • Concepts are organized in six-level hierarchy • 1 billion publication-concept relations 40
  • 43. Conclusions • Algorithmic approaches can be used to construct large-scale classifications • Algorithmic classifications at the level of publications gain popularity • Algorithmic possibilities depend on data availability • Algorithmic classifications may have the disadvantage of mixing up different principles for classifying items (e.g., research topic, research method, scientific community, theoretical tradition, basic vs. applied) 42
  • 44. Thank you for your attention! 43