This document summarizes the evolving landscape of bibliometric data sources and opportunities for bibliometric visualization. It discusses how alternative data sources like Dimensions, Crossref, and OpenCitations Corpus provide more open citation data than traditional sources like Web of Science and Scopus. While coverage varies, Dimensions and Crossref provide reasonably complete publication and citation data. Discrepancies between sources are due to reference inaccuracies and inconsistencies in citation matching. VOSviewer software supports network analysis and visualization using multiple data sources. The document calls for expanding open citation indexing to further open science.
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bibliometric data sources
1. Web of Science, Scopus, Dimensions, and beyond:
The evolving landscape of bibliometric data sources
Ludo Waltman, Martijn Visser, Nees Jan van Eck
Centre for Science and Technology Studies (CWTS), Leiden University
ROI-AV Conference: Visuals and Analytics that Matter
Copenhagen, Denmark
October 3, 2018
4. Introduction
• Increasing number of alternatives (Microsoft Academic, Dimensions,
Crossref, OpenCitations Corpus) to traditional bibliographic data sources
(Web of Science, Scopus, Google Scholar)
• Alternative data sources are more open than traditional ones
• How do the various data sources compare in terms of the completeness and
quality of their citation data?
3
5. Data sources
• Scopus
– May 2018
– Requires subscription
• Web of Science
– SCIE, SSCI, AHCI, CPCI
– June 2018
– Requires subscription
• Dimensions
– June 2018
– Openly available through web interface
• Crossref
– August 2018
– Openly available through API
4
6. Coverage of publications
5
All publications Publications with DOI
Publications with
unique DOI
Web of Science 40.1 100.0% 18.8 46.9% 18.8 46.9%
Scopus 44.9 100.0% 31.1 69.2% 30.6 68.3%
Dimensions 57.5 100.0% 55.1 95.9% 55.0 95.6%
Crossref 57.3 100.0% 57.3 100.0% 57.3 100.0%
• Publication counts in millions
• Time period 1996-2017
8. Comparison of citation data
7
Scopus-WoS overlap: 460.0M
Only in Scopus: 24.9M
Only in WoS: 15.5M
Scopus-Dimensions overlap: 414.3M
Only in Scopus: 43.5M
Only in Dimensions: 17.9M
Scopus-Crossref overlap: 171.3M
Only in Scopus: 292.1M
Only in Crossref: 6.4M
In these pairwise comparisons of data sources, only
citation links between citing and cited publications
indexed in both data sources are considered
9. Causes of discrepancies between data sources
• Inaccuracies in references
• Inaccuracies in reference data
• Inaccuracies in citation matching
• Multiple versions of a publication
• Multiple records for a publication
• References not having been deposited, being closed or not having been
matched
8
15. Conclusions
• Substantial discrepancies between data sources
• Reasonably complete citation data in Dimensions
• Large gaps in citation data in Crossref, due to references not having been
deposited, being closed or not having been matched
• Need for transparent high-quality citation matching algorithm
• Completeness and quality of other metadata?
14
29. Wish list for improving open data sources
• Expanding coverage of publications (OCC)
• Opening citations (Crossref)
• Opening other metadata, e.g. abstracts (Dimensions, Crossref, OCC)
• Improving completeness and standardization of metadata (Crossref)
• Speeding up APIs
28
30. Toward contextualized scientometrics
• Integrating data collection and visual analysis
• Interactive visual exploration (e.g., drilling down from high-level visual
overviews to underlying data)
• Large-scale visual analyses
• Moving from visualization as a tool for representation to visualization as a
tool for exploration, discussion and reflection
29
It is not certain why so many citation links are missing in WoS. Some references that are very similar to the ones above are linked in WoS. Probably it has to do with group author and supplement,