Information science in practice - research at a Trusted Digital Archive
1. DANS is an institute of KNAW and NWO
Data Archiving and Networked ServicesData Archiving and Networked Services
“Information science in practice
– research at a Trusted Digital
Archive”
Andrea Scharnhorst
Head of Research
August 31, 2017
Summerschool, honors students, University of Washington
2. Story line
Personal introduction – and a new project
Where are you? Welcome at DANS
Why an archive/library needs research? The DANS research group
Current hot topics at DANS
Research topic
Herbert van de Sompel “Reference Rot in Scholarly Communication”
3. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
Source: Places and Spaces – scimaps.org – Klavans/Boyack
Personal introduction
4. From Knowledge Space Lab to Digging into
Knowledge Graphs
Almila Akdag Sahal, Cheng Gao, Krzysztof Suchecki
Richard Smiraglia, Richard Szostak, Frank van Harmelen
Different knowledge representations
5. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
Almila Akdag Sahal, Cheng Gao, Krzysztof Suckecki, Andrea Scharnhorst;
Places and Spaces, 7th
Iteration, see http://www.scimaps.org/flat/exhibit_info/#7
6.
7. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
https://diggingintodata.org/awards/2016/project/digging-knowledge-graph
8. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
DARIAH"Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/"
9. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
DARIAH http://lodlaundromat.org
10. Data Archiving and Networked Services
- an archive for research data
• Institute of Royal Netherlands Academy of Arts
and Sciences and the Research Funding
Organisation (KNAW & NWO) since 2005
• First predecessor dates back to 1964 (Steinmetz
Foundation), Historical Data Archive 1989
• Mission: promote and provide permanent access
to digital research information (started with
digital archives in the humanities and social
sciences)
• Since 2011 – eResearch/research and innovation
group
11. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
16. Data Reviews, peer reviewed research data
Oral history, enhanced publications – Veteran Tapes
Dutch Parliamentary Election Studies, Data Source Book 1971-2006.
Time and Space. New applications of GIS for humanities.
Exploration of youth data.
CEDAR – Linked Data in Social History
Visual enhanced browsing - Knowescape
Data research engine (Elsevier)
MIXED: Migration to Intermediate XML for Electronic Data
Enhanced Publications - Verrijkte Publicaties
Persistent Identifiers
XML P.I. : Analysis tool for XML sources
EDNA - e-depot for Duch archeology
Geographic map in EASY (Electronic Archiving System)
Integrating NARCIS-DATAVERSE-EASY
Linked Data curation
Resource Sync applied – EHRI
Preparing Data for sharing; Guide to social science
data archiving
Driven by data – exploring the research horizon
Digital Research Infrastructure for the Arts and
Humanities (DARIAH)
DRIVER (Digital Repository Infrastructure Vision for
European Research)
Connecting ARchaeology and ARchitecture in
Europeana (CARARE); ARIADNE EU project
Data Seal of Approval
FAIR data principles
Technological basis
[DIGITAL DATA]
Science policy agendas
[RESEARCH VS. R-INFRASTRUCTURE]
Scientific practices
[ARCHIVES]
17. Steering Committee
DANS
Scientific Advisory
Board DANS
Director
Archive & SupportResearch & InnovationPolicy
Office Support
Software
Development
Departments
Competence groups
20170901
Research &
Innovation
Policy
Projects &
ServicesArchive
18. Daily life of a research group (2012)
Abbreviation Name DANS staff involved Funded
ACUMEN Academic Careers Understood
through Measurement and
Norms develops criteria and
guidelines for Good Evaluation
Practices of individual careers
(GEP)
Frank van der Most; Andrea
Scharnhorst
EC; FP7
APARSEN Alliance Permanent Access to the
Records of Science in Europe
Network
René van Horik, Marjan Grootveld,
Heiko Tjalsma
EC; FP7
CEDAR From fragment to fabric – Dutch
census data in a web of global
cultural and historic information
Christophe Guéret, Albert Meroño
Peñuela, Andrea Scharnhorst,
Maarten Hoogerwerf, Valentijn
Gilissen, Leen Breure, Peter Doorn
KNAW
Computatio
nal
Humanities
Data2Semantics From Data to Semantics for
Scientific Data Publishers
http://www.data2semantics.org/
Christophe Guéret, Albert Meroño
Peñuela, Andrea Scharnhorst,
Leen Breure, Peter Doorn
NL Agency
of the
Dutch
Ministry of
Economic
Affairs.
COMMIT
programme
Community
reviews (led by
Community
support)
Implementing data reviews in
EASY and publish about it
Marjan Grootveld, Jeff van
Egmond, Eko Indarto, Andrea
Scharnhorst
Intern
CKCC Geleerdenbrieven: Circulation of
Knowledge and Learned Practices
in the 17th-century Dutch
Republic. A Web-based
Humanities’ Collaboratory on
Correspondences
Dirk Roorda, Marjan Grootveld NWO
middelgroo
t
CRISP Context and Role of Interactive
Scientific Publications
Leen Breure Internally
NeDIMAH Network for Digital Methods in
the Arts and Humanities
René van Horik EC; ESF
Network
Grant
Visualization
EASY
Visualization Descriptors EASY Olav ten Bosch (external), Andrea
Scharnhorst, Peter Doorn
Internally,
KDP
VIVO - NARCIS
visualization
Visual exploration of NARCIS and
mapping NARCIS into VIVO
Linda Reijnhoudt, Andrea
Scharnhorst, Chris Baars, Katy
Börner, Christophe Gueret, Dirk
Roorda
Internally
EINS – Internet
science
Developing an integrated and
interdisciplinary scientific
understanding of Internet
networks and their co-evolution
with society
Andrea Scharnhorst (with Sally
Wyatt)
EC, NoE,
FP7
XPOS’RE 2.0 XML Publications On Scientific
Research
Leen Breure; Peter Doorn;
Maarten Hoogerwerf; René van
Horik
Internally
HistTel New Interface to the Historic
Census data
Rene van Horik, Christophe
Gueret, Ashkan Askpour, Albert
Penuela, Peter Doorn
Internally
KnowEscape COST action Andrea Scharnhorst FP7
ENS Elite Network Shifts
Computational Humanities KNAW
Andrea Scharnhorst KNAW
9 researcher group members +
collaborators
15 projects + …..
2017
14 research group members
11 projects across all competence
groups
DARIAH
19. Hot topics at DANS
DARIAH
• Working with institutions
• Certification – storing is not archiving – Data Seal of Approval [poster
• WDS, RDA, European Research Infrastructures (DARIAH, EHRI)
• Working with users – who is using DANS? [next slide]
• Research Data Management Plans [flyer] Producers
• Training Data Stewards/Scientists [flyer] Archivists
• Quality of service – FAIR data [poster] Consumers
• Innovating services
• Archive of the future -> Research projects –> Herbert van de Sompel
20. Types of users – general classification
👤 Individuals
👤 Institutions
Organisations
DANS user study (Christine Borgman team, Herbert
vd Sompel, Andrew Treloar)
•Use of EASY
•Persons
•Qualitative research based on interviews
with users in all categories
DARIAH
21. Herbert van de Sompel –
pioneer in digital information services
DARIAH
22. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
Memento
23. Bollen, Johan, Lyudmila Balakireva, Luís Bettencourt, Ryan Chute, Aric Hagberg, Marko A. Rodriguez, and Herbert Van de Sompel. 2009. “Clickstream Data Yields High-Resolution
Maps of Science.” PLoSOne 4 (3): 1-11.
Bollen, Johan, Lyudmila Balakireva, Luís Bettencourt, Ryan Chute, Aric Hagberg, Marko A. Rodriguez, and Herbert Van de Sompel. 2008. A Clickstream Map of Science. Courtesy of
Los Alamos National Laboratory. In “5th Iteration (2009): Science Maps for Science Policy-Makers,” Places & Spaces: Mapping Science, edited by Katy Börner and Elisha F.
Hardy. http://scimaps.org.
MESUR:
Studying
science
from
large-
scale
usage
data
24. Only use this slide to present a screenshot of an application.
As no style is applied, the screenshot can take up the whole
slide. For all other information please use the slide with
preset style!
Editor's Notes
I always start with a mutual introduction, and I like to use this map of science to illustrate my journey into science…..
Where do you come from ?
Put something in on Digging into Data and lit reference of article with Richard Smiraglia
Use the English version of this!
Add dataverse here
Take new snapshots
Add dataverse here
Take new snapshots
Add dataverse here
Take new snapshots
DANS as project-driven organisation
Why an archive needs research?
Bringing water to the point
HET IS AL HEEL VEEL ONDERZOEK, OF RESEARCH EN DEVELOPMENT BIJ DANS
SELECTIE VAN PROJEKTEN GEORDNET LANGS DRIE DIMENSIE
BELEID, TECHNOLOGIE, ONDERZOEKS WERELD
DANS as service institution has done research to support the services. The past research can be ordered along these dimensions. Our research programme will start from these experiences and knowledge – add the layer of fundamental research to it, located in the amorphous field of information science (library and documentation, computer sciences, sc. communication science)
When I started at DANS I realized that – although being a service institute – a lot of research is actual done at DANS. This is an overview about projects and publications of the last 5 years ordered along three important dimensions in which DANS is active. What is the difference to a research institute – DANS employees do not publish to be part of the academic discourse, they publish in their function of service and information providers (hence guidelines and reports are dominant); they forster publications (wat veteranen vertellen)(so authorship is not a primary goal and sometimes DANS employees are hard to find even); they react to needs from scientific communities (intense communication) and sometimes also ‘activate’ communities (support their most innovative members).
Last but not least: they provide a DATA archiving system (a production system) – this is another kind of sport with own rules;
Add organogram
The different activities mirror in the competence groups
Special: maintain service (production systems) – expand service (experimental systems mostly in projects)
In the DANS user study with Christine Borgman’s team we started with classical and high level categories Consumer and Producers of data (here concentrating on content in EASY). We added Archivists, because the DANS staff is by no means a passive curator of the processes. By means of our competence group Data Services, in the interaction with depositors (data manager in Archive) and in interaction with libraries, scientific communities, other archives (Beleid&Communication) we induce production of data, and via Software development we influence the re-use or Consumption of data.
In the DANS user study, we concentrate on individuals, and all explorative data analytics has therefore focused on persons.
In the inventory of Information Needs monitoring institutions is one important different element in comparison to the DANS user studies. Another one the monitoring of the service itself (smooth performance, but also migration of datasets, update on meta information of the AIP.
On a theoretical level (see mindnode in the supplement material) most of the questions contain three elements: users, activities, and content (mainly provided by services, but also by the organisation as a whole – strategic monitoring). But users can be depositors as well as data manager, and content can be an Archival package, a NARCIS record, or a project.
Relatiemanegement ….
Intro; career; important research projects (Memento, Mesure, Hibernate); recent prize
Google profile ?