Presentation given at Macquarie University in support of the ARDC 'institutional role in the data commons' project on "Implementing FAIR: Standards in Research Data Management" https://ardc.edu.au/news/data-and-services-discovery-activities-successful-applicants/
What Are The Drone Anti-jamming Systems Technology?
Ā
What it means to be FAIR
1. What it means to be FAIR
Sarah Jones
Digital Curation Centre
sarah.jones@glasgow.ac.uk
Twitter: @sjDCC
FAIR session, Macquarie University, 7th August 2019
2. What is Digital Curation Centre?
a centre of expertise in digital information curation with a focus
on building capacity, capability and skills for research data
management and open science
www.dcc.ac.uk
Training | Events | Tools | Advocacy | Consultancy | Guidance | Publications | Projects
3. Who am I?
ā¢ Archivist with humanities background
ā¢ Coordinator of DMPonline service
ā¢ Heavily involved in Research Data Alliance
ā¢ Co-Chair on Data Science Schools
ā¢ Rapporteur of FAIR Expert Group
ā¢ Independent member of EOSC Executive
Board
ā¢ From a seaside town ā hence why I love
beach and sunshine here :o)
FAIR session, Macquarie University, 7th August 2019
4. All the fun of the FAIR
Image Israel Palacio https://unsplash.com/photos/P6FgiDNe6W4
5. What is FAIR?
A set of principles that describe the attributes
data need to have to enable and enhance reuse,
by humans and machines
FAIR session, Macquarie University, 7th August 2019
Image CC-BY-SA by SangyaPundir
6. What FAIR means: 15 principles
Findable
F1. (meta)data are assigned a globally unique and
eternally persistent identifier.
F2. data are described with rich metadata.
F3. (meta)data are registered or indexed in a searchable
resource.
F4. metadata specify the data identifier.
Interoperable
I1. (meta)data use a formal, accessible, shared, and
broadly applicable language for knowledge
representation.
I2. (meta)data use vocabularies that follow FAIR
principles.
I3. (meta)data include qualified references to other
(meta)data.
Accessible
A1 (meta)data are retrievable by their identifier using a
standardized communications protocol.
A1.1 the protocol is open, free, and universally
implementable.
A1.2 the protocol allows for an authentication and
authorization procedure, where necessary.
A2 metadata are accessible, even when the data are no
longer available.
Reusable
R1. meta(data) have a plurality of accurate and relevant
attributes.
R1.1. (meta)data are released with a clear and
accessible data usage license.
R1.2. (meta)data are associated with their provenance.
R1.3. (meta)data meet domain-relevant community
standards.
Slide CC-BY by Erik Schultes, Leiden UMC
doi: 10.1038/sdata.2016.18
FAIR session, Macquarie University, 7th August 2019
7. The FAIR data principles explained
ā¢ Clarifications from the Dutch
Techcentre for Life Sciences
ā¢ Each principle is a link to further
clarification, examples and
context
https://www.dtls.nl/fair-data/fair-
principles-explained
R1. Meta(data) are richly described with a plurality of accurate and relevant
attributes
ā¢ By giving data many ālabelsā, it will be much easier to find and reuse the data.
ā¢ Provide not just metadata that allows discovery, but also metadata that richly
describes the context under which that data was generated
ā¢ āpluralityā indicates that metadata should be as generous as possible, even to the
point of providing information that may seem irrelevant.
FAIR session, Macquarie University, 7th August 2019
8. FAIR data checklist
ā¢ Findable
- Persistent Identifier
- Metadata online
ā¢ Accessible
- Data online
- Restrictions where needed
ā¢ Interoperable
- Use standards, controlled vocabs
- Common (open) formats
ā¢ Reusable
- Rich documentation
- Clear usage licence
https://doi.org/10.5281/zenodo.1065991FAIR session, Macquarie University, 7th August 2019
9. FAIR is nothing new
ā¢ Various research communities have been sharing their data
in a āFAIRā way long before the term emerged
ā¢ Meaningful and memorable articulation of concepts
ā¢ Natural desire to want to be āfairā
ā¢ FAIR is gaining significant international traction
FAIR session, Macquarie University, 7th August 2019
10. Open, FAIR and RDM ā setting FAIR in context
Image Richard Balog https://unsplash.com/photos/P6FgiDNe6W4
11. Ultimately funders expect:
ā¢ timely release of data
- once patents are filed or on (acceptance for) publication
ā¢ open data sharing
- As open as possible as closed as necessary
ā¢ preservation of data
- typically 5-10+ years if of long-term value
ā¢ evidence of following policy
- a Data Management Plan or institutional policy and services
See the SPARC Europe funder policy overview:
https://sparceurope.org/latest-update-to-european-open-data-
and-open-science-policies-released
12. Shifting language: policy examples
FAIR session, Macquarie University, 7th August 2019
c.2000 ā 2008
ā¢ Data management
ā¢ Data sharing
ā¢ Preservation
ā¢ Good research
ā¢ conduct codes
c.2010 on
ā¢ Open Science
ā¢ Open Data
c.2016 on
ā¢ FAIR data
ā¢ Reproducibility
ā¢ Ethical
?
* Anecdotal, not scientific. Personal observation on how I feel global data policy rhetoric and terminology has changed
13. Advice
Terminology changes but ideas persist.
Focus on core concepts:
ā¢ managing data well
ā¢ ensuring ethical conduct
ā¢ good quality, reusable data
ā¢ open sharing where possible
FAIR session, Macquarie University, 7th August 2019 Image by Headway
https://unsplash.com/photos/5QgIuuBxKwM
14. Forerunners to FAIR
OECD Principles and Guidelines for Access to
Research Data from Public Funding (2007)
A. Openness
B. Flexibility
C. Transparency
D. Legal conformity
E. Protection of IP
F. Formal responsibility
G. Professionalism
H. Interoperability
I. Quality
J. Security
K. Efficiency
L. Accountability
M. Sustainability
Science as an Open Enterprise (2012)
notion of āintelligent opennessā where data are
accessible, intelligible, assessable and useable
āOpen scientific research data should be easily
discoverable, accessible, assessable,
intelligible, useable, and wherever possible
interoperable to specific quality standards.ā
G8 Science Ministers Statement (2013)
FAIR session, Macquarie University, 7th August 2019
15. How do Open, FAIR & RDM intersect?
Open
FAIR data
Managed data
Internal
Self-interest
External
Community benefit
FAIR session, Macquarie University, 7th August 2019
16. FAIR and Open
ā¢ The greatest potential
reuse comes when data
are both FAIR and Open
ā¢ Align and harmonise FAIR
and Open data policy
FAIR session, Macquarie University, 7th August 2019
Concepts of FAIR and Open should not be conflated.
Data can be FAIR or Open, both or neither
17. Open, FAIR and RDM
FAIR session, Macquarie University, 7th August 2019
ā¢ Paper explores overlaps between
concepts of Open, FAIR and RDM.
ā¢ Proposes using Open and FAIR as
ways to engage researchers in
managing data well, as this is a
prerequisite for both.
ā¢ Recommends making data FAIR
and Open wherever possible
Higman, R., Bangert, D. and Jones, S., 2019. Three camps, one destination: the
intersections of research data management, FAIR and Open. Insights, 32(1), p.18.
DOI: http://doi.org/10.1629/uksg.468
18. Turning FAIR into Reality
Image Kid Circus https://unsplash.com/photos/7vSlK_9gHWA
19. FAIR Data Expert Group
Take a holistic approach to lay out what needs to be done to
make FAIR a reality, in general and for EOSC
Addresses the following key areas:
1. Concepts for FAIR
2. Creating a FAIR culture
3. Creating a technical ecosystem for FAIR
4. Skills and capacity building
5. Incentives and metrics
6. Investment and sustainability
Turning FAIR into Reality: Report and Action Plan
https://doi.org/10.2777/1524
20. Address culture and technology
FAIR session, Macquarie University, 7th August 2019
Incentives
Metrics
Skills
Investment
Cultural and
social aspects
that drive the
ecosystem and
enact change
Cloudofregistries
Two sides of one whole
21. FAIR Digital Objects
ā¢ Can include data, software,
and other research resources
ā¢ Universal use of PIDs
ā¢ Use of common formats
ā¢ Data accompanied
by code
ā¢ Rich metadata
ā¢ Clear licensing
FAIR session, Macquarie University, 7th August 2019
22. FAIR EG recommendations
FAIR session, Macquarie University, 7th August 2019
ā¢ Research communities
ā¢ Data service providers
ā¢ Standards bodies
ā¢ Coordination fora
ā¢ Policymakers
ā¢ Research funders
ā¢ Institutions
ā¢ Publishers
Recommendations
aimed at multiple
stakeholders:
23. FAIR metrics: data and services
FAIR session, Macquarie University, 7th August 2019
DATA REPOSITORY
F4. (meta)data are registered or
indexed in a searchable resource
+ TECHNOLOGIES
+ PROCEDURES
+ EXPERTISE
+ PEOPLE
(META)DATA
F1. (meta)data are assigned a
globally unique and persistent
identifier
F2. data are described with
rich metadata
F3. metadata clearly and
explicitly include the identifier
of the data it describes
Assessing FAIRness of data
Critical role of environment &
services in making data FAIR
24. FAIR metrics
ā¢ A set of metrics for FAIR Digital Objects should be developed
and implemented, starting from the basic common core of
descriptive metadata, PIDs and access.
ā¢ Build on existing work in this space ā RDA Working Group
ā¢ https://www.rd-alliance.org/groups/fair-data-maturity-model-wg
ā¢ Certification schemes are needed to assess all components of
the ecosystem as services that enable FAIR
FAIR session, Macquarie University, 7th August 2019
25. Services that enable FAIR
Many aspects of FAIR apply to services (findability, accessibility,
use of standardsā¦) but you also want to check:
ā¢ Appropriate policy is in place
ā¢ Robustness of business processes
ā¢ Expertise of current staff
ā¢ Value proposition / business model
ā¢ Succession plans
ā¢ Trustworthiness
FAIR session, Macquarie University, 7th August 2019
26. From metrics to incentives
ā¢ Use metrics to measure practice but beware misuse
ā¢ Generate genuine incentives ā career progression for data
sharing & curation, recognise all outputs of research, include
in recruitment and project evaluation processesā¦
ā¢ Implement ānext-generationā metrics
ā¢ Automate reporting as far as possible
FAIR session, Macquarie University, 7th August 2019
27. Many, many H2020 FAIR projects
clusters
National initiatives
ā¢ EOSC-Nordic
ā¢ EOSC-Pillar
ā¢ EOSC-synergy
ā¢ ExPaNDS
ā¢ NI4OS-Europe
FAIR session, Macquarie University, 7th August 2019
28. The European Open Science Cloud
Image Kyle Hinkson https://unsplash.com/photos/xyXcGADvAwE
29. An open festival for science
ā¢ Virtual space where science producers and science
consumers come together
ā¢ Federation of existing infrastructure and services
ā¢ An open-ended range of content and services
ā¢ Quality mark Ā« Data made in Europe Ā»
A platform for European research
FAIR session, Macquarie University, 7th August 2019
31. Executive Board
FAIR session, Macquarie University, 7th August 2019
ā¢ Karel Luyben & Cathrin
Stover as Co-Chairs
ā¢ 8 representatives of
stakeholder groups
ā¢ 3 independent experts
https://www.eoscsecretariat.eu/
eb-profiles
FAIR session, Macquarie University, 7th August 2019
32. EOSC Exec Board Working Groups
GB/EB comms
and engagement
sub-group
Skills WG
Going Global WG
Others under
consideration
FAIR session, Macquarie University, 7th August 2019
33. What is each WG is doing?
ā¢ Map EOSC-relevant national infrastructures
ā¢ Analyse Member State readiness to provide financial resource (with Sustainability)
ā¢ Propose mechanisms to facilitate convergence and alignment
Landscape
ā¢ Recommend a minimal set of Rules of Participation that define the rights,
obligations and accountability governing all EOSC transactions
ā¢ Embrace the principles of openness, transparency and inclusiveness
Rules of P.
ā¢ Define, agree and develop an interoperability layer to federate systems i.e.
standards, open APIs and protocols
ā¢ Offer a catalogue of EOSC datasets and services
Architecture
ā¢ FAIR practices ļ EOSC interoperability framework (with Arch. & RoP)
ā¢ Persistent Identifier (PID) policy for EOSC (with Architecture)
ā¢ Frameworks to assess FAIR data and certify services that enable FAIR
FAIR
ā¢ Provide a set of strategic and financing orientations for EOSC post 2020
ā¢ In-depth analysis of business models and their different implications
ā¢ Options for a governance framework to steer & oversee EOSC operations
Sustainability
FAIR session, Macquarie University, 7th August 2019
https://www.eoscsecretariat.eu/eosc-working-groups
FAIR session, Macquarie University, 7th August 2019
34. All the fun of the FAIR
Keep up to date with progress on the EOSCsecretariat blog:
https://www.eoscsecretariat.eu/news-events-opinion/opinion
FAIR session, Macquarie University, 7th August 2019
35. Where next at Macquarie?
Image David Iskander https://unsplash.com/photos/iWTamkU5kiI
36. Hook DMPs into existing processes
ā¢ Good idea to use Infonethica
ā¢ Do lots of user testing and be willing to iterate
ā¢ Keep the DMP short ā reuse info where possible
ā¢ Focusing on HDR students can be a good way to
seed good practice up ā some UK unis get
supervisors to review / approve DMPs
37. Use existing fora to get advice
ā¢ RDA Active DMPs Interest Group
ā¢ https://rd-alliance.org/groups/active-data-management-plans.html
ā¢ ARDC DMP Community of Practice
ā¢ https://ardc.edu.au/resources/communities-of-practice
ā¢ Jiscmail list with global RDM community. 1751
subscribers, running since 2008, email archiveā¦
ā¢ https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=RESEARCH-DATAMAN
FAIR session, Macquarie University, 7th August 2019
38. Focus on FAIR basics
For researchers
ā¢ Document data
ā¢ Use standards
ā¢ Deposit in a repository
ā¢ Assign a licence
ā¢ Get a PID
For services
ā¢ Get researchers thinking
early ā DMP to plan
ā¢ Advise on standards
ā¢ Offer / point to repositories
ā¢ Assign and use PIDs
ā¢ Foster a culture of sharing
ā¢ Recognise and reward FAIR
FAIR session, Macquarie University, 7th August 2019
40. KEEP
CALM
Lots of others may
have done lots of
FAIR things, but this
is an opportunity.
Learn from their
mistakes and copy
good practice.
Donāt fret about
being behindā¦
Image Joe DeSousa https://unsplash.com/photos/0MGhdhObDXA
OECD ā 13 principles e.g. openness, flexible, transparent, legal, interoperable, quality, secure, accountable, efficientā¦
OECD preconditions: ādata must be accessible and readily located; they must be intelligible to those who wish to scrutinise them; data must be assessable so that judgments can be made about their reliability and the competence of those who created them; and they must be usable by others.ā
G8 statement adopted verbatim in the European Commissionās first data guidelines for the Horizon 2020 framework programme later the same year.
When asked what kind of party the EOSC would be, one group suggested an open festival as itās a ā¦