SlideShare a Scribd company logo
1 of 39
Solving the data problem for research and beyond
Matthew Dovey, Head of e-infrastructure strategy, Jisc
John Kaye, Senior co-design manager - research data, Jisc
28/04/2017
1
Jisc research strategy
28/04/2017 Solving the data problem 2
Research is changing
»The 4th Paradigm of data-intensive research and
data-driven innovation
»Open by default
»Dependency on digital infrastructures and digital transformation
»Globally competitive environment – digital transformation is open
to everyone
28/04/2017 Solving the data problem 3
The vision
»Jisc’s vision is to make the UK the most digitally advanced
research nation in the world by fully exploiting the possibilities of
modern digital empowerment, content and connectivity
»Jisc will provide the underlying infrastructure which can scale and
flex to enable researchers to deliver the outcomes that funders,
government, industry and society want from the sector
»Our vision is of a seamless, interoperable digital infrastructure
which enables researchers and research organisations the freedom
to apply their strategic resources to maximise their research
impact and minimise the cost and burden of the supporting
operations
28/04/2017 Solving the data problem 4
The vision
28/04/2017 Solving the data problem 5
Underpinning
infrastructure
Information
model
Dynamic
research
platform
»Cyber-Security Support
»Data Assurance
»Network Performance
Optimisation
»Procurement Frameworks
»Research Analytics
»Research Outputs - Publication,
Curation, Archiving and Preservation
»Content Licensing, Discovery and
Management
»Standards and Identifiers
»Vocabularies
»Data Model
»Janet Backbone
»Federated Access and
Identity Management
»Data Centres
Research enabling services
»Advanced Networking
Technologies
»Data Warehouse
»Flexible Storage
»Metadata Profiles
»Application Profiles
»Data Brokerage
Top three priorities
»The comprehensive connectivity across the infrastructure at a
diversity of scales (local, regional, national, international)
»A coherent suite of research services which reduces the burden on
institutions, increases the efficiency, delivers solutions to common
problems and improves UK’s research performance
»Representation of the UK’s digital needs in our engagements and
advocacy in the national and international arena
Jisc will provide three elements of the vision
28/04/2017 Solving the data problem 6
Research strategy outcomes
1. The UK’s research environment is underpinned by flexible, scalable infrastructure where
standards based approaches ensure that data can be generated, moved, stored, found
and used with the minimum of cost or burden to the institution and the researcher
2. The transition from Open Access to Open Science where research objects are findable,
accessible, interoperable and reusable by academia, industry and society for wider
economic and social benefit
3. UK interests are represented in both international policy and operational environments
enabling UK researchers to collaborate, compete and comply with the global research
community
4. The UK maintains its position as a digital thought leader and shaper of both research
infrastructures and the wider scholarly communications environment
5. The investment in the mission-critical UK E-Infrastructure required by the research base
is safeguarded for the long-term enabling UK Research to continue to punch above its
weight in the global research environment
28/04/2017 Solving the data problem 7
Tiered storage
28/04/2017 Solving the data problem 8
Motivation and engagement
»Initial interest for explored with SDC-North tenants
»Informal vendor discussions to determine technical feasibility
»Requirements workshop – November 2016
»Active working group to develop full business case for phased
implementation in 2017
»Progress and input from wider community via
https://community.jisc.ac.uk/groups/tiered-storage
28/04/2017 Solving the data problem 9
Opportunities
» Provide a national storage provision filling a current gap
› Universities looking at ever-increasing storage requirements and needs
› Confused by different approaches (in house, cloud, hybrid), technologies, solutions,
pricing structures
› Different requirements and policies (internal, and externally imposed)
» Remove headache of procurement and management across multiple providers and
technologies
» Maximise Janet network value
» De-risk University in area of exponential growth
› Low riskPAYG infrastructure avoids over investment
28/04/2017 Solving the data problem 10
Benefits
» Savings on costs of power, cooling and carbon arising from a modern consolidated
infrastructure in a high-specification datacentre with modern cooling
» Procurement cost savings not just from quantity of procurements, but also from
timeliness of procurements: you will get cheaper overall storage costs by procuring 100TB
a year in each of five years than procuring 500TB once (simply because you get more
storage for your money as time goes on)
» Operational savings on time for installing and managing storage hardware
» Clear compliance with research council expectations for appropriate data management
across the research lifecycle
» Benefits across the University sector of providing a standard for research data
management and a standard costing
28/04/2017 Solving the data problem 11
Multi-vendor tiered storage proposal
28/04/2017 Solving the data problem 12
HSM Appliance
AWS
Cloud storage pool Archival storage pool
Customer infrastructure
(eg VMWare Vsphere)
Amazon
Glacier
Arkivum
Customer applications RDM share services
Cloud9
iSCSI
SMB
CIF
NFS
S3
https
Swift
ceph
…
Applications
Jisc tiered storage service
HSM Data Policy
• Pool Prioritisation
• Replication
• Snapshots
• SLAs (e.g.
retention,
availability,
security)
Distributed
storage pool
Google
HSM data policy
» Pool prioritisation
» Replication
» Snapshots
» SLAs (eg retention,
availability, security)
HSM Appliance
Tiered storage proposal - pools
28/04/2017 Solving the data problem 13
Pool Overview Class Copies RecoveryTime
Objective
Recovery Point
Objective
Distributed
storage pool
Data stored near sites (possibly based
on SDC1, SDC2 and other locations eg
national research e-infrastructure
centres, other NRENs) to give
onsitenearsite recovery times
Use of erasure-encoding to give
equivalence of 2 copies with ~1.6 times
storage capacity
Lever Janet
backbone to
deliverOnsite
equivalence
Equivalent to 2
Copies including
offsite
Onsitenear site
equivalent
<1 Hour
Cloud storage
pool
Managing data copies across multiple
cloud providers
Archive Equivalent to 2
Copies including
offsite
< 1 Hour 1-24 Hour
Archival storage
pool
Managing data copies across multiple
cloud “vault” providers (ie 99% or
100% guaranteed data recovery)
Vault Guaranteed
recovery
N/A N/A
Requirements and demand working group
»University of Oxford
»University of Leeds
»University of Manchester
»University College London
»London School of Economics
»Natural History Museum
»Additions welcome
Current members
»Phased technical specification
»Use scenarios
› (eg data movement)
»Business and financial case
› (includingTCO analysis)
»Market review and supplier
engagement
Key outputs
28/04/2017 Solving the data problem 14
Tiered storage positioning
28/04/2017 Solving the data problem 15
Storage
Providers
Jisc Tiered Storage
Other Jisc
Services
Storage
Policy
Storage
Policy
Storage
Policy
Storage
Policy
Jisc RDSS
Local Research
Data Systems
Other local systems
(financial, T&L, etc)
Jisc research data shared service
28/04/2017 Solving the data problem 16
The futures portfolio consists of three big areas
28/04/2017 Solving the data problem 17
Store
services
Playlists Diagnostic
tool builder
Curation
and remix
Learner
Analytics Services
Digital
capability
Learning
analytics
Digital
launchpad
Apprentice
workforce
development
Digital
leadership
Summer of
student
innovation
Analytics
academy
Analytics
labs
Qualification
verification
App
and
content
store
Research data
discovery
Research
data
usage
metrics
Equipment
data
Repository and
preservation platform
Research
data
shared
service
?
Research data discovery service
Alpha site
28/04/2017 Solving the data problem 18
Research data usage and metrics
28/04/2017 Solving the data problem 19
Shared Service Goals
»Policy compliance
»Efficiency
»Better research
28/04/2017 Solving the data problem 20
A key requirement
28/04/2017 Solving the data problem 21
…..but a challenging problem
28/04/2017 Solving the data problem 22
Implementing
Archivematica for
research data
preservation at
York and Hull
Jenny Mitcham
(DigitalArchivist) -
University ofYork
Research data shared service overview
28/04/2017 Solving the data problem 23
Data model
28/04/2017 Solving the data problem 24
Service MVP (Alpha – July 2017)
28/04/2017 Solving the data problem 25
Service MVP (Alpha – July 2017)
28/04/2017 Solving the data problem 26
Pilot MVP components
* Under review as additional reporting options may be available, also differing offers from
full dashboard/analytics to API only. Further discovery work is underway.
28/04/2017 Solving the data problem 27
RDSS Component Offer Number of Pilots Requiring (total =17)
RDSS Repository 14
RDSS Preservation 17
RDSS Reporting 14 (TBC)*
RDSS Storage 16
Pilot Alpha MVP integrations
*RDSS Framework Supplier
28/04/2017 Solving the data problem 28
RDSS Component Offer Number of Pilots Requiring (total =17)
Eprints (Repository) 12
Dspace (Repository) 4
Hydra (Repository) 2
Symplectic (CRIS)* 4
Pure (CRIS) 3
Converis (CRIS) 1
Authentication 17
Middlesex Figshare implementation
»Accelerated deployment in 10 weeks
(Installation by 10th November)
»Stakeholder engagement
»Development of institutional requirements
»Sign up to Datacite membership
»Implementation team (informal)
»Integration with Jisc Storage
»Implementation of pilot data repository
28/04/2017 Solving the data problem 29
The University of Jisc Sandbox
» Scratch environment for testing of
configuration and integration of service
platform components
» A mock HEI to integrate with
» Infrastructure as code, learning from
building, and managing the mixture of
SaaS and custom applications.This will
allow easy push button install of
products
» Working with test data and metadata
taken from real HEI repositories
» Consistent and standardised UX
» Bespoke development environment
28/04/2017 Solving the data problem 30
Apps CRIS
Test data
Zenodo
RDSS pilot HEI repositories
Publisher data
AWS
storage + tools
Data
repositories
Figshare, Hydra
Islandora, Haplo
Publication
repositories
Eprints
D-space
Preservation
systems
Preservica
Archivematica
Additional
software
and services
Assessing researchers’ needs - Data asset framework
28/04/2017 Solving the data problem 31
Preservation of research data
“I currently spend about £1,200 pa on data
storage from my own salary. I have the highest
data needs in my School, and there is no plan in
place for storing my data.”
28/04/2017 Solving the data problem 32
Sensitive research data
“It would be helpful to clarify the rules for storing
anonymised data on cloud services. My
departmental rules say this is never OK, however
this seems to contradict University rules.”
28/04/2017 Solving the data problem 33
University services to support RDM
“Support is woeful in the university currently, in
particular long-term data archiving is critically
required. Most of my non-current data is rotting
on CD's and hard-drives.”
28/04/2017 Solving the data problem 34
University services to support RDM
“Please, individualise the support.Workshop are
useless, emails with information are useless,
brochures are useless, posters are useless.”
28/04/2017 Solving the data problem 35
Researchdata.network
28/04/2017 Solving the data problem 36
Discussion
28/04/2017 Solving the data problem 37
What we’d like to know…..
» What are your current priorities and pain points with managing data?
» Do you have or are you expecting a data deluge?
» What would you like Jisc to provide for managing data?
» What would you like the Jisc offer to look like?
» Have we missed anything in our pilots?Are there gaps?
» Are there any aspects of data management you’d like to keep ‘in-house’?
» Do you have issues around research systems user experience for researchers and staff
» Do you have issues around systems interoperability
» Do you have preservation needs beyond research data (eg records management, Archives)
» Can you share any hooks or incentives to engage researchers in data management services
» Any tips for success and lessons learned that we can utilise in implementing systems?
» Anything else…..
28/04/2017 Solving the data problem 38
28/04/2017 Solving the data problem 39
Matthew Dovey
Head of e-infrastructure strategy
matthew.dovey@jisc.ac.uk
John Kaye
Senior co-design manager – Research Data
john.kaye@jisc.ac.uk
jisc.ac.uk/rd/projects/research-data-shared-service
https://community.jisc.ac.uk/groups/tiered-storage

More Related Content

What's hot

The way forward together
The way forward togetherThe way forward together
The way forward togetherJisc
 
Stakeholder forum 2015 - The way forward together - Phil Richards
Stakeholder forum 2015 - The way forward together - Phil RichardsStakeholder forum 2015 - The way forward together - Phil Richards
Stakeholder forum 2015 - The way forward together - Phil RichardsJisc
 
Janet in a changing world
Janet in a changing world Janet in a changing world
Janet in a changing world Jisc
 
Stakeholder strategic update webinar - research
 Stakeholder strategic update webinar - research Stakeholder strategic update webinar - research
Stakeholder strategic update webinar - researchJisc
 
Why I love Jisc - presentation from Paul Bartholomew
Why I love Jisc - presentation from Paul BartholomewWhy I love Jisc - presentation from Paul Bartholomew
Why I love Jisc - presentation from Paul BartholomewJisc
 
The year in review - you said, we're doing - presentation from David Maguire
The year in review - you said, we're doing - presentation from David MaguireThe year in review - you said, we're doing - presentation from David Maguire
The year in review - you said, we're doing - presentation from David MaguireJisc
 
Jisc's international strategy
Jisc's international strategyJisc's international strategy
Jisc's international strategyJisc
 
The year ahead
The year aheadThe year ahead
The year aheadJisc
 
Stakeholder strategic update webinar - higher education
Stakeholder strategic update webinar - higher educationStakeholder strategic update webinar - higher education
Stakeholder strategic update webinar - higher educationJisc
 
David Maguire - Serving the FE and HE sectors
David Maguire - Serving the FE and HE sectorsDavid Maguire - Serving the FE and HE sectors
David Maguire - Serving the FE and HE sectorsJisc
 
Stakeholder strategic update webinar - further education and skills
Stakeholder strategic update webinar - further education and skillsStakeholder strategic update webinar - further education and skills
Stakeholder strategic update webinar - further education and skillsJisc
 
Enabling a national digital library
Enabling a national digital libraryEnabling a national digital library
Enabling a national digital libraryJisc
 
Business intelligence for education
Business intelligence for education  Business intelligence for education
Business intelligence for education Jisc
 
Liz Barnes - Changing for Education 4.0
Liz Barnes - Changing for Education 4.0Liz Barnes - Changing for Education 4.0
Liz Barnes - Changing for Education 4.0Jisc
 
Open Access Pathfinder Case Study - Lincoln
Open Access Pathfinder Case Study - LincolnOpen Access Pathfinder Case Study - Lincoln
Open Access Pathfinder Case Study - LincolnDavid Young
 
Coming soon
Coming soonComing soon
Coming soonJisc
 
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-Collins
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-CollinsStakeholder forum 2015 - Engaging across the uk - Robert Haymon-Collins
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-CollinsJisc
 
Highlights of what is coming - presentation from Paul Feldman
Highlights of what is coming - presentation from Paul FeldmanHighlights of what is coming - presentation from Paul Feldman
Highlights of what is coming - presentation from Paul FeldmanJisc
 
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrow
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrowStakeholder forum 2015 - Jisc engagement architecture - Martyn harrow
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrowJisc
 
The Kent PSN, govroam and HSCN
The Kent PSN, govroam and HSCNThe Kent PSN, govroam and HSCN
The Kent PSN, govroam and HSCNJisc
 

What's hot (20)

The way forward together
The way forward togetherThe way forward together
The way forward together
 
Stakeholder forum 2015 - The way forward together - Phil Richards
Stakeholder forum 2015 - The way forward together - Phil RichardsStakeholder forum 2015 - The way forward together - Phil Richards
Stakeholder forum 2015 - The way forward together - Phil Richards
 
Janet in a changing world
Janet in a changing world Janet in a changing world
Janet in a changing world
 
Stakeholder strategic update webinar - research
 Stakeholder strategic update webinar - research Stakeholder strategic update webinar - research
Stakeholder strategic update webinar - research
 
Why I love Jisc - presentation from Paul Bartholomew
Why I love Jisc - presentation from Paul BartholomewWhy I love Jisc - presentation from Paul Bartholomew
Why I love Jisc - presentation from Paul Bartholomew
 
The year in review - you said, we're doing - presentation from David Maguire
The year in review - you said, we're doing - presentation from David MaguireThe year in review - you said, we're doing - presentation from David Maguire
The year in review - you said, we're doing - presentation from David Maguire
 
Jisc's international strategy
Jisc's international strategyJisc's international strategy
Jisc's international strategy
 
The year ahead
The year aheadThe year ahead
The year ahead
 
Stakeholder strategic update webinar - higher education
Stakeholder strategic update webinar - higher educationStakeholder strategic update webinar - higher education
Stakeholder strategic update webinar - higher education
 
David Maguire - Serving the FE and HE sectors
David Maguire - Serving the FE and HE sectorsDavid Maguire - Serving the FE and HE sectors
David Maguire - Serving the FE and HE sectors
 
Stakeholder strategic update webinar - further education and skills
Stakeholder strategic update webinar - further education and skillsStakeholder strategic update webinar - further education and skills
Stakeholder strategic update webinar - further education and skills
 
Enabling a national digital library
Enabling a national digital libraryEnabling a national digital library
Enabling a national digital library
 
Business intelligence for education
Business intelligence for education  Business intelligence for education
Business intelligence for education
 
Liz Barnes - Changing for Education 4.0
Liz Barnes - Changing for Education 4.0Liz Barnes - Changing for Education 4.0
Liz Barnes - Changing for Education 4.0
 
Open Access Pathfinder Case Study - Lincoln
Open Access Pathfinder Case Study - LincolnOpen Access Pathfinder Case Study - Lincoln
Open Access Pathfinder Case Study - Lincoln
 
Coming soon
Coming soonComing soon
Coming soon
 
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-Collins
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-CollinsStakeholder forum 2015 - Engaging across the uk - Robert Haymon-Collins
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-Collins
 
Highlights of what is coming - presentation from Paul Feldman
Highlights of what is coming - presentation from Paul FeldmanHighlights of what is coming - presentation from Paul Feldman
Highlights of what is coming - presentation from Paul Feldman
 
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrow
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrowStakeholder forum 2015 - Jisc engagement architecture - Martyn harrow
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrow
 
The Kent PSN, govroam and HSCN
The Kent PSN, govroam and HSCNThe Kent PSN, govroam and HSCN
The Kent PSN, govroam and HSCN
 

Similar to Solving the data problem for research beyond

Research Data Shared Service
Research Data Shared ServiceResearch Data Shared Service
Research Data Shared ServiceJisc
 
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John Kaye
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John KayeUKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John Kaye
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John KayeUKSG: connecting the knowledge community
 
RD shared services and research data spring
RD shared services and research data springRD shared services and research data spring
RD shared services and research data springJisc RDM
 
Business cases and costs RDN
Business cases and costs RDNBusiness cases and costs RDN
Business cases and costs RDNJisc RDM
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc RDM
 
Research Data Shared Service update at DPC
Research Data Shared Service update at DPCResearch Data Shared Service update at DPC
Research Data Shared Service update at DPCJisc RDM
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharingJisc RDM
 
Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016Jisc RDM
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013SALCTG
 
RDM shared services at IDCC
RDM shared services at IDCCRDM shared services at IDCC
RDM shared services at IDCCJisc RDM
 
UKSG Conference 2017 Breakout - Research Data Management: developing a system...
UKSG Conference 2017 Breakout - Research Data Management: developing a system...UKSG Conference 2017 Breakout - Research Data Management: developing a system...
UKSG Conference 2017 Breakout - Research Data Management: developing a system...UKSG: connecting the knowledge community
 
Research Data Shared Service Webinar #1
Research Data Shared Service Webinar #1Research Data Shared Service Webinar #1
Research Data Shared Service Webinar #1Jisc RDM
 
Jisc unleashing data 5 minutes
Jisc unleashing data 5 minutesJisc unleashing data 5 minutes
Jisc unleashing data 5 minutesDaniela G. Duca
 
Managing data behind creative masterpieces
Managing data behind creative masterpiecesManaging data behind creative masterpieces
Managing data behind creative masterpiecesJisc RDM
 
Jisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 PaperJisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 PaperJisc RDM
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
 
Repositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projectsRepositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projectsJisc RDM
 
UK data management environment and support
UK data management environment and supportUK data management environment and support
UK data management environment and supportJisc
 
Shared services - the future of HPC and big data facilities for UK research
Shared services - the future of HPC and big data facilities for UK researchShared services - the future of HPC and big data facilities for UK research
Shared services - the future of HPC and big data facilities for UK researchMartin Hamilton
 
Digital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspectiveDigital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspectiveChristopher Brown
 

Similar to Solving the data problem for research beyond (20)

Research Data Shared Service
Research Data Shared ServiceResearch Data Shared Service
Research Data Shared Service
 
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John Kaye
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John KayeUKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John Kaye
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John Kaye
 
RD shared services and research data spring
RD shared services and research data springRD shared services and research data spring
RD shared services and research data spring
 
Business cases and costs RDN
Business cases and costs RDNBusiness cases and costs RDN
Business cases and costs RDN
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016
 
Research Data Shared Service update at DPC
Research Data Shared Service update at DPCResearch Data Shared Service update at DPC
Research Data Shared Service update at DPC
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
 
RDM shared services at IDCC
RDM shared services at IDCCRDM shared services at IDCC
RDM shared services at IDCC
 
UKSG Conference 2017 Breakout - Research Data Management: developing a system...
UKSG Conference 2017 Breakout - Research Data Management: developing a system...UKSG Conference 2017 Breakout - Research Data Management: developing a system...
UKSG Conference 2017 Breakout - Research Data Management: developing a system...
 
Research Data Shared Service Webinar #1
Research Data Shared Service Webinar #1Research Data Shared Service Webinar #1
Research Data Shared Service Webinar #1
 
Jisc unleashing data 5 minutes
Jisc unleashing data 5 minutesJisc unleashing data 5 minutes
Jisc unleashing data 5 minutes
 
Managing data behind creative masterpieces
Managing data behind creative masterpiecesManaging data behind creative masterpieces
Managing data behind creative masterpieces
 
Jisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 PaperJisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 Paper
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
Repositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projectsRepositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projects
 
UK data management environment and support
UK data management environment and supportUK data management environment and support
UK data management environment and support
 
Shared services - the future of HPC and big data facilities for UK research
Shared services - the future of HPC and big data facilities for UK researchShared services - the future of HPC and big data facilities for UK research
Shared services - the future of HPC and big data facilities for UK research
 
Digital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspectiveDigital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspective
 

More from Jisc

Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...Jisc
 
Digital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptxDigital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptxJisc
 
Open Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptxOpen Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptxJisc
 
Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...Jisc
 
How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...Jisc
 
Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023Jisc
 
Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023Jisc
 
Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023Jisc
 
JISC Presentation.pptx
JISC Presentation.pptxJISC Presentation.pptx
JISC Presentation.pptxJisc
 
Community-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptxCommunity-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptxJisc
 
The Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptxThe Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptxJisc
 
Are we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptxAre we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptxJisc
 
JiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptxJiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptxJisc
 
UWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptxUWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptxJisc
 
An introduction to Cyber Essentials
An introduction to Cyber EssentialsAn introduction to Cyber Essentials
An introduction to Cyber EssentialsJisc
 

More from Jisc (20)

Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...
 
Digital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptxDigital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptx
 
Open Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptxOpen Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptx
 
Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...
 
How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...
 
Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023
 
Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023
 
Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023
 
JISC Presentation.pptx
JISC Presentation.pptxJISC Presentation.pptx
JISC Presentation.pptx
 
Community-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptxCommunity-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptx
 
The Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptxThe Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptx
 
Are we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptxAre we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptx
 
JiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptxJiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptx
 
UWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptxUWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptx
 
An introduction to Cyber Essentials
An introduction to Cyber EssentialsAn introduction to Cyber Essentials
An introduction to Cyber Essentials
 

Recently uploaded

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 

Recently uploaded (20)

Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 

Solving the data problem for research beyond

  • 1. Solving the data problem for research and beyond Matthew Dovey, Head of e-infrastructure strategy, Jisc John Kaye, Senior co-design manager - research data, Jisc 28/04/2017 1
  • 2. Jisc research strategy 28/04/2017 Solving the data problem 2
  • 3. Research is changing »The 4th Paradigm of data-intensive research and data-driven innovation »Open by default »Dependency on digital infrastructures and digital transformation »Globally competitive environment – digital transformation is open to everyone 28/04/2017 Solving the data problem 3
  • 4. The vision »Jisc’s vision is to make the UK the most digitally advanced research nation in the world by fully exploiting the possibilities of modern digital empowerment, content and connectivity »Jisc will provide the underlying infrastructure which can scale and flex to enable researchers to deliver the outcomes that funders, government, industry and society want from the sector »Our vision is of a seamless, interoperable digital infrastructure which enables researchers and research organisations the freedom to apply their strategic resources to maximise their research impact and minimise the cost and burden of the supporting operations 28/04/2017 Solving the data problem 4
  • 5. The vision 28/04/2017 Solving the data problem 5 Underpinning infrastructure Information model Dynamic research platform »Cyber-Security Support »Data Assurance »Network Performance Optimisation »Procurement Frameworks »Research Analytics »Research Outputs - Publication, Curation, Archiving and Preservation »Content Licensing, Discovery and Management »Standards and Identifiers »Vocabularies »Data Model »Janet Backbone »Federated Access and Identity Management »Data Centres Research enabling services »Advanced Networking Technologies »Data Warehouse »Flexible Storage »Metadata Profiles »Application Profiles »Data Brokerage
  • 6. Top three priorities »The comprehensive connectivity across the infrastructure at a diversity of scales (local, regional, national, international) »A coherent suite of research services which reduces the burden on institutions, increases the efficiency, delivers solutions to common problems and improves UK’s research performance »Representation of the UK’s digital needs in our engagements and advocacy in the national and international arena Jisc will provide three elements of the vision 28/04/2017 Solving the data problem 6
  • 7. Research strategy outcomes 1. The UK’s research environment is underpinned by flexible, scalable infrastructure where standards based approaches ensure that data can be generated, moved, stored, found and used with the minimum of cost or burden to the institution and the researcher 2. The transition from Open Access to Open Science where research objects are findable, accessible, interoperable and reusable by academia, industry and society for wider economic and social benefit 3. UK interests are represented in both international policy and operational environments enabling UK researchers to collaborate, compete and comply with the global research community 4. The UK maintains its position as a digital thought leader and shaper of both research infrastructures and the wider scholarly communications environment 5. The investment in the mission-critical UK E-Infrastructure required by the research base is safeguarded for the long-term enabling UK Research to continue to punch above its weight in the global research environment 28/04/2017 Solving the data problem 7
  • 9. Motivation and engagement »Initial interest for explored with SDC-North tenants »Informal vendor discussions to determine technical feasibility »Requirements workshop – November 2016 »Active working group to develop full business case for phased implementation in 2017 »Progress and input from wider community via https://community.jisc.ac.uk/groups/tiered-storage 28/04/2017 Solving the data problem 9
  • 10. Opportunities » Provide a national storage provision filling a current gap › Universities looking at ever-increasing storage requirements and needs › Confused by different approaches (in house, cloud, hybrid), technologies, solutions, pricing structures › Different requirements and policies (internal, and externally imposed) » Remove headache of procurement and management across multiple providers and technologies » Maximise Janet network value » De-risk University in area of exponential growth › Low riskPAYG infrastructure avoids over investment 28/04/2017 Solving the data problem 10
  • 11. Benefits » Savings on costs of power, cooling and carbon arising from a modern consolidated infrastructure in a high-specification datacentre with modern cooling » Procurement cost savings not just from quantity of procurements, but also from timeliness of procurements: you will get cheaper overall storage costs by procuring 100TB a year in each of five years than procuring 500TB once (simply because you get more storage for your money as time goes on) » Operational savings on time for installing and managing storage hardware » Clear compliance with research council expectations for appropriate data management across the research lifecycle » Benefits across the University sector of providing a standard for research data management and a standard costing 28/04/2017 Solving the data problem 11
  • 12. Multi-vendor tiered storage proposal 28/04/2017 Solving the data problem 12 HSM Appliance AWS Cloud storage pool Archival storage pool Customer infrastructure (eg VMWare Vsphere) Amazon Glacier Arkivum Customer applications RDM share services Cloud9 iSCSI SMB CIF NFS S3 https Swift ceph … Applications Jisc tiered storage service HSM Data Policy • Pool Prioritisation • Replication • Snapshots • SLAs (e.g. retention, availability, security) Distributed storage pool Google HSM data policy » Pool prioritisation » Replication » Snapshots » SLAs (eg retention, availability, security) HSM Appliance
  • 13. Tiered storage proposal - pools 28/04/2017 Solving the data problem 13 Pool Overview Class Copies RecoveryTime Objective Recovery Point Objective Distributed storage pool Data stored near sites (possibly based on SDC1, SDC2 and other locations eg national research e-infrastructure centres, other NRENs) to give onsitenearsite recovery times Use of erasure-encoding to give equivalence of 2 copies with ~1.6 times storage capacity Lever Janet backbone to deliverOnsite equivalence Equivalent to 2 Copies including offsite Onsitenear site equivalent <1 Hour Cloud storage pool Managing data copies across multiple cloud providers Archive Equivalent to 2 Copies including offsite < 1 Hour 1-24 Hour Archival storage pool Managing data copies across multiple cloud “vault” providers (ie 99% or 100% guaranteed data recovery) Vault Guaranteed recovery N/A N/A
  • 14. Requirements and demand working group »University of Oxford »University of Leeds »University of Manchester »University College London »London School of Economics »Natural History Museum »Additions welcome Current members »Phased technical specification »Use scenarios › (eg data movement) »Business and financial case › (includingTCO analysis) »Market review and supplier engagement Key outputs 28/04/2017 Solving the data problem 14
  • 15. Tiered storage positioning 28/04/2017 Solving the data problem 15 Storage Providers Jisc Tiered Storage Other Jisc Services Storage Policy Storage Policy Storage Policy Storage Policy Jisc RDSS Local Research Data Systems Other local systems (financial, T&L, etc)
  • 16. Jisc research data shared service 28/04/2017 Solving the data problem 16
  • 17. The futures portfolio consists of three big areas 28/04/2017 Solving the data problem 17 Store services Playlists Diagnostic tool builder Curation and remix Learner Analytics Services Digital capability Learning analytics Digital launchpad Apprentice workforce development Digital leadership Summer of student innovation Analytics academy Analytics labs Qualification verification App and content store Research data discovery Research data usage metrics Equipment data Repository and preservation platform Research data shared service ?
  • 18. Research data discovery service Alpha site 28/04/2017 Solving the data problem 18
  • 19. Research data usage and metrics 28/04/2017 Solving the data problem 19
  • 20. Shared Service Goals »Policy compliance »Efficiency »Better research 28/04/2017 Solving the data problem 20
  • 21. A key requirement 28/04/2017 Solving the data problem 21
  • 22. …..but a challenging problem 28/04/2017 Solving the data problem 22 Implementing Archivematica for research data preservation at York and Hull Jenny Mitcham (DigitalArchivist) - University ofYork
  • 23. Research data shared service overview 28/04/2017 Solving the data problem 23
  • 24. Data model 28/04/2017 Solving the data problem 24
  • 25. Service MVP (Alpha – July 2017) 28/04/2017 Solving the data problem 25
  • 26. Service MVP (Alpha – July 2017) 28/04/2017 Solving the data problem 26
  • 27. Pilot MVP components * Under review as additional reporting options may be available, also differing offers from full dashboard/analytics to API only. Further discovery work is underway. 28/04/2017 Solving the data problem 27 RDSS Component Offer Number of Pilots Requiring (total =17) RDSS Repository 14 RDSS Preservation 17 RDSS Reporting 14 (TBC)* RDSS Storage 16
  • 28. Pilot Alpha MVP integrations *RDSS Framework Supplier 28/04/2017 Solving the data problem 28 RDSS Component Offer Number of Pilots Requiring (total =17) Eprints (Repository) 12 Dspace (Repository) 4 Hydra (Repository) 2 Symplectic (CRIS)* 4 Pure (CRIS) 3 Converis (CRIS) 1 Authentication 17
  • 29. Middlesex Figshare implementation »Accelerated deployment in 10 weeks (Installation by 10th November) »Stakeholder engagement »Development of institutional requirements »Sign up to Datacite membership »Implementation team (informal) »Integration with Jisc Storage »Implementation of pilot data repository 28/04/2017 Solving the data problem 29
  • 30. The University of Jisc Sandbox » Scratch environment for testing of configuration and integration of service platform components » A mock HEI to integrate with » Infrastructure as code, learning from building, and managing the mixture of SaaS and custom applications.This will allow easy push button install of products » Working with test data and metadata taken from real HEI repositories » Consistent and standardised UX » Bespoke development environment 28/04/2017 Solving the data problem 30 Apps CRIS Test data Zenodo RDSS pilot HEI repositories Publisher data AWS storage + tools Data repositories Figshare, Hydra Islandora, Haplo Publication repositories Eprints D-space Preservation systems Preservica Archivematica Additional software and services
  • 31. Assessing researchers’ needs - Data asset framework 28/04/2017 Solving the data problem 31
  • 32. Preservation of research data “I currently spend about £1,200 pa on data storage from my own salary. I have the highest data needs in my School, and there is no plan in place for storing my data.” 28/04/2017 Solving the data problem 32
  • 33. Sensitive research data “It would be helpful to clarify the rules for storing anonymised data on cloud services. My departmental rules say this is never OK, however this seems to contradict University rules.” 28/04/2017 Solving the data problem 33
  • 34. University services to support RDM “Support is woeful in the university currently, in particular long-term data archiving is critically required. Most of my non-current data is rotting on CD's and hard-drives.” 28/04/2017 Solving the data problem 34
  • 35. University services to support RDM “Please, individualise the support.Workshop are useless, emails with information are useless, brochures are useless, posters are useless.” 28/04/2017 Solving the data problem 35
  • 38. What we’d like to know….. » What are your current priorities and pain points with managing data? » Do you have or are you expecting a data deluge? » What would you like Jisc to provide for managing data? » What would you like the Jisc offer to look like? » Have we missed anything in our pilots?Are there gaps? » Are there any aspects of data management you’d like to keep ‘in-house’? » Do you have issues around research systems user experience for researchers and staff » Do you have issues around systems interoperability » Do you have preservation needs beyond research data (eg records management, Archives) » Can you share any hooks or incentives to engage researchers in data management services » Any tips for success and lessons learned that we can utilise in implementing systems? » Anything else….. 28/04/2017 Solving the data problem 38
  • 39. 28/04/2017 Solving the data problem 39 Matthew Dovey Head of e-infrastructure strategy matthew.dovey@jisc.ac.uk John Kaye Senior co-design manager – Research Data john.kaye@jisc.ac.uk jisc.ac.uk/rd/projects/research-data-shared-service https://community.jisc.ac.uk/groups/tiered-storage

Editor's Notes

  1. What we have now – fragmentation, lack of interoperability, some good practice within subject areas but not the efficiencies possible when we deliver at scale.
  2. Vision Researchers shouldn’t need to think (too much!) about Research Data Management "Visible data, invisible infrastructure" Provide researchers intuitive, easy functionality to publish, archive and preserve their research outputs. Provide interoperable systems to allow researchers and institutions to fulfil and go beyond policy requirements and adhere to best practice throughout the RDM lifecycle. Goals RDM Policy compliance Increased sector efficiencies: procurement, data re-use, interoperability opportunities Improving the integrity of research Addressing Market Gaps: Integrated RDM system, Preservation Gap, Usability Accelerating Research Data Management in institutions Supporting institutions meet Open Access/REF
  3. Preservation This is the big GAP – many institutions are only now starting to address this need, in particular the question of what to keep (and what not to keep) and how log to keep things for. While there are solutions like Arkivum there is a gap in terms of curating for preservation – tools that allow file format identification, metadata and the creation of archival information packages – data integrity and even emulation. There is also a lack of true integration from data creation through to long term preservation.
  4. The long tail The long tail of unidentifiable files that we will have to deal with Mention Jenny Mitcham's stats - around 60% of unidentifiable items in the RDM collection using existing workflows PDF's - easy to deal with, as problem solved by global initiatives e.g. JHOVE, VeraPDF
  5. Interoperability In many ways the integration with other existing systems is the key USP for many potential stakeholders. No one institutional set up is the same as another and the shared service has to integrate each case so the integration piece across all of the lots shown here and plugging those into reporting services, aggregators and funder systems is a major challenge. We do it because it is hard. Worktribe -
  6. Note it is data as a top line BUT our solution WILL meet text requirements hence the OA / REF one here.
  7. Some of the important issues and requirements that will be addressed in beta is the service approach to managing large datasets and storage and access management for sensitive datasets. The beta phase also covers significant development and improvements to the user experience and integration with additional institutional systems such as HR, finance and ethics. Powerfolder
  8. Data Centres not financially vialble.