SlideShare a Scribd company logo
1 of 17
The Now and Future of
Data Publishing
Oxford University – 22nd May
Ruth Wilson
Publisher
Nature Publishing Group
22
Overview
Context
Scientific Data
– Concept
– Data descriptor
– Licenses
– Team
Evolution
– Better integration of SI
– Source data
– Data citations
33
Data, data, data
Two important factors are driving to make research data more available and
reusable:
• To ensure the scientific process is transparent and can be scrutinised and
research results reproduced
• To speed the scientific process, lead to new insights and reduce duplicated
and repeated work
To achieve this research data needs to be
– Available
– Findable
– Interpretable
– Re-usable
– Citable
44
Existing challenges
• Data producers do not necessarily get
appropriate credit for their work
• Traditional publications are focused on
hypothesis/conclusions
• The peer review process at many research
journals is not focused on ensuring data release
and data standards
• Data and info about datasets often ends in supp.
material
• Potentially valuable datasets are not released
5
Calling for submissions in Fall 2013, launching in Spring 2014
nature.com/scientificdata
66
What is Scientific Data?
• Scientific Data is an Open Access, online-only
platform containing data descriptors that
describe and explain datasets, supported by
an APC model.
• Data descriptors are a new type of content
and can be viewed as ‘secondary’ material
aimed at increasing the visibility and usability
of datasets and to aid research reproducibility
• For all types of data the descriptor will be peer
reviewed
77
What is Scientific Data..?
• As part of the peer review process we will
check that the data is publically available in an
approved data repository and follows
community guidelines
• All content will be published open access with
the author able to select from a number of
options. In addition the descriptor metadata
will be available under CC0.
• An in-house editorial team and new authoring
tools are being developed to ensure the
creation, submission, curation and publication
of data descriptors is as simple as possible
• The external advisory board will represent
different stakeholder views and provide
feedback on key services.
88
8
Data Descriptors
a new publication type for describing scientifically valuable
datasets
SciData DD
Structured
content
Export to various
formats
(ISA_tab, RDF, etc
)
Datasets
Interoperate with Community resources
Code Workflows
Advanced Search
and Discovery
functions
SciData DD
Structured
content
SciData DD
Structured
content
SciData DD
Structured
content
Link to related
Content
Nature Methods
Scientific Reports
Nature Genetics
99
Narrative content
complements both journal articles and repository records
Includes
– Highly detailed, reproducible methods descriptions
– Quality control & technical validation experiments
– Searchable, machine-readable meta-data
Does Not Include
– In depth analysis or tests of hypotheses
– New scientific conclusions
– Exploratory analysis (e.g. clustering)
1010
10
Structured content
It will be based on and compatible with ISA-tab and
undergo technical review by biocuration/standards referees
Submit ISA-tab files directly OR Submission tools and simple templates
help authors provide the information
without special tools
In-house curator
standardizes the
structured content
1111
License types
Data: the raw datasets will reside in public
repositories and likely to be CC0 similar to
Figshare and Dryad etc…
DATA DESCRIPTOR
Metadata: as NPG has already done with its
existing Linked Data Portal the metadata about
data descriptors in Scientific Data will be CC0
Narative/Figures: the narrative describing the
methodology of data generation/collection and
processing will be licensed under either of the
following, by author choice:
1212
Susanna-Assunta Sansone - Honorary Academic Editor
Andrew L Hufton - Managing Editor
Advisory Panel
Supported by
Joseph R. Ecker
Salk Institute, USA
Mark Forster
Syngenta, UK
Stephen Friend
Sage Bionetworks, USA
Pascale Gaudet
Swiss Institute of
Bioinformatics, Switzerland
Anne-Claude Gavin
EMBL, Germany
Albert J. R. Heck
Utrecht University, The Netherlands
Wolfram Horstmann
University of Oxford, UK
Johanna McEntyre
EMBL-EBI, European Bioinformatics Institute, UK
Anthony Rowe
Johnson & Johnson, USA
Richard H. Scheuermann
J. Craig Venter Institute, USA
Caroline Shamu
Harvard Medical School, USA
Jessica Tenenbaum
Duke Translational Medicine Institute, USA
Weida Tong
National Center for Toxicological
Research, FDA, USA
Judith A. Blake
The Jackson Laboratory, USA
Chris Bowler
IBENS, France
Piero Carninci
RIKEN Omics Science
Center, Japan
David Carr
Wellcome Trust, UK
Stephen Chanock
National Cancer Institute, USA
Simon Hodson
Jisc, UK
Who are we?
1313
Contacts
Call for submission Fall 2013
Launching in Spring 2014
13
• www.nature.com/scientificdata
• Email: scientificdata@nature.com
• Twitter: @ScientificData
Evolution
1515
Evolution - SI
• Greater accessibility/visibility
• Greater discoverability
• Currently about to be piloted on
• Nature Structural and Molecular Biology
• Nature Cell Biology
1616
Evolution
Source Data
About to be implemented on Nature
branded life science journals
Initially data behind figures
Data Citations
Thankyou

More Related Content

What's hot

Peer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalPeer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalVarsha Khodiyar
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data ManagementAnita de Waard
 
On community-standards, data curation and scholarly communication" Stanford M...
On community-standards, data curation and scholarly communication" Stanford M...On community-standards, data curation and scholarly communication" Stanford M...
On community-standards, data curation and scholarly communication" Stanford M...Susanna-Assunta Sansone
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersIncisive_Events
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystemVarsha Khodiyar
 
Publishing and impact 20141028
Publishing and impact 20141028Publishing and impact 20141028
Publishing and impact 20141028Hugo Besemer
 
NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016Susanna-Assunta Sansone
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Merce Crosas
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpVarsha Khodiyar
 
A Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesLIBER Europe
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterVarsha Khodiyar
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Merce Crosas
 
Simon Goudie - Wiley’s Recommendations for Journal Data Policies
Simon Goudie - Wiley’s Recommendations for Journal Data PoliciesSimon Goudie - Wiley’s Recommendations for Journal Data Policies
Simon Goudie - Wiley’s Recommendations for Journal Data PoliciesWiley
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...Susanna-Assunta Sansone
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014Susanna-Assunta Sansone
 
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...ASIS&T
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data ManagementLibrary_Connect
 

What's hot (20)

Peer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalPeer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journal
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
On community-standards, data curation and scholarly communication" Stanford M...
On community-standards, data curation and scholarly communication" Stanford M...On community-standards, data curation and scholarly communication" Stanford M...
On community-standards, data curation and scholarly communication" Stanford M...
 
Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producers
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Publishing and impact 20141028
Publishing and impact 20141028Publishing and impact 20141028
Publishing and impact 20141028
 
NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
A Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data Repositories
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopter
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
 
Simon Goudie - Wiley’s Recommendations for Journal Data Policies
Simon Goudie - Wiley’s Recommendations for Journal Data PoliciesSimon Goudie - Wiley’s Recommendations for Journal Data Policies
Simon Goudie - Wiley’s Recommendations for Journal Data Policies
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
 
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data Management
 
Burton - Security, Privacy and Trust
Burton - Security, Privacy and TrustBurton - Security, Privacy and Trust
Burton - Security, Privacy and Trust
 

Similar to Wilson-npg-scientific data-nfdp13

Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishingVarsha Khodiyar
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Susanna-Assunta Sansone
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataSusanna-Assunta Sansone
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...The University of Edinburgh
 
Research Transparency in the Social Sciences: DA-RT
Research Transparency in the Social Sciences: DA-RTResearch Transparency in the Social Sciences: DA-RT
Research Transparency in the Social Sciences: DA-RTARDC
 
Effective research data management
Effective research data managementEffective research data management
Effective research data managementCatherine Gold
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ LibraryARDC
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Susanna-Assunta Sansone
 
Data publication: Discover, Explore, Visualise
Data publication: Discover, Explore, VisualiseData publication: Discover, Explore, Visualise
Data publication: Discover, Explore, VisualiseAlejandra Gonzalez-Beltran
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)aaroncollie
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016Rebecca Raworth, MLIS
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemGlobus
 
Identifying and tracking research resources using RRIDs: a practical approach
Identifying and tracking research resources using RRIDs:  a practical approachIdentifying and tracking research resources using RRIDs:  a practical approach
Identifying and tracking research resources using RRIDs: a practical approachdkNET
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environmentphilipdurbin
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing dataWorld Agroforestry (ICRAF)
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceSusanna-Assunta Sansone
 

Similar to Wilson-npg-scientific data-nfdp13 (20)

Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
 
Research Transparency in the Social Sciences: DA-RT
Research Transparency in the Social Sciences: DA-RTResearch Transparency in the Social Sciences: DA-RT
Research Transparency in the Social Sciences: DA-RT
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ Library
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
 
Data publication: Discover, Explore, Visualise
Data publication: Discover, Explore, VisualiseData publication: Discover, Explore, Visualise
Data publication: Discover, Explore, Visualise
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data Ecosystem
 
Identifying and tracking research resources using RRIDs: a practical approach
Identifying and tracking research resources using RRIDs:  a practical approachIdentifying and tracking research resources using RRIDs:  a practical approach
Identifying and tracking research resources using RRIDs: a practical approach
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
 

More from DataDryad

Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13DataDryad
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13DataDryad
 
Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13DataDryad
 
Hole-data journal-nfdp13
Hole-data journal-nfdp13Hole-data journal-nfdp13
Hole-data journal-nfdp13DataDryad
 
Shotton force11-nfdp13
Shotton force11-nfdp13Shotton force11-nfdp13
Shotton force11-nfdp13DataDryad
 
Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13DataDryad
 
Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13DataDryad
 
Mounce-Herding Cats
Mounce-Herding CatsMounce-Herding Cats
Mounce-Herding CatsDataDryad
 
Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13DataDryad
 
Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13DataDryad
 
Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13DataDryad
 
Costas-data metrics-nfdp13
Costas-data metrics-nfdp13Costas-data metrics-nfdp13
Costas-data metrics-nfdp13DataDryad
 
Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13DataDryad
 
Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13DataDryad
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13DataDryad
 
Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13DataDryad
 
Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13DataDryad
 
Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13DataDryad
 
Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013DataDryad
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13DataDryad
 

More from DataDryad (20)

Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13
 
Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13
 
Hole-data journal-nfdp13
Hole-data journal-nfdp13Hole-data journal-nfdp13
Hole-data journal-nfdp13
 
Shotton force11-nfdp13
Shotton force11-nfdp13Shotton force11-nfdp13
Shotton force11-nfdp13
 
Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13
 
Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13
 
Mounce-Herding Cats
Mounce-Herding CatsMounce-Herding Cats
Mounce-Herding Cats
 
Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13
 
Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13
 
Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13
 
Costas-data metrics-nfdp13
Costas-data metrics-nfdp13Costas-data metrics-nfdp13
Costas-data metrics-nfdp13
 
Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13
 
Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
 
Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13
 
Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13
 
Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13
 
Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13
 

Recently uploaded

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Recently uploaded (20)

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Wilson-npg-scientific data-nfdp13

  • 1. The Now and Future of Data Publishing Oxford University – 22nd May Ruth Wilson Publisher Nature Publishing Group
  • 2. 22 Overview Context Scientific Data – Concept – Data descriptor – Licenses – Team Evolution – Better integration of SI – Source data – Data citations
  • 3. 33 Data, data, data Two important factors are driving to make research data more available and reusable: • To ensure the scientific process is transparent and can be scrutinised and research results reproduced • To speed the scientific process, lead to new insights and reduce duplicated and repeated work To achieve this research data needs to be – Available – Findable – Interpretable – Re-usable – Citable
  • 4. 44 Existing challenges • Data producers do not necessarily get appropriate credit for their work • Traditional publications are focused on hypothesis/conclusions • The peer review process at many research journals is not focused on ensuring data release and data standards • Data and info about datasets often ends in supp. material • Potentially valuable datasets are not released
  • 5. 5 Calling for submissions in Fall 2013, launching in Spring 2014 nature.com/scientificdata
  • 6. 66 What is Scientific Data? • Scientific Data is an Open Access, online-only platform containing data descriptors that describe and explain datasets, supported by an APC model. • Data descriptors are a new type of content and can be viewed as ‘secondary’ material aimed at increasing the visibility and usability of datasets and to aid research reproducibility • For all types of data the descriptor will be peer reviewed
  • 7. 77 What is Scientific Data..? • As part of the peer review process we will check that the data is publically available in an approved data repository and follows community guidelines • All content will be published open access with the author able to select from a number of options. In addition the descriptor metadata will be available under CC0. • An in-house editorial team and new authoring tools are being developed to ensure the creation, submission, curation and publication of data descriptors is as simple as possible • The external advisory board will represent different stakeholder views and provide feedback on key services.
  • 8. 88 8 Data Descriptors a new publication type for describing scientifically valuable datasets SciData DD Structured content Export to various formats (ISA_tab, RDF, etc ) Datasets Interoperate with Community resources Code Workflows Advanced Search and Discovery functions SciData DD Structured content SciData DD Structured content SciData DD Structured content Link to related Content Nature Methods Scientific Reports Nature Genetics
  • 9. 99 Narrative content complements both journal articles and repository records Includes – Highly detailed, reproducible methods descriptions – Quality control & technical validation experiments – Searchable, machine-readable meta-data Does Not Include – In depth analysis or tests of hypotheses – New scientific conclusions – Exploratory analysis (e.g. clustering)
  • 10. 1010 10 Structured content It will be based on and compatible with ISA-tab and undergo technical review by biocuration/standards referees Submit ISA-tab files directly OR Submission tools and simple templates help authors provide the information without special tools In-house curator standardizes the structured content
  • 11. 1111 License types Data: the raw datasets will reside in public repositories and likely to be CC0 similar to Figshare and Dryad etc… DATA DESCRIPTOR Metadata: as NPG has already done with its existing Linked Data Portal the metadata about data descriptors in Scientific Data will be CC0 Narative/Figures: the narrative describing the methodology of data generation/collection and processing will be licensed under either of the following, by author choice:
  • 12. 1212 Susanna-Assunta Sansone - Honorary Academic Editor Andrew L Hufton - Managing Editor Advisory Panel Supported by Joseph R. Ecker Salk Institute, USA Mark Forster Syngenta, UK Stephen Friend Sage Bionetworks, USA Pascale Gaudet Swiss Institute of Bioinformatics, Switzerland Anne-Claude Gavin EMBL, Germany Albert J. R. Heck Utrecht University, The Netherlands Wolfram Horstmann University of Oxford, UK Johanna McEntyre EMBL-EBI, European Bioinformatics Institute, UK Anthony Rowe Johnson & Johnson, USA Richard H. Scheuermann J. Craig Venter Institute, USA Caroline Shamu Harvard Medical School, USA Jessica Tenenbaum Duke Translational Medicine Institute, USA Weida Tong National Center for Toxicological Research, FDA, USA Judith A. Blake The Jackson Laboratory, USA Chris Bowler IBENS, France Piero Carninci RIKEN Omics Science Center, Japan David Carr Wellcome Trust, UK Stephen Chanock National Cancer Institute, USA Simon Hodson Jisc, UK Who are we?
  • 13. 1313 Contacts Call for submission Fall 2013 Launching in Spring 2014 13 • www.nature.com/scientificdata • Email: scientificdata@nature.com • Twitter: @ScientificData
  • 15. 1515 Evolution - SI • Greater accessibility/visibility • Greater discoverability • Currently about to be piloted on • Nature Structural and Molecular Biology • Nature Cell Biology
  • 16. 1616 Evolution Source Data About to be implemented on Nature branded life science journals Initially data behind figures Data Citations

Editor's Notes

  1. Very broad theme and not so much time so will concentrate on two aspects of linking between publications and research data at NPG, one is a new product Scientific Data – A new data focused OA peer reviewed platform, other is evolution of practises for existing journals.A small amount of context
  2. What are the existing challengesWe know that much research data is stored in draws if stored at all….
  3. Response to challenges NPG is launching Scientific Data - focused on data interpretation and reuseCalling for submissions in Fall 2013, launching in Spring 2014Six Principles. The Scientific Brand: Innovative new publishing brand from NPG. Open-access, community-driven. Feature-rich. Complements the more tradition-bound Nature titles.
  4. New layer in between traditional journal articles and Repositories. We don’t store the data.
  5. Blessing and a curse……UnderutilisedPublishers (including NPG) do little with Supplementary Information (SI), other than present it with the article in PDF form (not being in xml/html format makes it hard to index and find)Growth difficult to managePublishers are struggling with the growing amount of SI in the life sciences: since 2010 the Journal of Neuroscience no longer accepts SI as it felt it was adversely affecting peer review. In 2009 Cell restricted the number and volume of SIIncreasing volumes at NPGNumber of pieces of SI in NG grown by 65% between 2008 and 2011There were 1515 pieces of SI in Nature Genetics (incl. figures and tables) in the first half of 2011, compared to 915 in the same time period in 2008 The volume of SI across NPG has grown from 5299 files, to 6469 and 7120 (2008 – 2010)(22%, 10%) These figures do not break out individual figures and tables but instead look at the number of PDF files, doc files, xls files etc. Approx 60% are PDF files
  6. Source data – has been on EMBO and MSB for some time….Linked to Nature journals’ updated editorial policies aim to improve transparency and reproducibility by: -Both requiring much more precise description of statistics and employing the expertise of a statistics consultant, where needed;-Increasing the lengths of Methods sections in journals to allow authors to be much more descriptive and facilitate replication of their findings and;-Publishing source data: first the actual data points, that is, tabular source data, behind figures; next additional forms of source data.To this point we have been citing data sets in an online Accession Codes section in our articles online by listing the repository name and, via the persistent identifier, linking to the data set entry in the repository. We will further formalize Data Citations by having them appear in a similar manner to bibliographic references including ensuring that data set authors are more granularly credited for their work (and including the date, minimally year, of data deposition).