1. Research Data & Output
Management
Business Analytics Research Team
25 September, 2015
1
Jane Frazier
Data Librarian, Australian National Data Service
@mignon1915
This work is licenced under creativecommons.org/licenses/by/2.0/au/
5. pieces of information
held in any format or
media
raw, cleaned or
processed
experimental or
observational
numerical, descriptive,
visual or tactile
used as primary
sources for research
necessary to validate
research findings
What is research data?
5
6. What is research output?
https://www.auckland.ac.nz/en/about/the-university/how-university-works/policy-and-administration/research/output-system-and-reports/research-outputs--definition-and-categories.html
6
9. Reproducibility in computational research
Peng. Reproducible Research in Computational Science.
Science, 2011. [DOI:10.1126/science.1213847]
9
10. Reproducibility in computational research
general principles
● Assign a unique ID for each version of released data & code
● Use open licensing for data & code
● Workflow tracking should happen during the research process
● In your publication, include a statement describing computing environment(s) and software version(s)
● Make data, code and methods available and accessible
○ Version control
● Publish data, code and methods in non-proprietary formats (if possible)
● Cite any 3rd party data and code
● Follow data and code sharing guidelines for funded research
10
Stodden and Miguez. Best practices for computational science: software infrastructure and environments for reproducible and extensible research.
Journal of Open Research Software, 2014. [DOI:10.5334/jors.ay]
Yale Law School Roundtable on Data and Code Sharing. Reproducible research: addressing the need for data and code sharing in computational science.
Columbia University Academic Commons, 2010. [DOI:10.1109/MCSE.2010.113]
11. Version Control
11
for data for code for environments
No widely-accepted
conventions yet, but lots of
work being done…
{assignment of a DOI for each
version is recommended}
12. 12
Further Resources
Guide to best practices for researchers publishing computational results [http://wiki.stodden.net/Best_Practices]
W3C Provenance Working Group [http://www.w3.org/2011/prov/wiki/Main_Page]
On dynamic data citation [https://rd-alliance.org/groups/data-citation-wg/wiki/scalable-dynamic-data-citation-rda-wg-dc-position-paper.html and
http://dublincore.org/resources/training/ASIST_Webinar_20150408/Rauber-2015-04-08.pdf]
Research Data Alliance Data Citation Working Group [https://rd-alliance.org/groups/data-citation-wg.html]
Research Data Alliance Research Data Provenance Interest Group [https://rd-alliance.org/groups/research-data-provenance.html]
Research Data Alliance Reproducibility Interest Group [https://rd-alliance.org/groups/reproducibility-ig.html]