SlideShare a Scribd company logo
1 of 61
Directions in Open Science
Mike Travers
SRI Bioinformatics Research
Group
For AIC Lunch and Learn, 30 Jan 2012
About this talk
• Partly a trip report from Open Science
Summit 2011
• Partly an attempt to define open science
and explore its impact
• Partly an excuse to talk about some of my
own vaguely related work
• And partly some semi-crazy speculation
about future projects in this space
The Open Science Summit unites researchers, life science
industry professionals, students, patients and other
stakeholders to discuss the future of collaborative science
and innovation.
…in-depth sessions on new models for drug discovery and
clinical trials, personal genomics, the patent system, the
future of scientific publications, and more.
What is Open Science?
• Many different things, but boils down to:
• Removing barriers to scientific
communication and collaboration:
– Social
– Technical
– Legal
– Economic
– Bureaucratic
• To accelerate scientific progress
• Utilizing modern technology
Driven by technological change
• The Internet has radically reduced
communication costs
• So old institutions of scientific
communication are now obstacles
– Closed academic publishers, notably:
• Internet will transform scientific media just
like it has newspapers, TV, social life….
• The difference is: science is more
important than sharing cat pictures
For-profit
academic
publishing is a
racket
A very lucrative
one
Starting to be
rumbles of
complaint
(boycotts) from
academics
Open
• Most visible and successful branch
of open science
• Articles are free to read, pay
to publish
• Funders are starting to require
some form of public access
Gold: OA journal, Green: OA self-archiving
Open Access to the Scientific Journal Literature: Situation 2009, PLoS ONE, Bo-Christer Björk et al
Research Works Act
• H.R.3699 – “A bill to ensure the continued
publication and integrity of the peer-reviewed
research works by the private sector.”
No Federal agency may adopt, implement, maintain, continue, or otherwise engage in any
policy, program, or other activity that--
(1) causes, permits, or authorizes network dissemination of any private-sector research
work without the prior consent of the publisher of such work; or
(2) requires that any actual or prospective author, or the employer of such an actual or
prospective author, assent to network dissemination of a private-sector research work.
Myth 1: American consumers have a right to free access to articles their tax
dollars fund.
Fact
American taxpayers do not fund peer reviewed research articles; they fund some
of the research that is used in those articles…
Beyond Open Access
• Not going to say a whole lot about OA, because:
• It’s easy to understand
• It’s pretty clearly going to win in the long term
• By itself, not a very radical change to how science is
done:
– Knowledge is still in paper-sized chunks
– Papers are peer-reviewed prior to publication;
– Once something is published, it’s static
• All these parameters are being challenged in some
way by other efforts
• George Whitesides (Harvard chemist): “The concept
of the scientific paper is eroding before our very
eyes”
Variations on publishing
• “Peer review is broken”
– Too slow
– Too biased
– Too rigid
– May be “the worst system except for all the others”
• Pre-peer-review publication
– Eg arXiv.org
• Micropublication
– Crowdsourcing, blogs, wikis….
• Open-notebook science
– No gap at all between bench and publication
• Database-linked publications
• Dynamic Review Papers
Biggest sequencing operation
in the world
Generating 6 terabytes/day of
genomic data
Open-Source Genomic Analysis of Shiga-Toxin–Producing E. coli
O104:H4 Rohde et al 2011 (NEJM)
Toxic E. coli outbreak in Germany May 2011:
We released these data into the public domain… which elicited a burst of crowd-
sourced, curiosity-driven analyses carried out by bioinformaticians on four
continents. Twenty-four hours after the release of the genome, it had been
assembled; … Five days after the release of the sequence data, we had designed and
released strain-specific diagnostic primer sequences, and within a week, two dozen
reports had been filed on an open-source wiki …dedicated to analysis of the strain
https://github.com/ehec-outbreak-crowdsourced
GigaScience is a new integrated database and journal
co-published in collaboration between BGI Shenzhen
and BioMed Central, to meet the needs of a new
generation of biological and biomedical research as it
enters the era of "big-data."
Dynamic Review Papers
Conventional paper
Paired with
Dynamically-
updated,
wiki-based
paper/database/mo
del
Driving apps
Who comes to Open Science
Summits?
Activist Organizations
Participatory Medicine
& Disease Foundations
Startups
Social paper and citation
management
Scientific services
marketplace
Web-based molecule
library management
Citizen Science
Somewhat less garage-
• Independent research institute, started
from data released by Merck
• Repository of experimental data (Sage
Commons)
• Network of cooperating institutions
• Starting to build a computational platform
(Synapse)
Synthetic Biology
And some individual
researchers
• Peter Murray-Rust
Chemist, Cambridge,
promoter of Chemical Markup Language and
semantic web
“Closed science makes people die!”
• Victoria Stodden
Statistician, Columbia,
reproducibility of computational science
(cf ClimateGate)
Some open science success
stories
• Galaxy Zoo
• FoldIt
• Nutrient Network (NutNet)
• Prazinquantel synthesis
Galaxy Zoo
• Citizen science (loosely)
• Image classification task
• Mechanical Turk-like approach (but
unpaid)
• About 200K participants
• Discovered a whole new class of galaxies
(“green pea”) and a quasar mirror
• 22 published papers in 3 years
Social sharing of algorithms (“recipes”)
Descent with modification
Matthew Todd, chemist at U
of Syndney
Schistosmiasis
Looking for synthesis for
known drug Prazinquantel
(PZQ) in enantiopure form
Open-notebook science
(LabTrove)
Nutrient Network (NutNet)
What paper has the most
authors?
• NutNet paper:
40 authors, 41 institutions
• This one from SLAC and elsewhere:
407 authors, but only 35 institutions
Three variations on the scientific
process
• Automated Science
• Distributed Science
• Web-scale Intelligent Science
• Open Science as the lubrication /
accelerant that makes these feasible
Afferent: Automation for Drug
Discovery
• Combinatorial Chemistry
• Planning software to drive lab robots
Distributed Science
• Some science (eg evaluation of drug
candidates) is highly parallelizable,
• Hence distributable
• CollabRx was initially supposed to support
“virtual pharma companies” that would tie
disparate academic research efforts into
focused teams
Web-scale Intelligent Science
• Imagine all of science as a giant distributed
computational process
• Individual scientists are agents
– working on a small part of the problem
– Sharing their results
– Getting feedback and funding dependent on
success
• Centralized data integration and decision
tools used to help determine next useful
experiment
Steps towards distributed
intelligence
• Adaptive clinical trials
– Rather than a classical trial with two arms run to
completion
– Change the distribution of test cases based on
ongoing results
• Now imagine this strategy applied more globally
across all treatments for a disease
• Credit for this slightly mad vision goes mainly to
Marty Tenenbaum:
– AI Meets Web 2.0 (2006)
– Shrager, Tenenbaum, Travers, Cancer Commons:
Biomedicine in the Internet Age (2011)
What does all that have to do with
Open Science?
• Open Science is lowering barriers to
collaboration
• So it’s a necessary but not sufficient step
towards this new kind of science
• CollabRx may just have been too early:
– the groundwork hasn’t been laid yet,
– we are still working on basics
– (eg standards for representation)
• Reducing friction (or transaction costs) can
be incredibly important
“Changing the cost of innovation
fundamentally changes the nature of
innovation”
– Joichi Ito
TCP, HTTP etc are the
containerization of
data.
So what’s the analog
for scientific
knowledge?
Standardized Legal and
Institutional Mechanisms
A mix of technical,
institutional, and legal
standardization:
-Standard licenses
(parameterizable)
-RDF representation for
licenses.
-Web Tools to generate
these
-Sites that collect and
“market” available
materials.
BioBike, a platform for open
science
• Conceived of as a vehicle for getting
biologists to do their own knowledge-
based biocomputing.
• Lisp + Frame system + Bioinformatics
Tools
– Through-the-web programmability
– Community sharing of code and data
– Visual Programming Language
• Open Source
•
Jeff Elhai, Arnaud Taton, J. P. Massar, John K. Myers, Michael Travers, Johnny Casey, Mark
Slupesky, Jeff Shrager. BioBIKE: A Web-based, programmable, integrated biological knowledge
base. Nucleic Acids Research, 2009
BioBike and Open Science
• BioBike wasn’t for Open Science per se
• But it did explore some ideas in web-
based biocomputation
• The next-generation BioBike platform:
– Data: Big data, Open data, semantic web
integrated
– Programming: Able to deal with large scale
and distributed workflows with human
elements
– Collaboration: Integrating different
communities in a “trading zone”
KnowOS: The (Re)Birth of the Knowledge Operating
System. Mike Travers, JP Massar, and Jeff Shrager,
International Lisp Conference 2005
What is a platform?
• The economic meaning of “platform” is interesting
• Something that:
– Supports two-sided network effects
– Stands in the middle and extracts a toll
• Examples:
– Credit cards
(merchants ↔ consumers)
– Operating systems
(application developers ↔ users)
• Science has more complicated networks and relations
– Data providers
– Data consumers
– Service providers
– Analysts (statisticians, eg)
– Patients
• A science platform is not going to make anyone rich like Facebook,
but it would be nice to have a powerful and standard way for all
these groups to collaborate.
Open Data is outstripping analysis
capacity
• Or in other words:
– data is cheap,
– attention, knowledge, & expertise are
expensive
• A platform for collaborative computational
interpretation of biological data
• To better leverage the expensive
resources
identifies advancing new computational infrastructure as a
priority for driving innovation in science and engineering.
Scientific discovery and innovation are advancing
along fundamentally new pathways opened by the
development of increasingly sophisticated software.
the overarching goal of transforming
innovations in research and education into
sustained software resources that are an
integral part of the cyberinfrastructure
Anti-open arguments
• Peer-review is an essential filter; without it
too much nonsense gets out
• Electronic availability of articles actually leads
to narrowing of science (Evans, 2008)
• Privacy, HIPAA, etc.
• Need to retain IP for economic motivation
• The problem isn’t availability of data; it’s
making sense of what we do have
• See PRISM for more
Opener Science
• Science is already
pretty open!
• institutions of openness
played a role in the
foundation of science,
including the first
scientific journals
Historical Origins of Open
Science
• Before the invention of science,
knowledge of the natural world was closely
guarded, passed down from master to
apprentice.
• The development of institutions of
openness was a key factor in the scientific
revolution (Paul David, Stanford
economist)
• …and the printing press was a key factor
in that.
So…
• The printing press is almost 600 years old
• The scientific journal is almost 350 years old
• There’s been some advancement in
communication technology since then…
• Science will eventually change:
– Either a modest acceleration of the scientific
process,
– Or as significant and discontinuous as the first
scientific revolution
• Which one? An open question.
Further Reading
End

More Related Content

What's hot

Data visualisations: drawing actionable insights from science and technology ...
Data visualisations: drawing actionable insights from science and technology ...Data visualisations: drawing actionable insights from science and technology ...
Data visualisations: drawing actionable insights from science and technology ...EFSA EU
 
20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data ThingsKatina Toufexis
 
20160523 23 Research Data Things
20160523 23 Research Data Things20160523 23 Research Data Things
20160523 23 Research Data ThingsKatina Toufexis
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Jisc
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceMartin Donnelly
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data ThingsKatina Toufexis
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?Philip Bourne
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterprisePhilip Bourne
 
Open Science, Why not?
Open Science, Why not?Open Science, Why not?
Open Science, Why not?Nancy Pontika
 
Presentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical SocietyPresentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical Societyosimod
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017Sandra Gesing
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 
Symbiosis—Is Collaboration the New Innovation? (Part 3 of 3), Mike Conlon
Symbiosis—Is Collaboration the New Innovation?  (Part 3 of 3), Mike ConlonSymbiosis—Is Collaboration the New Innovation?  (Part 3 of 3), Mike Conlon
Symbiosis—Is Collaboration the New Innovation? (Part 3 of 3), Mike ConlonAllen Press
 
Research data spring: streamlining deposit
Research data spring: streamlining depositResearch data spring: streamlining deposit
Research data spring: streamlining depositJisc RDM
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 
How to support decisions with online collaborative models?
How to support decisions with online collaborative models?How to support decisions with online collaborative models?
How to support decisions with online collaborative models?EFSA EU
 
How to Improve Research Visibility and Impact: Session 5, Online Repository
How to Improve Research Visibility and Impact: Session 5, Online RepositoryHow to Improve Research Visibility and Impact: Session 5, Online Repository
How to Improve Research Visibility and Impact: Session 5, Online RepositoryNader Ale Ebrahim
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectivePhilip Bourne
 

What's hot (20)

Cern webfest
Cern webfestCern webfest
Cern webfest
 
Data visualisations: drawing actionable insights from science and technology ...
Data visualisations: drawing actionable insights from science and technology ...Data visualisations: drawing actionable insights from science and technology ...
Data visualisations: drawing actionable insights from science and technology ...
 
20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data Things
 
20160523 23 Research Data Things
20160523 23 Research Data Things20160523 23 Research Data Things
20160523 23 Research Data Things
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open Science
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital Enterprise
 
Open Science, Why not?
Open Science, Why not?Open Science, Why not?
Open Science, Why not?
 
Presentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical SocietyPresentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical Society
 
The State of Open Data Report by @figshare
The State of Open Data Report  by @figshareThe State of Open Data Report  by @figshare
The State of Open Data Report by @figshare
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Symbiosis—Is Collaboration the New Innovation? (Part 3 of 3), Mike Conlon
Symbiosis—Is Collaboration the New Innovation?  (Part 3 of 3), Mike ConlonSymbiosis—Is Collaboration the New Innovation?  (Part 3 of 3), Mike Conlon
Symbiosis—Is Collaboration the New Innovation? (Part 3 of 3), Mike Conlon
 
Research data spring: streamlining deposit
Research data spring: streamlining depositResearch data spring: streamlining deposit
Research data spring: streamlining deposit
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
How to support decisions with online collaborative models?
How to support decisions with online collaborative models?How to support decisions with online collaborative models?
How to support decisions with online collaborative models?
 
How to Improve Research Visibility and Impact: Session 5, Online Repository
How to Improve Research Visibility and Impact: Session 5, Online RepositoryHow to Improve Research Visibility and Impact: Session 5, Online Repository
How to Improve Research Visibility and Impact: Session 5, Online Repository
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 

Viewers also liked

Building Capacity for Open Science
Building Capacity for Open ScienceBuilding Capacity for Open Science
Building Capacity for Open ScienceKaitlin Thaney
 
The Future of Open Science
The Future of Open ScienceThe Future of Open Science
The Future of Open SciencePhilip Bourne
 
What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?Leslie Chan
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Rene Von schomberg
 
Introduction to open science
Introduction to open scienceIntroduction to open science
Introduction to open scienceReme Melero
 
Principles and practice of Open Science
Principles and practice of Open SciencePrinciples and practice of Open Science
Principles and practice of Open Sciencepetermurrayrust
 
Open Science at the European Commission
Open Science at the European CommissionOpen Science at the European Commission
Open Science at the European CommissionCarl-Christian Buhr
 
Open Science: from the Knowledge Economy to Wisdom Societies
Open Science: from the Knowledge Economy to Wisdom SocietiesOpen Science: from the Knowledge Economy to Wisdom Societies
Open Science: from the Knowledge Economy to Wisdom SocietiesGuillaume Dumas
 
Efforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesEfforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesLIBER Europe
 
Brigita Serafinavičiūtė. Open science – The Essential Ingredient of Our Future
Brigita Serafinavičiūtė. Open science – The Essential Ingredient of Our FutureBrigita Serafinavičiūtė. Open science – The Essential Ingredient of Our Future
Brigita Serafinavičiūtė. Open science – The Essential Ingredient of Our FutureAidis Stukas
 
The current state(s) of Open Science
The current state(s) of Open ScienceThe current state(s) of Open Science
The current state(s) of Open Scienceuherb
 
Bioinformatics for Computer Scientists
Bioinformatics for Computer Scientists Bioinformatics for Computer Scientists
Bioinformatics for Computer Scientists Aung Thu Rha Hein
 
Columbia Talk on Open Notebook Science
Columbia Talk on Open Notebook ScienceColumbia Talk on Open Notebook Science
Columbia Talk on Open Notebook ScienceJean-Claude Bradley
 
Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Reme Melero
 
Open data and Open Science
Open data and Open ScienceOpen data and Open Science
Open data and Open Sciencepetermurrayrust
 
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)judan1970
 

Viewers also liked (20)

Building Capacity for Open Science
Building Capacity for Open ScienceBuilding Capacity for Open Science
Building Capacity for Open Science
 
The Future of Open Science
The Future of Open ScienceThe Future of Open Science
The Future of Open Science
 
What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?
 
Open science
Open scienceOpen science
Open science
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts';
 
Introduction to open science
Introduction to open scienceIntroduction to open science
Introduction to open science
 
Principles and practice of Open Science
Principles and practice of Open SciencePrinciples and practice of Open Science
Principles and practice of Open Science
 
Open Science at the European Commission
Open Science at the European CommissionOpen Science at the European Commission
Open Science at the European Commission
 
Open Science: from the Knowledge Economy to Wisdom Societies
Open Science: from the Knowledge Economy to Wisdom SocietiesOpen Science: from the Knowledge Economy to Wisdom Societies
Open Science: from the Knowledge Economy to Wisdom Societies
 
Open Science and Identifiers
Open Science and IdentifiersOpen Science and Identifiers
Open Science and Identifiers
 
Efforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesEfforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research Libraries
 
Brigita Serafinavičiūtė. Open science – The Essential Ingredient of Our Future
Brigita Serafinavičiūtė. Open science – The Essential Ingredient of Our FutureBrigita Serafinavičiūtė. Open science – The Essential Ingredient of Our Future
Brigita Serafinavičiūtė. Open science – The Essential Ingredient of Our Future
 
The current state(s) of Open Science
The current state(s) of Open ScienceThe current state(s) of Open Science
The current state(s) of Open Science
 
Bioinformatics for Computer Scientists
Bioinformatics for Computer Scientists Bioinformatics for Computer Scientists
Bioinformatics for Computer Scientists
 
Columbia Talk on Open Notebook Science
Columbia Talk on Open Notebook ScienceColumbia Talk on Open Notebook Science
Columbia Talk on Open Notebook Science
 
Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020
 
Relationships between Open Science, Science 2.0, and Social Media
Relationships between Open Science, Science 2.0, and Social MediaRelationships between Open Science, Science 2.0, and Social Media
Relationships between Open Science, Science 2.0, and Social Media
 
Open data and Open Science
Open data and Open ScienceOpen data and Open Science
Open data and Open Science
 
Open Science: What, why, how?
Open Science: What, why, how? Open Science: What, why, how?
Open Science: What, why, how?
 
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
 

Similar to Directions in Open Science

Nicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do researchNicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do researchGigaScience, BGI Hong Kong
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterprisePhilip Bourne
 
Share: Science Information Life Cycle
Share: Science Information Life CycleShare: Science Information Life Cycle
Share: Science Information Life Cyclekauberry
 
Understanding knowledge as a commons
Understanding knowledge as a commonsUnderstanding knowledge as a commons
Understanding knowledge as a commonsawillway
 
EoE HLN summer conference 2015 IK - open access & research data management
EoE HLN summer conference 2015 IK - open access & research data managementEoE HLN summer conference 2015 IK - open access & research data management
EoE HLN summer conference 2015 IK - open access & research data managementIsla Kuhn
 
FORCE11: Future of Research Communications and e-Scholarship
FORCE11:  Future of Research Communications and e-ScholarshipFORCE11:  Future of Research Communications and e-Scholarship
FORCE11: Future of Research Communications and e-ScholarshipMaryann Martone
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research PaperAnita de Waard
 
Next Generation Preprint Service
Next Generation Preprint ServiceNext Generation Preprint Service
Next Generation Preprint ServicePhilip Bourne
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Maryann Martone
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016Jisc
 
Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Takeya Kasukawa
 
Practical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapePractical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapeDigital Science
 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowBarry Smith
 
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...GigaScience, BGI Hong Kong
 
Altmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the ImplicationsAltmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the ImplicationsKR_Barker
 

Similar to Directions in Open Science (20)

Nicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do researchNicole Nogoy: GigaScience...how licensing can change the way we do research
Nicole Nogoy: GigaScience...how licensing can change the way we do research
 
Data and Research Infrastructures and Open Science
Data and Research Infrastructures and Open ScienceData and Research Infrastructures and Open Science
Data and Research Infrastructures and Open Science
 
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content TypesIlik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
 
Biomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital EnterpriseBiomedical Research as an Open Digital Enterprise
Biomedical Research as an Open Digital Enterprise
 
Share: Science Information Life Cycle
Share: Science Information Life CycleShare: Science Information Life Cycle
Share: Science Information Life Cycle
 
Understanding knowledge as a commons
Understanding knowledge as a commonsUnderstanding knowledge as a commons
Understanding knowledge as a commons
 
EoE HLN summer conference 2015 IK - open access & research data management
EoE HLN summer conference 2015 IK - open access & research data managementEoE HLN summer conference 2015 IK - open access & research data management
EoE HLN summer conference 2015 IK - open access & research data management
 
FORCE11: Future of Research Communications and e-Scholarship
FORCE11:  Future of Research Communications and e-ScholarshipFORCE11:  Future of Research Communications and e-Scholarship
FORCE11: Future of Research Communications and e-Scholarship
 
Alpsp final martone
Alpsp final martoneAlpsp final martone
Alpsp final martone
 
The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?
 
Science 2.0
Science 2.0Science 2.0
Science 2.0
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research Paper
 
Next Generation Preprint Service
Next Generation Preprint ServiceNext Generation Preprint Service
Next Generation Preprint Service
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
 
Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019
 
Practical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapePractical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscape
 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to Know
 
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
 
Altmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the ImplicationsAltmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the Implications
 

Recently uploaded

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 

Recently uploaded (20)

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 

Directions in Open Science

  • 1. Directions in Open Science Mike Travers SRI Bioinformatics Research Group For AIC Lunch and Learn, 30 Jan 2012
  • 2. About this talk • Partly a trip report from Open Science Summit 2011 • Partly an attempt to define open science and explore its impact • Partly an excuse to talk about some of my own vaguely related work • And partly some semi-crazy speculation about future projects in this space
  • 3. The Open Science Summit unites researchers, life science industry professionals, students, patients and other stakeholders to discuss the future of collaborative science and innovation. …in-depth sessions on new models for drug discovery and clinical trials, personal genomics, the patent system, the future of scientific publications, and more.
  • 4. What is Open Science? • Many different things, but boils down to: • Removing barriers to scientific communication and collaboration: – Social – Technical – Legal – Economic – Bureaucratic • To accelerate scientific progress • Utilizing modern technology
  • 5. Driven by technological change • The Internet has radically reduced communication costs • So old institutions of scientific communication are now obstacles – Closed academic publishers, notably: • Internet will transform scientific media just like it has newspapers, TV, social life…. • The difference is: science is more important than sharing cat pictures
  • 6. For-profit academic publishing is a racket A very lucrative one Starting to be rumbles of complaint (boycotts) from academics
  • 7. Open • Most visible and successful branch of open science • Articles are free to read, pay to publish • Funders are starting to require some form of public access
  • 8. Gold: OA journal, Green: OA self-archiving Open Access to the Scientific Journal Literature: Situation 2009, PLoS ONE, Bo-Christer Björk et al
  • 9. Research Works Act • H.R.3699 – “A bill to ensure the continued publication and integrity of the peer-reviewed research works by the private sector.” No Federal agency may adopt, implement, maintain, continue, or otherwise engage in any policy, program, or other activity that-- (1) causes, permits, or authorizes network dissemination of any private-sector research work without the prior consent of the publisher of such work; or (2) requires that any actual or prospective author, or the employer of such an actual or prospective author, assent to network dissemination of a private-sector research work.
  • 10. Myth 1: American consumers have a right to free access to articles their tax dollars fund. Fact American taxpayers do not fund peer reviewed research articles; they fund some of the research that is used in those articles…
  • 11. Beyond Open Access • Not going to say a whole lot about OA, because: • It’s easy to understand • It’s pretty clearly going to win in the long term • By itself, not a very radical change to how science is done: – Knowledge is still in paper-sized chunks – Papers are peer-reviewed prior to publication; – Once something is published, it’s static • All these parameters are being challenged in some way by other efforts • George Whitesides (Harvard chemist): “The concept of the scientific paper is eroding before our very eyes”
  • 12. Variations on publishing • “Peer review is broken” – Too slow – Too biased – Too rigid – May be “the worst system except for all the others” • Pre-peer-review publication – Eg arXiv.org • Micropublication – Crowdsourcing, blogs, wikis…. • Open-notebook science – No gap at all between bench and publication • Database-linked publications • Dynamic Review Papers
  • 13. Biggest sequencing operation in the world Generating 6 terabytes/day of genomic data Open-Source Genomic Analysis of Shiga-Toxin–Producing E. coli O104:H4 Rohde et al 2011 (NEJM) Toxic E. coli outbreak in Germany May 2011: We released these data into the public domain… which elicited a burst of crowd- sourced, curiosity-driven analyses carried out by bioinformaticians on four continents. Twenty-four hours after the release of the genome, it had been assembled; … Five days after the release of the sequence data, we had designed and released strain-specific diagnostic primer sequences, and within a week, two dozen reports had been filed on an open-source wiki …dedicated to analysis of the strain https://github.com/ehec-outbreak-crowdsourced
  • 14. GigaScience is a new integrated database and journal co-published in collaboration between BGI Shenzhen and BioMed Central, to meet the needs of a new generation of biological and biomedical research as it enters the era of "big-data."
  • 15. Dynamic Review Papers Conventional paper Paired with Dynamically- updated, wiki-based paper/database/mo del
  • 16.
  • 18. Who comes to Open Science Summits?
  • 21. Startups Social paper and citation management Scientific services marketplace Web-based molecule library management
  • 23. Somewhat less garage- • Independent research institute, started from data released by Merck • Repository of experimental data (Sage Commons) • Network of cooperating institutions • Starting to build a computational platform (Synapse)
  • 25. And some individual researchers • Peter Murray-Rust Chemist, Cambridge, promoter of Chemical Markup Language and semantic web “Closed science makes people die!” • Victoria Stodden Statistician, Columbia, reproducibility of computational science (cf ClimateGate)
  • 26. Some open science success stories • Galaxy Zoo • FoldIt • Nutrient Network (NutNet) • Prazinquantel synthesis
  • 27.
  • 28. Galaxy Zoo • Citizen science (loosely) • Image classification task • Mechanical Turk-like approach (but unpaid) • About 200K participants • Discovered a whole new class of galaxies (“green pea”) and a quasar mirror • 22 published papers in 3 years
  • 29.
  • 30.
  • 31.
  • 32. Social sharing of algorithms (“recipes”) Descent with modification
  • 33. Matthew Todd, chemist at U of Syndney Schistosmiasis Looking for synthesis for known drug Prazinquantel (PZQ) in enantiopure form Open-notebook science (LabTrove)
  • 34.
  • 36.
  • 37. What paper has the most authors? • NutNet paper: 40 authors, 41 institutions • This one from SLAC and elsewhere: 407 authors, but only 35 institutions
  • 38. Three variations on the scientific process • Automated Science • Distributed Science • Web-scale Intelligent Science • Open Science as the lubrication / accelerant that makes these feasible
  • 39. Afferent: Automation for Drug Discovery • Combinatorial Chemistry • Planning software to drive lab robots
  • 40.
  • 41. Distributed Science • Some science (eg evaluation of drug candidates) is highly parallelizable, • Hence distributable • CollabRx was initially supposed to support “virtual pharma companies” that would tie disparate academic research efforts into focused teams
  • 42.
  • 43. Web-scale Intelligent Science • Imagine all of science as a giant distributed computational process • Individual scientists are agents – working on a small part of the problem – Sharing their results – Getting feedback and funding dependent on success • Centralized data integration and decision tools used to help determine next useful experiment
  • 44. Steps towards distributed intelligence • Adaptive clinical trials – Rather than a classical trial with two arms run to completion – Change the distribution of test cases based on ongoing results • Now imagine this strategy applied more globally across all treatments for a disease • Credit for this slightly mad vision goes mainly to Marty Tenenbaum: – AI Meets Web 2.0 (2006) – Shrager, Tenenbaum, Travers, Cancer Commons: Biomedicine in the Internet Age (2011)
  • 45. What does all that have to do with Open Science? • Open Science is lowering barriers to collaboration • So it’s a necessary but not sufficient step towards this new kind of science • CollabRx may just have been too early: – the groundwork hasn’t been laid yet, – we are still working on basics – (eg standards for representation) • Reducing friction (or transaction costs) can be incredibly important
  • 46. “Changing the cost of innovation fundamentally changes the nature of innovation” – Joichi Ito TCP, HTTP etc are the containerization of data. So what’s the analog for scientific knowledge?
  • 48. A mix of technical, institutional, and legal standardization: -Standard licenses (parameterizable) -RDF representation for licenses. -Web Tools to generate these -Sites that collect and “market” available materials.
  • 49. BioBike, a platform for open science • Conceived of as a vehicle for getting biologists to do their own knowledge- based biocomputing. • Lisp + Frame system + Bioinformatics Tools – Through-the-web programmability – Community sharing of code and data – Visual Programming Language • Open Source • Jeff Elhai, Arnaud Taton, J. P. Massar, John K. Myers, Michael Travers, Johnny Casey, Mark Slupesky, Jeff Shrager. BioBIKE: A Web-based, programmable, integrated biological knowledge base. Nucleic Acids Research, 2009
  • 50.
  • 51.
  • 52. BioBike and Open Science • BioBike wasn’t for Open Science per se • But it did explore some ideas in web- based biocomputation • The next-generation BioBike platform: – Data: Big data, Open data, semantic web integrated – Programming: Able to deal with large scale and distributed workflows with human elements – Collaboration: Integrating different communities in a “trading zone” KnowOS: The (Re)Birth of the Knowledge Operating System. Mike Travers, JP Massar, and Jeff Shrager, International Lisp Conference 2005
  • 53. What is a platform? • The economic meaning of “platform” is interesting • Something that: – Supports two-sided network effects – Stands in the middle and extracts a toll • Examples: – Credit cards (merchants ↔ consumers) – Operating systems (application developers ↔ users) • Science has more complicated networks and relations – Data providers – Data consumers – Service providers – Analysts (statisticians, eg) – Patients • A science platform is not going to make anyone rich like Facebook, but it would be nice to have a powerful and standard way for all these groups to collaborate.
  • 54. Open Data is outstripping analysis capacity • Or in other words: – data is cheap, – attention, knowledge, & expertise are expensive • A platform for collaborative computational interpretation of biological data • To better leverage the expensive resources
  • 55. identifies advancing new computational infrastructure as a priority for driving innovation in science and engineering. Scientific discovery and innovation are advancing along fundamentally new pathways opened by the development of increasingly sophisticated software. the overarching goal of transforming innovations in research and education into sustained software resources that are an integral part of the cyberinfrastructure
  • 56. Anti-open arguments • Peer-review is an essential filter; without it too much nonsense gets out • Electronic availability of articles actually leads to narrowing of science (Evans, 2008) • Privacy, HIPAA, etc. • Need to retain IP for economic motivation • The problem isn’t availability of data; it’s making sense of what we do have • See PRISM for more
  • 57. Opener Science • Science is already pretty open! • institutions of openness played a role in the foundation of science, including the first scientific journals
  • 58. Historical Origins of Open Science • Before the invention of science, knowledge of the natural world was closely guarded, passed down from master to apprentice. • The development of institutions of openness was a key factor in the scientific revolution (Paul David, Stanford economist) • …and the printing press was a key factor in that.
  • 59. So… • The printing press is almost 600 years old • The scientific journal is almost 350 years old • There’s been some advancement in communication technology since then… • Science will eventually change: – Either a modest acceleration of the scientific process, – Or as significant and discontinuous as the first scientific revolution • Which one? An open question.
  • 61. End

Editor's Notes

  1. An OA journal is just a joiurnal…
  2. Datacite: working with data centres to assign persistent identifiers to datasets, we are developing an infrastructure that supports simple and effective methods of data citation, discovery, and access.
  3. Artificial artificila intelligence