SlideShare a Scribd company logo
1 of 42
Download to read offline
The end of
the scientific paper
as we know it
(in 4 easy steps)
Frank van Harmelen
(+ Paul Groth)
VU Amsterdam
Reports on
the death of
the scientific paper
have been greatly
exaggerated
Frank van Harmelen
(+ Paul Groth)
VU Amsterdam
And how the Semantic Web
makes it possible
Semsci 2017 workshop
• 7/10 papers about data
• 3/10 are about papers
and they are about papers written by&for people
Thanks (in order of appearance) to:
• Paul Groth
• Tobias Kuhn
• Jan Velterop
• Barend Mons
• Anita de Waard
• Carole Goble
Scientific publishing hasn’t changed
in 350 years
• Letter from Christian Huygens (1652)
• Writing to his prof in Mathematics
• Citing (and complaining about)
work of Descartes
• One of 3000 letters by Huygens
2017: Only superficial changes
• Different format & style
• Different medium
(Web, PDF)
• Different speed
(PubMed = 2 papers/min)
Section 1: Related work
Section 2: Research question
Section 3: Experimental design
Section 4: Experimental findings
Section 5: Interpretation, conclusions
And our papers still follow
this storyline:
Step 1: Study & interpret literature
Step 2: Formulate hypothesis
Step 3: Design experiment
Step 4: Execute experiment
Step 5: Publish results
This storyline is important,
but only readable by people,
not for machines
How to make our papers more usable?
“We only need information extraction
because we first did information burial” (Barend Mons)
“A journal paper
is a state-funeral
for your results”
(Hans Akkermans)
Step 1: explicit rhetorical structure
Capture the roles of blocks of text &
make these roles explicit
1 paper = 1 Network of blocks
N papers = 1 Network of blocks
Results Results
Interpretati
ons
Interpretati
ons
Conclusio
ns
Problem
Method
Results
Interpretati
ons
Conclusio
ns
Problem
Method
One paper Another paper
Step 2: explicit fine-grained
rhetorical structure
Locate individual knowledge items
and their relationships
Example: Scholonto, ClaiMaker [Buckinham-Shum]
Paper = set of claims
Claim = text – relation – text
Relation = causes, predicts, prevents; addresses, solves
equals, is-similar-to; proofs, supports, challenges
1 paper = 1 fine-grained network of relations
N papers = 1 fine-grained network of relations
Step 3: do away with the paper altogether.
• Any fact is a relation between two things (“triple”)
• Count each fact as a nano-publication
• Together, these nano-publications form a
huge very fine-grained network of relations,
a web of knowledge,
a “semantic web”
• Computers as colleagues,
not (only) tools
Just publish the facts
What is a Nanopublication
“A nanopublication is the smallest unit of
publishable information: an assertion about
anything that can be uniquely identified and
attributed to its author”
http://nanopub.org
Step 4: turning context into a
1st class citizen
• Link to all the stuff that goes on before publication:
– Datasets, workflows
– Open Lab books
– Open peer reviewing
• Link to all the stuff that goes on after publication:
– Websites
– Blogs
– Emails
– Tweets
– Give web-addresses to objects (URIs)
– Use the web to link between the objects
– Provide meaning in a form that computers can handle (RDF)
These principles embodied
in already deployed technology
We can build this using
semantic web technology
So now we have…
No longer a set of
disconnected monolithic PDFs
A network of facts, reviews,
evidence, opinions, data
The story so far…
• Publishing hasn't changed for 300+ years
• The structure and format of our papers
is still based on this
• Deconstruct the scientific paper
– from monolithic block of text
– to a network of computer readable facts & context
• All of this made possible by the semantic web
But…….
Pragmatic infeasiblility
Pragmatic infeasiblility
Previous experiments in formalising (social) science
turned out to be very hard:
• Hannan and Freeman's theory of organizational inertia
in first-order logic
American Sociological Review 59(4):571-593 · August
1994
• Caroll & Hannan’s resource portioning theory
in first order logic
Computational & Mathematical Organization Theory 7,
87–111, 2001.
Pragmatic (in)feasiblility
Many sciences are quantitave,
but I guess this is still possible in RDF + MathML:
Pragmatic infeasiblility
Science is a social activity, which includes persuasion,
rhetorics, deliberate ambiguity, etc.
Issue #3: hedging
s
CACM, Vol. 22, No. 5, May 1979
“A proof doesn't settle a mathematical argument.
Contrary to what its name suggests,
a proof is only one step in the direction of confidence.
We believe that, in the end,
it is a social process that determines whether
mathematicians feel confident about a theorem.
Thomas, J., The Axiom of Choice, North-Holland, Amsterdam, 1973
(a historical review of independence results in set theory)
Technical infeasibility: Scalability
Scalability
#statements/year =
#statements/nanopub x #nanopubs/paper x #papers/year
= 30 x N x 1.5M = N x 45M/yr
Let’s hope N ≈ O(10)….
Technical infeasibility: expressivity
• RDF hopelessly simple
• Needs at least DL:
“Mosquito’s transmit malaria“
All? no.
Some? yes.
Only? probably.
transmit. Malaria  Mosquitos
Many? Most?
• Beyond DL:
Probabilities, fuzziness, inconsistencies
Technical (in)feasibility:
Argumentation graphs
Escilatopram does not inhibit CYP2D6”
Micropublications, Clark, Ciccarese, Goble, 2013
Technical (in)feasibility:
Argumentation graphs
Argumentation graphs require:
• Defeasible logic
• Modal logic
• Higher-order logic
• ….
at scale of 450M statement/yr 
Should we give up on computers
as scientific colleagues?
• A more modest role for nano-publications?
– Annotations of datasets?
– Very approximate annotations of papers?
• Make them speak our language
instead of us speaking theirs?

More Related Content

What's hot

Once upon a time in Datatown ...
Once upon a time in Datatown ...Once upon a time in Datatown ...
Once upon a time in Datatown ...srazniewski
 
JHU Data Science MOOCs - Behind the Scenes
JHU Data Science MOOCs - Behind the ScenesJHU Data Science MOOCs - Behind the Scenes
JHU Data Science MOOCs - Behind the Scenesjtleek
 
Shebanq roma-2013-10-01
Shebanq roma-2013-10-01Shebanq roma-2013-10-01
Shebanq roma-2013-10-01Dirk Roorda
 
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update The Semantic Web: 2010 Update
The Semantic Web: 2010 Update James Hendler
 
What the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open DataWhat the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open DataHeiko Paulheim
 
Recommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time InformationRecommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time InformationChristoph Trattner
 
Data Science Education at JHSPH
Data Science Education at JHSPHData Science Education at JHSPH
Data Science Education at JHSPHjtleek
 
Network analysis: People and open source communities
Network analysis: People and open source communitiesNetwork analysis: People and open source communities
Network analysis: People and open source communitiesDawn Foster
 
Network Relationships and Job Changes of Software Developers at Sunbelt 2016
Network Relationships and Job Changes of Software Developers at Sunbelt 2016Network Relationships and Job Changes of Software Developers at Sunbelt 2016
Network Relationships and Job Changes of Software Developers at Sunbelt 2016Dawn Foster
 
OpenML Tutorial: Networked Science in Machine Learning
OpenML Tutorial: Networked Science in Machine LearningOpenML Tutorial: Networked Science in Machine Learning
OpenML Tutorial: Networked Science in Machine LearningJoaquin Vanschoren
 
Network Analysis: Tech Evangelism London Meetup
Network Analysis: Tech Evangelism London MeetupNetwork Analysis: Tech Evangelism London Meetup
Network Analysis: Tech Evangelism London MeetupDawn Foster
 
Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...
Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...
Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...Christoph Lange
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataMatthew Rowe
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsTobias Kuhn
 
Data and Donuts: Data organization
Data and Donuts: Data organizationData and Donuts: Data organization
Data and Donuts: Data organizationC. Tobin Magle
 
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...Dawn Foster
 
Context is King: On Semantic Publishing
Context is King: On Semantic PublishingContext is King: On Semantic Publishing
Context is King: On Semantic PublishingStefan Gradmann
 
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
Webometrics 1.0 - from AltaVista to Small Worlds and Genre DriftWebometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Driftguest5ec99a
 

What's hot (20)

Once upon a time in Datatown ...
Once upon a time in Datatown ...Once upon a time in Datatown ...
Once upon a time in Datatown ...
 
JHU Data Science MOOCs - Behind the Scenes
JHU Data Science MOOCs - Behind the ScenesJHU Data Science MOOCs - Behind the Scenes
JHU Data Science MOOCs - Behind the Scenes
 
Shebanq roma-2013-10-01
Shebanq roma-2013-10-01Shebanq roma-2013-10-01
Shebanq roma-2013-10-01
 
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update The Semantic Web: 2010 Update
The Semantic Web: 2010 Update
 
What the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open DataWhat the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open Data
 
Recommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time InformationRecommending Items in Social Tagging Systems Using Tag and Time Information
Recommending Items in Social Tagging Systems Using Tag and Time Information
 
Data Science Education at JHSPH
Data Science Education at JHSPHData Science Education at JHSPH
Data Science Education at JHSPH
 
Network analysis: People and open source communities
Network analysis: People and open source communitiesNetwork analysis: People and open source communities
Network analysis: People and open source communities
 
Network Relationships and Job Changes of Software Developers at Sunbelt 2016
Network Relationships and Job Changes of Software Developers at Sunbelt 2016Network Relationships and Job Changes of Software Developers at Sunbelt 2016
Network Relationships and Job Changes of Software Developers at Sunbelt 2016
 
Murpha11
Murpha11Murpha11
Murpha11
 
How to search a network
How to search a networkHow to search a network
How to search a network
 
OpenML Tutorial: Networked Science in Machine Learning
OpenML Tutorial: Networked Science in Machine LearningOpenML Tutorial: Networked Science in Machine Learning
OpenML Tutorial: Networked Science in Machine Learning
 
Network Analysis: Tech Evangelism London Meetup
Network Analysis: Tech Evangelism London MeetupNetwork Analysis: Tech Evangelism London Meetup
Network Analysis: Tech Evangelism London Meetup
 
Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...
Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...
Interlinking Data and Knowledge in Enterprises, Research and Society with Lin...
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and Nanopublications
 
Data and Donuts: Data organization
Data and Donuts: Data organizationData and Donuts: Data organization
Data and Donuts: Data organization
 
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
 
Context is King: On Semantic Publishing
Context is King: On Semantic PublishingContext is King: On Semantic Publishing
Context is King: On Semantic Publishing
 
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
Webometrics 1.0 - from AltaVista to Small Worlds and Genre DriftWebometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
 

Similar to The end of the scientific paper as we know it (or not...)

The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)Frank van Harmelen
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behaviorJames Howison
 
Algorithms - Jeff Erickson.pdf
Algorithms - Jeff Erickson.pdfAlgorithms - Jeff Erickson.pdf
Algorithms - Jeff Erickson.pdfHannah Baker
 
The culture of researchData
The culture of researchData The culture of researchData
The culture of researchData TheContentMine
 
The Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-RustThe Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-RustLEARN Project
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research PaperAnita de Waard
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchDatapetermurrayrust
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTAMartin Wynne
 
jon-on reasearch.ppt
jon-on reasearch.pptjon-on reasearch.ppt
jon-on reasearch.pptSumit Roy
 
Digital scholarship - all day workshop
Digital scholarship - all day workshopDigital scholarship - all day workshop
Digital scholarship - all day workshopMartin Weller
 
As thurston says (prague)
As thurston says (prague)As thurston says (prague)
As thurston says (prague)Brendan Larvor
 
As thurston says (prague)
As thurston says (prague)As thurston says (prague)
As thurston says (prague)Brendan Larvor
 
Norway talk #1 dual level theory ppt
Norway talk #1 dual level theory pptNorway talk #1 dual level theory ppt
Norway talk #1 dual level theory pptdjleu
 
Alec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).Pdf
Alec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).PdfAlec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).Pdf
Alec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).PdfTodd Turner
 
Being an Open Scholar in a Connected World
Being an Open Scholar in a Connected WorldBeing an Open Scholar in a Connected World
Being an Open Scholar in a Connected WorldStian Håklev
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
OntoMath digital ecosystem
OntoMath digital ecosystemOntoMath digital ecosystem
OntoMath digital ecosystemAlik Kirillovich
 
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Wouter Beek
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysisLuke Czarnecki
 

Similar to The end of the scientific paper as we know it (or not...) (20)

The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)
 
Ngsp
NgspNgsp
Ngsp
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behavior
 
Algorithms - Jeff Erickson.pdf
Algorithms - Jeff Erickson.pdfAlgorithms - Jeff Erickson.pdf
Algorithms - Jeff Erickson.pdf
 
The culture of researchData
The culture of researchData The culture of researchData
The culture of researchData
 
The Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-RustThe Culture of Research Data, by Peter Murray-Rust
The Culture of Research Data, by Peter Murray-Rust
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research Paper
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTA
 
jon-on reasearch.ppt
jon-on reasearch.pptjon-on reasearch.ppt
jon-on reasearch.ppt
 
Digital scholarship - all day workshop
Digital scholarship - all day workshopDigital scholarship - all day workshop
Digital scholarship - all day workshop
 
As thurston says (prague)
As thurston says (prague)As thurston says (prague)
As thurston says (prague)
 
As thurston says (prague)
As thurston says (prague)As thurston says (prague)
As thurston says (prague)
 
Norway talk #1 dual level theory ppt
Norway talk #1 dual level theory pptNorway talk #1 dual level theory ppt
Norway talk #1 dual level theory ppt
 
Alec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).Pdf
Alec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).PdfAlec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).Pdf
Alec Fisher-The Logic Of Real Arguments-Cambridge University Press (2004).Pdf
 
Being an Open Scholar in a Connected World
Being an Open Scholar in a Connected WorldBeing an Open Scholar in a Connected World
Being an Open Scholar in a Connected World
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
OntoMath digital ecosystem
OntoMath digital ecosystemOntoMath digital ecosystem
OntoMath digital ecosystem
 
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysis
 

More from Frank van Harmelen

The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"Frank van Harmelen
 
Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)Frank van Harmelen
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyFrank van Harmelen
 
Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019Frank van Harmelen
 
Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019Frank van Harmelen
 
On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...Frank van Harmelen
 
Linked Open Data for Medical Guidelines Interactions
Linked Open Data for Medical  Guidelines InteractionsLinked Open Data for Medical  Guidelines Interactions
Linked Open Data for Medical Guidelines InteractionsFrank van Harmelen
 
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...Frank van Harmelen
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)Frank van Harmelen
 
4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic Web4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic WebFrank van Harmelen
 
Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...Frank van Harmelen
 
Ontology mapping needs context & approximation
Ontology mapping needs context & approximationOntology mapping needs context & approximation
Ontology mapping needs context & approximationFrank van Harmelen
 
Ontology Mapping - Out Of The Babel Tower
Ontology Mapping - Out Of The Babel TowerOntology Mapping - Out Of The Babel Tower
Ontology Mapping - Out Of The Babel TowerFrank van Harmelen
 
LarKC: the large knowledge collider
LarKC: the large knowledge colliderLarKC: the large knowledge collider
LarKC: the large knowledge colliderFrank van Harmelen
 

More from Frank van Harmelen (20)

The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"
 
Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxology
 
Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019
 
Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019
 
Empirical Semantics
Empirical SemanticsEmpirical Semantics
Empirical Semantics
 
On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...
 
Linked Open Data for Medical Guidelines Interactions
Linked Open Data for Medical  Guidelines InteractionsLinked Open Data for Medical  Guidelines Interactions
Linked Open Data for Medical Guidelines Interactions
 
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)
 
4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic Web4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic Web
 
WCIT2010
WCIT2010WCIT2010
WCIT2010
 
Het slimme Web 3.0
Het slimme Web 3.0Het slimme Web 3.0
Het slimme Web 3.0
 
OWL briefing
OWL briefingOWL briefing
OWL briefing
 
RDF briefing
RDF briefingRDF briefing
RDF briefing
 
Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...Semantic Web research anno 2006:main streams, popular falacies, current statu...
Semantic Web research anno 2006:main streams, popular falacies, current statu...
 
Ontology mapping needs context & approximation
Ontology mapping needs context & approximationOntology mapping needs context & approximation
Ontology mapping needs context & approximation
 
Ontology Mapping - Out Of The Babel Tower
Ontology Mapping - Out Of The Babel TowerOntology Mapping - Out Of The Babel Tower
Ontology Mapping - Out Of The Babel Tower
 
Where Does It Break?
Where Does It Break?Where Does It Break?
Where Does It Break?
 
LarKC: the large knowledge collider
LarKC: the large knowledge colliderLarKC: the large knowledge collider
LarKC: the large knowledge collider
 

Recently uploaded

Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPirithiRaju
 
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's SurvivalHarry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survivalkevin8smith
 
Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxpriyankatabhane
 
Food_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyFood_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyHemantThakare8
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...Chayanika Das
 
AICTE activity on Water Conservation spreading awareness
AICTE activity on Water Conservation spreading awarenessAICTE activity on Water Conservation spreading awareness
AICTE activity on Water Conservation spreading awareness1hk20is002
 
Understanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdfUnderstanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdfHabibouKarbo
 
Environment modelling and its environmental aspects
Environment modelling and its environmental aspectsEnvironment modelling and its environmental aspects
Environment modelling and its environmental aspectsMansi Rastogi
 
Unit-V-Introduction to Data Mining.pptx
Unit-V-Introduction to  Data Mining.pptxUnit-V-Introduction to  Data Mining.pptx
Unit-V-Introduction to Data Mining.pptxHarsha Patel
 
Advances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerAdvances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerLuis Miguel Chong Chong
 
Introduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsIntroduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsCreative-Biolabs
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsDobusch Leonhard
 
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsMarkus Roggen
 
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasBACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasChayanika Das
 
Think Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig BobchinThink Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig BobchinNathan Cone
 
Interpreting SDSS extragalactic data in the era of JWST
Interpreting SDSS extragalactic data in the era of JWSTInterpreting SDSS extragalactic data in the era of JWST
Interpreting SDSS extragalactic data in the era of JWSTAlexander F. Mayer
 
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsTimeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsDanielBaumann11
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPRPirithiRaju
 

Recently uploaded (20)

Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPR
 
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's SurvivalHarry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
 
Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptx
 
Food_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyFood_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiology
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
 
AICTE activity on Water Conservation spreading awareness
AICTE activity on Water Conservation spreading awarenessAICTE activity on Water Conservation spreading awareness
AICTE activity on Water Conservation spreading awareness
 
Understanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdfUnderstanding Nutrition, 16th Edition pdf
Understanding Nutrition, 16th Edition pdf
 
Environment modelling and its environmental aspects
Environment modelling and its environmental aspectsEnvironment modelling and its environmental aspects
Environment modelling and its environmental aspects
 
Unit-V-Introduction to Data Mining.pptx
Unit-V-Introduction to  Data Mining.pptxUnit-V-Introduction to  Data Mining.pptx
Unit-V-Introduction to Data Mining.pptx
 
Advances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerAdvances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of Cancer
 
Introduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsIntroduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative Biolabs
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and Pitfalls
 
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
 
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasBACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
 
Think Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig BobchinThink Science: What Are Eclipses (101), by Craig Bobchin
Think Science: What Are Eclipses (101), by Craig Bobchin
 
Interpreting SDSS extragalactic data in the era of JWST
Interpreting SDSS extragalactic data in the era of JWSTInterpreting SDSS extragalactic data in the era of JWST
Interpreting SDSS extragalactic data in the era of JWST
 
Bioenergetics and the role of ATP to drive the beats of life.
Bioenergetics and the role of ATP to drive the beats of life.Bioenergetics and the role of ATP to drive the beats of life.
Bioenergetics and the role of ATP to drive the beats of life.
 
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological CorrelationsTimeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
Timeless Cosmology: Towards a Geometric Origin of Cosmological Correlations
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
 

The end of the scientific paper as we know it (or not...)

  • 1. The end of the scientific paper as we know it (in 4 easy steps) Frank van Harmelen (+ Paul Groth) VU Amsterdam
  • 2. Reports on the death of the scientific paper have been greatly exaggerated Frank van Harmelen (+ Paul Groth) VU Amsterdam And how the Semantic Web makes it possible
  • 3. Semsci 2017 workshop • 7/10 papers about data • 3/10 are about papers and they are about papers written by&for people Thanks (in order of appearance) to: • Paul Groth • Tobias Kuhn • Jan Velterop • Barend Mons • Anita de Waard • Carole Goble
  • 4. Scientific publishing hasn’t changed in 350 years • Letter from Christian Huygens (1652) • Writing to his prof in Mathematics • Citing (and complaining about) work of Descartes • One of 3000 letters by Huygens
  • 5. 2017: Only superficial changes • Different format & style • Different medium (Web, PDF) • Different speed (PubMed = 2 papers/min)
  • 6. Section 1: Related work Section 2: Research question Section 3: Experimental design Section 4: Experimental findings Section 5: Interpretation, conclusions And our papers still follow this storyline: Step 1: Study & interpret literature Step 2: Formulate hypothesis Step 3: Design experiment Step 4: Execute experiment Step 5: Publish results This storyline is important, but only readable by people, not for machines
  • 7. How to make our papers more usable? “We only need information extraction because we first did information burial” (Barend Mons) “A journal paper is a state-funeral for your results” (Hans Akkermans)
  • 8.
  • 9.
  • 10. Step 1: explicit rhetorical structure Capture the roles of blocks of text & make these roles explicit 1 paper = 1 Network of blocks N papers = 1 Network of blocks Results Results Interpretati ons Interpretati ons Conclusio ns Problem Method Results Interpretati ons Conclusio ns Problem Method One paper Another paper
  • 11. Step 2: explicit fine-grained rhetorical structure Locate individual knowledge items and their relationships Example: Scholonto, ClaiMaker [Buckinham-Shum] Paper = set of claims Claim = text – relation – text Relation = causes, predicts, prevents; addresses, solves equals, is-similar-to; proofs, supports, challenges 1 paper = 1 fine-grained network of relations N papers = 1 fine-grained network of relations
  • 12.
  • 13. Step 3: do away with the paper altogether. • Any fact is a relation between two things (“triple”) • Count each fact as a nano-publication • Together, these nano-publications form a huge very fine-grained network of relations, a web of knowledge, a “semantic web” • Computers as colleagues, not (only) tools Just publish the facts
  • 14. What is a Nanopublication “A nanopublication is the smallest unit of publishable information: an assertion about anything that can be uniquely identified and attributed to its author” http://nanopub.org
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Step 4: turning context into a 1st class citizen • Link to all the stuff that goes on before publication: – Datasets, workflows – Open Lab books – Open peer reviewing • Link to all the stuff that goes on after publication: – Websites – Blogs – Emails – Tweets
  • 21. – Give web-addresses to objects (URIs) – Use the web to link between the objects – Provide meaning in a form that computers can handle (RDF) These principles embodied in already deployed technology We can build this using semantic web technology
  • 22. So now we have… No longer a set of disconnected monolithic PDFs A network of facts, reviews, evidence, opinions, data
  • 23.
  • 24.
  • 25.
  • 26. The story so far… • Publishing hasn't changed for 300+ years • The structure and format of our papers is still based on this • Deconstruct the scientific paper – from monolithic block of text – to a network of computer readable facts & context • All of this made possible by the semantic web
  • 29. Pragmatic infeasiblility Previous experiments in formalising (social) science turned out to be very hard: • Hannan and Freeman's theory of organizational inertia in first-order logic American Sociological Review 59(4):571-593 · August 1994 • Caroll & Hannan’s resource portioning theory in first order logic Computational & Mathematical Organization Theory 7, 87–111, 2001.
  • 30. Pragmatic (in)feasiblility Many sciences are quantitave, but I guess this is still possible in RDF + MathML:
  • 31. Pragmatic infeasiblility Science is a social activity, which includes persuasion, rhetorics, deliberate ambiguity, etc.
  • 32.
  • 33.
  • 35. s
  • 36. CACM, Vol. 22, No. 5, May 1979 “A proof doesn't settle a mathematical argument. Contrary to what its name suggests, a proof is only one step in the direction of confidence. We believe that, in the end, it is a social process that determines whether mathematicians feel confident about a theorem.
  • 37. Thomas, J., The Axiom of Choice, North-Holland, Amsterdam, 1973 (a historical review of independence results in set theory)
  • 38. Technical infeasibility: Scalability Scalability #statements/year = #statements/nanopub x #nanopubs/paper x #papers/year = 30 x N x 1.5M = N x 45M/yr Let’s hope N ≈ O(10)….
  • 39. Technical infeasibility: expressivity • RDF hopelessly simple • Needs at least DL: “Mosquito’s transmit malaria“ All? no. Some? yes. Only? probably. transmit. Malaria  Mosquitos Many? Most? • Beyond DL: Probabilities, fuzziness, inconsistencies
  • 40. Technical (in)feasibility: Argumentation graphs Escilatopram does not inhibit CYP2D6” Micropublications, Clark, Ciccarese, Goble, 2013
  • 41. Technical (in)feasibility: Argumentation graphs Argumentation graphs require: • Defeasible logic • Modal logic • Higher-order logic • …. at scale of 450M statement/yr 
  • 42. Should we give up on computers as scientific colleagues? • A more modest role for nano-publications? – Annotations of datasets? – Very approximate annotations of papers? • Make them speak our language instead of us speaking theirs?

Editor's Notes

  1. Use circular diagram?
  2. Move this to start of talk: this workshop does mostly before & after, not papers themselves.
  3. Overall conclusion: Publishing hasn't changed for 300+ years The storyline of our papers is still based on this Deconstruct the scientific paper from monolithic block to a network of facts & context All of this made possible by the semantic web Consequences for science mapping: Science maps will get better Science maps will be more needed