SlideShare a Scribd company logo
1 of 50
Towards an
Open Research Knowledge
Graph
Sören Auer
Gottfried Wilhelm Leibniz
* 21. Juni/ 1. Juli 1646 in Leipzig
† 14. November 1716 in Hannover
Namesake Member of
Library of
Namesake
Had to do some research on
serials…
5
Serials
Mail order catalogs
6
7
Mail order catalogs
8
9
10
Road Maps
11
Phone Books
How does it work today?
13
14
15
16
New means adapted to the new posibilities were developed, e.g. „zooming“,
dynamics
Business models changed completely
More focus on data, interlinking of data and services and search in the data
Integration, crowdsourcing play an important role
The World of Publishing & Communication
has profundely changed
What about Scholarly
Communication?
18
Scientific publishing in the 17th
century
One of the earliest research
journals: Philosophical Transactions of the
Royal Society
© CC BY Henry Oldenburg
19
Publishing in 1970s
20
Scientific publishing today
We have:
BUT
• Mainly based on PDF
• Is only partially machine-readable
• Does not preserve structure
• Does not allow embedding of semantics
• Does not facilitate interactivity/dynamicity/
repurposing
• …
21
Proliferation of scientific literature
Duplication and inefficiency
Deficiency of peer-review
Reproducibility crisis
Science is Seriously Flawed
22
Science and engineering articles by region, country: 2004 and 2014
Proliferation of scientific literature
National Science Foundation: Science and Engineering Publication Output Trends: https://www.nsf.gov/statistics/2018/nsf18300/nsf18300.pdf
23
1,500 scientists lift the lid on reproducibility
Monya Baker in Nature, 2016. 533 (7604): 452–454. doi:10.1038/533452a:
• 70% failed to reproduce at least one other scientist's experiment
• 50% failed to reproduce one of their own
experiments
Failure to reproduce results among disciplines
(in brackets own results):
• chemistry: 87% (64%),
• biology: 77% (60%),
• physics and engineering: 69% (51%),
• Earth sciences: 64% (41%).
Reproducibility Crisis
© Stanford Medicine - Stanford University
24
How can we avoid duplication if the terminology, research problems, approaches,
methods, characteristics, evaluations, … are not properly defined and identified?
How would you build an engine/building without properly defining their parts,
relationships, materials, characteristics … ?
Duplication and Inefficiency
25
Lack of:
• Transparency – information is hidden in text
• Integratability – fitting different research results together
• Machine assistance – unstructured content is hard to process
• Identifyability of concepts beyond metadata
• Collaboration – one brain barrier
• Overview – scientists look for the needle in the haystack
Root Cause - Deficiency of Scholarly
Communication?
How can we fix it?
26
27
Realizing Vannevar Bush‘s
vision of Memex
Linked Data Principles
1. Use URIs to identify the “things” in your data
2. Use http:// URIs so people (and machines) can look them up on the web
3. When a URI is looked up, return a description of the thing in the W3C
Resource Description Format (RDF)
4. Include links to related things
http://www.w3.org/DesignIssues/LinkedData.html
28
[1] Auer, Lehmann, Ngomo, Zaveri: Introduction to Linked Data and Its Lifecycle on the Web. Reasoning Web 2013
Page 29
1. Graph based RDF data model consisting of S-P-O statements (facts)
RDF & Linked Data in a Nutshell
NasigConf2018
dbpedia:Atlanta
09.06.2018
NASIG
conf:organizes
conf:starts
conf:takesPlaceIn
2. Serialised as RDF Triples:
NASIG conf:organizes NasigConf2018 .
NasigConf2018 conf:starts “2018-06-09”^^xsd:date .
NasigConf2018 conf:takesPlaceAt dbpedia:Atlanta .
3. Publication under URL in Web, Intranet, Extranet
Subject Predicate Object
Page 30
Creating Knowledge Graphs with RDF
Linked Data
located in
label
industry
headquarters
full nameDHL
Post Tower
162.5 m
Bonn
Logistics Logistik
DHL International GmbH
height
物流
label
Page 31
Graph consists of:
 Resources (identified via URIs)
 Literals: data values with data type (URI) or language (multilinguality integrated)
 Attributes of resources are also URI-identified (from vocabularies)
Various data sources and vocabularies can be arbitrarily mixed and meshed
URIs can be shortened with namespace prefixes; e.g. dbp: → http://dbpedia.org/resource/
RDF Data Model (a bit more technical)
gn:locatedIn
rdfs:label
dbo:industry
ex:headquarters
foaf:namedbp:DHL_International_GmbH
dbp:Post_Tower
"162.5"^^xsd:decimal
dbp:Bonn
dbp:Logistics
"Logistik"@de
"DHL International GmbH"^^xsd:string
ex:height
"物流"@zh
rdfs:label
rdf:value
unit:Meter
ex:unit
Page 32
• Fabric of concept, class, property, relationships, entity descriptions
• Uses a knowledge representation formalism
(typically RDF, RDF-Schema, OWL)
• Holistic knowledge (multi-domain, source, granularity):
• instance data (ground truth),
• open (e.g. DBpedia, WikiData), private (e.g. supply chain data),
closed data (product models),
• derived, aggregated data,
• schema data (vocabularies, ontologies)
• meta-data (e.g. provenance, versioning, documentation licensing)
• comprehensive taxonomies to categorize entities
• links between internal and external data
• mappings to data stored in other systems and databases
Knowledge Graphs – A definition
Smart Data for Machine
Learning
Page 33
Page 34
Search Engine Optimization & Web-Commerce
 Schema.org used by >20% of Web sites
 Major search engines exploit semantic descriptions
Pharma, Lifesciences
 Mature, comprehensive vocabularies and ontologies
 Billions of disease, drug, clinical trial descriptions
Digital Libraries
 Many established vocabularies (DublinCore, FRBR, EDM)
 Millions of aggregated from thousands of memory institutions in
Europeana, German Digital Library
Emerging Knowledge Graphs & Data Spaces
Paradigm Change in Scholarly Communication
Towards more Knowledge-based Information Flows
36
Paradigm Change in Scholarly Communication Knowledge-based
Information Flows in Science & Technology
Challenges: Digitalisation of Science, monopolisation by commercial actors,
Proliferation of publications, Reproducibility Crisis
37
Mathematics
• Definitions
• Theorems
• Proofs
• Methods
• …
Physics
• Experiments
• Data
• Models
• …
Chemistry
• Substances
• Structures
• Reactions
• …
Computer Science
• Concepts
• Implemen-
tations
• Evaluations
• …
Technology
• Standards
• Processes
• Elements
• Units,
Sensor data
Architecture
• Regulations
• Elements
• Models
• …
Open Research Knowledge Graph
Overarching Concepts
 Research problems
 Definitions
 Research approaches
 Methods
Artefacts
 Publications
 Data
 Software
 Image/Audio/Video
 Knowledge Graphs / Ontologies
Domain specific concepts
Open Research Knowledge
Graph makes comprehensive
and subject-specific concepts
clearly identifiable and links
them semantically (with
clearly described relations)
with each other and with
relevant further artifacts.
38
39
Search for CRISPR:
>4.000 Results
40
Chemistry Example: CRISPR/Cas Genome Editing
41
Semantic Representation using a Knowledge Graph
Author Robert Reed
Research Problem
Methods
Experimental Data
related Concepts
Genome editing in Lepidoptera
CRISPR/cas9
Lepidoptera; Genome editing; CRSIPR
https://doi.org/10.5281/zenodo.896916
A practial guide to CRISPR/cas9
editing in Lepidoptera
<https://doi.org/10.1101/130344>
Robert Reed
<https://orcid.org/0000-0002-
6065-6728>
Genome editing in
Lepidoptera
Experimental Data
https://doi.org/10.528
1/zenodo.896916
isAuthorOf
adresses
CRSPRS/cas9
isImplementedBy
isEvaluatedWith
Genome editing
<https://www.wikidata.or
g/wiki/Q24630389>
relatesConcept
3. Graph representation
2. Graph Curation Form
1. Original Publication
42
Automatic Generation of Comparisons/Surveys
43
Open Research Knowledge Graph
interlinks existing Services and Resources
44
Interlinking Article, Software, Video and
Graph resources describing the research
47
Advantages of knowledge based scholarly communication
 Clear identification of all relevant artifacts, concepts, attributes, relationships 
terminological and conceptual precision and sharpness, less ambiguity
 Better and explicit networking of all relevant artifacts and information sources 
traceability
 ORKG machine-readability  new search, retrieval, mining and assistance
applications
 Avoidance of media discontinuities in the different phases of scientific work 
Increased efficiency
 Use of concepts and relationships across disciplinary boundaries 
Interdisciplinarity and transdisciplinarity
 Halting the proliferation of scientific publications  less duplication
 Facilitating the entry of young academics or laypersons  Open Science
48
There is a lot to do:
• Equip existing services with Linked Data interfaces
• Enable the deep semantic description of research, requires
• Good user interfaces
• Scalable storage and search facility
• Collaboration between scientists, libariens, knowledge engineers, machines
Stay tuned
• Mailinglist/group: https://groups.google.com/forum/#!forum/orkg
• Comming soon: Open Research Knowledge Graph: https://orkg.org
• Next workshop at TIB on November, 22nd (after DILS Conference:
https://events.tib.eu/dils2018/)
Outlook
https://de.linkedin.com/in/soerenauer
https://twitter.com/soerenauer
https://www.xing.com/profile/Soeren_Auer
http://www.researchgate.net/profile/Soeren_Auer
TIB & Leibniz University of Hannover
Soeren.Auer@tib.eu
Sören Auer
50
Said Fathalla, Sahar Vahdati, Sören Auer, Christoph Lange:
Towards a Knowledge Graph Representing Research Findings by Semantifying
Survey Articles. TPDL 2017: 315-327,
https://www.researchgate.net/publication/319419350
Sahar Vahdati, Natanael Arndt, Sören Auer, Christoph Lange:
OpenResearch: Collaborative Management of Scholarly Communication Metadata.
EKAW 2016: 778-793, https://www.researchgate.net/publication/309700661
Sören Auer: Towards an Open Research Knowledge Graph
https://zenodo.org/record/1157185
Sören Auer, Viktor Kovtun, Manuel Prinz, Anna Kasprzik, Markus Stocker: Towards a
Knowledge Graph for Science. https://doi.org/10.15488/3401
References

More Related Content

What's hot

FIWARE Training: JSON-LD and NGSI-LD
FIWARE Training: JSON-LD and NGSI-LDFIWARE Training: JSON-LD and NGSI-LD
FIWARE Training: JSON-LD and NGSI-LD
FIWARE
 

What's hot (20)

Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
 
ArCo: the Italian Cultural Heritage Knowledge Graph
ArCo: the Italian Cultural Heritage Knowledge GraphArCo: the Italian Cultural Heritage Knowledge Graph
ArCo: the Italian Cultural Heritage Knowledge Graph
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Introduction of Knowledge Graphs
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge Graphs
 
MongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World ExamplesMongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World Examples
 
The Basics of MongoDB
The Basics of MongoDBThe Basics of MongoDB
The Basics of MongoDB
 
Mongodb basics and architecture
Mongodb basics and architectureMongodb basics and architecture
Mongodb basics and architecture
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Spark SQL Tutorial | Spark Tutorial for Beginners | Apache Spark Training | E...
Spark SQL Tutorial | Spark Tutorial for Beginners | Apache Spark Training | E...Spark SQL Tutorial | Spark Tutorial for Beginners | Apache Spark Training | E...
Spark SQL Tutorial | Spark Tutorial for Beginners | Apache Spark Training | E...
 
RDF, linked data and semantic web
RDF, linked data and semantic webRDF, linked data and semantic web
RDF, linked data and semantic web
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
 
FIWARE Training: JSON-LD and NGSI-LD
FIWARE Training: JSON-LD and NGSI-LDFIWARE Training: JSON-LD and NGSI-LD
FIWARE Training: JSON-LD and NGSI-LD
 
Introduction to Spark with Python
Introduction to Spark with PythonIntroduction to Spark with Python
Introduction to Spark with Python
 
ETL VS ELT.pdf
ETL VS ELT.pdfETL VS ELT.pdf
ETL VS ELT.pdf
 
MongoDB Schema Design
MongoDB Schema DesignMongoDB Schema Design
MongoDB Schema Design
 
Data Modeling with NGSI, NGSI-LD
Data Modeling with NGSI, NGSI-LDData Modeling with NGSI, NGSI-LD
Data Modeling with NGSI, NGSI-LD
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI Initiatives
 
Data Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesData Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph Databases
 
Enterprise Knowledge Graph
Enterprise Knowledge GraphEnterprise Knowledge Graph
Enterprise Knowledge Graph
 

Similar to Towards an Open Research Knowledge Graph

ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Jon Voss
 

Similar to Towards an Open Research Knowledge Graph (20)

Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Open data and linked data
Open data and linked dataOpen data and linked data
Open data and linked data
 
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies
 
Linked library data
Linked library dataLinked library data
Linked library data
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Linked Data: Why Bother?
Linked Data:  Why Bother?Linked Data:  Why Bother?
Linked Data: Why Bother?
 
LKG Editor Dev
LKG Editor DevLKG Editor Dev
LKG Editor Dev
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 
Metadata for researchers
Metadata for researchers Metadata for researchers
Metadata for researchers
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and Collaborations
 
Semantic Web Technologies: Changing Bibliographic Descriptions?
Semantic Web Technologies: Changing Bibliographic Descriptions?Semantic Web Technologies: Changing Bibliographic Descriptions?
Semantic Web Technologies: Changing Bibliographic Descriptions?
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Measuring Science – Tracing the authors
Measuring Science – Tracing the authorsMeasuring Science – Tracing the authors
Measuring Science – Tracing the authors
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
 
Connecting Heterogeneous Collections using Linked Data
Connecting Heterogeneous Collections using Linked DataConnecting Heterogeneous Collections using Linked Data
Connecting Heterogeneous Collections using Linked Data
 
Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 

More from Sören Auer

Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Sören Auer
 
Open data for smart cities
Open data for smart citiesOpen data for smart cities
Open data for smart cities
Sören Auer
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
Sören Auer
 
Das Semantische Daten Web für Unternehmen
Das Semantische Daten Web für UnternehmenDas Semantische Daten Web für Unternehmen
Das Semantische Daten Web für Unternehmen
Sören Auer
 
Creating knowledge out of interlinked data
Creating knowledge out of interlinked dataCreating knowledge out of interlinked data
Creating knowledge out of interlinked data
Sören Auer
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
Sören Auer
 
ESWC2010 "Linked Data: Now what?" Panel Discussion slides
ESWC2010 "Linked Data: Now what?" Panel Discussion slidesESWC2010 "Linked Data: Now what?" Panel Discussion slides
ESWC2010 "Linked Data: Now what?" Panel Discussion slides
Sören Auer
 
LESS - Template-based Syndication and Presentation of Linked Data for End-users
LESS - Template-based Syndication and Presentation of Linked Data for End-usersLESS - Template-based Syndication and Presentation of Linked Data for End-users
LESS - Template-based Syndication and Presentation of Linked Data for End-users
Sören Auer
 

More from Sören Auer (20)

Knowledge Graph Research and Innovation Challenges
Knowledge Graph Research and Innovation ChallengesKnowledge Graph Research and Innovation Challenges
Knowledge Graph Research and Innovation Challenges
 
Describing Scholarly Contributions semantically with the Open Research Knowle...
Describing Scholarly Contributions semantically with the Open Research Knowle...Describing Scholarly Contributions semantically with the Open Research Knowle...
Describing Scholarly Contributions semantically with the Open Research Knowle...
 
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
 
Cognitive data
Cognitive dataCognitive data
Cognitive data
 
DBpedia - 10 year ISWC SWSA best paper award presentation
DBpedia  - 10 year ISWC SWSA best paper award presentationDBpedia  - 10 year ISWC SWSA best paper award presentation
DBpedia - 10 year ISWC SWSA best paper award presentation
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphs
 
Towards digitizing scholarly communication
Towards digitizing scholarly communicationTowards digitizing scholarly communication
Towards digitizing scholarly communication
 
Project overview big data europe
Project overview big data europeProject overview big data europe
Project overview big data europe
 
LDOW2015 Position Talk and Discussion
LDOW2015 Position Talk and DiscussionLDOW2015 Position Talk and Discussion
LDOW2015 Position Talk and Discussion
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Open data for smart cities
Open data for smart citiesOpen data for smart cities
Open data for smart cities
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Проект Евросоюза LOD2 и Британский Институт Открытых данных
Проект Евросоюза LOD2 и Британский Институт Открытых данныхПроект Евросоюза LOD2 и Британский Институт Открытых данных
Проект Евросоюза LOD2 и Британский Институт Открытых данных
 
Das Semantische Daten Web für Unternehmen
Das Semantische Daten Web für UnternehmenDas Semantische Daten Web für Unternehmen
Das Semantische Daten Web für Unternehmen
 
Creating knowledge out of interlinked data
Creating knowledge out of interlinked dataCreating knowledge out of interlinked data
Creating knowledge out of interlinked data
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
ESWC2010 "Linked Data: Now what?" Panel Discussion slides
ESWC2010 "Linked Data: Now what?" Panel Discussion slidesESWC2010 "Linked Data: Now what?" Panel Discussion slides
ESWC2010 "Linked Data: Now what?" Panel Discussion slides
 
LESS - Template-based Syndication and Presentation of Linked Data for End-users
LESS - Template-based Syndication and Presentation of Linked Data for End-usersLESS - Template-based Syndication and Presentation of Linked Data for End-users
LESS - Template-based Syndication and Presentation of Linked Data for End-users
 

Recently uploaded

development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 

Recently uploaded (20)

development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 

Towards an Open Research Knowledge Graph

  • 1. Towards an Open Research Knowledge Graph Sören Auer
  • 2.
  • 3. Gottfried Wilhelm Leibniz * 21. Juni/ 1. Juli 1646 in Leipzig † 14. November 1716 in Hannover Namesake Member of Library of Namesake
  • 4. Had to do some research on serials…
  • 6. 6
  • 8. 8
  • 9. 9
  • 12. How does it work today?
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16 New means adapted to the new posibilities were developed, e.g. „zooming“, dynamics Business models changed completely More focus on data, interlinking of data and services and search in the data Integration, crowdsourcing play an important role The World of Publishing & Communication has profundely changed
  • 18. 18 Scientific publishing in the 17th century One of the earliest research journals: Philosophical Transactions of the Royal Society © CC BY Henry Oldenburg
  • 20. 20 Scientific publishing today We have: BUT • Mainly based on PDF • Is only partially machine-readable • Does not preserve structure • Does not allow embedding of semantics • Does not facilitate interactivity/dynamicity/ repurposing • …
  • 21. 21 Proliferation of scientific literature Duplication and inefficiency Deficiency of peer-review Reproducibility crisis Science is Seriously Flawed
  • 22. 22 Science and engineering articles by region, country: 2004 and 2014 Proliferation of scientific literature National Science Foundation: Science and Engineering Publication Output Trends: https://www.nsf.gov/statistics/2018/nsf18300/nsf18300.pdf
  • 23. 23 1,500 scientists lift the lid on reproducibility Monya Baker in Nature, 2016. 533 (7604): 452–454. doi:10.1038/533452a: • 70% failed to reproduce at least one other scientist's experiment • 50% failed to reproduce one of their own experiments Failure to reproduce results among disciplines (in brackets own results): • chemistry: 87% (64%), • biology: 77% (60%), • physics and engineering: 69% (51%), • Earth sciences: 64% (41%). Reproducibility Crisis © Stanford Medicine - Stanford University
  • 24. 24 How can we avoid duplication if the terminology, research problems, approaches, methods, characteristics, evaluations, … are not properly defined and identified? How would you build an engine/building without properly defining their parts, relationships, materials, characteristics … ? Duplication and Inefficiency
  • 25. 25 Lack of: • Transparency – information is hidden in text • Integratability – fitting different research results together • Machine assistance – unstructured content is hard to process • Identifyability of concepts beyond metadata • Collaboration – one brain barrier • Overview – scientists look for the needle in the haystack Root Cause - Deficiency of Scholarly Communication?
  • 26. How can we fix it? 26
  • 28. Linked Data Principles 1. Use URIs to identify the “things” in your data 2. Use http:// URIs so people (and machines) can look them up on the web 3. When a URI is looked up, return a description of the thing in the W3C Resource Description Format (RDF) 4. Include links to related things http://www.w3.org/DesignIssues/LinkedData.html 28 [1] Auer, Lehmann, Ngomo, Zaveri: Introduction to Linked Data and Its Lifecycle on the Web. Reasoning Web 2013
  • 29. Page 29 1. Graph based RDF data model consisting of S-P-O statements (facts) RDF & Linked Data in a Nutshell NasigConf2018 dbpedia:Atlanta 09.06.2018 NASIG conf:organizes conf:starts conf:takesPlaceIn 2. Serialised as RDF Triples: NASIG conf:organizes NasigConf2018 . NasigConf2018 conf:starts “2018-06-09”^^xsd:date . NasigConf2018 conf:takesPlaceAt dbpedia:Atlanta . 3. Publication under URL in Web, Intranet, Extranet Subject Predicate Object
  • 30. Page 30 Creating Knowledge Graphs with RDF Linked Data located in label industry headquarters full nameDHL Post Tower 162.5 m Bonn Logistics Logistik DHL International GmbH height 物流 label
  • 31. Page 31 Graph consists of:  Resources (identified via URIs)  Literals: data values with data type (URI) or language (multilinguality integrated)  Attributes of resources are also URI-identified (from vocabularies) Various data sources and vocabularies can be arbitrarily mixed and meshed URIs can be shortened with namespace prefixes; e.g. dbp: → http://dbpedia.org/resource/ RDF Data Model (a bit more technical) gn:locatedIn rdfs:label dbo:industry ex:headquarters foaf:namedbp:DHL_International_GmbH dbp:Post_Tower "162.5"^^xsd:decimal dbp:Bonn dbp:Logistics "Logistik"@de "DHL International GmbH"^^xsd:string ex:height "物流"@zh rdfs:label rdf:value unit:Meter ex:unit
  • 32. Page 32 • Fabric of concept, class, property, relationships, entity descriptions • Uses a knowledge representation formalism (typically RDF, RDF-Schema, OWL) • Holistic knowledge (multi-domain, source, granularity): • instance data (ground truth), • open (e.g. DBpedia, WikiData), private (e.g. supply chain data), closed data (product models), • derived, aggregated data, • schema data (vocabularies, ontologies) • meta-data (e.g. provenance, versioning, documentation licensing) • comprehensive taxonomies to categorize entities • links between internal and external data • mappings to data stored in other systems and databases Knowledge Graphs – A definition Smart Data for Machine Learning
  • 34. Page 34 Search Engine Optimization & Web-Commerce  Schema.org used by >20% of Web sites  Major search engines exploit semantic descriptions Pharma, Lifesciences  Mature, comprehensive vocabularies and ontologies  Billions of disease, drug, clinical trial descriptions Digital Libraries  Many established vocabularies (DublinCore, FRBR, EDM)  Millions of aggregated from thousands of memory institutions in Europeana, German Digital Library Emerging Knowledge Graphs & Data Spaces
  • 35. Paradigm Change in Scholarly Communication Towards more Knowledge-based Information Flows
  • 36. 36 Paradigm Change in Scholarly Communication Knowledge-based Information Flows in Science & Technology Challenges: Digitalisation of Science, monopolisation by commercial actors, Proliferation of publications, Reproducibility Crisis
  • 37. 37 Mathematics • Definitions • Theorems • Proofs • Methods • … Physics • Experiments • Data • Models • … Chemistry • Substances • Structures • Reactions • … Computer Science • Concepts • Implemen- tations • Evaluations • … Technology • Standards • Processes • Elements • Units, Sensor data Architecture • Regulations • Elements • Models • … Open Research Knowledge Graph Overarching Concepts  Research problems  Definitions  Research approaches  Methods Artefacts  Publications  Data  Software  Image/Audio/Video  Knowledge Graphs / Ontologies Domain specific concepts Open Research Knowledge Graph makes comprehensive and subject-specific concepts clearly identifiable and links them semantically (with clearly described relations) with each other and with relevant further artifacts.
  • 38. 38
  • 41. 41 Semantic Representation using a Knowledge Graph Author Robert Reed Research Problem Methods Experimental Data related Concepts Genome editing in Lepidoptera CRISPR/cas9 Lepidoptera; Genome editing; CRSIPR https://doi.org/10.5281/zenodo.896916 A practial guide to CRISPR/cas9 editing in Lepidoptera <https://doi.org/10.1101/130344> Robert Reed <https://orcid.org/0000-0002- 6065-6728> Genome editing in Lepidoptera Experimental Data https://doi.org/10.528 1/zenodo.896916 isAuthorOf adresses CRSPRS/cas9 isImplementedBy isEvaluatedWith Genome editing <https://www.wikidata.or g/wiki/Q24630389> relatesConcept 3. Graph representation 2. Graph Curation Form 1. Original Publication
  • 42. 42 Automatic Generation of Comparisons/Surveys
  • 43. 43 Open Research Knowledge Graph interlinks existing Services and Resources
  • 44. 44
  • 45. Interlinking Article, Software, Video and Graph resources describing the research
  • 46.
  • 47. 47 Advantages of knowledge based scholarly communication  Clear identification of all relevant artifacts, concepts, attributes, relationships  terminological and conceptual precision and sharpness, less ambiguity  Better and explicit networking of all relevant artifacts and information sources  traceability  ORKG machine-readability  new search, retrieval, mining and assistance applications  Avoidance of media discontinuities in the different phases of scientific work  Increased efficiency  Use of concepts and relationships across disciplinary boundaries  Interdisciplinarity and transdisciplinarity  Halting the proliferation of scientific publications  less duplication  Facilitating the entry of young academics or laypersons  Open Science
  • 48. 48 There is a lot to do: • Equip existing services with Linked Data interfaces • Enable the deep semantic description of research, requires • Good user interfaces • Scalable storage and search facility • Collaboration between scientists, libariens, knowledge engineers, machines Stay tuned • Mailinglist/group: https://groups.google.com/forum/#!forum/orkg • Comming soon: Open Research Knowledge Graph: https://orkg.org • Next workshop at TIB on November, 22nd (after DILS Conference: https://events.tib.eu/dils2018/) Outlook
  • 50. 50 Said Fathalla, Sahar Vahdati, Sören Auer, Christoph Lange: Towards a Knowledge Graph Representing Research Findings by Semantifying Survey Articles. TPDL 2017: 315-327, https://www.researchgate.net/publication/319419350 Sahar Vahdati, Natanael Arndt, Sören Auer, Christoph Lange: OpenResearch: Collaborative Management of Scholarly Communication Metadata. EKAW 2016: 778-793, https://www.researchgate.net/publication/309700661 Sören Auer: Towards an Open Research Knowledge Graph https://zenodo.org/record/1157185 Sören Auer, Viktor Kovtun, Manuel Prinz, Anna Kasprzik, Markus Stocker: Towards a Knowledge Graph for Science. https://doi.org/10.15488/3401 References

Editor's Notes

  1. SITUATION Wissensaustausch erfolgt nach wie vor mittels Dokumenten ■ HOHER AUFWAND beim Erstellen und Lesen der Dokumente / Artikel ■ Maschinelle Unterstützung bei der Verarbeitung / Suche nur begrenzt möglich ■ Viele REIBUNGSVERLUSTE durch Ambiguität, fehlende Vergleichbarkeit ZIEL Digitalisierung der Wissenschaft durch Etablierung wissensbasierter Informationsflüsse ■ Repräsentation und Kommunikation mittels Wissensgraphen ■ GEMEINSAMES VERSTÄNDNIS von Daten und Informationen durch dezentrale, kollaborative Kuratierung von Wissensgraphen ■ INTEGRATION in existierende und neue Dienste ERGEBNIS Wissenschaftliches Arbeiten wird revolutioniert ■ Informationen und Forschungsergebnisse können MITEINANDER VERNETZT und besser mit komplexen Informationsbedürfnissen in Verbindung gebracht werden. ■ EFFIZIENZGEWINNE, da Ergebnisse direkt vergleichbar und leichter wiederverwendbar
  2. Verfügbare Genome editing Verfahren Site-specificity Hohe Zielgenauigkeit: Wird eine Region ab 18 Nukleotiden sicher erkannt, spricht man von einer eineindeutigen Erkennungsrate der Nukleotidsequenz. Liegt der Wert darunter, steigt die Wahrscheinlichkeit, einen unerwünschten Bereich des Genoms zu erwischen Ease-of-Use / Cost-Efficiency Meganukleasen. Erkennen zwar lange Nukleotidsequenzen, aber dafür ist es sehr aufwändig eine passende Meganuklease für eine gewünschte Sequenz zu finden. Sowohl das Engineering als auch das Screening sind kostenintensiv ZFN. Hohe Screening-Kosten, da Specifity schwer vorherzusagen
  3. Der von den TIB mit Partnerorganisationen entwickelte Open Research Knowledge Graph (1) repräsentiert originäre Forschungsergebnisse explizit semantisch und (2) verknüpft vorhandene Metadaten, Daten, Wissens- & Informationsressourcen reichhaltig miteinander. Der Graph kann von Forschungsgemeinschaften kollaborativ kuratiert werden, sichert die Herkunft (Provenance), repräsentiert den wissenschaftlichen Diskurs und Evolution.