SlideShare a Scribd company logo
1 of 15
David Kuilman, Gina Donato, Dr. Rinke Hoekstra
A content standard for data-platform use cases:
Content Profiles
& linked documents
NISO Diversity of formats
February 10, 2021 11:00am
Working Group initiative to create a NISO standard for the interchange
of academic, research, and professional content, data, and semantics
2
Elsevier Data Platform vision
…entity-driven processes
(Early) access
and visibility
Expedite shapes
Lineage
Provenance
Policy / license
Priority of
content and
authorship
Content is data
Content and data
operate seamlessly
Content structure
follows document
entity structure
Rich HTML5 literals
for UI/UX use cases
Role based
processing
Content typology
Granular
Context-based
using process
and purpose
intelligence
Content is
shared
All content can be
leveraged throughout
the platform by all
contributor/consumer
roles using a common
vocabulary
Zero organisational
boundaries
Policies for compliance
Continuous
flow and
hydration
Partial and
complete resources
Extensible types
and enrichments
Optimisation
of formats
Machine
learning
Human
interaction
Agile, extensible
and resilient
Fast services development
Nimble models
Extensible models
Arbitrary content (types)
Service level agreement
Handle exception flows
gracefully and informed
Business requirement: from a content perspective
Anatomy of content entity processes on a data platform
Source
Data
Harvesting Normalisation Extraction matching Linking Curation Publishing
… entity driven workflow
Classic document driven workflow…
manuscript Internal format copyedit Mastercopy Product
mappings mappings
The Content Profiles & Linked Document standard (CP/LD) is the result of
adopting content platform principles to provide the flexibility, extensibility and
connectivity required on a
data platform for academic, research and professional content
Lets consider a few critical design considerations first…
Pipeline to cyclic
Human-in-the-loop
Merging data entities and content entities on demand
Sourcing
Harvesting
Normalizing
Extraction
Matching
linking
Publishing
Key concept: think cyclic, not linear…
Sourcing
Harvesting
Normalizing
Extraction
Matching
linking
Publishing
Sourcing
Harvesting
Normalizing
Extraction
Matching
linking
Publishing
Sourcing
Harvesting
Normalizing
Extraction
Matching
linking
Publishing
Sourcing Harvesting Normalisation Extraction matching Linking Curation Publishing
… in parallel workflows
… author
… review
… approve
… connect
… edit
… recommend
… annotate
…
Human-in-the-loop
Key concept: think human-in-the-loop and machine learning
Sourcing
Harvesting
Normalizing
Extraction
Matching
linking
Publishing
Gold set
Test sets
Human curation within
content centric workflows
Human curation within
Machine Learning
Contributor
Consumer
Continuous improvement
Content operations
Platform operations
Continuous deployment
Model operations
Content
artefacts
Enhanced
Content
artefacts
Human supervised
Content usage metrics
The CP/LD standard uses established standards to create the
format framework that supports data platform content
operations without compromise
Linked data and HTML5 unite syntax, structure and semantics
needed on the platform
HTML5
JSON-LD +
Structured narrative
Semantic data layer
XHTML dialect
Linked Data
Usage standard and guidelines
Independent of any particular use case
Content Profile standard & Linked Document
XML Schema
RDF Schema
SHACL
XML
Schema
RDF: Discovery
XML: consistency
JSON: messaging
JSON-LD: knowledge infusion
HTML5: representation
Business roles
This is a part of text that has a specific style (italic)
This is a paragraph
This paragraph is the abstract of the paper
This paragraph is the title of the paper
This is author Alba Grifoni
This is a citation of another paper
This is a result reported on in this paper
This is a mention of the “COVID-19” concept
This is a mention of the “SARS-CoV2” concept
This states that “SARS-CoV2” reactive “CD4+ T-cells” exist in ~40%-
60% of unexposed individuals, suggesting cross-reactive T-cell
recognition with “common cold”
doi:10.1126/sciimunol.aan5393
“55425663600”
hgraph:id-88f9e4ca-c776-3380-933b-f1218c4ef1fd (COVID-19)
hgraph:id-2ab6cd87-e543-3229-85ff-c862a90f415c (SARS-CoV2)
hgraph:id-88f9e4ca-c776-3380-933b-f1218c4ef1fd (T-CD4+)
hgraph:id-2ab6cd87-e543-3229-85ff-c862a90f415c (SARS-CoV2)
hgraph:id-a28e7725-1919-34f0-a648-45721d8bd6a2 (common cold)
reactive to
reactive to
The anatomy of a Linked
Document
service
service
service
service
service
service
service
service
service
service
assertions
documents
resources
Aggregations
products
Content Topics blueprint for data platform
Bespoke normalizers Linked Data processors Query
harversting
Harvested
manuscript
Normalized
document
Enriched article A finished
article
Article
Author
Document
Document
Document
Author
Document
Article
Document
Author
attributes
Manuscript
Conclusion
Abstract
Author
String
Author
String
Activating the platform: listen and merge application
An author manuscript
Author mention
Author as Person Entity
Author as Entity and representation
Conclusion
Abstract
service
service
service
merge
Activating the platform: merge topics and create a product view
After merging the topics, the
finished view offers:
• A manuscript becomes an
Document
• the position of an abstract
and a conclusion
• An person has been identified
as author
• The author string has been
identified within the
document.
• The author has entity
attributes
• The document assembly is a
scientific article of type
‘Finished’ because it satisfies
the above criteria
merge
Article Author
Author
attributes
Abstract
Author
String
Conclusion
Outside document
Inside document
HTML5 vocabulary
JSON-LD predicates
Relationships legend
A finished article
Key takeaways
• Content is data; treat it as data not as documents
• Normalization is great divider from files to entities, items and assertions
• Entity-designed data and Author-designed data become blended
• Machine learner and researcher forge alliance
On standards & formats…
• RDF and XML schema technology (remain) backbone for information
modelling
• JSON, JSON-LD and HTML5 serialisations dominant for content standards
Working Group initiative to create a NISO standard for the interchange
of academic, research, and professional content, data, and semantics
Further information:
Kuliman "Content Profiles & linked documents"

More Related Content

What's hot

Regression Testing - An Overview
Regression Testing - An OverviewRegression Testing - An Overview
Regression Testing - An OverviewBugRaptors
 
UX + BA: Working Together In Harmony [updated]
UX + BA: Working Together In Harmony [updated]UX + BA: Working Together In Harmony [updated]
UX + BA: Working Together In Harmony [updated]Jacklyn Burgan
 
Accelerate : la vitesse conditionne l'excellence
Accelerate : la vitesse conditionne l'excellence Accelerate : la vitesse conditionne l'excellence
Accelerate : la vitesse conditionne l'excellence OCTO Technology
 
Understanding DevOps
Understanding DevOpsUnderstanding DevOps
Understanding DevOpsInnoTech
 
Number of oocytes and progesterone levels in IVF: Do they matter?
Number of oocytes and progesterone levels in IVF: Do they matter?Number of oocytes and progesterone levels in IVF: Do they matter?
Number of oocytes and progesterone levels in IVF: Do they matter?Sandro Esteves
 
Fertility Enhancing Laparoscopic Surgeries Panel Discussion
Fertility Enhancing Laparoscopic Surgeries Panel DiscussionFertility Enhancing Laparoscopic Surgeries Panel Discussion
Fertility Enhancing Laparoscopic Surgeries Panel DiscussionRajesh Gajbhiye
 
Getting Started with Azure DevOps
Getting Started with Azure DevOpsGetting Started with Azure DevOps
Getting Started with Azure DevOpsJessica Deen
 
Role of tubal surgery in era of ivf
Role of tubal surgery in era of ivfRole of tubal surgery in era of ivf
Role of tubal surgery in era of ivfSanjay Makwana
 
Integrating User Centered Design with Agile Development
Integrating User Centered Design with Agile DevelopmentIntegrating User Centered Design with Agile Development
Integrating User Centered Design with Agile DevelopmentJulia Borkenhagen
 
Integrating Automated Testing into DevOps
Integrating Automated Testing into DevOpsIntegrating Automated Testing into DevOps
Integrating Automated Testing into DevOpsTechWell
 
Transform Agile Development With Practical DevOps
Transform Agile Development With Practical DevOpsTransform Agile Development With Practical DevOps
Transform Agile Development With Practical DevOpsGaurav Sharma
 
Building a DevOps organization
Building a DevOps organizationBuilding a DevOps organization
Building a DevOps organizationZinnov
 
A (Brief) History of User Experience
A (Brief) History of User ExperienceA (Brief) History of User Experience
A (Brief) History of User ExperienceChris Pallé
 

What's hot (20)

Regression Testing - An Overview
Regression Testing - An OverviewRegression Testing - An Overview
Regression Testing - An Overview
 
DevOps Best Practices
DevOps Best PracticesDevOps Best Practices
DevOps Best Practices
 
DevOps Foundation
DevOps FoundationDevOps Foundation
DevOps Foundation
 
UX + BA: Working Together In Harmony [updated]
UX + BA: Working Together In Harmony [updated]UX + BA: Working Together In Harmony [updated]
UX + BA: Working Together In Harmony [updated]
 
Accelerate : la vitesse conditionne l'excellence
Accelerate : la vitesse conditionne l'excellence Accelerate : la vitesse conditionne l'excellence
Accelerate : la vitesse conditionne l'excellence
 
DevOps - A Gentle Introduction
DevOps - A Gentle IntroductionDevOps - A Gentle Introduction
DevOps - A Gentle Introduction
 
Understanding DevOps
Understanding DevOpsUnderstanding DevOps
Understanding DevOps
 
Number of oocytes and progesterone levels in IVF: Do they matter?
Number of oocytes and progesterone levels in IVF: Do they matter?Number of oocytes and progesterone levels in IVF: Do they matter?
Number of oocytes and progesterone levels in IVF: Do they matter?
 
Fertility Enhancing Laparoscopic Surgeries Panel Discussion
Fertility Enhancing Laparoscopic Surgeries Panel DiscussionFertility Enhancing Laparoscopic Surgeries Panel Discussion
Fertility Enhancing Laparoscopic Surgeries Panel Discussion
 
Devops ppt
Devops pptDevops ppt
Devops ppt
 
Getting Started with Azure DevOps
Getting Started with Azure DevOpsGetting Started with Azure DevOps
Getting Started with Azure DevOps
 
Role of tubal surgery in era of ivf
Role of tubal surgery in era of ivfRole of tubal surgery in era of ivf
Role of tubal surgery in era of ivf
 
Integrating User Centered Design with Agile Development
Integrating User Centered Design with Agile DevelopmentIntegrating User Centered Design with Agile Development
Integrating User Centered Design with Agile Development
 
Infertility and PCOS
Infertility and PCOSInfertility and PCOS
Infertility and PCOS
 
Intro to DevOps
Intro to DevOpsIntro to DevOps
Intro to DevOps
 
Integrating Automated Testing into DevOps
Integrating Automated Testing into DevOpsIntegrating Automated Testing into DevOps
Integrating Automated Testing into DevOps
 
Transform Agile Development With Practical DevOps
Transform Agile Development With Practical DevOpsTransform Agile Development With Practical DevOps
Transform Agile Development With Practical DevOps
 
DevOps introduction
DevOps introductionDevOps introduction
DevOps introduction
 
Building a DevOps organization
Building a DevOps organizationBuilding a DevOps organization
Building a DevOps organization
 
A (Brief) History of User Experience
A (Brief) History of User ExperienceA (Brief) History of User Experience
A (Brief) History of User Experience
 

Similar to Kuliman "Content Profiles & linked documents"

Building an effective sharepoint team
Building an effective sharepoint teamBuilding an effective sharepoint team
Building an effective sharepoint teamBaris Bruce Tuncertan
 
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...PwC
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewDataWorks Summit/Hadoop Summit
 
FHIR Client Development with .NET
FHIR Client Development with .NETFHIR Client Development with .NET
FHIR Client Development with .NETBrian Postlethwaite
 
LavaCon 2017 - Authored by Man and Machine: Interactive Documents?
LavaCon 2017 - Authored by Man and Machine: Interactive Documents?LavaCon 2017 - Authored by Man and Machine: Interactive Documents?
LavaCon 2017 - Authored by Man and Machine: Interactive Documents?Jack Molisani
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigmJonathan Challener
 
TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010Eli Robillard
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersDebdoot Mukherjee
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community UpdateCarole Goble
 
Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Suite Solutions
 
Approaches to machine actionable links
Approaches to machine actionable linksApproaches to machine actionable links
Approaches to machine actionable linksStephen Richard
 
How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?confluent
 
Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011
Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011
Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011Knowledge Cue
 
Enterprise Content Management Migration Best Practices Feat Migrations From...
Enterprise Content Management Migration Best Practices   Feat Migrations From...Enterprise Content Management Migration Best Practices   Feat Migrations From...
Enterprise Content Management Migration Best Practices Feat Migrations From...Alfresco Software
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 

Similar to Kuliman "Content Profiles & linked documents" (20)

Building an effective sharepoint team
Building an effective sharepoint teamBuilding an effective sharepoint team
Building an effective sharepoint team
 
OpenKM commercial
OpenKM commercialOpenKM commercial
OpenKM commercial
 
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
FHIR Client Development with .NET
FHIR Client Development with .NETFHIR Client Development with .NET
FHIR Client Development with .NET
 
LavaCon 2017 - Authored by Man and Machine: Interactive Documents?
LavaCon 2017 - Authored by Man and Machine: Interactive Documents?LavaCon 2017 - Authored by Man and Machine: Interactive Documents?
LavaCon 2017 - Authored by Man and Machine: Interactive Documents?
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigm
 
TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge Workers
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
 
Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009
 
Approaches to machine actionable links
Approaches to machine actionable linksApproaches to machine actionable links
Approaches to machine actionable links
 
How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?
 
Microsoft SharePoint Syntex
Microsoft SharePoint SyntexMicrosoft SharePoint Syntex
Microsoft SharePoint Syntex
 
Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011
Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011
Mark Orange - SharePoint 2010 Content Types Model - SPC NZ 2011
 
Enterprise Content Management Migration Best Practices Feat Migrations From...
Enterprise Content Management Migration Best Practices   Feat Migrations From...Enterprise Content Management Migration Best Practices   Feat Migrations From...
Enterprise Content Management Migration Best Practices Feat Migrations From...
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 

More from National Information Standards Organization (NISO)

More from National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 

Recently uploaded

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 

Recently uploaded (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 

Kuliman "Content Profiles & linked documents"

  • 1. David Kuilman, Gina Donato, Dr. Rinke Hoekstra A content standard for data-platform use cases: Content Profiles & linked documents NISO Diversity of formats February 10, 2021 11:00am Working Group initiative to create a NISO standard for the interchange of academic, research, and professional content, data, and semantics
  • 2. 2 Elsevier Data Platform vision …entity-driven processes
  • 3. (Early) access and visibility Expedite shapes Lineage Provenance Policy / license Priority of content and authorship Content is data Content and data operate seamlessly Content structure follows document entity structure Rich HTML5 literals for UI/UX use cases Role based processing Content typology Granular Context-based using process and purpose intelligence Content is shared All content can be leveraged throughout the platform by all contributor/consumer roles using a common vocabulary Zero organisational boundaries Policies for compliance Continuous flow and hydration Partial and complete resources Extensible types and enrichments Optimisation of formats Machine learning Human interaction Agile, extensible and resilient Fast services development Nimble models Extensible models Arbitrary content (types) Service level agreement Handle exception flows gracefully and informed Business requirement: from a content perspective
  • 4. Anatomy of content entity processes on a data platform Source Data Harvesting Normalisation Extraction matching Linking Curation Publishing … entity driven workflow Classic document driven workflow… manuscript Internal format copyedit Mastercopy Product mappings mappings
  • 5. The Content Profiles & Linked Document standard (CP/LD) is the result of adopting content platform principles to provide the flexibility, extensibility and connectivity required on a data platform for academic, research and professional content Lets consider a few critical design considerations first… Pipeline to cyclic Human-in-the-loop Merging data entities and content entities on demand
  • 6. Sourcing Harvesting Normalizing Extraction Matching linking Publishing Key concept: think cyclic, not linear… Sourcing Harvesting Normalizing Extraction Matching linking Publishing Sourcing Harvesting Normalizing Extraction Matching linking Publishing Sourcing Harvesting Normalizing Extraction Matching linking Publishing Sourcing Harvesting Normalisation Extraction matching Linking Curation Publishing … in parallel workflows … author … review … approve … connect … edit … recommend … annotate … Human-in-the-loop
  • 7. Key concept: think human-in-the-loop and machine learning Sourcing Harvesting Normalizing Extraction Matching linking Publishing Gold set Test sets Human curation within content centric workflows Human curation within Machine Learning Contributor Consumer Continuous improvement Content operations Platform operations Continuous deployment Model operations Content artefacts Enhanced Content artefacts Human supervised Content usage metrics
  • 8. The CP/LD standard uses established standards to create the format framework that supports data platform content operations without compromise Linked data and HTML5 unite syntax, structure and semantics needed on the platform
  • 9. HTML5 JSON-LD + Structured narrative Semantic data layer XHTML dialect Linked Data Usage standard and guidelines Independent of any particular use case Content Profile standard & Linked Document XML Schema RDF Schema SHACL XML Schema RDF: Discovery XML: consistency JSON: messaging JSON-LD: knowledge infusion HTML5: representation Business roles
  • 10. This is a part of text that has a specific style (italic) This is a paragraph This paragraph is the abstract of the paper This paragraph is the title of the paper This is author Alba Grifoni This is a citation of another paper This is a result reported on in this paper This is a mention of the “COVID-19” concept This is a mention of the “SARS-CoV2” concept This states that “SARS-CoV2” reactive “CD4+ T-cells” exist in ~40%- 60% of unexposed individuals, suggesting cross-reactive T-cell recognition with “common cold” doi:10.1126/sciimunol.aan5393 “55425663600” hgraph:id-88f9e4ca-c776-3380-933b-f1218c4ef1fd (COVID-19) hgraph:id-2ab6cd87-e543-3229-85ff-c862a90f415c (SARS-CoV2) hgraph:id-88f9e4ca-c776-3380-933b-f1218c4ef1fd (T-CD4+) hgraph:id-2ab6cd87-e543-3229-85ff-c862a90f415c (SARS-CoV2) hgraph:id-a28e7725-1919-34f0-a648-45721d8bd6a2 (common cold) reactive to reactive to The anatomy of a Linked Document
  • 11. service service service service service service service service service service assertions documents resources Aggregations products Content Topics blueprint for data platform Bespoke normalizers Linked Data processors Query harversting Harvested manuscript Normalized document Enriched article A finished article
  • 12. Article Author Document Document Document Author Document Article Document Author attributes Manuscript Conclusion Abstract Author String Author String Activating the platform: listen and merge application An author manuscript Author mention Author as Person Entity Author as Entity and representation Conclusion Abstract service service service merge
  • 13. Activating the platform: merge topics and create a product view After merging the topics, the finished view offers: • A manuscript becomes an Document • the position of an abstract and a conclusion • An person has been identified as author • The author string has been identified within the document. • The author has entity attributes • The document assembly is a scientific article of type ‘Finished’ because it satisfies the above criteria merge Article Author Author attributes Abstract Author String Conclusion Outside document Inside document HTML5 vocabulary JSON-LD predicates Relationships legend A finished article
  • 14. Key takeaways • Content is data; treat it as data not as documents • Normalization is great divider from files to entities, items and assertions • Entity-designed data and Author-designed data become blended • Machine learner and researcher forge alliance On standards & formats… • RDF and XML schema technology (remain) backbone for information modelling • JSON, JSON-LD and HTML5 serialisations dominant for content standards Working Group initiative to create a NISO standard for the interchange of academic, research, and professional content, data, and semantics Further information:

Editor's Notes

  1. XML DTD 5.6 (OPS), XOCS… Common Index Profile (CIP) -> structure & metadata NLP: CM2, FPE, Leadmine, MedScan, Termite (SciBite) … Linking: Parity, FPE, …