SlideShare a Scribd company logo
1 of 37
Download to read offline
Jem Rayfield, November, 2018
Towards Data Driven
Publishing
Leveraging Knowledge Graphs and
Text Analytics
Contech: 2018
Outline
● From; Unstructured Ambiguous Content
● Knowledge Graphs
● Ontotext Platform
● To; Data driven publishing
How can I get OP?From: Unstructured
Ambiguous Content
S
NP VP
Adj N
Stolen painting found by tree
V
PP
P N
S
NP VP
Adj N
Stolen painting found by tree
V
PP
P N
Knowledge and Graphs
Traditional relational
databases only store
information...
Graphs treat the
connections between
information with
equal importance.
Knowledge graphs represent
information in a manner
similar to how a human
understands information.
Ontotext GraphDb; uses graph
statements to reason and
infer additional knowledge.
Vector space indices
for similarity.
Graph; Reasoning & Inference
S = Berners-Lee
P = type
O = Person
S = Person
P = subClassOf
O = Mammal
S = Berners-Lee
P = type
O = Mammal
DATA (RDF)
KNOWLEDGE
(ONTOLOGY)
NEW
Implied
DATA
(RDF)
Graph & Vector Space;
Entity Awareness, Similarity
+
Big Knowledge Graphs; Provide Awareness
● Important airports near london?
● Most popular banks in UK
● People mentioned together with Apple
in the news
Vector Space; Similarity & Concordance
● Find similar content
● Find similar concepts and link
● Find relevant concepts for content
Vector
Space
Index
Similarity
Documents
Annotated
With
Graph Ids
GraphDb Vector Space; Similarity & Concordance
urn:Car
urn:Make
urn:Model
urn:Tires
urn:Car
urn:Engine
urn:Tires
urn:SUV
urn:Make
urn:MLModel
urn:Markov
urn:Car
urn:Engine
urn:Tires
urn:SUV
urn:Make
urn:Model
urn:MLModel
urn:Markov
1 1
1
1
0
0
0
0
0
0
1
0
0
1
0
01
1
0
1
0
0
0
0
Ontotext Platform
Analyses content
Text
Analytics
API
Content
Concept
Suggestions
Classification
Relationships
...
Sentiment
Relationships
TA: Vocabulary Aware
Semantic Disambiguation
GraphDB
Vocabulary
Vocabulary Gazetteer
Disambiguation
(ML Model)
NLP Pipeline
Language Detection
POS
...
...
...
Relevance Ranking
(Statistical)
...
Dynamic
Vocabulary
Get
Suggestions
Annotate
Content
Apple : Organisation
Tim Cook : Person, CEO
Tim Cook : Person, Footballer
Samsung : Organisation
Apple : Organisation
Tim Cook : Person, CEO
Tim Cook : Person, Footballer
Samsung : Organisation
87% - Tim Cook : Person, CEO
68% - Apple : Organisation
56% - Samsung : Organisation
Apple CEO Tim Cook
was at a conference
with the CEO of
Samsung. Tim
explained how smart
phones are changing
the consumer
electronics market.
Suggestions
Entity Detection from Vocab
Disambiguation
Relevance
Automated (Governed) Machine Learning
Text Analytics
Machine Learnt
Model
Curation
Accept|Reject|Modify
Gold Standard Corpus
[W3C Open Annotation]
Re-train
moderate
suggestmodify
corpus
load
update
model
Annotates content with knowledge
Open
Annotation
API
Content
Content
Semantic
Fingerprint
Content Vocabulary Annotation Graph
Content
Apple
Organisation
SamsungAnnotation
textpos:123,142
relevance:56%
mentions
Annotation
textpos:123,142
relevance:68%
about
Tim Cook Person
target
target
tag
tag
ceo
type
type
competitor
Annotation
textpos:123,142
relevance:87%
about
target
tag
USA
NASDAQ
Computer
Hardware
location
exchange
sector
Understands content
Content
Knowledge Graph
Content
UK
Apple
Samsung
USA
located in
mentions
about
Tim
Cook
USA
NASDAQ
exchange
headquarters
Tim
Cook
Computer
Hardware
ceo
industry
about
Understands users
User Data
Knowledge Graph
User
UK
Apple
Inc
Samsung
USA
NASDAQ
lives in
employed by
interested in
Tim
Cook
Computer
Hardware
located in
headquartered in
exchange
ceo
industry
Captures behaviour
Event
APIEvents User
Event
Index
Understands behaviour
User
Social
Behaviour
User
content:view
content:scroll
content:dwell
concept:follow
hashtag:follow
tweet:view
User
Behaviour
Mine social behaviour
User
Behaviour
Social
APIEvents
User
Event
Index
Behavioral + Contextual recommendation
Reads
Behavioral
similarity
Increased Engagement
Content
Knowledge Graph
User Data
Knowledge Graph
+
Social
Behaviour
User
Behaviour+ =
Architecture
Unstructured Content
Users + Events
Content
Concordance
Search
Annotation
User Events
Text Analytics
Recommendation
Knowledge Graph
Semantic
Fingerprint
Structured
Reference data
OP APIs Tools &
Visualisations
To; Data driven publishing
Dynamic Semantic Publishing
Authoring
● Rapid high value, lower cost content curation
● Capture knowledge and meaning as re-usable data
Search & Discovery
● Unambiguous semantic search
● Recommendation and Similarity
Product
● Re-purpose and aggregate with Business context
● Generate new revenue streams
Enhanced Publishing Workflow
Authoring Editorial Production Delivery
Discover
Related
Content
Add
references
Add
Context
Annotate
With Concepts
& Relations
Organise &
Improve
Workflow
Link to
products &
archive
Dynamic
Data driven
products
Content
Transformation
Domain
Modelled
IA
Contextual
Semantic
Search
Recommend
Related
Content
Personalised
Content
Streams
DSP - BBC Sport
o Goals
✓ Create a dynamic semantic publishing
platform that assembles web pages
on-the-fly using a variety of data
sources
✓ Deliver highly relevant data to web site
visitors with sub-second response
"The goal is to be able to more easily and accurately aggregate
content, find it and share it across many sources. From these
simple relationships and building blocks you can dynamically build
up incredibly rich sites and navigation on any platform."
John O’Donovan, Chief Technical Architect, BBC
The IET
o Goals
✓ Manageable, discoverable, searchable;
Journals, research papers and articles
✓ Semantic search using existing
taxonomies
✓ Intelligent citations and data
provenance
✓ Automated, dynamic repurposing of
content assets
✓ Enable new revenue opportunities
Thank you!
Experience the technology with our demonstrators
NOW: Semantic News Portal http://now.ontotext.com
RANK: News popularity ranking for companies http://rank.ontotext.com
FactForge: Knowledge graph of linked open data and news
about People and Organizations
http://factforge.net

More Related Content

More from Ontotext

Best Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining ProcessingBest Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining Processing
Ontotext
 

More from Ontotext (20)

Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
 
Hercule: Journalist Platform to Find Breaking News and Fight Fake Ones
Hercule: Journalist Platform to Find Breaking News and Fight Fake OnesHercule: Journalist Platform to Find Breaking News and Fight Fake Ones
Hercule: Journalist Platform to Find Breaking News and Fight Fake Ones
 
How to migrate to GraphDB in 10 easy to follow steps
How to migrate to GraphDB in 10 easy to follow steps How to migrate to GraphDB in 10 easy to follow steps
How to migrate to GraphDB in 10 easy to follow steps
 
GraphDB Cloud: Enterprise Ready RDF Database on Demand
GraphDB Cloud: Enterprise Ready RDF Database on DemandGraphDB Cloud: Enterprise Ready RDF Database on Demand
GraphDB Cloud: Enterprise Ready RDF Database on Demand
 
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing Platform
 
How is smart data cooked?
How is smart data cooked?How is smart data cooked?
How is smart data cooked?
 
Efficient Practices for Large Scale Text Mining Process
Efficient Practices for Large Scale Text Mining ProcessEfficient Practices for Large Scale Text Mining Process
Efficient Practices for Large Scale Text Mining Process
 
The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open Data
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
The Knowledge Discovery Quest
The Knowledge Discovery Quest The Knowledge Discovery Quest
The Knowledge Discovery Quest
 
Best Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining ProcessingBest Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining Processing
 
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageBuild Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
 
Semantic Data Normalization For Efficient Clinical Trial Research
Semantic Data Normalization For Efficient Clinical Trial ResearchSemantic Data Normalization For Efficient Clinical Trial Research
Semantic Data Normalization For Efficient Clinical Trial Research
 
Gain Super Powers in Data Science: Relationship Discovery Across Public Data
Gain Super Powers in Data Science: Relationship Discovery Across Public DataGain Super Powers in Data Science: Relationship Discovery Across Public Data
Gain Super Powers in Data Science: Relationship Discovery Across Public Data
 
Gaining Advantage in e-Learning with Semantic Adaptive Technology
Gaining Advantage in e-Learning with Semantic Adaptive TechnologyGaining Advantage in e-Learning with Semantic Adaptive Technology
Gaining Advantage in e-Learning with Semantic Adaptive Technology
 
Cooking up the Semantic Web
Cooking up the Semantic WebCooking up the Semantic Web
Cooking up the Semantic Web
 
Diving in Panama Papers and Open Data to Discover Emerging News
Diving in Panama Papers and Open Data to Discover Emerging NewsDiving in Panama Papers and Open Data to Discover Emerging News
Diving in Panama Papers and Open Data to Discover Emerging News
 
How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk Analytics
 
Why Semantics Matter? Adding the semantic edge to your content, right from au...
Why Semantics Matter? Adding the semantic edge to your content,right from au...Why Semantics Matter? Adding the semantic edge to your content,right from au...
Why Semantics Matter? Adding the semantic edge to your content, right from au...
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Towards data driven publishing. Leveraging Knowledge Graphs and Text Analytics.