SlideShare a Scribd company logo
1 of 41
schema.org
Linked Data’s Gateway Drug
<script type="application/ld+json">
{
"@context": "http://schema.org",
"@type": "Drug",
"name": "schema.org",
"activeIngredient": "Linked data",
"dosageForm": "Structured data",
"recognizingAuthority": [{
"@type": "Organization",
"name": "Bing"
},{
"@type": "Organization",
"name": "Google"
},{
"@type": "Organization",
"name": "Yahoo"
},{
"@type": "Organization",
"name": "Yandex"
}]
}
</script>
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Electronic Arts
schema.org/worksFor
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
bit.ly/semsearch
schema.org
pending.schema.org/knowsAbout
bit.ly/sdataevents schema.org/WebSite
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
schema.org
pending.schema.org/knowsAbout
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
History and adoption
schema.org followed in the footsteps of other structured data initiatives, but appears to
have enjoyed much broader adoption
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
schema.org
Microformats (2004)
Broad search engine support
data-vocabulary.org (2009)
data-vocabulary.org
Open Graph Protocol (2007)
Partial search engine support
GoodRelations (2007)
DCMI Terms (2003)
FOAF (2000)
No explicit search engine support
Structured data existed prior to schema.org, but often with little or no search engine support
The road to schema.org
schema.org (2011)
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
A “collection of shared vocabularies … that can be understood by the major search engines”
schema.org in a nutshell
Structure
• A collection of schemas consisting of types, properties and
enumerations
• Types – classes and subclasses (e.g. “Book”)
• Properties – attributes expecting a value of a particular data type
(e.g. “sameAs”), or relations expecting an instance of a particular
type (e.g. “author”) or an enumeration member (e.g. “availability”)
• Enumerations – a class (e.g. “ItemAvailability) whose members
are considered neither types nor properties (e.g. “InStock”)
Search engine support
• A joint initiative supported at launch by Bing, Google and
Yahoo, and soon after by Yandex
Supported encoding formats
• Microdata and RDFa supported at launch, with RDFa Lite and
JSON-LD support following
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
All data from Web Data Commons
0.00%
2.00%
4.00%
6.00%
8.00%
10.00%
12.00%
14.00%
16.00%
2012 Aug 2013 Nov 2014 Dec 2015 Nov 2016 Oct 2017 Nov
Format Use as a Percentage of Sampled Domains
RDFa Microdata JSON-LD
Robust schema.org adoption data is hard to come by, but format use helps paint the picture
schema.org adoption as inferred from Web Data Commons data
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
What’s currently being encoded with these syntaxes is almost exclusively schema.org
For microdata and JSON-LD, it’s schema.org all the way down
Top Classes, Microdata, Nov. 2017 Top Classes, JSON-LD, Nov. 2017
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
All data from Web Data Commons
Format Use by Number of Domains in Sample
Raw Web Data Commons format usage data belies the relative expressiveness of schema.org
A relatively large vocabulary results in more assertions
2012 2017
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Raw Web Data Commons format usage data belies the relative expressiveness of schema.org
A relatively large vocabulary results in more assertions
<span class= "author vcard">
<a href=
"http://www.seoskeptic.com/
aaron-bradley/"
class="url fn">Aaron Bradley</a>
“... OGP (Open Graph Protocol) and
microformat approaches can be found on
approximately as many sites as Schema.org,
but given their much smaller vocabularies,
they appear on less than fewer than half as
many pages and contain fewer than a quarter
as many logical assertions.”
Guha, Brickley and Macbeth, Dec. 2015
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Such as they are
schema.org use by the numbers
Apr. 2014 Dec. 2014 Dec. 2015 Nov. 2018
0.3% 22.0% 31.3%
21.9%
JSON-LD
15.6%
Microdata
% of domains
SearchMetrics
500K domains
Microdata only?
% of pages
Guha, Brickley, Macbeth
10B pages
% of websites
W3Techs
Top 10M websites
(Alexa)
% of pages
Guha, Brickley, Macbeth
10B pages
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
The path to adoption
The vocabulary launched with a clear value proposition for webmasters, and has been
buoyed since by a collaborative vocabulary development model, a modified extension
mechanism and the added flexibility afforded by JSON-LD
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Event
Recipe, AggregateRating
Product, AggregateRating
The search engines incentivized schema.org use right out of the gate with rich snippets
Rich results at launch
Rich results post-launch
The search engines have been steadily adding new search features as the vocabulary grows
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Organization.logo, Organization.sameAs JobPosting ClaimReview
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
23 March 20174 May 2016
0 200 400 600 800 1000
Jun-11
Nov-15
Nov-18
Classes in schema.org, 2011-2018
Core Extensions Pending
A living vocabulary
Over the course of time schema.org has become more and more expressive
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
public-schemaorg W3C Mailing List
schema.org provides multiple mechanisms for collaborative vocabulary development
Making vocabulary development a community affair
schema.org on Github Partnerships
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
GS1’s SmartSearch is powered by a schema.org
external extension
schema.org’s extension mechanism was completely revamped in v2.0 (May 2015)
Extending schema.org with more specialized vocabulary
SmartSearch in action at Tesco
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
schema.org endorsed JSON-LD in 2013; Google started using it in 2014, with full support by 2016
JSON-LD: developer-friendly linked data
“…the whole point about it is, it is JSON first and RDF
second. And the fact that it carries RDF is simply
unimportant. And it's particularly unimportant to people
who are JSON users – which is basically every web
developer these days.
“People don't need to know everything, they can create
really cool applications, and if they find JSON-LD useful
– fantastic. If they don't know that it's RDF, I don't care.”
Phil Archer, Aug. 2014
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Separation of the data and presentation layers makes life considerably easier for web developers
JSON-LD versus inline markup: no contest
Product Details Page: Before Product Details Page: After
<script type="application/ld+json">
{
"@context": "http://schema.org",
"@type": "Product",
"name": "Bob's Best Basic T"
"image": "bbbt-pink.jpg",
"offers": {
"@type": "Offer",
"price": "$28",
"priceCurrency": "$USD",
},
"aggregateRating": {
…
<script type="application/ld+json">
{
"@context": "http://schema.org",
"@type": "Product",
"name": "Bob's Best Basic T"
"image": "bbbt-pink.jpg",
"offers": {
"@type": "Offer",
"price": "$28",
"priceCurrency": "$USD",
},
"aggregateRating": {
…
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
schema.org beyond search
Seemingly striking the right balance between expressiveness and complexity, the
vocabulary is being used for applications outside of search, and is increasingly the
starting point for ground-up linked data initiatives
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Pinterest uses schema.org to populate Article, Product and Recipe Rich Pins
Leveraging structured data to enhance the presentation layer
Pinterest Product Rich Pin Offer Information on Pin Source Page
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
When Google needed vocabulary for its Assistant it unsurprisingly turned to schema.org
Virtual assistants and schema.org
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
When Google needed vocabulary for its Assistant it unsurprisingly turned to schema.org
Virtual assistants and schema.org
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Amazon’s Alexa Meaning Representation Language is based on schema.org
Virtual assistants and schema.org
“The Alexa ontology utilized schema.org as
its base and has been updated to include
support for spoken language. In addition,
using schema.org as the base of the Alexa
Ontology means that it shares a vocabulary
used by more than 10 million websites, which
can be linked to the Alexa ontology”
Thomas Kollar et al, Jun. 2018
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
A New Zealand health insurance company used the vocabulary to kickstart product development
Bootstrapping development with schema.org
David Gibson, Feb. 2018
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
The vocabulary allows linked data practitioners to construct knowledge graphs with relative ease
Bootstrapping development with schema.org
“…the knowledge graph is implemented as a
triple store where the data has been
represented using a small number of
vocabularies (mostly schema.org with some
terms borrowed from TAXREF-LD and the
TDWG LSID vocabularies).”
Rod Page, Ozymandias
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Chinese search engine Baidu appears to have based its knowledge graph on schema.org
Bootstrapping development with schema.org
Via Google Translate
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Electronic Arts used the vocabulary as the basis for their domain ontology
Bootstrapping development with schema.org
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Boundaries of the vocabulary
As schema.org is adopted for use in increasingly diverse domains, there’s more and
more demands to add to the vocabulary: does it risk becoming too much “an ontology of
everything”, or is it actually not expressive enough?
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Is it an animal?
Just how much can we say about each entity?
Let’s play 20 questions using schema.org vocabulary!
Is it a vegetable? Is it a mineral?
It’s a Thing It’s a Thing It’s a Thing
More expressive exceptions:
Person, Product
More expressive exception:
Product
More expressive exception:
Product
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
But there’s always a tension between adding to schema.org and referencing existing vocabularies
The “add animals and plants” discussion has recently reignited
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
But there’s always a tension between adding to schema.org and referencing existing vocabularies
The “add animals and plants” discussion has recently reignited
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Recent developments and future
directions
At the same time that the improved ability of machines to understand content makes
structured data use less of an imperative, schema.org is increasingly finding itself useful
as a mechanism for serialized linked data
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
If machines are eventually able to parse content like humans will structured data still be necessary?
Will AI and related technologies render schema.org obsolete?
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Leveraging schema.org allows Google to improve the discoverability of datasets
Bridging the semantic gap with Dataset Search
Year of Birth No. of cases
1976 1
1977 1
1980 1
1981 2
1982 7
1983 8
1984 7
1985 7
1986 11
…
Total 89
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
JSON-LD data feeds enable publishers to support user-initiated video or audio playback
Bridging the action gap with Google Media Actions
<script type="application/ld+json">
{
"@context": ["http://schema.org",
{"@language": "en"}],
"@type": "Movie",
"@id": "http://example.com/M",
"url": "http://example.com/M",
"name": “M",
"potentialAction": {
"@type": "WatchAction",
"target": {
"@type": "EntryPoint",
"urlTemplate":
"http://example.com/M?autoplay=true",
"inLanguage": "en",
"actionPlatform": [
"http://schema.org/DesktopWebPlatform",
"http://schema.org/MobileWebPlatform",
"http://schema.org/AndroidPlatform",
"http://schema.org/IOSPlatform",
"http://schema.googleapis.com/GoogleVideoCa
st"
]
…
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
This Google tool supports direct entry of ClaimReview data, which then appears on dataCommons.org
Bridging the markup gap with the Fact Check Markup Tool
...
"@type" : "DataFeedItem",
"dateModified" : "2018-10-24T15:00:14.238315+00:00",
"item" :
[
{
"@context" : "schema.org",
"@type" : "ClaimReview",
"author" :
{
"@type" : "Organization",
"name" : "Sens3",
"url" : "http://fct.sens3.com/"
},
"claimReviewed" : "I play the trumpet!",
"datePublished" : "2018-10-09",
"itemReviewed" :
{
"@type" : "Claim",
"author" :
{
"@type" : "Person",
"name" : "Paul McCartney"
}
},
"reviewRating" :
...
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
This Google tool supports direct entry of ClaimReview data, which then appears on dataCommons.org
Bridging the markup gap with the Fact Check Markup Tool
...
"@type" : "DataFeedItem",
"dateModified" : "2018-10-24T15:00:14.238315+00:00",
"item" :
[
{
"@context" : "schema.org",
"@type" : "ClaimReview",
"author" :
{
"@type" : "Organization",
"name" : "Sens3",
"url" : "http://fct.sens3.com/"
},
"claimReviewed" : "I play the trumpet!",
"datePublished" : "2018-10-09",
"itemReviewed" :
{
"@type" : "Claim",
"author" :
{
"@type" : "Person",
"name" : "Paul McCartney"
}
},
"reviewRating" :
...
"@type": "Rating",
"ratingValue": “2",
"alternateName" : “Mostly False",
"bestRating": "5",
"worstRating": "1“
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
schema.org has established common ground on shared terminology: is it time to address identifiers?
Questions of identity
“Very early in the formation of schema.org we made a strong decision, which was not
to support canonical IDs, and I think it was an important thing because it would have
been very politically contentious at the time to support it, because we basically would
have had to pick somebody's ID system to have canonical IDs.
“I think the time has come for canonical IDs, so I would love to see schema.org or
some other organization take on canonical IDs.”
Steve Macbeth, Microsoft, Apr. 2018
Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
Let’s keep the conversation going
Thanks!
<script type="application/ld+json">
{
"@context": "http://schema.org",
"@type": "CommunicateAction",
"agent": {
"@type": "Person",
"name": "Aaron"
},
"recipient": {
"@type": "PeopleAudience",
"name": "CDL2018 Attendees"
},
"object": "Stay in touch!"
}
</script>
Twitter
@aaranged
LinkedIn
linkedin.com/in/aaranged/
Semantic Search Marketing
bit.ly/semsearch

More Related Content

What's hot

How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsOntotext
 
Lessons Learned from a Community of Practice about Enterprise Search
Lessons Learned from a Community of Practice about Enterprise Search Lessons Learned from a Community of Practice about Enterprise Search
Lessons Learned from a Community of Practice about Enterprise Search Kurt Kragh Sørensen
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Juan Sequeda
 
How Graph Databases efficiently store, manage and query connected data at s...
How Graph Databases efficiently  store, manage and query  connected data at s...How Graph Databases efficiently  store, manage and query  connected data at s...
How Graph Databases efficiently store, manage and query connected data at s...jexp
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformOntotext
 
ODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer NatureODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer NatureMichele Pasin
 
Text Analytics Online Knowledge Base / Database
Text Analytics Online Knowledge Base / DatabaseText Analytics Online Knowledge Base / Database
Text Analytics Online Knowledge Base / DatabaseNaveen Kumar
 
Identify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UKIdentify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UKRinggold Inc
 
Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncFranz Inc. - AllegroGraph
 
Graph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SFGraph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SFAmazon Web Services
 
First Logistics Seo analysis logistics - logistics news page
First Logistics Seo analysis   logistics - logistics news pageFirst Logistics Seo analysis   logistics - logistics news page
First Logistics Seo analysis logistics - logistics news pageBrian Bateman
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesSrinath Srinivasa
 
Semantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisSemantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisCraig Knoblock
 
Intro to Neo4j with Ruby
Intro to Neo4j with RubyIntro to Neo4j with Ruby
Intro to Neo4j with RubyMax De Marzi
 
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...semanticsconference
 

What's hot (20)

How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk Analytics
 
Lessons Learned from a Community of Practice about Enterprise Search
Lessons Learned from a Community of Practice about Enterprise Search Lessons Learned from a Community of Practice about Enterprise Search
Lessons Learned from a Community of Practice about Enterprise Search
 
Charles Ivie
Charles Ivie Charles Ivie
Charles Ivie
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5
 
How Graph Databases efficiently store, manage and query connected data at s...
How Graph Databases efficiently  store, manage and query  connected data at s...How Graph Databases efficiently  store, manage and query  connected data at s...
How Graph Databases efficiently store, manage and query connected data at s...
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing Platform
 
Graph based data models
Graph based data modelsGraph based data models
Graph based data models
 
ODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer NatureODI Summit 2016 - Linked Open Data at Springer Nature
ODI Summit 2016 - Linked Open Data at Springer Nature
 
JSON-LD Update
JSON-LD UpdateJSON-LD Update
JSON-LD Update
 
Graph & Neptune
Graph & NeptuneGraph & Neptune
Graph & Neptune
 
Text Analytics Online Knowledge Base / Database
Text Analytics Online Knowledge Base / DatabaseText Analytics Online Knowledge Base / Database
Text Analytics Online Knowledge Base / Database
 
Identify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UKIdentify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UK
 
Lju Lazarevic
Lju LazarevicLju Lazarevic
Lju Lazarevic
 
Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz Inc
 
Graph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SFGraph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SF
 
First Logistics Seo analysis logistics - logistics news page
First Logistics Seo analysis   logistics - logistics news pageFirst Logistics Seo analysis   logistics - logistics news page
First Logistics Seo analysis logistics - logistics news page
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
 
Semantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisSemantics for Big Data Integration and Analysis
Semantics for Big Data Integration and Analysis
 
Intro to Neo4j with Ruby
Intro to Neo4j with RubyIntro to Neo4j with Ruby
Intro to Neo4j with Ruby
 
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
 

Similar to schema.org: Linked Data's Gateway Drug

Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Websamar_slideshare
 
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014Robert Meusel
 
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)Ig Bittencourt
 
Schema.org where did that come from?
Schema.org where did that come from?Schema.org where did that come from?
Schema.org where did that come from?Richard Wallis
 
SMX Advanced 2012 - Catching up with the Semantic Web
SMX Advanced 2012 - Catching up with the Semantic WebSMX Advanced 2012 - Catching up with the Semantic Web
SMX Advanced 2012 - Catching up with the Semantic WebMatthew Brown
 
How google is using linked data today and vision for tomorrow
How google is using linked data today and vision for tomorrowHow google is using linked data today and vision for tomorrow
How google is using linked data today and vision for tomorrowVasu Jain
 
Koneksys - Offering Services to Connect Data using the Data Web
Koneksys - Offering Services to Connect Data using the Data WebKoneksys - Offering Services to Connect Data using the Data Web
Koneksys - Offering Services to Connect Data using the Data WebKoneksys
 
Introduction to Microdata & Google Rich Snippets
Introduction to Microdata  & Google Rich SnippetsIntroduction to Microdata  & Google Rich Snippets
Introduction to Microdata & Google Rich SnippetsKishan Gor
 
360i POV on the Schema.org Markup Initiative
360i POV on the Schema.org Markup Initiative360i POV on the Schema.org Markup Initiative
360i POV on the Schema.org Markup Initiative360i
 
Graph and Amazon Neptune - Bill Baldwin
Graph and Amazon Neptune - Bill BaldwinGraph and Amazon Neptune - Bill Baldwin
Graph and Amazon Neptune - Bill BaldwinAmazon Web Services
 
Deep Dive on Amazon Neptune - AWS Online Tech Talks
Deep Dive on Amazon Neptune - AWS Online Tech TalksDeep Dive on Amazon Neptune - AWS Online Tech Talks
Deep Dive on Amazon Neptune - AWS Online Tech TalksAmazon Web Services
 
APIs and Linked Data: A match made in Heaven
APIs and Linked Data: A match made in HeavenAPIs and Linked Data: A match made in Heaven
APIs and Linked Data: A match made in HeavenMichael Petychakis
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapubeswcsummerschool
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
 
Employing Google Refine to publish Linked Data
Employing Google Refine to publish Linked DataEmploying Google Refine to publish Linked Data
Employing Google Refine to publish Linked DataFadi Maali
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowRichard Wallis
 

Similar to schema.org: Linked Data's Gateway Drug (20)

Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Web
 
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014
 
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
 
Schema.org where did that come from?
Schema.org where did that come from?Schema.org where did that come from?
Schema.org where did that come from?
 
SMX Advanced 2012 - Catching up with the Semantic Web
SMX Advanced 2012 - Catching up with the Semantic WebSMX Advanced 2012 - Catching up with the Semantic Web
SMX Advanced 2012 - Catching up with the Semantic Web
 
How google is using linked data today and vision for tomorrow
How google is using linked data today and vision for tomorrowHow google is using linked data today and vision for tomorrow
How google is using linked data today and vision for tomorrow
 
Koneksys - Offering Services to Connect Data using the Data Web
Koneksys - Offering Services to Connect Data using the Data WebKoneksys - Offering Services to Connect Data using the Data Web
Koneksys - Offering Services to Connect Data using the Data Web
 
Introduction to Microdata & Google Rich Snippets
Introduction to Microdata  & Google Rich SnippetsIntroduction to Microdata  & Google Rich Snippets
Introduction to Microdata & Google Rich Snippets
 
360i POV on the Schema.org Markup Initiative
360i POV on the Schema.org Markup Initiative360i POV on the Schema.org Markup Initiative
360i POV on the Schema.org Markup Initiative
 
Graph and Amazon Neptune - Bill Baldwin
Graph and Amazon Neptune - Bill BaldwinGraph and Amazon Neptune - Bill Baldwin
Graph and Amazon Neptune - Bill Baldwin
 
Graph and Amazon Neptune
Graph and Amazon NeptuneGraph and Amazon Neptune
Graph and Amazon Neptune
 
Why rdfa
Why rdfaWhy rdfa
Why rdfa
 
Deep Dive on Amazon Neptune - AWS Online Tech Talks
Deep Dive on Amazon Neptune - AWS Online Tech TalksDeep Dive on Amazon Neptune - AWS Online Tech Talks
Deep Dive on Amazon Neptune - AWS Online Tech Talks
 
APIs and Linked Data: A match made in Heaven
APIs and Linked Data: A match made in HeavenAPIs and Linked Data: A match made in Heaven
APIs and Linked Data: A match made in Heaven
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
 
Pratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnectPratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnect
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
Graph and Amazon Neptune
Graph and Amazon NeptuneGraph and Amazon Neptune
Graph and Amazon Neptune
 
Employing Google Refine to publish Linked Data
Employing Google Refine to publish Linked DataEmploying Google Refine to publish Linked Data
Employing Google Refine to publish Linked Data
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & How
 

Recently uploaded

TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxAndrieCagasanAkio
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxmibuzondetrabajo
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxNIMMANAGANTI RAMAKRISHNA
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxMario
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书zdzoqco
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119APNIC
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 

Recently uploaded (11)

TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptx
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptx
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptx
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptx
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 

schema.org: Linked Data's Gateway Drug

  • 1. schema.org Linked Data’s Gateway Drug <script type="application/ld+json"> { "@context": "http://schema.org", "@type": "Drug", "name": "schema.org", "activeIngredient": "Linked data", "dosageForm": "Structured data", "recognizingAuthority": [{ "@type": "Organization", "name": "Bing" },{ "@type": "Organization", "name": "Google" },{ "@type": "Organization", "name": "Yahoo" },{ "@type": "Organization", "name": "Yandex" }] } </script> Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged
  • 2. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Electronic Arts schema.org/worksFor
  • 3. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged bit.ly/semsearch schema.org pending.schema.org/knowsAbout bit.ly/sdataevents schema.org/WebSite
  • 4. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged schema.org pending.schema.org/knowsAbout
  • 5. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged History and adoption schema.org followed in the footsteps of other structured data initiatives, but appears to have enjoyed much broader adoption
  • 6. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged schema.org Microformats (2004) Broad search engine support data-vocabulary.org (2009) data-vocabulary.org Open Graph Protocol (2007) Partial search engine support GoodRelations (2007) DCMI Terms (2003) FOAF (2000) No explicit search engine support Structured data existed prior to schema.org, but often with little or no search engine support The road to schema.org schema.org (2011)
  • 7. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged A “collection of shared vocabularies … that can be understood by the major search engines” schema.org in a nutshell Structure • A collection of schemas consisting of types, properties and enumerations • Types – classes and subclasses (e.g. “Book”) • Properties – attributes expecting a value of a particular data type (e.g. “sameAs”), or relations expecting an instance of a particular type (e.g. “author”) or an enumeration member (e.g. “availability”) • Enumerations – a class (e.g. “ItemAvailability) whose members are considered neither types nor properties (e.g. “InStock”) Search engine support • A joint initiative supported at launch by Bing, Google and Yahoo, and soon after by Yandex Supported encoding formats • Microdata and RDFa supported at launch, with RDFa Lite and JSON-LD support following
  • 8. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged All data from Web Data Commons 0.00% 2.00% 4.00% 6.00% 8.00% 10.00% 12.00% 14.00% 16.00% 2012 Aug 2013 Nov 2014 Dec 2015 Nov 2016 Oct 2017 Nov Format Use as a Percentage of Sampled Domains RDFa Microdata JSON-LD Robust schema.org adoption data is hard to come by, but format use helps paint the picture schema.org adoption as inferred from Web Data Commons data
  • 9. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged What’s currently being encoded with these syntaxes is almost exclusively schema.org For microdata and JSON-LD, it’s schema.org all the way down Top Classes, Microdata, Nov. 2017 Top Classes, JSON-LD, Nov. 2017
  • 10. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged All data from Web Data Commons Format Use by Number of Domains in Sample Raw Web Data Commons format usage data belies the relative expressiveness of schema.org A relatively large vocabulary results in more assertions 2012 2017
  • 11. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Raw Web Data Commons format usage data belies the relative expressiveness of schema.org A relatively large vocabulary results in more assertions <span class= "author vcard"> <a href= "http://www.seoskeptic.com/ aaron-bradley/" class="url fn">Aaron Bradley</a> “... OGP (Open Graph Protocol) and microformat approaches can be found on approximately as many sites as Schema.org, but given their much smaller vocabularies, they appear on less than fewer than half as many pages and contain fewer than a quarter as many logical assertions.” Guha, Brickley and Macbeth, Dec. 2015
  • 12. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Such as they are schema.org use by the numbers Apr. 2014 Dec. 2014 Dec. 2015 Nov. 2018 0.3% 22.0% 31.3% 21.9% JSON-LD 15.6% Microdata % of domains SearchMetrics 500K domains Microdata only? % of pages Guha, Brickley, Macbeth 10B pages % of websites W3Techs Top 10M websites (Alexa) % of pages Guha, Brickley, Macbeth 10B pages
  • 13. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged The path to adoption The vocabulary launched with a clear value proposition for webmasters, and has been buoyed since by a collaborative vocabulary development model, a modified extension mechanism and the added flexibility afforded by JSON-LD
  • 14. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Event Recipe, AggregateRating Product, AggregateRating The search engines incentivized schema.org use right out of the gate with rich snippets Rich results at launch
  • 15. Rich results post-launch The search engines have been steadily adding new search features as the vocabulary grows Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Organization.logo, Organization.sameAs JobPosting ClaimReview
  • 16. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged 23 March 20174 May 2016 0 200 400 600 800 1000 Jun-11 Nov-15 Nov-18 Classes in schema.org, 2011-2018 Core Extensions Pending A living vocabulary Over the course of time schema.org has become more and more expressive
  • 17. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged public-schemaorg W3C Mailing List schema.org provides multiple mechanisms for collaborative vocabulary development Making vocabulary development a community affair schema.org on Github Partnerships
  • 18. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged GS1’s SmartSearch is powered by a schema.org external extension schema.org’s extension mechanism was completely revamped in v2.0 (May 2015) Extending schema.org with more specialized vocabulary SmartSearch in action at Tesco
  • 19. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged schema.org endorsed JSON-LD in 2013; Google started using it in 2014, with full support by 2016 JSON-LD: developer-friendly linked data “…the whole point about it is, it is JSON first and RDF second. And the fact that it carries RDF is simply unimportant. And it's particularly unimportant to people who are JSON users – which is basically every web developer these days. “People don't need to know everything, they can create really cool applications, and if they find JSON-LD useful – fantastic. If they don't know that it's RDF, I don't care.” Phil Archer, Aug. 2014
  • 20. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Separation of the data and presentation layers makes life considerably easier for web developers JSON-LD versus inline markup: no contest Product Details Page: Before Product Details Page: After <script type="application/ld+json"> { "@context": "http://schema.org", "@type": "Product", "name": "Bob's Best Basic T" "image": "bbbt-pink.jpg", "offers": { "@type": "Offer", "price": "$28", "priceCurrency": "$USD", }, "aggregateRating": { … <script type="application/ld+json"> { "@context": "http://schema.org", "@type": "Product", "name": "Bob's Best Basic T" "image": "bbbt-pink.jpg", "offers": { "@type": "Offer", "price": "$28", "priceCurrency": "$USD", }, "aggregateRating": { …
  • 21. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged schema.org beyond search Seemingly striking the right balance between expressiveness and complexity, the vocabulary is being used for applications outside of search, and is increasingly the starting point for ground-up linked data initiatives
  • 22. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Pinterest uses schema.org to populate Article, Product and Recipe Rich Pins Leveraging structured data to enhance the presentation layer Pinterest Product Rich Pin Offer Information on Pin Source Page
  • 23. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged When Google needed vocabulary for its Assistant it unsurprisingly turned to schema.org Virtual assistants and schema.org
  • 24. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged When Google needed vocabulary for its Assistant it unsurprisingly turned to schema.org Virtual assistants and schema.org
  • 25. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Amazon’s Alexa Meaning Representation Language is based on schema.org Virtual assistants and schema.org “The Alexa ontology utilized schema.org as its base and has been updated to include support for spoken language. In addition, using schema.org as the base of the Alexa Ontology means that it shares a vocabulary used by more than 10 million websites, which can be linked to the Alexa ontology” Thomas Kollar et al, Jun. 2018
  • 26. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged A New Zealand health insurance company used the vocabulary to kickstart product development Bootstrapping development with schema.org David Gibson, Feb. 2018
  • 27. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged The vocabulary allows linked data practitioners to construct knowledge graphs with relative ease Bootstrapping development with schema.org “…the knowledge graph is implemented as a triple store where the data has been represented using a small number of vocabularies (mostly schema.org with some terms borrowed from TAXREF-LD and the TDWG LSID vocabularies).” Rod Page, Ozymandias
  • 28. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Chinese search engine Baidu appears to have based its knowledge graph on schema.org Bootstrapping development with schema.org Via Google Translate
  • 29. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Electronic Arts used the vocabulary as the basis for their domain ontology Bootstrapping development with schema.org
  • 30. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Boundaries of the vocabulary As schema.org is adopted for use in increasingly diverse domains, there’s more and more demands to add to the vocabulary: does it risk becoming too much “an ontology of everything”, or is it actually not expressive enough?
  • 31. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Is it an animal? Just how much can we say about each entity? Let’s play 20 questions using schema.org vocabulary! Is it a vegetable? Is it a mineral? It’s a Thing It’s a Thing It’s a Thing More expressive exceptions: Person, Product More expressive exception: Product More expressive exception: Product
  • 32. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged But there’s always a tension between adding to schema.org and referencing existing vocabularies The “add animals and plants” discussion has recently reignited
  • 33. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged But there’s always a tension between adding to schema.org and referencing existing vocabularies The “add animals and plants” discussion has recently reignited
  • 34. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Recent developments and future directions At the same time that the improved ability of machines to understand content makes structured data use less of an imperative, schema.org is increasingly finding itself useful as a mechanism for serialized linked data
  • 35. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged If machines are eventually able to parse content like humans will structured data still be necessary? Will AI and related technologies render schema.org obsolete?
  • 36. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Leveraging schema.org allows Google to improve the discoverability of datasets Bridging the semantic gap with Dataset Search Year of Birth No. of cases 1976 1 1977 1 1980 1 1981 2 1982 7 1983 8 1984 7 1985 7 1986 11 … Total 89
  • 37. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged JSON-LD data feeds enable publishers to support user-initiated video or audio playback Bridging the action gap with Google Media Actions <script type="application/ld+json"> { "@context": ["http://schema.org", {"@language": "en"}], "@type": "Movie", "@id": "http://example.com/M", "url": "http://example.com/M", "name": “M", "potentialAction": { "@type": "WatchAction", "target": { "@type": "EntryPoint", "urlTemplate": "http://example.com/M?autoplay=true", "inLanguage": "en", "actionPlatform": [ "http://schema.org/DesktopWebPlatform", "http://schema.org/MobileWebPlatform", "http://schema.org/AndroidPlatform", "http://schema.org/IOSPlatform", "http://schema.googleapis.com/GoogleVideoCa st" ] …
  • 38. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged This Google tool supports direct entry of ClaimReview data, which then appears on dataCommons.org Bridging the markup gap with the Fact Check Markup Tool ... "@type" : "DataFeedItem", "dateModified" : "2018-10-24T15:00:14.238315+00:00", "item" : [ { "@context" : "schema.org", "@type" : "ClaimReview", "author" : { "@type" : "Organization", "name" : "Sens3", "url" : "http://fct.sens3.com/" }, "claimReviewed" : "I play the trumpet!", "datePublished" : "2018-10-09", "itemReviewed" : { "@type" : "Claim", "author" : { "@type" : "Person", "name" : "Paul McCartney" } }, "reviewRating" : ...
  • 39. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged This Google tool supports direct entry of ClaimReview data, which then appears on dataCommons.org Bridging the markup gap with the Fact Check Markup Tool ... "@type" : "DataFeedItem", "dateModified" : "2018-10-24T15:00:14.238315+00:00", "item" : [ { "@context" : "schema.org", "@type" : "ClaimReview", "author" : { "@type" : "Organization", "name" : "Sens3", "url" : "http://fct.sens3.com/" }, "claimReviewed" : "I play the trumpet!", "datePublished" : "2018-10-09", "itemReviewed" : { "@type" : "Claim", "author" : { "@type" : "Person", "name" : "Paul McCartney" } }, "reviewRating" : ... "@type": "Rating", "ratingValue": “2", "alternateName" : “Mostly False", "bestRating": "5", "worstRating": "1“
  • 40. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged schema.org has established common ground on shared terminology: is it time to address identifiers? Questions of identity “Very early in the formation of schema.org we made a strong decision, which was not to support canonical IDs, and I think it was an important thing because it would have been very politically contentious at the time to support it, because we basically would have had to pick somebody's ID system to have canonical IDs. “I think the time has come for canonical IDs, so I would love to see schema.org or some other organization take on canonical IDs.” Steve Macbeth, Microsoft, Apr. 2018
  • 41. Aaron Bradley, Connected Data London 2018 ▪ #CDL2018 ▪ @aaranged Let’s keep the conversation going Thanks! <script type="application/ld+json"> { "@context": "http://schema.org", "@type": "CommunicateAction", "agent": { "@type": "Person", "name": "Aaron" }, "recipient": { "@type": "PeopleAudience", "name": "CDL2018 Attendees" }, "object": "Stay in touch!" } </script> Twitter @aaranged LinkedIn linkedin.com/in/aaranged/ Semantic Search Marketing bit.ly/semsearch