SlideShare a Scribd company logo
1 of 58
Introduction to
Metadata
Application Profiles
DCMI Webinar
Karen Coyle
2018
Data silos
Data silos
DC BIBO MARC21
Data silos
MARC21 MARC21C21 MARC21
What are application profiles?
• Record your institution or project's choices
• Form a basis for developing a consensus around your own
data
• Express specific practices, rules
• Tell data consumers what to expect
Why do we need them?
• How can someone else understand your
data well enough to make use of it?
• Not unlike open source problem: you can
declare your code ‘open’ and wish people
‘good luck’ or you can provide support.
Who needs them?
• Creators: anyone providing data
• Users
• anyone who can/is allowed to access the data
• both people AND machines - not an either/or, but
should be both
What are they?
• Basic structure of the data
• the story that the data tells; what you are trying to say
• what are the things? how are they described?
• What are the properties and the rules for property use?
• What are the values?
How are they?
• What will a profile be? How can it be implemented?
• Documents (PDF)
• Spreadsheets
• Code (RDF, JSON, XML)
What does an application profile
look like?
Dublin Core and
Application Profiles
Dublin Core Singapore Framework for
Application Profiles (2007)
Domain model
Domain
model
Functional requirements
• Before developing any solutions, define problems
• Decide which problems you can solve
• State the requirements for success
Vocabularies
• Profiles reuse vocabularies
• Profiles can select from a single
vocabulary
• Profiles can extend a vocabulary
• Profiles can combine vocabularies
Term reuse & semantics
• Reuse can narrow semantics but should never contradict how
the term is defined at its origin
• Terms with strict definitions (e.g. OWL constraints, limits on
valid values, disjoint with other terms) are the hardest to
reuse
• Base vocabularies are best if they employ minimum semantic
commitment
Components of a profile
• Vocabulary
• Definitions
• Usage rules
• Cardinality of terms and values
• Examples
• Validation rules
This is not a full list!
Validation rules
• Can have foaf:name or (foaf:foreName + foaf:familyName)
• dct:date cannot be > 2020
• Subjects must be from http://id.loc.gov/authorities/subjects/
Validation
• Non-RDF (e.g. XML schema)
• SHACL – W3C recommendation (SHApes Constraint Language)
• https://www.w3.org/TR/shacl/
• ShEx – W3C community group (Shape Expressions)
• http://shex.io/
Validation
• Non-RDF (e.g. XML schema)
• SHACL – W3C recommendation (SHApes Constraint Language)
• https://www.w3.org/TR/shacl/
• ShEx – W3C community group (Shape Expressions)
• http://shex.io/
Validation
• Non-RDF (e.g. XML schema)
• SHACL – W3C recommendation (SHApes Constraint Language)
• https://www.w3.org/TR/shacl/
• ShEx – W3C community group (Shape Expressions)
• http://shex.io/
my:IssueShape {
ex:state [ex:unassigned
ex:assigned];
}
Not everything can be validated
• "Recommended" "Mandatory if applicable"
• Names, resource titles, other string-based data
Maintaining profiles
Profile maintenance
• Who maintains the profile?
• How will new terms be added?
• What can be changed?
• How can the profile be extended?
What we need so that
we can (easily) create
profiles
Some profile-related efforts
• Dublin Core (since the late 1990's) based on Singapore Framework
• http://dublincore.org/documents/singapore-framework/
• http://dublincore.org/documents/profile-guidelines/
• DXWG – Data eXchange Working Group, W3C, application profile guidance
(2017, due 2019)
• https://www.w3.org/2017/dxwg/wiki/Main_Page
Standard profile language(s)
• Core for the simplest needs, or for getting started
• shows domain model
• lists vocabulary terms
• can express basic rules for vocabulary members, especially cardinality & values
• documentation for human readers
Generic domain model - DC
Profile
Resource
Property
Value
"things"
"terms or elements"
"data"
MyBookCase
Profile: MyBookCase
Resource: Book
Resource: Person
http://dublincore.org/documents/profile-guidelines/
MyBookCase
Profile: MyBookCase
Resource: Book
Property: title
Property: author
Property: size
Resource: Person
Property: name
MyBookCase
Profile: MyBookCase
Resource: Book
Property: title
min:1, max:1
Property: author
min:0, max:3
Property: size
min:1, max:1
Resource: Person
Property: name
MyBookCase
Profile: MyBookCase
Resource: Book
Property: title
min:1, max:1
value type: literal
Property: author
min:0, max:3
value type: IRI
Property: size
min:1, max:1
value type: integer
Resource: Person
Property: name
APs as spreadsheets?
Can we make validation "easy"?
• Valid properties ✔
• Valid values ✔
• Value types
• Value lists (text or URIs)
• Conditional rules 
• If A not B
• A or (B & C)
Validation – bridging the gap
• Profile may need validation pseudo-code
• Pseudo-code -> validation standard (SHACL, ShEx)?
• What to do with non-actionable statements of validation (“mandatory if
applicable”)?
Summary: Functions of a profile
• Consensus-building
• Documentation
• Input/output control
• Validation (input and output and sharing)
Thank you
kcoyle@kcoyle.net
https://github.com/kcoyle/RDF-AP

More Related Content

What's hot

Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsTrish Whetzel
 
RDA Toolkit Essentials 2015-09-24
RDA Toolkit Essentials 2015-09-24RDA Toolkit Essentials 2015-09-24
RDA Toolkit Essentials 2015-09-24jhennelly
 
RDA Toolkit Essentials 2015-06-11
RDA Toolkit Essentials 2015-06-11RDA Toolkit Essentials 2015-06-11
RDA Toolkit Essentials 2015-06-11jhennelly
 
RDA Toolkit Essentials 2014-12-17
RDA Toolkit Essentials 2014-12-17RDA Toolkit Essentials 2014-12-17
RDA Toolkit Essentials 2014-12-17jhennelly
 
Sharepoint taxonomy introduction us
Sharepoint taxonomy introduction   usSharepoint taxonomy introduction   us
Sharepoint taxonomy introduction usQUONTRASOLUTIONS
 
RDA Toolkit Essentials 09.17.2014
RDA Toolkit Essentials 09.17.2014RDA Toolkit Essentials 09.17.2014
RDA Toolkit Essentials 09.17.2014jhennelly
 
RDA Toolkit Essentials 2015-03-18
RDA Toolkit Essentials 2015-03-18RDA Toolkit Essentials 2015-03-18
RDA Toolkit Essentials 2015-03-18jhennelly
 
01 18 rda toolkit essentials v6
01 18 rda toolkit essentials v601 18 rda toolkit essentials v6
01 18 rda toolkit essentials v6jhennelly
 
Open Data Management for Public Automated Translation
Open Data Management for Public Automated TranslationOpen Data Management for Public Automated Translation
Open Data Management for Public Automated TranslationDave Lewis
 
03.21 rda toolkit essentials
03.21 rda toolkit essentials03.21 rda toolkit essentials
03.21 rda toolkit essentialsjhennelly
 
RDA Toolkit Essentials - 06.18.2014
RDA Toolkit Essentials - 06.18.2014RDA Toolkit Essentials - 06.18.2014
RDA Toolkit Essentials - 06.18.2014jhennelly
 
RDA Toolkit Essentials webinar 03.19.14
RDA Toolkit Essentials webinar 03.19.14RDA Toolkit Essentials webinar 03.19.14
RDA Toolkit Essentials webinar 03.19.14jhennelly
 
05.16 rda toolkit essentials
05.16 rda toolkit essentials05.16 rda toolkit essentials
05.16 rda toolkit essentialsjhennelly
 
4. pentz orcid outreach_20121016
4. pentz orcid outreach_201210164. pentz orcid outreach_20121016
4. pentz orcid outreach_20121016ORCID, Inc
 
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)Crossref
 
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...Crossref
 
Ed Pentz: Executive Summary #crossref15
Ed Pentz: Executive Summary #crossref15Ed Pentz: Executive Summary #crossref15
Ed Pentz: Executive Summary #crossref15Crossref
 

What's hot (20)

Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic Applications
 
RDA Toolkit Essentials 2015-09-24
RDA Toolkit Essentials 2015-09-24RDA Toolkit Essentials 2015-09-24
RDA Toolkit Essentials 2015-09-24
 
RDA Toolkit Essentials 2015-06-11
RDA Toolkit Essentials 2015-06-11RDA Toolkit Essentials 2015-06-11
RDA Toolkit Essentials 2015-06-11
 
RDA Toolkit Essentials 2014-12-17
RDA Toolkit Essentials 2014-12-17RDA Toolkit Essentials 2014-12-17
RDA Toolkit Essentials 2014-12-17
 
Sharepoint taxonomy introduction us
Sharepoint taxonomy introduction   usSharepoint taxonomy introduction   us
Sharepoint taxonomy introduction us
 
RDA Toolkit Essentials 09.17.2014
RDA Toolkit Essentials 09.17.2014RDA Toolkit Essentials 09.17.2014
RDA Toolkit Essentials 09.17.2014
 
RDA Toolkit Essentials 2015-03-18
RDA Toolkit Essentials 2015-03-18RDA Toolkit Essentials 2015-03-18
RDA Toolkit Essentials 2015-03-18
 
01 18 rda toolkit essentials v6
01 18 rda toolkit essentials v601 18 rda toolkit essentials v6
01 18 rda toolkit essentials v6
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti... NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 
Open Data Management for Public Automated Translation
Open Data Management for Public Automated TranslationOpen Data Management for Public Automated Translation
Open Data Management for Public Automated Translation
 
03.21 rda toolkit essentials
03.21 rda toolkit essentials03.21 rda toolkit essentials
03.21 rda toolkit essentials
 
Thompson 6-jun15-final
Thompson 6-jun15-finalThompson 6-jun15-final
Thompson 6-jun15-final
 
RDA Toolkit Essentials - 06.18.2014
RDA Toolkit Essentials - 06.18.2014RDA Toolkit Essentials - 06.18.2014
RDA Toolkit Essentials - 06.18.2014
 
RDA Toolkit Essentials webinar 03.19.14
RDA Toolkit Essentials webinar 03.19.14RDA Toolkit Essentials webinar 03.19.14
RDA Toolkit Essentials webinar 03.19.14
 
05.16 rda toolkit essentials
05.16 rda toolkit essentials05.16 rda toolkit essentials
05.16 rda toolkit essentials
 
4. pentz orcid outreach_20121016
4. pentz orcid outreach_201210164. pentz orcid outreach_20121016
4. pentz orcid outreach_20121016
 
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)
 
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...
 
Xml
XmlXml
Xml
 
Ed Pentz: Executive Summary #crossref15
Ed Pentz: Executive Summary #crossref15Ed Pentz: Executive Summary #crossref15
Ed Pentz: Executive Summary #crossref15
 

Similar to An introduction to Metadata Application Profiles

Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) robin fay
 
DXWG Profiles Guidance & Vocabulary
DXWG Profiles Guidance & VocabularyDXWG Profiles Guidance & Vocabulary
DXWG Profiles Guidance & Vocabularynjcar
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchChimezie Ogbuji
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Gautier Poupeau
 
Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)Robert Sanderson
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)Venky Dood
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE
 
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...Jonathan Ralton
 
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com Lucidworks
 
Vectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic MatchingVectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic MatchingSimon Hughes
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudOntotext
 
Delivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsDelivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsBen Gardner
 
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesHaystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesOpenSource Connections
 
Searching with vectors
Searching with vectorsSearching with vectors
Searching with vectorsSimon Hughes
 

Similar to An introduction to Metadata Application Profiles (20)

Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries)
 
DXWG Profiles Guidance & Vocabulary
DXWG Profiles Guidance & VocabularyDXWG Profiles Guidance & Vocabulary
DXWG Profiles Guidance & Vocabulary
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
A theory of Metadata enriching & filtering
A theory of  Metadata enriching & filteringA theory of  Metadata enriching & filtering
A theory of Metadata enriching & filtering
 
Metadata
MetadataMetadata
Metadata
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: Metadata
 
Memorix and SHACL
Memorix and SHACLMemorix and SHACL
Memorix and SHACL
 
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...
 
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com
 
Vectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic MatchingVectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic Matching
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Delivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsDelivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphs
 
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesHaystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon Hughes
 
Searching with vectors
Searching with vectorsSearching with vectors
Searching with vectors
 
L07 metadata
L07 metadataL07 metadata
L07 metadata
 

Recently uploaded

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

An introduction to Metadata Application Profiles

  • 5. What are application profiles? • Record your institution or project's choices • Form a basis for developing a consensus around your own data • Express specific practices, rules • Tell data consumers what to expect
  • 6. Why do we need them? • How can someone else understand your data well enough to make use of it? • Not unlike open source problem: you can declare your code ‘open’ and wish people ‘good luck’ or you can provide support.
  • 7. Who needs them? • Creators: anyone providing data • Users • anyone who can/is allowed to access the data • both people AND machines - not an either/or, but should be both
  • 8. What are they? • Basic structure of the data • the story that the data tells; what you are trying to say • what are the things? how are they described? • What are the properties and the rules for property use? • What are the values?
  • 9. How are they? • What will a profile be? How can it be implemented? • Documents (PDF) • Spreadsheets • Code (RDF, JSON, XML)
  • 10. What does an application profile look like?
  • 11.
  • 12.
  • 13.
  • 14.
  • 16. Dublin Core Singapore Framework for Application Profiles (2007)
  • 17.
  • 18.
  • 19.
  • 20.
  • 23.
  • 24. Functional requirements • Before developing any solutions, define problems • Decide which problems you can solve • State the requirements for success
  • 25.
  • 26. Vocabularies • Profiles reuse vocabularies • Profiles can select from a single vocabulary • Profiles can extend a vocabulary • Profiles can combine vocabularies
  • 27.
  • 28.
  • 29. Term reuse & semantics • Reuse can narrow semantics but should never contradict how the term is defined at its origin • Terms with strict definitions (e.g. OWL constraints, limits on valid values, disjoint with other terms) are the hardest to reuse • Base vocabularies are best if they employ minimum semantic commitment
  • 30.
  • 31. Components of a profile • Vocabulary • Definitions • Usage rules • Cardinality of terms and values • Examples • Validation rules This is not a full list!
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38. Validation rules • Can have foaf:name or (foaf:foreName + foaf:familyName) • dct:date cannot be > 2020 • Subjects must be from http://id.loc.gov/authorities/subjects/
  • 39. Validation • Non-RDF (e.g. XML schema) • SHACL – W3C recommendation (SHApes Constraint Language) • https://www.w3.org/TR/shacl/ • ShEx – W3C community group (Shape Expressions) • http://shex.io/
  • 40. Validation • Non-RDF (e.g. XML schema) • SHACL – W3C recommendation (SHApes Constraint Language) • https://www.w3.org/TR/shacl/ • ShEx – W3C community group (Shape Expressions) • http://shex.io/
  • 41. Validation • Non-RDF (e.g. XML schema) • SHACL – W3C recommendation (SHApes Constraint Language) • https://www.w3.org/TR/shacl/ • ShEx – W3C community group (Shape Expressions) • http://shex.io/ my:IssueShape { ex:state [ex:unassigned ex:assigned]; }
  • 42. Not everything can be validated • "Recommended" "Mandatory if applicable" • Names, resource titles, other string-based data
  • 44. Profile maintenance • Who maintains the profile? • How will new terms be added? • What can be changed? • How can the profile be extended?
  • 45. What we need so that we can (easily) create profiles
  • 46. Some profile-related efforts • Dublin Core (since the late 1990's) based on Singapore Framework • http://dublincore.org/documents/singapore-framework/ • http://dublincore.org/documents/profile-guidelines/ • DXWG – Data eXchange Working Group, W3C, application profile guidance (2017, due 2019) • https://www.w3.org/2017/dxwg/wiki/Main_Page
  • 47. Standard profile language(s) • Core for the simplest needs, or for getting started • shows domain model • lists vocabulary terms • can express basic rules for vocabulary members, especially cardinality & values • documentation for human readers
  • 48. Generic domain model - DC Profile Resource Property Value "things" "terms or elements" "data"
  • 49. MyBookCase Profile: MyBookCase Resource: Book Resource: Person http://dublincore.org/documents/profile-guidelines/
  • 50. MyBookCase Profile: MyBookCase Resource: Book Property: title Property: author Property: size Resource: Person Property: name
  • 51. MyBookCase Profile: MyBookCase Resource: Book Property: title min:1, max:1 Property: author min:0, max:3 Property: size min:1, max:1 Resource: Person Property: name
  • 52. MyBookCase Profile: MyBookCase Resource: Book Property: title min:1, max:1 value type: literal Property: author min:0, max:3 value type: IRI Property: size min:1, max:1 value type: integer Resource: Person Property: name
  • 53.
  • 55. Can we make validation "easy"? • Valid properties ✔ • Valid values ✔ • Value types • Value lists (text or URIs) • Conditional rules  • If A not B • A or (B & C)
  • 56. Validation – bridging the gap • Profile may need validation pseudo-code • Pseudo-code -> validation standard (SHACL, ShEx)? • What to do with non-actionable statements of validation (“mandatory if applicable”)?
  • 57. Summary: Functions of a profile • Consensus-building • Documentation • Input/output control • Validation (input and output and sharing)

Editor's Notes

  1. We' e all gotten the message about data silos and how bad they are. They prevent data sharing, they keep us from taking advantage of the work of others.
  2. We think of data silos as being the result of using different standards or different data formats.
  3. But just because we use the same basic standard, it doesn't mean that we are producing identical data. As anyone who has tried to consume data from another institution or project knows, there are always local variations – variations in which fields or terms are used, variations in how the data has been recorded. Even when we think we are not creating data in a silo, in many cases we are. It's difficult to avoid have your data be your data. I'm going to talk about how application profiles can help with data sharing and data reuse.
  4. They are many things
  5. Rather like the schema.org focus in terms of data providers = anyone with data on the web. These folks are not always coders, and may have experience limited to a only a few technologies. Any solution has to fit into their toolbox; we can’t require them to re-tool to use this solution.
  6. Today there is no standard format for application profiles
  7. Profiles exist today, and take many forms. This is an example of a profile based on the Dataset Catalog (DCAT) vocabulary. There a number of different application profiles based on this vocabulary, which is was developed to support government open data in the European Union, although it also used elsewhere. Many countries have created application profiles that are specific to their country because they each had some specific needs. Those choices are recorded in the application profiles which generally take the form of documents.
  8. BIBFRAME has a number of profiles.
  9. This is a profile based on the BIBFRAME vocabulary.
  10. There is information about each term, including what type of value is expect (whether it will be a URI for a thing or will be text).
  11. Dublin Core has had the ide aof application profiles since the late 1990s. This fits in well with the fact that DC is intended as a highly reusable vocabulary that can fit many different situations and needs.
  12. First in 2007. In keeping with the use of the place name to name things, this was first presented in Singapore.
  13. It's the picture that you have in your head that tells you what aspect of the world your data covers, what are the "parts" or entities that make up your description of that world, and how the parts fit together. Domain models can vary in their complexity and detail, and may have different levels of detail depending on the view that is needed at some moment in time.
  14. Profiles generally are a reuse of vocabularies. Like BIBFRAME, the profile can be a selection from a single vocabulary. In this case the profile's message is: I am a profile of BIBFRAME. I use some but not all of the BIBFRAME-defined vocabulary. Profiles can make use of all or part of a vocabulary, but can add terms that they need but that are missing from the base vocabulary. This is the case with the DCAT profiles: they are based on the DCAT vocabulary, but in some cases the users of DCAT needed to add some information that was not covered by that vocabulary. If you have worked with the Europeana Data Model you may be familiar with profiles that have some Europeana elements but that also add their own terms or equivalents. The DCAT profiles have much overlap between them but each one has information that is not included in DCAT. But profiles can also be not related to any single vocabulary. They can be a mix and match that essentially creates a new vocabulary. Oftentimes these vocabularies are not treated as profiles, and the line between a new vocabulary made up of existing terms and a profile is not distinct. We can say, however, that a profile does not have to be primarily based on any one vocabulary. Even Europeana and DCAT vocabularies make use of terms from Dublin Core,
  15. bibframe is an example of a profile that is a selection from a single vocabulary.
  16. DCAT uses terms from its own vocabulary, from dublin core terms and foaf, among others.
  17. You have to pay attention to how a term is defined before you reuse it.
  18. When you have all of this together, then it is time to create your profile.
  19. Examples
  20. The problem is the validation code usually is pretty complex. This is an extremely simple example that say that my property "status" can be one of two things: "assigned" or "unassigned". So you an imagine how much code it takes to say something much more complicated. It isn't reasonable to assume that everyone who creates metadata is capable of writing the needed validation code.
  21. There are a lot of things in data that cannot be validated, especially for those in the cultural heritage area where much of there metadata consists of text, and is based on decisions made by human beings, not on calculations. So although validation is important, validation alone won't describe a profile.
  22. These are community decisions. If you do not include in your development of profiles the means to maintain and evolve the profiles, within a short time they cease being living solutions to your metadata needs. https://pro.europeana.eu/project/creation-and-governance-of-edm-mappings-profiles-and-extensions-task-force http://makxdekkers.com/DXWG/DCAT-AP.pdf
  23. Just to mention here that I am co-chairing the W3C group representing DC. That group will be creating a kind of "best practices" document but nothing so specific as code. The Dublin Core work is more detailed, but has not yet yielded a usable schema for profiles. Hopefully that is in progress.
  24. This looks like an entity-relation diagram at this point. But more is needed.
  25. This looks like an entity-relation diagram at this point. But more is needed.
  26. In a sense, an application profile for an application profile. Not yet "finished", but may demonstrate that we can use something as simple as a spreadsheet to allow people to easily create application profiles that can be converted to a form that would allow for ingest and validation. All without the metadata schema creator having to write code. Note in particular that very few of the elements here are required. (Those with an initial zero in the third column are optional.) An application profile could be simply a list of terms that are used for one or more resources, with a resource being a document, a person, a place, a subject. Whatever you want it to be.
  27. Spreadsheets – which then become CSV files (comma separated files).
  28. It would be ideal to be able to express all of the rules that would be necessary to validate your data. That can get to be quite complex.
  29. Can we include in this simple view of profiles an easy way to include validation rules, or at least the most basic validation rules? That would go a long way to aiding interoperability of datasets.