SlideShare a Scribd company logo
1 of 53
Semantics and the Humanities
some lessons from my journey 2000-2012
Guus Schreiber
VU University Amsterdam
The Web:
resources and links
URL URL
Web link
The Semantic Web:
typed resources and links
URL URL
Web link
ULAN
Henri Matisse
Dublin Core
creator
Painting
“Woman with hat
SFMOMA
2000
5
photo annotation study
Ape use case
• A person searches for photos
of an “orange ape”
• An image collection of animal
photographs contains
snapshots of orang-utans.
• The search engine finds the
photos, despite the fact that
the words “orange” and “ape”
do not appear in annotations
6
Annotation with vocabulary (Iconclass)
7
Qualitative comparison with
image search engines
• AltaVista image finder:
– “great + ape”: 13 hits, 6 of which are images of
apes
– “great ape”: 45,000 hits. Precision 32% (estimate)
• Gettyone.Com
– Uses some partial hierarchies
• Good results on “ape”
• No results “great ape”
– Poor results on ape features
• Result with “ape scratching”
• No result with “ape AND hand AND head”
– Knowledge of indexing scheme is necessary for
successful complex queries
2003
Annotating paintings
with help of multiple vocabularies
9
Image content issues
general specific abstract Total
Object event 7.4% 0.1% 0.7% 8.3%
place 1.9% 0.7% 2.6%
time 0.2% 0.2% 0.4%
relations 1.7% 0.1% 1.8%
uncharacterized 40.1% 14.2% 2.8% 57.1%
Subtotal 51.5% 15.0% 3.8% 70.3%
Scene event 4.7% 0.3% 5.0%
place 7.7% 0.9% 1.2% 9.9%
time 4.3% 0.3% 1.1% 5.8%
relations 0.3% 0.3%
uncharacterized 5.8% 0.1% 2.8% 8.8%
Subtotal 22.9% 1.4% 5.4% 29.7%
Total 74.4% 16.4% 9.2% 100.0%
Conceptual Level
CharacteristicScope
11
Annotation and search architecture
Knowledge
corrpora
AAT
ICONCLASS
WordNet
Annotation
Template
VRA 3.0
Scene descriptors
Annotation
& search tool
RDF
Schema
RDF image
annotations
Semantic search attempts
12
Knowledge representation builds on a
long tradition of vocabulary construction
The myth of a unified vocabulary
• In large virtual collections there are always multiple
vocabularies
– In multiple languages
• Every vocabulary has its own perspective
– You can’t just merge them
• But you can use vocabularies jointly by defining a limited
set of links
– “Vocabulary alignment”
• It is surprising what you can do with just a few links
Ontological commitment
• Ontologists tend to be "trigger-happy"
– i.e. define as many axioms as possible
• Over-commitment makes ontologies less
usable
– The art of being minimal
– “In der Beschr≠änkung zeigt sich der Meister”
• Design criterion SKOS: minimal commitment
15
Wordnet
2006
Culture Web
20
Natural-lang proc.
automatic annotation
text stings  concepts
Distributed
cultuurwijzer.nl collections
OAI-based access
Reasoning support
time/space reasoning
Web interface
support for web collections
Presentation facilities
semantic presentation
device-specific
Interoperability
XML/RDF/OWL
Scalability
> 10,000,000 triples
Ontologies
WordNet, AAT, TGN
ULAN, Dutch labels
Search strategies
sibling search
semantic distance
Dublin Core
specializations
dumb-down
semantic annotation
DIGITAL HERITAGE
COLLECTIONS
semantic search
BASELINEENHANCED
FEATURES
NEW
FEATURES
Semantic annotation
21
Relatively easy for
people and places
Term disambiguation is key issue in semantic
search
• Pre-query
– Ask user to disambiguate by displaying list of
possible meanings
– Interface is more complex, but more search
functionality can be offered
• Post-query
– Sort search results based on different meanings of
the search term
– Mimics Google-type search
Semantic search: result clustering based on
retrieval path
Keyword search with semantic clustering
1. Btree of literals plus Porter stem and
metaphone index
2. Find resources with matching labels
• Default resources are “Work”s
3. Find related resources by one-way graph
traversal
• OWL use: inverse, symmetric,
transitive, same as
Threshold used for constraining search
4. Cluster results (group instances)
Distributed vs. centralized collection data
• In the beginning we wanted to do it all
distributed
– accessed through protocol such as OAI
• In practice, external metadata access is
cumbersome
– Centralized (cached) metadata
– Distributed data (images, ..)
Mobile museum tour
Supporting annotation: automatically deriving
spatial relations
Object1
left
Object2
?
Other search paradigms:
relation search
2009
Europeana
• “Europeana enables people to explore the digital
resources of Europe's museums, libraries, archives and
audio-visual collections.’’
www.europeana.eu
From portal… …to data aggregator.
Eeuopeana Data Model (EDM)
requirements
1. Distinction between “provided object” (painting, book,
program) and digital representation
2. Distinction between object and metadata record
describing an object .
3. Allow for multiple records for same object, containing
potentially contradictory statements about an object
4. No information loss
5. Standard metadata format that can be specialized
6. Standard vocabulary format that can be specialized
7. EDM should be based on existing standards
“not yet another standard” !
EDM basics
• OAI ORE for organization of metadata about
an object
– Requirements 1-4
• Dublin Core for metadata representation
– Requirement 5
• SKOS for vocabulary representation
– Requirement 6
OAI ORE, Dublin Core and SKOS together fulfil
Requirement-7!
Data conversion process
XML
DB-dumps
OAI
Servers
Legacy formats
Initial RDF
EDM in RDF
Generic
conversion
Legacy
conversion
Graph
Rewrite
36
Provenance
Graph
• All steps (nodes) live visualized in a strategy/
provenance graph
• Nodes include
• input vocabularies
• lexical matchers
• (arity) selectors
• set operations (subtraction, union,…)
• mapping sets (intermediary and final)
• evaluation processes
Problems in alignment
• We have not agreed on an adequate alignment
vocabulary
– misuse of owl:sameAs
• We have no adequate methodology for
evaluating alignments (Tordai et al., 2011)
– How to handle lack of gold standard
– Inter-observer agreement metrics
• In particular, people do not agree on how
different classes align
– and this is not because they don’t do it “right”
38
Limitations of categorical thinking
• The set theory on which ontology languages are
built is inadequate for modelling how people
think about categories (Lakoff)
– Category boundaries are not hard: cf. art styles
– People think of prototypes; some examples are very
prototypical, others less
• We also need to make meta-distinctions explicit
– organizing class: “furniture”
– base-level class: “chair”
– domain-specific: “Windsor chair”
Other technical issues
• Information retrieval as graph search
– more semantics => more paths
– finding optimal graph patterns
• Information extraction
– recognizing people, locations, …
– identity resolution
• Multi-lingual resources
Organizational issues
• Cultural heritage organizations find it
difficult to “give away” their data
– concerns for quality
• Re-orientation: Web is not derivative of
physical presence; they should stand side-
by-side
• Universal access: everyone should be
able to enjoy the Rijksmuseum Amsterdam
Economic issues
• Primary access free
• Secondary services cost money
– virtual museum shop can offer much larger
collection
– access to high-resolution images
– tourist services on mobile devices
2012
Winner EuroITVCompetition
Best Archiveson theWebAward
http://waisda.nl
45
Events
from piracy reports & Web sources
46
Demonstrator from Poseidon:
Visualising piracy events
47
48
Mockup Prototype (II)
BiographyNet: Linking the world of History
eScience internal review – Thursday, 18 September 2014
Links with other datasets
• Database with data about all Dutch ministers
in the period 1575-1815 (Fred van Lieburg)
• Data converted for interoperability with other
sources
• Geographical mobility computed with help of
original place names and Geonames data
(http://www.geonames.org)
Mobility of Dutch ministers 1575-
1815
Acknowledgements
• Many co-workers, from ICES-KIS, MultimediaN
E-Culture, CHOICE, CHIP, Agora, MuNCH,
BiographyNet, Europeana Connect,
PrestoPrime, Dutch Ships & Sailors

More Related Content

What's hot

NoTube: integrating TV and Web with the help of semantics
NoTube: integrating TV and Web with the help of semanticsNoTube: integrating TV and Web with the help of semantics
NoTube: integrating TV and Web with the help of semanticsGuus Schreiber
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Lora Aroyo
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18Rafael Alvarado
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...Lora Aroyo
 
Creating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataCreating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataAngela Zoss
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...Lora Aroyo
 
Electronic literature (e lit) in public libraries
Electronic literature (e lit) in public librariesElectronic literature (e lit) in public libraries
Electronic literature (e lit) in public librariesAlexandr Belov
 
SLU Liberal Tech 2007, full day slides
SLU Liberal Tech 2007, full day slidesSLU Liberal Tech 2007, full day slides
SLU Liberal Tech 2007, full day slidesBryan Alexander
 
Electronic literature and its place in digital library
Electronic literature and its place in digital libraryElectronic literature and its place in digital library
Electronic literature and its place in digital libraryAlexandr Belov
 
Electronic Literature
Electronic LiteratureElectronic Literature
Electronic LiteratureSiswo Harsono
 
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...Jenn Riley
 
Adaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generationAdaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generationPeter Brusilovsky
 
Dh presentation helig 2014
Dh presentation helig 2014Dh presentation helig 2014
Dh presentation helig 2014HELIGLIASA
 
Digital Humanities: An Introduction
Digital Humanities: An IntroductionDigital Humanities: An Introduction
Digital Humanities: An IntroductionDilip Barad
 
Balboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public ResourceBalboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public ResourceAnna Chiaretta Lavatelli
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesChantal van Son
 

What's hot (19)

NoTube: integrating TV and Web with the help of semantics
NoTube: integrating TV and Web with the help of semanticsNoTube: integrating TV and Web with the help of semantics
NoTube: integrating TV and Web with the help of semantics
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
 
20080606 VöGler GöTtingen E Humanities
20080606 VöGler GöTtingen E Humanities20080606 VöGler GöTtingen E Humanities
20080606 VöGler GöTtingen E Humanities
 
Creating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataCreating and Processing Digital Humanities Data
Creating and Processing Digital Humanities Data
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
 
Electronic literature (e lit) in public libraries
Electronic literature (e lit) in public librariesElectronic literature (e lit) in public libraries
Electronic literature (e lit) in public libraries
 
SLU Liberal Tech 2007, full day slides
SLU Liberal Tech 2007, full day slidesSLU Liberal Tech 2007, full day slides
SLU Liberal Tech 2007, full day slides
 
Electronic literature and its place in digital library
Electronic literature and its place in digital libraryElectronic literature and its place in digital library
Electronic literature and its place in digital library
 
Electronic Literature
Electronic LiteratureElectronic Literature
Electronic Literature
 
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
 
Adaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generationAdaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generation
 
Dh presentation helig 2014
Dh presentation helig 2014Dh presentation helig 2014
Dh presentation helig 2014
 
Digital Humanities: An Introduction
Digital Humanities: An IntroductionDigital Humanities: An Introduction
Digital Humanities: An Introduction
 
Balboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public ResourceBalboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public Resource
 
What is Digital Public History? Teaching and Practice

What is Digital Public History? Teaching and Practice
 What is Digital Public History? Teaching and Practice

What is Digital Public History? Teaching and Practice

 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social Sciences
 
Digicraft and 'Systemic' Thinking in Digital Humanities
Digicraft and 'Systemic' Thinking  in Digital HumanitiesDigicraft and 'Systemic' Thinking  in Digital Humanities
Digicraft and 'Systemic' Thinking in Digital Humanities
 

Similar to Semantics and the Humanities: some lessons from my journey 2000-2012

Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringeswcsummerschool
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model Valentine Charles
 
PATHS at Royal Melbourne Institute of Technology
PATHS at Royal Melbourne Institute of TechnologyPATHS at Royal Melbourne Institute of Technology
PATHS at Royal Melbourne Institute of Technologypathsproject
 
PATHS at the Language Technology Group, Computer Science and Software Enginee...
PATHS at the Language Technology Group, Computer Science and Software Enginee...PATHS at the Language Technology Group, Computer Science and Software Enginee...
PATHS at the Language Technology Group, Computer Science and Software Enginee...pathsproject
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...Keith.May
 
Reaching the researcher
Reaching the researcherReaching the researcher
Reaching the researcherLIBER Europe
 
Share: discovery: a focus on papers
Share: discovery: a focus on papersShare: discovery: a focus on papers
Share: discovery: a focus on paperslisld
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623Dov Winer
 
Judaica europeana dovwinerjudaicalibrarians
Judaica europeana dovwinerjudaicalibrariansJudaica europeana dovwinerjudaicalibrarians
Judaica europeana dovwinerjudaicalibrariansDov Winer
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Dov Winer
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...Valentine Charles
 
Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...MaxKemman
 
Mapping the European(a) metadata landscape
Mapping the European(a) metadata landscapeMapping the European(a) metadata landscape
Mapping the European(a) metadata landscapeSally Chambers
 
20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanitiesStefan Gradmann
 
Data Harmonisation for Ethical Collaborative Research: The ResearchSpace Project
Data Harmonisation for Ethical Collaborative Research:The ResearchSpace ProjectData Harmonisation for Ethical Collaborative Research:The ResearchSpace Project
Data Harmonisation for Ethical Collaborative Research: The ResearchSpace ProjectDominic Oldman
 
Digital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Initiative
 

Similar to Semantics and the Humanities: some lessons from my journey 2000-2012 (20)

Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model
 
PATHS at Royal Melbourne Institute of Technology
PATHS at Royal Melbourne Institute of TechnologyPATHS at Royal Melbourne Institute of Technology
PATHS at Royal Melbourne Institute of Technology
 
PATHS at the Language Technology Group, Computer Science and Software Enginee...
PATHS at the Language Technology Group, Computer Science and Software Enginee...PATHS at the Language Technology Group, Computer Science and Software Enginee...
PATHS at the Language Technology Group, Computer Science and Software Enginee...
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
 
Reaching the researcher
Reaching the researcherReaching the researcher
Reaching the researcher
 
Linking Data the ALM way (Boris Zetterlund)
Linking Data the ALM way (Boris Zetterlund)Linking Data the ALM way (Boris Zetterlund)
Linking Data the ALM way (Boris Zetterlund)
 
Ecdl2004
Ecdl2004Ecdl2004
Ecdl2004
 
Share: discovery: a focus on papers
Share: discovery: a focus on papersShare: discovery: a focus on papers
Share: discovery: a focus on papers
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623
 
Judaica europeana dovwinerjudaicalibrarians
Judaica europeana dovwinerjudaicalibrariansJudaica europeana dovwinerjudaicalibrarians
Judaica europeana dovwinerjudaicalibrarians
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
 
Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...
 
Mapping the European(a) metadata landscape
Mapping the European(a) metadata landscapeMapping the European(a) metadata landscape
Mapping the European(a) metadata landscape
 
20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities
 
Edina cigs-21-september-2012
Edina cigs-21-september-2012Edina cigs-21-september-2012
Edina cigs-21-september-2012
 
Data Harmonisation for Ethical Collaborative Research: The ResearchSpace Project
Data Harmonisation for Ethical Collaborative Research:The ResearchSpace ProjectData Harmonisation for Ethical Collaborative Research:The ResearchSpace Project
Data Harmonisation for Ethical Collaborative Research: The ResearchSpace Project
 
Digital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool Medley
 

More from Guus Schreiber

Ontologies: vehicles for reuse
Ontologies: vehicles for reuseOntologies: vehicles for reuse
Ontologies: vehicles for reuseGuus Schreiber
 
Linking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveLinking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveGuus Schreiber
 
CommonKADS project management
CommonKADS project managementCommonKADS project management
CommonKADS project managementGuus Schreiber
 
UML notations used by CommonKADS
UML notations used by CommonKADSUML notations used by CommonKADS
UML notations used by CommonKADSGuus Schreiber
 
Advanced knowledge modelling
Advanced knowledge modellingAdvanced knowledge modelling
Advanced knowledge modellingGuus Schreiber
 
CommonKADS design and implementation
CommonKADS design and implementationCommonKADS design and implementation
CommonKADS design and implementationGuus Schreiber
 
CommonKADS communication model
CommonKADS communication modelCommonKADS communication model
CommonKADS communication modelGuus Schreiber
 
CommonKADS knowledge modelling process
CommonKADS knowledge modelling processCommonKADS knowledge modelling process
CommonKADS knowledge modelling processGuus Schreiber
 
CommonKADS knowledge model templates
CommonKADS knowledge model templatesCommonKADS knowledge model templates
CommonKADS knowledge model templatesGuus Schreiber
 
CommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsCommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsGuus Schreiber
 
CommonKADS knowledge management
CommonKADS knowledge managementCommonKADS knowledge management
CommonKADS knowledge managementGuus Schreiber
 
CommonKADS context models
CommonKADS context modelsCommonKADS context models
CommonKADS context modelsGuus Schreiber
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsGuus Schreiber
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospectsGuus Schreiber
 
E-Culture semantic search pilot
E-Culture semantic search pilotE-Culture semantic search pilot
E-Culture semantic search pilotGuus Schreiber
 
Ontology Engineering: Ontology Use
Ontology Engineering: Ontology UseOntology Engineering: Ontology Use
Ontology Engineering: Ontology UseGuus Schreiber
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignmentGuus Schreiber
 
Ontology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationOntology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationGuus Schreiber
 

More from Guus Schreiber (20)

Ontologies: vehicles for reuse
Ontologies: vehicles for reuseOntologies: vehicles for reuse
Ontologies: vehicles for reuse
 
Linking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveLinking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archive
 
CommonKADS project management
CommonKADS project managementCommonKADS project management
CommonKADS project management
 
UML notations used by CommonKADS
UML notations used by CommonKADSUML notations used by CommonKADS
UML notations used by CommonKADS
 
Advanced knowledge modelling
Advanced knowledge modellingAdvanced knowledge modelling
Advanced knowledge modelling
 
CommonKADS design and implementation
CommonKADS design and implementationCommonKADS design and implementation
CommonKADS design and implementation
 
CommonKADS communication model
CommonKADS communication modelCommonKADS communication model
CommonKADS communication model
 
CommonKADS knowledge modelling process
CommonKADS knowledge modelling processCommonKADS knowledge modelling process
CommonKADS knowledge modelling process
 
CommonKADS knowledge model templates
CommonKADS knowledge model templatesCommonKADS knowledge model templates
CommonKADS knowledge model templates
 
CommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsCommonKADS knowledge modelling basics
CommonKADS knowledge modelling basics
 
CommonKADS knowledge management
CommonKADS knowledge managementCommonKADS knowledge management
CommonKADS knowledge management
 
CommonKADS context models
CommonKADS context modelsCommonKADS context models
CommonKADS context models
 
Introduction
IntroductionIntroduction
Introduction
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to Applications
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospects
 
E-Culture semantic search pilot
E-Culture semantic search pilotE-Culture semantic search pilot
E-Culture semantic search pilot
 
Vista-TV overview
Vista-TV overviewVista-TV overview
Vista-TV overview
 
Ontology Engineering: Ontology Use
Ontology Engineering: Ontology UseOntology Engineering: Ontology Use
Ontology Engineering: Ontology Use
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Ontology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationOntology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluation
 

Recently uploaded

Authentic Travel Experience 2024 Greg DeShields.pptx
Authentic Travel Experience 2024 Greg DeShields.pptxAuthentic Travel Experience 2024 Greg DeShields.pptx
Authentic Travel Experience 2024 Greg DeShields.pptxGregory DeShields
 
Italia Lucca 1 Un tesoro nascosto tra le sue mura
Italia Lucca 1 Un tesoro nascosto tra le sue muraItalia Lucca 1 Un tesoro nascosto tra le sue mura
Italia Lucca 1 Un tesoro nascosto tra le sue murasandamichaela *
 
Apply Indian E-Visa Process Online (Evisa)
Apply Indian E-Visa Process Online (Evisa)Apply Indian E-Visa Process Online (Evisa)
Apply Indian E-Visa Process Online (Evisa)RanjeetKumar108130
 
5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)
5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)
5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)Mazie Garcia
 
Where to Stay in Lagos, Portugal.pptxasd
Where to Stay in Lagos, Portugal.pptxasdWhere to Stay in Lagos, Portugal.pptxasd
Where to Stay in Lagos, Portugal.pptxasdusmanghaniwixpatriot
 
Aeromexico Airlines Flight Name Change Policy
Aeromexico Airlines Flight Name Change PolicyAeromexico Airlines Flight Name Change Policy
Aeromexico Airlines Flight Name Change PolicyFlyFairTravels
 
Hoi An Ancient Town, Vietnam (越南 會安古鎮).ppsx
Hoi An Ancient Town, Vietnam (越南 會安古鎮).ppsxHoi An Ancient Town, Vietnam (越南 會安古鎮).ppsx
Hoi An Ancient Town, Vietnam (越南 會安古鎮).ppsxChung Yen Chang
 
(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCRsoniya singh
 
Inspirational Quotes About Italy and Food
Inspirational Quotes About Italy and FoodInspirational Quotes About Italy and Food
Inspirational Quotes About Italy and FoodKasia Chojecki
 
Moroccan Architecture presentation ( Omar & Yasine ).pptx
Moroccan Architecture presentation ( Omar & Yasine ).pptxMoroccan Architecture presentation ( Omar & Yasine ).pptx
Moroccan Architecture presentation ( Omar & Yasine ).pptxOmarOuazzani1
 
"Fly with Ease: Booking Your Flights with Air Europa"
"Fly with Ease: Booking Your Flights with Air Europa""Fly with Ease: Booking Your Flights with Air Europa"
"Fly with Ease: Booking Your Flights with Air Europa"flyn goo
 
8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR
8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR
8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCRdollysharma2066
 
Exploring Sicily Your Comprehensive Ebook Travel Guide
Exploring Sicily Your Comprehensive Ebook Travel GuideExploring Sicily Your Comprehensive Ebook Travel Guide
Exploring Sicily Your Comprehensive Ebook Travel GuideTime for Sicily
 
69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)
69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)
69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)Escort Service
 
Revolutionalizing Travel: A VacAI Update
Revolutionalizing Travel: A VacAI UpdateRevolutionalizing Travel: A VacAI Update
Revolutionalizing Travel: A VacAI Updatejoymorrison10
 
Haitian culture and stuff and places and food and travel.pptx
Haitian culture and stuff and places and food and travel.pptxHaitian culture and stuff and places and food and travel.pptx
Haitian culture and stuff and places and food and travel.pptxhxhlixia
 
question 2: airplane vocabulary presentation
question 2: airplane vocabulary presentationquestion 2: airplane vocabulary presentation
question 2: airplane vocabulary presentationcaminantesdaauga
 
How Safe Is It To Witness Whales In Maui’s Waters
How Safe Is It To Witness Whales In Maui’s WatersHow Safe Is It To Witness Whales In Maui’s Waters
How Safe Is It To Witness Whales In Maui’s WatersMakena Coast Charters
 

Recently uploaded (20)

Authentic Travel Experience 2024 Greg DeShields.pptx
Authentic Travel Experience 2024 Greg DeShields.pptxAuthentic Travel Experience 2024 Greg DeShields.pptx
Authentic Travel Experience 2024 Greg DeShields.pptx
 
Enjoy ➥8448380779▻ Call Girls In Sector 62 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 62 Noida Escorts Delhi NCREnjoy ➥8448380779▻ Call Girls In Sector 62 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 62 Noida Escorts Delhi NCR
 
Italia Lucca 1 Un tesoro nascosto tra le sue mura
Italia Lucca 1 Un tesoro nascosto tra le sue muraItalia Lucca 1 Un tesoro nascosto tra le sue mura
Italia Lucca 1 Un tesoro nascosto tra le sue mura
 
Apply Indian E-Visa Process Online (Evisa)
Apply Indian E-Visa Process Online (Evisa)Apply Indian E-Visa Process Online (Evisa)
Apply Indian E-Visa Process Online (Evisa)
 
5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)
5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)
5S - House keeping (Seiri, Seiton, Seiso, Seiketsu, Shitsuke)
 
Where to Stay in Lagos, Portugal.pptxasd
Where to Stay in Lagos, Portugal.pptxasdWhere to Stay in Lagos, Portugal.pptxasd
Where to Stay in Lagos, Portugal.pptxasd
 
Aeromexico Airlines Flight Name Change Policy
Aeromexico Airlines Flight Name Change PolicyAeromexico Airlines Flight Name Change Policy
Aeromexico Airlines Flight Name Change Policy
 
Hoi An Ancient Town, Vietnam (越南 會安古鎮).ppsx
Hoi An Ancient Town, Vietnam (越南 會安古鎮).ppsxHoi An Ancient Town, Vietnam (越南 會安古鎮).ppsx
Hoi An Ancient Town, Vietnam (越南 會安古鎮).ppsx
 
(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR
 
Inspirational Quotes About Italy and Food
Inspirational Quotes About Italy and FoodInspirational Quotes About Italy and Food
Inspirational Quotes About Italy and Food
 
Moroccan Architecture presentation ( Omar & Yasine ).pptx
Moroccan Architecture presentation ( Omar & Yasine ).pptxMoroccan Architecture presentation ( Omar & Yasine ).pptx
Moroccan Architecture presentation ( Omar & Yasine ).pptx
 
"Fly with Ease: Booking Your Flights with Air Europa"
"Fly with Ease: Booking Your Flights with Air Europa""Fly with Ease: Booking Your Flights with Air Europa"
"Fly with Ease: Booking Your Flights with Air Europa"
 
8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR
8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR
8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR
 
Exploring Sicily Your Comprehensive Ebook Travel Guide
Exploring Sicily Your Comprehensive Ebook Travel GuideExploring Sicily Your Comprehensive Ebook Travel Guide
Exploring Sicily Your Comprehensive Ebook Travel Guide
 
69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)
69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)
69 Girls ✠ 9599264170 ✠ Call Girls In East Of Kailash (VIP)
 
Revolutionalizing Travel: A VacAI Update
Revolutionalizing Travel: A VacAI UpdateRevolutionalizing Travel: A VacAI Update
Revolutionalizing Travel: A VacAI Update
 
Haitian culture and stuff and places and food and travel.pptx
Haitian culture and stuff and places and food and travel.pptxHaitian culture and stuff and places and food and travel.pptx
Haitian culture and stuff and places and food and travel.pptx
 
question 2: airplane vocabulary presentation
question 2: airplane vocabulary presentationquestion 2: airplane vocabulary presentation
question 2: airplane vocabulary presentation
 
Enjoy ➥8448380779▻ Call Girls In Sector 74 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 74 Noida Escorts Delhi NCREnjoy ➥8448380779▻ Call Girls In Sector 74 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 74 Noida Escorts Delhi NCR
 
How Safe Is It To Witness Whales In Maui’s Waters
How Safe Is It To Witness Whales In Maui’s WatersHow Safe Is It To Witness Whales In Maui’s Waters
How Safe Is It To Witness Whales In Maui’s Waters
 

Semantics and the Humanities: some lessons from my journey 2000-2012

  • 1. Semantics and the Humanities some lessons from my journey 2000-2012 Guus Schreiber VU University Amsterdam
  • 2. The Web: resources and links URL URL Web link
  • 3. The Semantic Web: typed resources and links URL URL Web link ULAN Henri Matisse Dublin Core creator Painting “Woman with hat SFMOMA
  • 5. 5 photo annotation study Ape use case • A person searches for photos of an “orange ape” • An image collection of animal photographs contains snapshots of orang-utans. • The search engine finds the photos, despite the fact that the words “orange” and “ape” do not appear in annotations
  • 7. 7 Qualitative comparison with image search engines • AltaVista image finder: – “great + ape”: 13 hits, 6 of which are images of apes – “great ape”: 45,000 hits. Precision 32% (estimate) • Gettyone.Com – Uses some partial hierarchies • Good results on “ape” • No results “great ape” – Poor results on ape features • Result with “ape scratching” • No result with “ape AND hand AND head” – Knowledge of indexing scheme is necessary for successful complex queries
  • 9. Annotating paintings with help of multiple vocabularies 9
  • 10. Image content issues general specific abstract Total Object event 7.4% 0.1% 0.7% 8.3% place 1.9% 0.7% 2.6% time 0.2% 0.2% 0.4% relations 1.7% 0.1% 1.8% uncharacterized 40.1% 14.2% 2.8% 57.1% Subtotal 51.5% 15.0% 3.8% 70.3% Scene event 4.7% 0.3% 5.0% place 7.7% 0.9% 1.2% 9.9% time 4.3% 0.3% 1.1% 5.8% relations 0.3% 0.3% uncharacterized 5.8% 0.1% 2.8% 8.8% Subtotal 22.9% 1.4% 5.4% 29.7% Total 74.4% 16.4% 9.2% 100.0% Conceptual Level CharacteristicScope
  • 11. 11 Annotation and search architecture Knowledge corrpora AAT ICONCLASS WordNet Annotation Template VRA 3.0 Scene descriptors Annotation & search tool RDF Schema RDF image annotations
  • 13. Knowledge representation builds on a long tradition of vocabulary construction
  • 14. The myth of a unified vocabulary • In large virtual collections there are always multiple vocabularies – In multiple languages • Every vocabulary has its own perspective – You can’t just merge them • But you can use vocabularies jointly by defining a limited set of links – “Vocabulary alignment” • It is surprising what you can do with just a few links
  • 15. Ontological commitment • Ontologists tend to be "trigger-happy" – i.e. define as many axioms as possible • Over-commitment makes ontologies less usable – The art of being minimal – “In der Beschr≠änkung zeigt sich der Meister” • Design criterion SKOS: minimal commitment 15
  • 16.
  • 18. 2006
  • 20. 20 Natural-lang proc. automatic annotation text stings  concepts Distributed cultuurwijzer.nl collections OAI-based access Reasoning support time/space reasoning Web interface support for web collections Presentation facilities semantic presentation device-specific Interoperability XML/RDF/OWL Scalability > 10,000,000 triples Ontologies WordNet, AAT, TGN ULAN, Dutch labels Search strategies sibling search semantic distance Dublin Core specializations dumb-down semantic annotation DIGITAL HERITAGE COLLECTIONS semantic search BASELINEENHANCED FEATURES NEW FEATURES
  • 21. Semantic annotation 21 Relatively easy for people and places
  • 22.
  • 23. Term disambiguation is key issue in semantic search • Pre-query – Ask user to disambiguate by displaying list of possible meanings – Interface is more complex, but more search functionality can be offered • Post-query – Sort search results based on different meanings of the search term – Mimics Google-type search
  • 24. Semantic search: result clustering based on retrieval path
  • 25. Keyword search with semantic clustering 1. Btree of literals plus Porter stem and metaphone index 2. Find resources with matching labels • Default resources are “Work”s 3. Find related resources by one-way graph traversal • OWL use: inverse, symmetric, transitive, same as Threshold used for constraining search 4. Cluster results (group instances)
  • 26. Distributed vs. centralized collection data • In the beginning we wanted to do it all distributed – accessed through protocol such as OAI • In practice, external metadata access is cumbersome – Centralized (cached) metadata – Distributed data (images, ..)
  • 28. Supporting annotation: automatically deriving spatial relations Object1 left Object2
  • 30. 2009
  • 31. Europeana • “Europeana enables people to explore the digital resources of Europe's museums, libraries, archives and audio-visual collections.’’ www.europeana.eu From portal… …to data aggregator.
  • 32.
  • 33. Eeuopeana Data Model (EDM) requirements 1. Distinction between “provided object” (painting, book, program) and digital representation 2. Distinction between object and metadata record describing an object . 3. Allow for multiple records for same object, containing potentially contradictory statements about an object 4. No information loss 5. Standard metadata format that can be specialized 6. Standard vocabulary format that can be specialized 7. EDM should be based on existing standards “not yet another standard” !
  • 34. EDM basics • OAI ORE for organization of metadata about an object – Requirements 1-4 • Dublin Core for metadata representation – Requirement 5 • SKOS for vocabulary representation – Requirement 6 OAI ORE, Dublin Core and SKOS together fulfil Requirement-7!
  • 35. Data conversion process XML DB-dumps OAI Servers Legacy formats Initial RDF EDM in RDF Generic conversion Legacy conversion Graph Rewrite
  • 36. 36
  • 37. Provenance Graph • All steps (nodes) live visualized in a strategy/ provenance graph • Nodes include • input vocabularies • lexical matchers • (arity) selectors • set operations (subtraction, union,…) • mapping sets (intermediary and final) • evaluation processes
  • 38. Problems in alignment • We have not agreed on an adequate alignment vocabulary – misuse of owl:sameAs • We have no adequate methodology for evaluating alignments (Tordai et al., 2011) – How to handle lack of gold standard – Inter-observer agreement metrics • In particular, people do not agree on how different classes align – and this is not because they don’t do it “right” 38
  • 39. Limitations of categorical thinking • The set theory on which ontology languages are built is inadequate for modelling how people think about categories (Lakoff) – Category boundaries are not hard: cf. art styles – People think of prototypes; some examples are very prototypical, others less • We also need to make meta-distinctions explicit – organizing class: “furniture” – base-level class: “chair” – domain-specific: “Windsor chair”
  • 40. Other technical issues • Information retrieval as graph search – more semantics => more paths – finding optimal graph patterns • Information extraction – recognizing people, locations, … – identity resolution • Multi-lingual resources
  • 41. Organizational issues • Cultural heritage organizations find it difficult to “give away” their data – concerns for quality • Re-orientation: Web is not derivative of physical presence; they should stand side- by-side • Universal access: everyone should be able to enjoy the Rijksmuseum Amsterdam
  • 42. Economic issues • Primary access free • Secondary services cost money – virtual museum shop can offer much larger collection – access to high-resolution images – tourist services on mobile devices
  • 43. 2012
  • 44.
  • 45. Winner EuroITVCompetition Best Archiveson theWebAward http://waisda.nl 45
  • 46. Events from piracy reports & Web sources 46
  • 48. 48
  • 49.
  • 50. Mockup Prototype (II) BiographyNet: Linking the world of History eScience internal review – Thursday, 18 September 2014
  • 51. Links with other datasets • Database with data about all Dutch ministers in the period 1575-1815 (Fred van Lieburg) • Data converted for interoperability with other sources • Geographical mobility computed with help of original place names and Geonames data (http://www.geonames.org)
  • 52. Mobility of Dutch ministers 1575- 1815
  • 53. Acknowledgements • Many co-workers, from ICES-KIS, MultimediaN E-Culture, CHOICE, CHIP, Agora, MuNCH, BiographyNet, Europeana Connect, PrestoPrime, Dutch Ships & Sailors