SlideShare a Scribd company logo
1 of 46
Digital Medieval Data Curation
CLIR Postdoctoral Fellowship Seminar
Bryn Mawr, 2013
Benjamin Albritton, Stanford University Libraries
blalbrit@stanford.edu
@bla222
Current State: A World of Silos
Roman de la Rose Parker on the Web e-codices And so on…
Data Interoperability
• Break down silos
• Separate data from applications
• Share data models and
programming interfaces
• Enable interactions at the tool and
repository level
Designing Modular Repositories and
Tools
Image Data (Canonical)
Image
Viewer
Discovery
Annotation
Non-image data (Canonical)
Transcription
Image Viewer
Image
Analysis
Discovery Tool X?
Repository
Repository
User
Interface
3rd-Party
Tools
Image Data (Canonical)
Image
Viewer
Discovery
Annotation
Non-image data (Canonical)
Transcription
Image Viewer
Image
Analysis
Discovery Tool X?
Repository
Repository
User
Interface
3rd-Party
Tools
Designing Modular Repositories and
Tools
Image Data (Canonical)
Image
Viewer
Discovery
Annotation
Non-image data (Canonical)
Transcription
Image Viewer
Image
Analysis
Discovery Tool X?
Designing Modular Repositories and
Tools
Iterative Interactions
Multiple Data Sources
• Existing structured data (catalogs)
• User-added
– Comments
– Transcriptions
– Etc.
• Digital images
• Machine processing
Motivating Questions
What does this mean for medieval data?
• How do we rethink medieval object data in a
shared, distributed, global space?
• How do we enable collaboration and encourage
engagement?
• How do we deal with tools that are producing
new data on digital surrogates that are
implicitly about a real world object?
Transcribing from Digital Surrogates
La Terre de Secille
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR Fold A Open
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR Fold A Open Fold A and B Open
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR Fold A Open Fold A and B Open f. iiiV
The Shared Canvas
• Represents a real world thing we
want to “talk” about
• Has a unique name
• http://dms-data.stanford.edu/Parker/CCC026/canvas-12
Data Model: SharedCanvas
http://www.shared-canvas.org
Data is “about” a real thing
Canvas Paradigm
• A Canvas is an empty space in which to build up a display
• Makes explicit that the image is a surrogate
Open Annotation Model
• Annotation (a document)
• Body (the ‘comment’ of the annotation)
• Target (the resource the Body is ‘about’)
Model: Annotations to Paint Canvas
• The Canvas represents the empty page
• Annotation links Image with Canvas
Model: Annotations to Paint Canvas
• Annotation links Text with Canvas
Model: Annotations to Paint Canvas
Model: Missing Pages
Medieval Data Use-Cases: A Sampler
• Structured data from existing sources
• Transcription and glyphs
• Structured data from new sources
Structured Data from Existing Sources
A Catalog of the Manuscripts of
Salisbury Cathedral Library
Drives Discovery
Transcription:
T-PEN (Saint Louis University) http://t-pen.org
• Transcription tool
• Provides image parsing
– Columns
BNF fr. 9221 – column parsing
T-PEN (Saint Louis University)
http://t-pen.org
• Transcription tool
• Provides image parsing
– Columns
– Lines
BNF fr. 9221 – line parsing
T-PEN (Saint Louis University)
http://t-pen.org
BNF fr. 9221 – transcription view
Drives Full-Text Search
http://t-pen.org/TPEN
… and other interfaces
http://stanford.edu/~blalbrit/v-machine-2/samples/DamedequiRF5.xml
T-PEN’s PaleoTool
BNF fr. 1586 – glyph parsing
Results for “matching” glyphs
Glyphs with multiple letters
Comparing results across manuscripts
BNF fr. 1586 CCCC 324
User-created Structured Data
Beinecke MS 310, f. 1r
• Each row = 1 day (January 1, here)
• Lists the feast of the Circumcision
• Optionally provides additional information
Distributed Resources /
Distributed Environments
Data capture in T-PEN
http:t-pen.org – Saint Louis University
Front-end: Exhibit
http://guillaumedemachaut.com/kalendar/sharedkalendar.html
Simple (really simple) Exhibit based on kalendar transcriptions
(Exhibit: http://www.simile-widgets.org/exhibit/)
For each record:
Enabling rapid comparison
Two mss. include the entry “Thimotheus apostel”
Distributed Resources /
Distributed Environments
SharedCanvas Demo Implementation
http://www.shared-canvas.org/impl/demodh
SharedCanvas Demo Implementation
http://www.shared-canvas.org/impl/demodh
SharedCanvas Demo Implementation
http://www.shared-canvas.org/impl/demodh
A Sea of Manuscript Data
• Thousands of manuscripts currently available
interoperably, with more coming rapidly
• Discovery data is a mixed bag
• Tools provide data back into the system that
can be re-used
• New data drives new discovery, new
interfaces, and new visualization challenges
• Management and manipulation of that “wild”
data is a serious challenge

More Related Content

What's hot

Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
The European Library
 

What's hot (20)

Your research as open science
Your research as open scienceYour research as open science
Your research as open science
 
Deriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF DataDeriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF Data
 
AINL 2016: Kozerenko
AINL 2016: Kozerenko AINL 2016: Kozerenko
AINL 2016: Kozerenko
 
The Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela KroegerThe Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela Kroeger
 
POSTDATA: Towards publishing European Poetry as Linked Open Data
POSTDATA: Towards publishing European Poetry as Linked Open DataPOSTDATA: Towards publishing European Poetry as Linked Open Data
POSTDATA: Towards publishing European Poetry as Linked Open Data
 
AINL 2016: Kuznetsova
AINL 2016: KuznetsovaAINL 2016: Kuznetsova
AINL 2016: Kuznetsova
 
co:op-READ-Convention Marburg - Günter Mühlberger
co:op-READ-Convention Marburg - Günter Mühlbergerco:op-READ-Convention Marburg - Günter Mühlberger
co:op-READ-Convention Marburg - Günter Mühlberger
 
Linked open data: standardization, interoperability and multilingual challeng...
Linked open data: standardization, interoperability and multilingual challeng...Linked open data: standardization, interoperability and multilingual challeng...
Linked open data: standardization, interoperability and multilingual challeng...
 
co:op-READ-Convention Marburg - Basilis Gatos
co:op-READ-Convention Marburg - Basilis Gatosco:op-READ-Convention Marburg - Basilis Gatos
co:op-READ-Convention Marburg - Basilis Gatos
 
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IU
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IUOne Discovery Layer, Eight Front Doors: Implementing Blacklight @ IU
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IU
 
Introduction to persistency and Berkeley DB
Introduction to persistency and Berkeley DBIntroduction to persistency and Berkeley DB
Introduction to persistency and Berkeley DB
 
Semantic Web in the Digital Humanities
Semantic Web in the Digital HumanitiesSemantic Web in the Digital Humanities
Semantic Web in the Digital Humanities
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsRDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL Platforms
 
IIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership MeetingIIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership Meeting
 
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)
 
co:op-READ-Convention Marburg - Sebastian Colutto
co:op-READ-Convention Marburg - Sebastian Coluttoco:op-READ-Convention Marburg - Sebastian Colutto
co:op-READ-Convention Marburg - Sebastian Colutto
 
Session 03 acquiring data
Session 03 acquiring dataSession 03 acquiring data
Session 03 acquiring data
 
A non-technical introduction to text mining for information specialists
A non-technical introduction to text mining for information specialists A non-technical introduction to text mining for information specialists
A non-technical introduction to text mining for information specialists
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
 

Viewers also liked (8)

Normativa del Sistema de Contabilidad General de la Nación
Normativa del Sistema de Contabilidad General de la NaciónNormativa del Sistema de Contabilidad General de la Nación
Normativa del Sistema de Contabilidad General de la Nación
 
Dust Collector
Dust CollectorDust Collector
Dust Collector
 
Dust collector
Dust collectorDust collector
Dust collector
 
Guía Uso de Plataforma
Guía Uso de Plataforma Guía Uso de Plataforma
Guía Uso de Plataforma
 
Cooperative education in tourism industry bonnie group
Cooperative education in tourism industry  bonnie groupCooperative education in tourism industry  bonnie group
Cooperative education in tourism industry bonnie group
 
Maa
MaaMaa
Maa
 
Confianza legítima 2
Confianza legítima 2Confianza legítima 2
Confianza legítima 2
 
Dto 854 02-dic-2004
Dto 854 02-dic-2004Dto 854 02-dic-2004
Dto 854 02-dic-2004
 

Similar to Digital Medieval Data Curation

Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
madhuvardhan
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
madhuvardhan
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
Jon Stroop
 

Similar to Digital Medieval Data Curation (20)

Florence2
Florence2Florence2
Florence2
 
Facsimiles of Text and Music from Distributed Resources
Facsimiles of Text and Music from Distributed ResourcesFacsimiles of Text and Music from Distributed Resources
Facsimiles of Text and Music from Distributed Resources
 
A Comparative Kalendar - DH2013 Presentation
A Comparative Kalendar - DH2013 PresentationA Comparative Kalendar - DH2013 Presentation
A Comparative Kalendar - DH2013 Presentation
 
Shared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conferenceShared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conference
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Overview of Lincoln Paper Design
Overview of Lincoln Paper DesignOverview of Lincoln Paper Design
Overview of Lincoln Paper Design
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
From ontology to wiki
From ontology to wikiFrom ontology to wiki
From ontology to wiki
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open Context
 
From Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science TalesFrom Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science Tales
 
Doing DH in Theological Libraries
Doing DH in Theological LibrariesDoing DH in Theological Libraries
Doing DH in Theological Libraries
 
DL-architecture.ppt
DL-architecture.pptDL-architecture.ppt
DL-architecture.ppt
 
The Data-Intensive Visual Analytics (DIVA) project
The Data-Intensive Visual Analytics (DIVA) projectThe Data-Intensive Visual Analytics (DIVA) project
The Data-Intensive Visual Analytics (DIVA) project
 
Digital libraries
Digital librariesDigital libraries
Digital libraries
 
"Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications""Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications"
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 

Digital Medieval Data Curation

Editor's Notes

  1. Allows filtering by date, item, and manuscript, as well as search across the items