Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Paola Mazzucchi, AIE and mEDRA Project Manager, Converging metadata for converging media @ Converging Media 2014
1. Converging metadata for converging
media
Good metadata help selling more books.
But that’s not the end of the story
Paola Mazzucchi
With the support of
Converging Media Conference, Gent, 24th September 2014
2. Technology and Innovation for Smart Publishing
TISP is the platform for the publishing industry and the ICT industry to discuss
2
about innovation, collaboration and partnerships, within an international network.
TISP helps publishing and ICT converging on specific business needs and on the
strategic view for the future.
TISP is an EU funded Thematic Network
Over TISP:
Twitter: @tispnetwork
Linkedin group: TISP - Technology and Innovation for Smart Publishing
www.smartbook-tisp.eu
3. Scenario setting: convergence
3
• «What is a product» in the digital environment?
• Granularity and Complexity
• Beyond the product: works and abstractions
• Relations among stuff
People make Stuff.
People use Stuff.
People do deals about Stuff.
• Same people, multiple names and identities
• Same people multiple roles in multiple
products
• Relations among people
• Events
• Actions people do with stuff
• Relations among people and stuff
4. 4
These basic principles
help us to further elaborate
on the challenges in today digital environment
5. Scenario setting: convergence
5
People make Stuff.
People use Stuff.
People do deals about Stuff.
Identify unambiguously
the relevant entities
Describe relevant
features of the relevant
entities (rich metadata)
Create semantic relationships
among different entities
(products, works, people,
parties, etc.)
Respond to the demand
by users to re-use
content
Enrich the experience
of the end user
Provide information about
how end users can get
appropriate permission to
Increase discoverability re-use content
of content (SEO)
Optimise supply chains
operations, from royalties
distribution to and sales
management
6. 6
It’s all about metadata!
The possibility to create and maintain relationships among entities relies on a
network of metadata records and persistent identifiers, ensuring interoperability
across market-segments
This applies to each of the media sectors as a self-contained domain
This applies more and more to a cross-media environment
7. Our tool kit: Identifiers & Metadata
7
ISBN
DOI
ISTC
ONIX4B
ONIX4DOI
Marc
Dublin Core
GTIN
GRid & ISRC
ISWC
DDEX
ISAN
EIDR
EIDR
PLUS ID
IPTC
PLUS
XMP & Exif
8. 8
So far so good, as far as the theory goes…
but
Let’s come back to our daily job
9. Metadata in everyday life in the publishing industry/1
9
Publishers
Distributors
E-book
platforms
BIP
Libraries
Search
engines
Social
networks
s
Online
retailers
Wholesalers
Publishers
Quality check and data Quality check
enrichment
Quality check e data
enrichment
RRO
Bookshops
Metadata creation Metadata management
and supply
ISBN assignment
Metadata or
book
Metadata
or book
Metadata
Metadata consumption
10. Metadata in everyday life in the publishing industry/2
10
Good metadata help selling more books
hope you all know by now…
Basic metadata Enhanced metadata
11. 11
But that’s not the end of the story
Let’s have a look at some initiatives and services that
make use of the good metadata that help selling more books
in a different context to enable a different range of services
Publishing
industry
Converging
metadata
Converging
media sectors
12. Rights information management services
12
ARROW is a system to streamline
“rights information discovery” in a
book or collection of books to
lawfully digitise and make available
the European cultural heritage
ARROW operations and algorithms
are powered by metadata:
- national bibliographies (stuff)
- authors authority files (people)
- book supply chain data (BIP) (stuff)
- rights management data (RRO and
CMO) (events)
Media sector: books and audio-visual
FORWARD will build a rights
discovery service for the audio-visual
sector through an automated
system that will search, harvest and
process metadata from film archives
and producers.
FORWARD and the ARROW system
will be fully interoperable and
accessible to queries from all users
across the EU.
13. Accessibility for the visually impaired services
LIA (Accessible Italian Books) is a dedicated
service to increase the number of accessible
e-books available on the Italian market for
blind and visually impaired readers.
LIA operations are powered by metadata:
- Title metadata from publishers (stuff)
- Supply chain metadata (BIP) (stuff)
- Accessibility metadata created in the
certification service (stuff and event)
Media sector: books
13
PRODUCTION CATALOGUING DISTRIBUTION USE
Semantic
Mark-up
Metadata
management
Metadata
supply
Mark-up and
Metadata use
In LIA accessibility metadata are merged
with title information from E-Kitab (the
BIP of Italian e-books)
The resulting ONIX 3.0 metadata convey
information on ebook accessibility all
along the publishing supply chain:
stores, libraries, aggregators.
14. Improve discoverability and SEO
The implementation of Schema.org mark-up on LIA website was tested, mapping
product and accessibility metadata from ONIX 3.0 records to the schema Book
✔︎ ✔︎ ✖︎
microdata RDFa JSON-LD
Unfortunately the STANCA act that regulates the accessible web in Italy excludes the use
of HTML5 to develop accessible websites.
14
✖︎ ✖︎ ✔︎
microdata RDFa JSON-LD
14
15. Streamline online rights transaction services
RDI is a project, EU-funded under the CIP
framework, aimed at demonstrating how to
efficiently manage and trade intellectual property
rights online for any and all types of usage, across
any and all types of content, in any and all media.
RDI operations are powered by metadata
provided by different sources:
- Content metadata (stuff)
- Rightsholders metadata: authors/creators and
15
publishers/producers (people)
- Rights and licensing metadata (RRO/CMO)
(events)
At the core of RDI is the creation of an
interoperable communication layer
between data sources and users, for
example consumers looking for a license to
use a piece of content, or “B2B” users
looking for permission to re-sell or re-purpose
existing content to create new
content.
RDI implements the principles of the Linked
Content Coalition to facilitate and expand
the legitimate use of content in the digital
network through the effective use of
interoperable identifiers and metadata
Media sector: books and journals; film and audio-visual; music; images
16. Use case in the books and journal sector
16
Users, other aggregators, other services
17. DOI
DOI (Digital Object identifier) is persistent, cross-media and resolvable identifier.
Resolution is the process of going from an identifier to information about the
identified entity (metadata) and in some cases the entity itself. DOI has been
made interoperable with other identifiers (ex. the ISBN), therefore can support
the use online of other identifiers, to access metadata and services associated to
the content identified.
One of the most well-known DOI-based services is in the in the academic and research
environment where the DOI makes resolvable the semantic relations among different content
types.
Crosslinking and citation services between journals, datasets, researchers and funders are
powered by DOI resolution and DOI metadata.
17
Media sector: books and journals; film and audio-visual; datasets; PSI
Media sector: books and journals; datasets
18. DOI Context-aware multiple resolution
18
Metadata
services
DOI
Registration
and
bibliographic
metadata
Rights
information
metadata
Accessibility
metadata
DOI kernel
metadata
Content
services
Reuse
(licensing/permission)
Buy
Access
Other
resolutions
Alternative version
about the content
about the contributors
Get metadata
in RDF xml
Get metadata
in citation format
Get metadata
in Turtle
Get metadata
in xml
Media sector: books and journals; film and audio-visual; datasets; PSI
19. Content negotiation of DOI metadata
19
Content Negotiation is an application to make available and disseminate DOI
metadata in different formats regardless the Registration Agency where the DOI
has been registered and where associated metadata are stored.
Media sector: books and journals; datasets
20. Formatting service of content negotiated DOI metadata
Media sector: books and journals; datasets
20
22. 22
The four Gospels
What do
these have in
common?
Columbo
(aka Peter Falk)
Harry Potter
and the Deathly Hallows
Part 1
Einsturzende
Neubauten
What
else?
23. 23
The four Gospels
ISBN
9781847678355
Nick Cave and the
Bad Seeds
ISNI 0000 0001
1958 9618
Columbo
(aka Peter Falk)
ISNI 0000 0000
7823 8642
Harry Potter
and the Deathly
Hallows
Part 1
ISAN 0000-0002-C755-
0000-D-0000-0002-V
(DVD)
Einsturzende
Neubauten
ISNI 0000 0001
2291 0804
Related products
ISBN 9780862417963
Related work
ISTC A02-2012-000013F3-7
Related product
ISBN 9781408835029
Related work
ISTC A02-2013-000003E3-0
Related work
ISAN 0000-0000-32FE-0000-
T-0000-0000-O
(Der Himmel über Berlin)
Related Party
0000 0000 7839 5824
(Blixa Bargeld)
Related version
ISAN 0000-0002-C755-0000-D-
0000-0001-X (Blue Ray)
Related work
ISAN 0000-0002-C755-
0000-D-0000-0000-Z
EIDR 10.5240/FE76-07CC-CACA-7BE7-C49E-K
24. My metadata With the support of
Name: Paola
Surname: Mazzucchi
Professional Affiliation: AIE/mEDRA
Role: Project Manager
Email: paola.mazzucchi@aie.it
Twitter: @MetalGoddess
Linkedin: https://www.linkedin.com/in/paolamazzucchi