The document summarizes a webinar on granular discovery presented by NISO. It includes:
- An introduction to Alexander Street Press and their roots in granular content like film scripts, journal letters, and archival collections. They discuss challenges in granular discovery like different definitions of playback and what content levels can be sold.
- A presentation by figshare on making open data discoverable through services like DataCite for citations and their API. They discuss partnerships with publishers and search engine optimization.
- A discussion by ProQuest of the complexities of granular discovery like changes needed to systems to support new granularity levels, conflicts in metadata between content providers, and the importance of metadata quality.
Measures of Dispersion and Variability: Range, QD, AD and SD
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? Part 2: The Business Complexities of Granular Discovery
1. NISO Two Part Webinar:
Is Granularity the Next Discovery Frontier?
Part 2: The Business Complexities
of Granular Discovery
Wednesday, March 18, 2015
Speakers:
Andrea Eastman-Mullins, Chief Operating Officer,
Alexander Street Press
Dan Valen, Product Specialist,
figshare
Dave Hovenden, Content Operations Manager,
Summon® Service, ProQuest
http://www.niso.org/news/events/2015/webinars/granularity_pt2/
2. Granular Discovery: A Discipline-Based Approach
Andrea Eastman-Mullins, COO
March 18, 2015
www.alexanderstreet.com
3. Agenda
• About Alexander Street Press
• Granularity for teaching and research
• Discipline context
• Challenges & Solutions
4. Alexander Street Press
• One of the first and largest vendors of streaming video and
music into libraries
• Publisher of 100+ high quality collections
• Based in Alexandria, Virginia
• About 120 people
5. Individual Faculty
Broad range of Libraries
Harvard
University
University
of Tokyo
University of
Phoenix
National U.
of Singapore
Columbia
University
New York
Public Library
Manhattan
Community
College
Okanagan
College
Albuquerque
Public Library
British
Library
We serve 40m faculty and students
in more than 30,000 institutions.
7. • Anthropology
• Counseling
• History
• Music
• Performing Arts
• Diversity Studies
• Education
• Feature Film
• Business
• Health Sciences
Award-Winning Collections
8. Our roots are in granular content…
If the source is We index
Feature Film Script Segments
Journal Letter
Diary Daily Entry
Play Scenes
Local History Text Photographs
Archival Collection Item
Classical Work Movement
12. “I’ll research this on YouTube?”
• Anthropology: Examine marriage rituals across
cultures from 1975-2010.
• History: What words did JFK use most frequently in
his early career vs. his later career?
• Counseling: Show me every time Albert Ellis used
the word “emotion” in his counseling demonstrations.
• Music: Show me performances of Strauss by Kiri Te
Kanawa.
• Drama: Examine performances of Shakespearean
jokes over time.
27. Usage: What is a playback?
• Each segment
• Extras
• Each full opera or play
• End-user created segments
• Overlapping segments
28. What level is for sale?
• Patron Driven Acquisition playback triggers sale
• Single title sale
• Total content count in a collection
• Free preview
Answers must serve:
Libraries Faculty Content Partners
33. Work smart. Discover more.
Supported by
What is figshare?
Citable
Shareable
Discoverable
34. How figshare makes content discoverable
4 ways…
• Persistent identifiers through DataCite
• Institutional offering and API access
• Working alongside publishers
• SEO through figshare.com
35. Why data citations?
Pepe, Alberto; Goodman, Alyssa; Muench, August; Crosas, Merce; Erdmann, Christopher (2014): Some
descriptive statistics about top domains linked in astronomy publications. Table_1.xls. PLOS ONE.
10.1371/journal.pone.0104798.t001.
Pepe, Alberto; Goodman, Alyssa; Muench, August; Crosas, Merce;
Erdmann, Christopher (2014): Volume of potential data links in
astronomy publications. Figure_1.tif. PLOS ONE.
10.1371/journal.pone.0104798.g001.
36. • Enabling easy reuse and verification of data
• Allowing the impact of data to be tracked
• Creating a scholarly structure that recognizes
and rewards data producers
Source: https://www.datacite.org/services/cite-your-data.html
Why DataCite?
37. Number of DataCite DOI registrations
figshare-hosted file views
figshare
Open data repository 1
Open data repository 2
figshare content on DataCite
38. ✔
Reporting Dashboard
Impact and Usage Reporting
Administrative Workflow Portal
A portal where administrators can
manage curation of files to be made
public, storage space allocation and
user rights
Public Digital Research Repository
A customizable public portal with all
digital files made public at an
institutional, departmental and group
level
Research Data Management
Private, controlled storage and
collaborative spaces
for every academic at the institution
figshare for institutions: key pieces of functionality
✔
✔
✔
39. Work smart. Discover more.
Supported by
figshare’s API
The figshare API allows you to push data to
figshare, or pull data out
This allows you to build applications on top of
academic research
40. PLOS and discovery
• figshare hosts the
supplemental data for
all seven PLOS journals
• You can see inside the
SI files (datasets, text
files, documents,
animations, videos and
presentation files)
without leaving the
article or opening
them
• When you spot
interesting files you
can download them
singly or altogether
41. figshare as a recommendation engine
• PLOS’ data
availability policy
• Recommendations
delivered by
figshare extend
beyond research
outputs attached to
PLOS publications
• Not all scholarly
outputs may fit in
an article, but
might very well be
critically
instrumental to
others’ research
42. Google Scholar
• We mark-up all
figshare.com
content for
inclusion in
Google Scholar
• Google bots
recognize our
content as open
and academic
• Faster way to get
to content–
proof of
research v.
paper
44. Work smart. Discover more.
Supported by
Thank you so much for
your time!
dan@figshare.com
@figshare
http://www.figshare.com
45. WHEN GRANULARITY MET
DISCOVERY: THE COMPLEXITIES
OF GRANULAR CONTENT
DISCOVERY
NISO Webinar: Is Granularity the Next Discovery Frontier? Part 2: The
Business Complexities of Granular Discover
Dave Hovenden, Content Operations Manager – Summon
March 18, 2015
46. When Granularity Met Discovery
3/18/2015
When Harry Met Sally. Digital image. Wonder Magazine. Wonder Magazine. 4 Mar. 2015.
47. What Do We Mean When We Talk About Granular Content?
3/18/2015 47
Journal
Journal
Issue
Journal
Article
Subsection
Table
48. Granular Content Discovery – How it Should Work When All
the Pieces are in Place
3/18/2015 48
Granular
Content
Discovery
Quick
Painless
Simple
Without a
lot of ‘noise’
(i.e.,
irrelevant
results)
Gets you
want you
need the
first time
49. How it Should Work!
3/18/2015 49
Book Chapter Example
q= “On the City Walls and Built Environment of Pavia (1330)"
51. The Complexities of Granular Content Discovery
3/18/2015 51When Harry Met Sally Quotes. Digital image. Bustle. Bustle.com. 4 Mar. 2015.
52. The Complexities of Granular Content Discovery
3/18/2015 52When Harry Met Sally Quotes. Digital image. Bustle. Bustle.com. 4 Mar. 2015.
53. What are Some of the Complexities?
• System Changes to Accommodate New Levels of Granularity
– The changes can be done, but is the level of investment worth it
in all cases?
• Granularity Conflicts between Content Providers when Content
is Aggregated
– Disagreements on the granularity level of the content exist
between various content providers when they aggregate the
same content
• Metadata Quality
– Content can be granular, but if the metadata quality is
inconsistent or poor, it can limit discoverability
3/18/2015 53
54. Encountering New Levels of Granularity
3/18/2015 54When Harry Met Sally. Digital image. G-Pop. G-Pop. 11 Mar. 2015.
56. Introducing Brand New Levels of Granularity Discovery
3/18/2015 56
Granular
Content
Discovery
Schema
Changes
UI
Changes
API
Changes
Match &
Merge
Changes Relevancy
Changes
Content
Mapping
Changes
Link
Resolver
Changes
57. • What’s the scope/scale of this
granular content? Do we have
a lot of content coming in at
this level of granularity, or is it
only a small amount?
• Is this in high demand for
librarians and their patrons?
• What other priorities do we
have? Could the resources,
time, and money be put to
better use focusing on other
product enhancements and
improvements?
3/18/2015 57
Weighing the Business Considerations
58. Granularity Conflicts between Content Sources
3/18/2015 58
Source A
Source B
Archival News
by Taves, Brian
Cinema Journal, 03/1999, Volume 38, Issue 3. pg. 109
Video
by Taves, Brian
Cinema Journal, 03/1999, Volume 38, Issue 3, pg. 113
59. 3/18/2015 59
The Consequences of Granularity Conflicts
• When the user tries to access the
content, the metadata is sent to the
link resolver to generate an
OpenURL link
• Because the library has access to
the journal from multiple sources,
the user is presented with several
options to decide from which source
the linker should resolve to
• If the user tries to access the
content from Source A, the link
resolver will fail. This is because
Source A’s metadata is slightly
different because they’ve indexed
the content at a different level of
granularity
60. What Metadata the Link Resolver Uses to Construct the
OpenURL Link to Source A’s Platform
• rft_val_fmt=info:ofi/fmt:kev:mtx:journal&
• rft.genre=article&
• rft.atitle=Video&
• rft.jtitle=Cinema+Journal&
• rft.au=Taves%2C+Brian&
• rft.date=1999-03-22&
• rft.pub=University+of+Texas+at+Austin+%28University+of+Texas+Press%29&
• rft.issn=0009-7101&
• rft.eissn=1527-2087&
• rft.volume=38&
• rft.issue=3&
• rft.spage=113&
• rft.externalDBID=n%2Fa&
• rft.externalDocID=160417287¶mdict=en-US
3/18/2015 60
61. When Conflicts in Granular Content Can be Confusing Due to
Conflicting Metadata
3/18/2015 61
Reading list on Turkey
by Akyol, Mustafa
The World Today, ISSN 0043-9134, 08/2013,
Volume 69, Issue 4
62. The Original Article
3/18/2015 62
Reading list on Turkey
by Akyol, Mustafa
The World Today, ISSN 0043-9134, 08/2013,
Volume 69, Issue 4
63. Source A – Book Review Level of Granularity
3/18/2015 63
64. Source B – Magazine Article Level of Granularity
3/18/2015 64
65. Granular Content, Metadata Conflict
3/18/2015 65
Source A
Source B
Different start page
Different start page
Wrong issue
number
66. Granularity Conflicts Have the Potential to Become More
Common
3/18/2015 66
• Approx. 40% of the
Summon Index is
made up of content
from aggregation
databases
• More and more content
is being made available
to aggregation
databases from
Publishers and Open
Access resources
• Aggregation databases
are still popular among
libraries due to the
breadth of content they
provide
67. The Need for Content Granularity Standards for Discovery
3/18/2015 67
68. How Summon Enables Granular Content Discovery
3/18/2015 68
Book Chapter Example
q= “On the City Walls and Built Environment of Pavia (1330)"
69. Closing Thoughts
3/18/2015 69
• Accurate, detailed, and consistent metadata
for the granular content is important
• Explore what standards for granular content
discovery are needed
• The goal is to make it easier for people to find
granular content – that’s the whole point of
discovery!
70. 3/18/2015 70
The End
When Harry Met Sally. Digital image. Parade Magazine. Parade Publications, INc.. 15 Mar. 2015.
71. NISO Webinar • March 11, 2015
Questions?
All questions will be posted with presenter answers on
the NISO website following the webinar:
http://www.niso.org/news/events/2015/webinars/granularity_pt1/
NISO Two-Part Webinar
Is Granularity the Next Discovery Frontier?
Part 1: Supporting Direct Access to Increasingly Granular
Chunks of Content
72. Thank you for joining us today.
Please take a moment to fill out the brief online survey.
We look forward to hearing from you!
THANK YOU