SlideShare a Scribd company logo
1 of 33
A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers Adam Chandler Cornell University Library Cornell University Library, Metadata Working Group Forum 16 October 2009
OpenURL model
OpenURL model cont.  incoming OpenURL http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange&rft.auinit=c&rft.aulast=merk&rft.date=2009&rft.epage=162&rft.genre=article&rft.issn=0737-8831&rft.issue=1&rft.place=bingley&rft.pub=emerald+group+publishing+limited&rft.spage=151&rft.stitle=libr+hi+tech&rft.title=library+hi+tech&rft.volume=27&rfr_id=info:sid/www.isinet.com:wok:wos&rft.au=scholze,+f&rft.au=windisch,+n&rft_id=info:doi/10.1108%2f07378830910942991/ in our knowledge base? title: Library hi tech     issn: 0737-8831   start date: 19970101    end date:  link-to syntax for Emerald http://www.emeraldinsight.com/rpsv/cgi-bin/cgi?body=linker&reqidx=#@ISSN-HYPHEN#(#@DATE#)#@VOLUME#:#@ISSUE#L.#@SPAGE#
OpenURL is pervasive Cornell link resolver alone: July 1, 2008 – June 30, 2009: 402,000 OpenURL service requests. 402,000 * 123(ARL libraries) = 49 million
Cornell’s top 10 OpenURL sources Web of Knowledge WorldCat Local Google Scholar Webfeat (our “Find Articles” service) EBSCOHost OCLC FirstSearch SilverPlatter Weill Cornell Medical Center SciFinder Scholar  PubMed
example OpenURL http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange&rft.auinit=c&rft.aulast=merk&rft.date=2009&rft.epage=162&rft.genre=article&rft.issn=0737-8831&rft.issue=1&rft.place=bingley&rft.pub=emerald+group+publishing+limited&rft.spage=151&rft.stitle=libr+hi+tech&rft.title=library+hi+tech&rft.volume=27&rfr_id=info:sid/www.isinet.com:wok:wos&rft.au=scholze,+f&rft.au=windisch,+n&rft_id=info:doi/10.1108%2f07378830910942991/
example OpenURL (1) http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004 &url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx &rft_val_fmt=info:ofi/fmt:kev:mtx:journal &rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange &rft.auinit=c &rft.aulast=merk &rft.date=2009 &rft.epage=162 &rft.genre=article &rft.issn=0737-8831
example OpenURL (2) &rft.issue=1 &rft.place=bingley &rft.pub=emerald+group+publishing+limited &rft.spage=151 &rft.stitle=libr+hi+tech &rft.title=library+hi+tech &rft.volume=27 &rfr_id=info:sid/www.isinet.com:wok:wos &rft.au=scholze,+f &rft.au=windisch,+n &rft_id=info:doi/10.1108%2f07378830910942991/
 … but quality of experience is difficult to benchmark Wrong start end date in the local library's holdings knowledge base (see NISO KBART) Semantically inaccurate metadata from the OpenURL origin (wrong ISSN, for example)  Wrong link-to syntax in link resolver Fragile handling of incoming links by content provider
 … but quality of experience is difficult to benchmark Inaccurate or missing Crossref DOI URL (sometimes the DOI registration process is out of sync with the mounting of articles) Subscription errors (especially with the start of a new calendar year) Syntactically incorrect or missing metadata from the OpenURL origin
Literature review I can identify no systematic study designed and carried out to benchmark the quality of linking. The OpenURL standard was introduced some ten years ago.
Wakimoto, Walker, and Dabbour (2006) Main finding: Users just expect full-text. When they do not get it they are disappointed. Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
Wakimoto, Walker, and Dabbour (2006) "Where does SFX start and where does it end? If an SFX request does not result in a full-text link, does the problem lie with the source database’s metadata, the construction of the OpenURL request, the SFX KnowledgeBase, the SFX software, the resulting target resource, or even the local library’s collection development plan?" (p. 134) Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
Blake and Knudson (2002) “Increased awareness of bibliographic/citation standards by authors. Increased submission of publications with bibliographical references reflecting the accepted standards.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Blake and Knudson (2002) “Increased outreach by librarians to authors emphasizing and promoting the importance of citation standards for electronic document retrieval.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Blake and Knudson (2002) “Increased communication between primary publishers and secondary publishers. Metadata corrections and updates need to be better coordinated.” (NISO KBART role) Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Blake and Knudson (2002) “Increased consistency in metadata within a single database and across databases. This would result in a higher success rate of linking and would allow the algorithms to be simpler. Simpler algorithms are easier to maintain and modify.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Hughes (2004) Hughes describes an initiative of the Open Language Archives Community (OLAC), a consortium of linguistic data archives, to create an infrastructure to support metadata quality assessment within a specialized Open Archives Initiative (OAI) community.  . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes (2004) Metadata quality should be evaluated on a per record and per collection basis and assessed against the baseline of broader community practice. Metadata quality requires both structural and semantic validation.  . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes (2004) Goals:  establish a baseline against which future instances can be compared;  provide assistance to data providers;  evaluate a set of domain-grounded controlled vocabularies. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes’ approach Each metadata record score from 0 - 10.  There are two parts, a "Code Existence Score and an Element Absence Penalty," with weighting.  The Code Existence Score is specific to the OLAC communities use of Dublin Core extensions.  The Element Absence Penalty is based on the premise that the usefullness of a given metadata decreases in the absence of core metadata fields.  The absence of a core element results in a negative 0.2 penalty. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes’ approach From this simple approach, an array of metrics are derived:   archive diversity;  metadata quality;  core elements per record;  core element usage;  code usage;  code and element usage;  star rating. From these metrics a score is computed for each metadata record, each archive, and the community as a whole. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Mellon funded planning grant for L'Année philologique  1. Canonical Citation Linking: http://cwkb.org In collaboration with Eric Rebillard, Professor, Classics and History, and David Ruddy, Cornell University Library 2. OpenURL Quality Is it possible to build a tool for evaluating the quality of OpenURLs from a content provider?
Key findings from 2008 Mellon OpenURL quality investigation Hughes’ approach to metadata evaluation is excellent  scaffolding  to help build a model for OpenURL metadata evaluation, but it does not match the problem exactly.
Constant: Core elements used by content providers in their link-to targets title - 64% spage - 64% volume - 61% issue - 60% date - 48% aulast - 47% issn - 35% atitle - 35% DOI - 14% ISBN – 5% Based on an analysis of link-tos in the Cornell instance of the III WebBridge link resolver product.
Variable: Frequency of element string patterns for all sources
aulast  First author's family name. This may be more than one word. In many citations, the author's family name is recorded first and is followed by a comma, e.g. Smith, Fred James is recorded as "aulast=smith"
aulast   if ($e =~ /aulast/) {       $patterns{$neworigin}{$newsid}{$e}++;       if ($elementhash{$e} =~ /^[A-Za-z]+$/) { $patterns{$neworigin}{$newsid}{"aulast_simple"}++; } elsif ($elementhash{$e} =~ /^[A-Za-z]+, .+$/) { $patterns{$neworigin}{$newsid}{"aulast_comma"}++; } elsif ($elementhash{$e} =~ /^[A-Z][a-z]+( [A-Z])+$/) { $patterns{$neworigin}{$newsid}{"aulast_simpleplusinitial"}++;} else { $patterns{$neworigin}{$newsid}{"aulast_other"}++; }     }
aulast_other examples Ryan S Miller Louise D Bryant DAVID J MCKENZIE %C4%90okovi%C4%87 Indu B Ahluwalia Carreras-Sangr%c3%a0 Bautista-Casta%C3%B1o O%27Shea Melissa Ventura Marra Guan XueYing%3B Yu Nan%3B ShangguanXiaoXia
spage First page number of a start/end (spage-epage) pair. Note that pages are not always numeric.
spage      if ($e =~ /spage/) {       $patterns{$neworigin}{$newsid}{$e}++;       if ($elementhash{$e} =~ /^+$/) { $patterns{$neworigin}{$newsid}{"spage_number"}++; } elsif ($elementhash{$e} =~ /^+-+$/) { $patterns{$neworigin}{$newsid}{"spage_number_number"}++; } elsif ($elementhash{$e} =~ /[A-Za-z].+/) { $patterns{$neworigin}{$newsid}{"spage_string_w_number"}++; } else { $patterns{$neworigin}{$newsid}{"spage_other"}++; }     }
spage_other examples 1033 (6 pages) 85(19) 575 (11 pages) 283...290 PHYS GLRM 58,+VI
date The publication date of the item or bundle encoded in the "Complete date" variant of ISO8601 (see http://www.w3.org/TR/NOTE-datetime). This format is YYYYMM- DD where YYYY is the four-digit year, MM is the month of the year between 01 (January) and 12 (December), and DD is the day of the month between 01 and 28 or 29 or 30 or 31, depending on length of the month and whether it is a leap year.

More Related Content

What's hot

Open Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionOpen Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionTimothy Cole
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for DiscoveryOCLC
 
LoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata AnalysisLoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata Analysislocloud
 
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives TaiwanA Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwanandrea huang
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTHerbert Van de Sompel
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueHerbert Van de Sompel
 
Visualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeVisualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeJonathan Yu
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31Dag Endresen
 

What's hot (10)

Open Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionOpen Annotation Collaboration Introduction
Open Annotation Collaboration Introduction
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
 
LoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata AnalysisLoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata Analysis
 
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives TaiwanA Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning Issue
 
Visualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeVisualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscape
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31
 
Creating Pockets of Persistence
Creating Pockets of PersistenceCreating Pockets of Persistence
Creating Pockets of Persistence
 

Viewers also liked

How does your media product represent particular social
How does your media product represent particular socialHow does your media product represent particular social
How does your media product represent particular sociallucymcdonnell5
 
Quesitonaire pie
Quesitonaire pieQuesitonaire pie
Quesitonaire piehalyma120
 
Five Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel ConsumerFive Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel ConsumerAdroit Digital
 
You Are My All in All
You Are My All in AllYou Are My All in All
You Are My All in Allladybag
 
BIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docxBIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docxBibin Thomas
 
Print Ad Marketing Plan
Print Ad Marketing PlanPrint Ad Marketing Plan
Print Ad Marketing Planabcd3
 
Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)Sergii Illiukhin
 
少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授 少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授 vincent8899
 
考試沒教的事
考試沒教的事考試沒教的事
考試沒教的事ADAN CHEN
 
3.àrees funcionals decisions financeres
3.àrees funcionals   decisions financeres3.àrees funcionals   decisions financeres
3.àrees funcionals decisions financeresddaude
 
Conclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinsonConclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinsonClusterExcellence
 
Dispositivos de multimedia
Dispositivos de multimediaDispositivos de multimedia
Dispositivos de multimediasashiaisela
 
Synthesis multimedia learning
Synthesis multimedia learningSynthesis multimedia learning
Synthesis multimedia learningkylealee
 
US Energy Consumption by State as of 2005
US Energy Consumption by State  as of 2005US Energy Consumption by State  as of 2005
US Energy Consumption by State as of 2005Bruce LaCour
 

Viewers also liked (20)

Día de san valentín
Día de san valentínDía de san valentín
Día de san valentín
 
How does your media product represent particular social
How does your media product represent particular socialHow does your media product represent particular social
How does your media product represent particular social
 
Quesitonaire pie
Quesitonaire pieQuesitonaire pie
Quesitonaire pie
 
Five Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel ConsumerFive Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel Consumer
 
Self talk
Self talkSelf talk
Self talk
 
You Are My All in All
You Are My All in AllYou Are My All in All
You Are My All in All
 
BIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docxBIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docx
 
Print Ad Marketing Plan
Print Ad Marketing PlanPrint Ad Marketing Plan
Print Ad Marketing Plan
 
Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)
 
T. suchman plenary friday & saturday building a culture
T. suchman plenary friday & saturday building a cultureT. suchman plenary friday & saturday building a culture
T. suchman plenary friday & saturday building a culture
 
Roses
RosesRoses
Roses
 
少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授 少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授
 
Bautista - PICARD 2011 Presentation
Bautista - PICARD 2011 PresentationBautista - PICARD 2011 Presentation
Bautista - PICARD 2011 Presentation
 
UIC Thesis Cancare
UIC Thesis CancareUIC Thesis Cancare
UIC Thesis Cancare
 
考試沒教的事
考試沒教的事考試沒教的事
考試沒教的事
 
3.àrees funcionals decisions financeres
3.àrees funcionals   decisions financeres3.àrees funcionals   decisions financeres
3.àrees funcionals decisions financeres
 
Conclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinsonConclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinson
 
Dispositivos de multimedia
Dispositivos de multimediaDispositivos de multimedia
Dispositivos de multimedia
 
Synthesis multimedia learning
Synthesis multimedia learningSynthesis multimedia learning
Synthesis multimedia learning
 
US Energy Consumption by State as of 2005
US Energy Consumption by State  as of 2005US Energy Consumption by State  as of 2005
US Energy Consumption by State as of 2005
 

Similar to A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers

The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffHeather Seneff
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
 
Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base Leila Zemmouchi-Ghomari
 
Current metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh AlemuCurrent metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh AlemuGetaneh Alemu
 
Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Nancy Pontika
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Figoblog
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...innovatics
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...Open Science Fair
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyPRELIDA Project
 
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
RDA implementation: the new cataloguing standard in Europe - Dilyana DuchevaLISDISConference
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Trish Rose-Sandler
 
eResources in Academic Libraries
eResources in Academic LibrarieseResources in Academic Libraries
eResources in Academic Librariesottumtk
 

Similar to A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers (20)

The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Ji cv6n2
Ji cv6n2Ji cv6n2
Ji cv6n2
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_Seneff
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today..."In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
 
Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base
 
FAIRy Stories
FAIRy StoriesFAIRy Stories
FAIRy Stories
 
Current metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh AlemuCurrent metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh Alemu
 
Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
Scholze imcw 2014-11-25
Scholze imcw 2014-11-25Scholze imcw 2014-11-25
Scholze imcw 2014-11-25
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
 
eResources in Academic Libraries
eResources in Academic LibrarieseResources in Academic Libraries
eResources in Academic Libraries
 

Recently uploaded

Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxMusic 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxleah joy valeriano
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 

Recently uploaded (20)

FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptxMusic 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
Music 9 - 4th quarter - Vocal Music of the Romantic Period.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 

A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers

  • 1. A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers Adam Chandler Cornell University Library Cornell University Library, Metadata Working Group Forum 16 October 2009
  • 3. OpenURL model cont. incoming OpenURL http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange&rft.auinit=c&rft.aulast=merk&rft.date=2009&rft.epage=162&rft.genre=article&rft.issn=0737-8831&rft.issue=1&rft.place=bingley&rft.pub=emerald+group+publishing+limited&rft.spage=151&rft.stitle=libr+hi+tech&rft.title=library+hi+tech&rft.volume=27&rfr_id=info:sid/www.isinet.com:wok:wos&rft.au=scholze,+f&rft.au=windisch,+n&rft_id=info:doi/10.1108%2f07378830910942991/ in our knowledge base? title: Library hi tech issn: 0737-8831 start date: 19970101 end date: link-to syntax for Emerald http://www.emeraldinsight.com/rpsv/cgi-bin/cgi?body=linker&reqidx=#@ISSN-HYPHEN#(#@DATE#)#@VOLUME#:#@ISSUE#L.#@SPAGE#
  • 4. OpenURL is pervasive Cornell link resolver alone: July 1, 2008 – June 30, 2009: 402,000 OpenURL service requests. 402,000 * 123(ARL libraries) = 49 million
  • 5. Cornell’s top 10 OpenURL sources Web of Knowledge WorldCat Local Google Scholar Webfeat (our “Find Articles” service) EBSCOHost OCLC FirstSearch SilverPlatter Weill Cornell Medical Center SciFinder Scholar PubMed
  • 7. example OpenURL (1) http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004 &url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx &rft_val_fmt=info:ofi/fmt:kev:mtx:journal &rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange &rft.auinit=c &rft.aulast=merk &rft.date=2009 &rft.epage=162 &rft.genre=article &rft.issn=0737-8831
  • 8. example OpenURL (2) &rft.issue=1 &rft.place=bingley &rft.pub=emerald+group+publishing+limited &rft.spage=151 &rft.stitle=libr+hi+tech &rft.title=library+hi+tech &rft.volume=27 &rfr_id=info:sid/www.isinet.com:wok:wos &rft.au=scholze,+f &rft.au=windisch,+n &rft_id=info:doi/10.1108%2f07378830910942991/
  • 9. … but quality of experience is difficult to benchmark Wrong start end date in the local library's holdings knowledge base (see NISO KBART) Semantically inaccurate metadata from the OpenURL origin (wrong ISSN, for example) Wrong link-to syntax in link resolver Fragile handling of incoming links by content provider
  • 10. … but quality of experience is difficult to benchmark Inaccurate or missing Crossref DOI URL (sometimes the DOI registration process is out of sync with the mounting of articles) Subscription errors (especially with the start of a new calendar year) Syntactically incorrect or missing metadata from the OpenURL origin
  • 11. Literature review I can identify no systematic study designed and carried out to benchmark the quality of linking. The OpenURL standard was introduced some ten years ago.
  • 12. Wakimoto, Walker, and Dabbour (2006) Main finding: Users just expect full-text. When they do not get it they are disappointed. Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
  • 13. Wakimoto, Walker, and Dabbour (2006) "Where does SFX start and where does it end? If an SFX request does not result in a full-text link, does the problem lie with the source database’s metadata, the construction of the OpenURL request, the SFX KnowledgeBase, the SFX software, the resulting target resource, or even the local library’s collection development plan?" (p. 134) Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
  • 14. Blake and Knudson (2002) “Increased awareness of bibliographic/citation standards by authors. Increased submission of publications with bibliographical references reflecting the accepted standards.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 15. Blake and Knudson (2002) “Increased outreach by librarians to authors emphasizing and promoting the importance of citation standards for electronic document retrieval.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 16. Blake and Knudson (2002) “Increased communication between primary publishers and secondary publishers. Metadata corrections and updates need to be better coordinated.” (NISO KBART role) Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 17. Blake and Knudson (2002) “Increased consistency in metadata within a single database and across databases. This would result in a higher success rate of linking and would allow the algorithms to be simpler. Simpler algorithms are easier to maintain and modify.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 18. Hughes (2004) Hughes describes an initiative of the Open Language Archives Community (OLAC), a consortium of linguistic data archives, to create an infrastructure to support metadata quality assessment within a specialized Open Archives Initiative (OAI) community. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 19. Hughes (2004) Metadata quality should be evaluated on a per record and per collection basis and assessed against the baseline of broader community practice. Metadata quality requires both structural and semantic validation. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 20. Hughes (2004) Goals: establish a baseline against which future instances can be compared; provide assistance to data providers; evaluate a set of domain-grounded controlled vocabularies. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 21. Hughes’ approach Each metadata record score from 0 - 10. There are two parts, a "Code Existence Score and an Element Absence Penalty," with weighting. The Code Existence Score is specific to the OLAC communities use of Dublin Core extensions. The Element Absence Penalty is based on the premise that the usefullness of a given metadata decreases in the absence of core metadata fields. The absence of a core element results in a negative 0.2 penalty. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 22. Hughes’ approach From this simple approach, an array of metrics are derived: archive diversity; metadata quality; core elements per record; core element usage; code usage; code and element usage; star rating. From these metrics a score is computed for each metadata record, each archive, and the community as a whole. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 23. Mellon funded planning grant for L'Année philologique 1. Canonical Citation Linking: http://cwkb.org In collaboration with Eric Rebillard, Professor, Classics and History, and David Ruddy, Cornell University Library 2. OpenURL Quality Is it possible to build a tool for evaluating the quality of OpenURLs from a content provider?
  • 24. Key findings from 2008 Mellon OpenURL quality investigation Hughes’ approach to metadata evaluation is excellent scaffolding to help build a model for OpenURL metadata evaluation, but it does not match the problem exactly.
  • 25. Constant: Core elements used by content providers in their link-to targets title - 64% spage - 64% volume - 61% issue - 60% date - 48% aulast - 47% issn - 35% atitle - 35% DOI - 14% ISBN – 5% Based on an analysis of link-tos in the Cornell instance of the III WebBridge link resolver product.
  • 26. Variable: Frequency of element string patterns for all sources
  • 27. aulast First author's family name. This may be more than one word. In many citations, the author's family name is recorded first and is followed by a comma, e.g. Smith, Fred James is recorded as "aulast=smith"
  • 28. aulast if ($e =~ /aulast/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^[A-Za-z]+$/) { $patterns{$neworigin}{$newsid}{"aulast_simple"}++; } elsif ($elementhash{$e} =~ /^[A-Za-z]+, .+$/) { $patterns{$neworigin}{$newsid}{"aulast_comma"}++; } elsif ($elementhash{$e} =~ /^[A-Z][a-z]+( [A-Z])+$/) { $patterns{$neworigin}{$newsid}{"aulast_simpleplusinitial"}++;} else { $patterns{$neworigin}{$newsid}{"aulast_other"}++; } }
  • 29. aulast_other examples Ryan S Miller Louise D Bryant DAVID J MCKENZIE %C4%90okovi%C4%87 Indu B Ahluwalia Carreras-Sangr%c3%a0 Bautista-Casta%C3%B1o O%27Shea Melissa Ventura Marra Guan XueYing%3B Yu Nan%3B ShangguanXiaoXia
  • 30. spage First page number of a start/end (spage-epage) pair. Note that pages are not always numeric.
  • 31. spage if ($e =~ /spage/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^+$/) { $patterns{$neworigin}{$newsid}{"spage_number"}++; } elsif ($elementhash{$e} =~ /^+-+$/) { $patterns{$neworigin}{$newsid}{"spage_number_number"}++; } elsif ($elementhash{$e} =~ /[A-Za-z].+/) { $patterns{$neworigin}{$newsid}{"spage_string_w_number"}++; } else { $patterns{$neworigin}{$newsid}{"spage_other"}++; } }
  • 32. spage_other examples 1033 (6 pages) 85(19) 575 (11 pages) 283...290 PHYS GLRM 58,+VI
  • 33. date The publication date of the item or bundle encoded in the "Complete date" variant of ISO8601 (see http://www.w3.org/TR/NOTE-datetime). This format is YYYYMM- DD where YYYY is the four-digit year, MM is the month of the year between 01 (January) and 12 (December), and DD is the day of the month between 01 and 28 or 29 or 30 or 31, depending on length of the month and whether it is a leap year.
  • 34. date if ($e =~ /date/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^{4}$/) { $patterns{$neworigin}{$newsid}{"date_dddd"}++; } elsif ($elementhash{$e} =~ /^{4}-{2}$/) { $patterns{$neworigin}{$newsid}{"date_dddd-dd"}++; } elsif ($elementhash{$e} =~ /^{4}-{2}-{2}$/) { $patterns{$neworigin}{$newsid}{"date_dddd-dd-dd"}++; } elsif ($elementhash{$e} =~ /^{4}-{4}$/) { $patterns{$neworigin}{$newsid}{"date_dddd-dddd"}++; } elsif ($elementhash{$e} =~ /^{8}$/) { $patterns{$neworigin}{$newsid}{"date_dddddddd"}++; } else {$patterns{$neworigin}{$newsid}{"date_dateother"}++; } }
  • 35. date_other examples 1956 July %7E1994 June 5%2C 2002 JUN 30 05 2006%282007%29 1922,+April+25th %5B%5B1943-06-19%5D%5D
  • 36. issn International Standard Serials Number (ISSN). The issn may contain a hyphen, e.g. "1041-5653"
  • 37. issn if ($e =~ /issn/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^{4}-{3}./) { $patterns{$neworigin}{$newsid}{"issn_number_number"}++; } elsif ($elementhash{$e} =~ /^{7}./) { $patterns{$neworigin}{$newsid}{"issn_number"}++; } else { $patterns{$neworigin}{$newsid}{"issn_other"}++; } }
  • 38. issn_other examples 0065-2598%28print%29 0018-5345+%28ISSN+print%29 ISSN ISBN 0-9525091-5-6. 0021-8375%28print%29%7C1439-0361%28electronic%29 1471-2164+%28ISSN+online%29 0191-8699%3B0191-8699 0741-8329 (Print)%3B NLM Unique Journal Identifier%3A 8502311
  • 39. How often out of 402,000 Cornell OpenURLs?
  • 40. flat file output logsourceyear quarter origin sid metric count cornell 2009 Q1 csacsa:commabs-set-c atitle 154 cornell 2009 Q1 csacsa:commabs-set-c atitle_colon 101 cornell 2009 Q1 csacsa:commabs-set-c atitle_other 53 cornell 2009 Q1 csacsa:commabs-set-c aulast 159 cornell 2009 Q1 csacsa:commabs-set-c aulast_other 4 cornell 2009 Q1 csacsa:commabs-set-c aulast_simple 155 cornell 2009 Q1 csacsa:commabs-set-c date 159 cornell 2009 Q1 csacsa:commabs-set-c date_dddd 110 cornell 2009 Q1 csacsa:commabs-set-c date_dddd-dd 49 cornell 2009 Q1 csacsa:commabs-set-c isbn 6 cornell 2009 Q1 csacsa:commabs-set-c isbn_10 6 cornell 2009 Q1 csacsa:commabs-set-c issn 135 cornell 2009 Q1 csacsa:commabs-set-c issn_number-number 135 cornell 2009 Q1 csacsa:commabs-set-c issue 136 cornell 2009 Q1 csacsa:commabs-set-c issue_number 132 cornell 2009 Q1 csacsa:commabs-set-c issue_number_dash_number2 cornell 2009 Q1 csacsa:commabs-set-c issue_other 2 cornell 2009 Q1 csacsa:commabs-set-c spage 153 cornell 2009 Q1 csacsa:commabs-set-c spage_number 153 cornell 2009 Q1 csacsa:commabs-set-c title 160 cornell 2009 Q1 csacsa:commabs-set-c total 160 cornell 2009 Q1 csacsa:commabs-set-c volume 139 cornell 2009 Q1 csacsa:commabs-set-c volume_number 139
  • 42. Next steps create a NISO structure to wrap around the metrics: “NISO OpenURL Quality Index” add non-Cornell data from libraries and link resolver vendors (model is agnostic to source) confirm and publicize key elements used by target syntaxes can the quality of the global OpenURL network be modeled mathematically?
  • 43. How to stay in the loop http://openurlquality.blogspot.com/ Adam ChandlerDatabase Management and Electronic Resources Research LibrarianCentral Library OperationsCornell University Librarytel: 607-255-5760email: alc28@cornell.edu