请输入您要查询的百科知识:

 

词条 Open Archives Initiative Protocol for Metadata Harvesting
释义

  1. History

  2. Registries

  3. Uses

  4. Software

  5. Archives

  6. Workshops

  7. See also

  8. Notes

  9. References

  10. External links

{{more footnotes|date=October 2008}}

The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a protocol developed for harvesting metadata descriptions of records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.[1]

The protocol is usually just referred to as the OAI Protocol.

OAI-PMH uses XML over HTTP. Version 2.0 of the protocol was released in 2002; the document was last updated in 2015. It has a Creative Commons license BY-SA.

History

In the late 1990s, Herbert Van de Sompel (Ghent University) was working with researchers and librarians at Los Alamos National Laboratory (US) and called a meeting to address difficulties related to interoperability issues of e-print servers and digital repositories. The meeting was held in Santa Fe, New Mexico, in October 1999. A key development from the meeting was the definition of an interface that permitted e-print servers to expose metadata for the papers it held in a structured fashion so other repositories could identify and copy papers of interest with each other. This interface/protocol was named the "Santa Fe Convention".[1]

Several workshops were held in 2000 at the ACM Digital Libraries conference[2] and elsewhere to share the ideas from the Santa Fe Convention. It was discovered at the workshops that the problems faced by the e-print community were also shared by libraries, museums, journal publishers, and others who needed to share distributed resources. To address these needs, the Coalition for Networked Information[3] and the Digital Library Federation[4] provided funding to establish an Open Archives Initiative (OAI) secretariat managed by Herbert Van de Sompel and Carl Lagoze. The OAI held a meeting at Cornell University (Ithaca, New York) in September 2000 to improve the interface developed at the Santa Fe Convention. The specifications were refined over e-mail.

OAI-PMH version 1.0 was introduced to the public in January 2001 at a workshop in Washington D.C., and another in February in Berlin, Germany. Subsequent modifications to the XML standard by the W3C required making minor modifications to OAI-PMH resulting in version 1.1. The current version, 2.0, was released in June 2002. It contained several technical changes and enhancements and is not backward compatible.

Registries

The OAI Protocol was adopted by many digital libraries, institutional repositories, and digital archives. Although registration is not mandatory, it is encouraged.

There are several large registries of OAI-compliant repositories:

  1. The Open Archives list of registered OAI repositories
  2. The OAI registry at University of Illinois at Urbana-Champaign
  3. [https://web.archive.org/web/20060302071423/http://celestial.eprints.org/ The Celestial OAI registry]
  4. [https://web.archive.org/web/20060322163348/http://archives.eprints.org/ Eprint’s Institutional Archives Registry]
  5. [https://web.archive.org/web/20090628103036/http://www.openarchives.eu/ Openarchives.eu The European Guide to OAI-PMH compliant repositories in the world]
  6. ScientificCommons.org A worldwide service and registry
  7. [https://www.finna.fi Finna.fi the material library of Finnish archives, libraries and museums]

Uses

Some commercial search engines use OAI-PMH to acquire more resources. Google initially included support for OAI-PMH when launching sitemaps, however decided to support only the standard XML Sitemaps format in May 2008.[5] In 2004, Yahoo! acquired content from OAIster (University of Michigan) that was obtained through metadata harvesting with OAI-PMH. Wikimedia uses an OAI-PMH repository to provide feeds of Wikipedia and related site updates for search engines and other bulk analysis/republishing endeavors.[6] Especially when dealing with thousands of files being harvested every day, OAI-PMH can help in reducing the network traffic and other resource usage by doing incremental harvesting.[7] NASA's Mercury metadata search system uses OAI-PMH to index thousands of metadata records from Global Change Master Directory (GCMD) every day.[8]

The mod_oai project is using OAI-PMH to expose content to web crawlers that is accessible from Apache Web servers.

Software

OAI-PMH is based on a client–server architecture, in which "harvesters" request information on updated records from "repositories". Requests for data can be based on a datestamp range, and can be restricted to named sets defined by the provider. Data providers are required to provide XML metadata in Dublin Core format, and may also provide it in other XML formats.

A number of software systems support the OAI-PMH, including Fedora, EThOS from the British Library, GNU EPrints from the University of Southampton, Open Journal Systems from the Public Knowledge Project, Desire2Learn, DSpace from MIT, HyperJournal from the University of Pisa, Digibib from Digibis, MyCoRe, Primo, DigiTool, Rosetta and MetaLib from Ex Libris, ArchivalWare from PTFS, DOOR [9] from the eLab[10] in Lugano, Switzerland, panFMP from the PANGAEA (data library),[11] SimpleDL from Roaring Development, and jOAI.[12]

Archives

A number of large archives support the protocol including arXiv and the CERN Document Server.

Workshops

A dedicated workshop, The CERN Workshop on Innovations in Scholarly Communication, has been held at CERN in Geneva on a regular basis since 2001. It is now co-organised by University of Geneva and CERN every two years in June. OAI8 was held on June 19th-21st, 2013; [https://indico.cern.ch/event/332370/ OAI9] was held on June 17–19, 2015; and [https://indico.cern.ch/event/405949/ OAI10] was held on June 21 to 23, 2017.

See also

  • Data format management
  • Digital curation
  • Digital preservation
  • File format
  • Dublin Core, an ISO metadata standard
  • National Digital Information Infrastructure and Preservation Program (NDIIPP)
  • National Digital Library Program (NDLP)
  • Metadata Encoding and Transmission Standard (METS) maintained by the Library of Congress
  • Implementation Strategies (PREMIS)
  • LOCKSS
  • Search as a service
  • Web archiving

Notes

1. ^{{Cite journal |author= Marshall Breeding |date= September 2002 |title= Understanding the Protocol for Metadata Harvesting of the Open Archives Initiative |work= Computers in Libraries |volume=8 |number= 24 |pages= 24–29 |url= http://www.librarytechnology.org/ltg-displaytext.pl?RC=9944 |accessdate= October 11, 2013 }}
2. ^ACM Digital Libraries conference
3. ^Coalition for Networked Information
4. ^Digital Library Federation
5. ^Google Webmaster blog
6. ^{{Cite document |title=Wikimedia update feed service |url=http://meta.wikimedia.org/wiki/Wikimedia_update_feed_service |publisher=Wikimedia Meta-Wiki |accessdate=14 July 2013}}
7. ^incremental harvesting
8. ^{{Cite journal |title=Data sharing and retrieval uses OAI-PMH |author1=R. Devarakonda |author2=G. Palanisamy |author3=J. Green |author4=B. Wilson |year=2010 |journal=Earth Science Informatics |volume=4 |issue=1 |pages=1–5 |publisher=Springer Berlin / Heidelberg |doi=10.1007/s12145-010-0073-0}}{{inconsistent citations}}
9. ^DOOR
10. ^eLab
11. ^panFMP
12. ^jOAI

References

  • {{cite conference |last1=Lagoze |first1=Carl |last2=Van de Sompel |first2=Herbert |author-link2=Herbert Van de Sompel |year=2001 |title=The Open Archives Initiative: Building a Low-Barrier Interoperability Framework |url=http://www.openarchives.org/documents/jcdl2001-oai.pdf |booktitle=Proceedings of the first ACM/IEEE-CS Joint Conference on Digital Libraries |conference=JCDL'01 |pages=54–62 |doi=10.1145/379437.379449 |isbn=1-58113-345-6 |citeseerx=10.1.1.161.6800}}
  • {{cite journal |last1=Lynch |first1=Clifford A. |author-link1=Clifford Lynch |date=August 2001 |title=Metadata harvesting and the open archives initiative |url=http://old.arl.org/resources/pubs/br/br217/br217mhp.shtml |journal=ARL: A Bimonthly Report |publisher=ARL |issue=217 |pages=1–9}}
  • {{cite journal |last1=McCown |first1=Frank |last2=Liu |first2=Xiaoming |last3=Nelson |first3=Michael L. |last4=Zubair |first4=Mohammed |date=March–April 2006 |title=Search Engine Coverage of the OAI-PMH Corpus |journal=IEEE Internet Computing |volume=10 |issue=2 |pages=66–73 |url=http://library.lanl.gov/cgi-bin/getfile?LA-UR-05-9158.pdf |doi=10.1109/MIC.2006.41}}
  • {{cite journal |last1=Van de Sompel |first1=Herbert |author-link1=Herbert Van de Sompel |last2=Lagoze |first2=Carl |year=2000 |title=The Santa Fe Convention of the Open Archives Initiative |journal=D-Lib Magazine |volume=6 |issue=2 |doi=10.1045/february2000-vandesompel-oai}}
  • {{cite journal |last1=Van de Sompel |first1=Herbert |author-link1=Herbert Van de Sompel |last2=Young |first2=Jeffrey A. |last3=Hickey |first3=Thomas B. |year=2003 |title=Using the OAI-PMH ... Differently |journal=D-Lib Magazine |volume=9 |issue=7/8 |doi=10.1045/july2003-young}}
  • {{cite journal |title=Data sharing and retrieval uses OAI-PMH |last1=Devarakonda |first1=Ranjeet |last2=Palanisamy |first2=Giri |last3=Green |first3=Jim |last4=Wilson |first4=Bruce |year=2010 |journal=Earth Science Informatics |volume=4 |issue=1 |pages= 1–5 |publisher=Springer Berlin / Heidelberg |doi = 10.1007/s12145-010-0073-0}}

External links

  • [https://web.archive.org/web/20100314214736/http://oai.sdu.edu.tr/ Suleyman Demirel University Open Archives Harvester]
  • Protocol specification
  • [https://www.loc.gov/library/libarch-digital.html National Library of Congress, Digital Collections and Programs]
  • Library of Congress, National Digital Information Infrastructure and Preservation Program
  • [https://www.loc.gov/webcapture/ Library of Congress, Web Capture]
  • [https://play.google.com/store/apps/details?id=de.appwerft.aiopmh Android App]
{{open access navbox}}OAI-PMH

5 : Digital libraries|Internet protocols|Metadata|Open access projects|Archival science

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/12 2:45:54