The content on this wiki is being preserved for historical purposes, but is not being maintained and is probably no longer accurate.

For current information about DPLA, see the DPLA main site.

What types of metadata should be collected by a DPLA and what can be done to increase its interoperability

From Digital Library of America Project
Jump to: navigation, search
DPLA Wiki Navigation
About the DPLA
DPLA Website
Main PageBerkman Center
Board of Directors
Steering Committee
Dev portal
Ongoing Work
Workstreams
Audience and ParticipationContent and Scope
Financial/Business ModelsGovernance
Legal IssuesTechnical Aspects
Additional Activities
Beta SprintWorkshopsEvents
Media and Blog Mentions
Possible Models
List of Models
Concept Note
Get Involved
Community PortalSign on
Join the listservListserv archives
Weekly listserv recapsSuggested Resources

The DPLA will have to acquire a huge amount of metadata, and has the opportunity to generate much more. It needs that metadata in order to provide core services to its users, but it could also be made available for the development of applications by external developers, and to be mashed up with existing applications. For example, if given access to anonymized usage data and user ratings, a developer might create a highly tuned recommendation engine. Or, an existing application such as Wikipedia might want to link to books in the DPLA's collection.

Possible types of metatdata to collect

Potential types of metadata to collect.

  • Catalog data: Title, creator, license, url, year, language, etc.
  • Usage data about the DPLA site: Which works are played or checked out? Which links are clicked?
  • Usage data about works: Which works are played all the way to the end?
  • Marginalia, annotations
  • Circulation data from physical libraries
  • User ratings and reviews
  • User-created links among the works
  • Semantic relationships among objects (created by users, harvested from existing sites and collections, generated via analysis)
  • Geographic data about the subject of works, their creation, and where they are physically held


Issues

  • Privacy
  • Data standards for interoperability
  • Make the data available via bulk download and/or via APIs?
  • Identifying works that are "the same" at the relevant levels of the FRBR standard
  • Building a functional, useful API
Personal tools