The content on this wiki is being preserved for historical purposes, but is not being maintained and is probably no longer accurate.

For current information about DPLA, see the DPLA main site.

Build history/2012-02-17

From Digital Library of America Project
(Redirected from 17-02-2012)
Jump to: navigation, search

Contents

Disclaimers

Notice: The data and metadata currently deposited in the DPLA platform by DPLA partner institutions is provided for the purposes of experimentation. It is not to be used in any production system that is available publicly or privately, except for research and development purposes. Harvard University and the DPLA make no claim regarding the accuracy of the data or its copyright status, and are not responsible for any claims or damages which may result from its use.

Also: At this stage do not stand behind the current build. We cannot guarantee that it will work or, if it works that the inputs, outputs, and API will not change in ways that will ruin whatever you build. We will, however, document changes, and we encourage you to communicate with the dev core team in any of the ways listed on this wiki's homepage.

Notice: Data from the University of Illinois Urbana/Champaign is made available under terms described here.

Current Build

Release Notes

Data

  1. Added:
    1. Web content, including TED Talks, authors@Google, NPR episodes
  2. Biodiversity Heritage collection (c.50,000 records)
    1. biodiversitylibrary_org
  3. The Bancroft Library, from UC Berkeley (c.30,000 records)
    1. cdlib_org
  4. Pathetic dummy library item data
    1. 100K records, divided into 3 source collections:
      1. data_source_1
      2. data_source_2
      3. data_source_3
    2. Dummy fields, except for:
      1. Lib Congress Subject Headings
      2. Format
      3. Language
      4. Pub year
      5. Data type (item or collection)

Why dummy records? Because we want to avoid any and all discussion of the provenance of the information.

Functionality

  1. Added:
    1. Data mapped to DC++ schema, and unmapped data is stored as key:value pairs
      1. Unmapped values can be addressed via "dark_"+ key
    2. Switched to Mongo
  2. RestfulAPI
    1. Query against all data fields
    2. Returns JSON
  3. Item data model
    1. Simplified, Dublin-Core-ish
  4. Collections metadata provisionally absorbed into item data structure
    1. Marked as collections

Tools and Apps

  1. Documentation
  2. Bookshelf visualizer (thank you Annie Cain!)
Personal tools