In the spirit of the Collections as Data project, Leyburn Library is publishing datasets derived from our archival collections. These datasets are for use in computationally-informed research and teaching, such as text analysis or data visualization.

In addition to our archival data, you will find a listing of datasets available through our licensed resources, such as the Gale Primary Sources.

This material is made available for use in research, teaching, and private study, pursuant to U.S. Copyright law. The user assumes full responsibility for any use of the materials, including but not limited to, infringement of copyright and publication rights of reproduced materials. Any materials used should be fully credited with the source.

Questions? Contact Mackenzie Brooks (brooksm [at] or Paula Kiser (kiserp [at]

Archival Data

Alumni Magazine

Date: 1941-2020. Published quarterly.

Format: Text corpus.

Download the dataset on GitHub

View this publication in the Digital Archive.

Front page of Ring-tum Phi newspaper

Ring-tum Phi

Date: 1897-2013 Published weekly (varies).

Format: Text corpus.

Download the dataset on GitHub

View this publication in the Digital Archive.

Cover of Shenandoah magazine from Winter 1975. Black and white image of tree in front of abandoned house.


  1. Index of magazine contents from 1950-2015.
  2. Index of editors and their role for each issue.

Format: Tabular data in CSV.

Download the dataset on GitHub

Visit the Shenandoah website. Shenandoah has not been fully digitized, but can be accessed in print in Leyburn Library.

Local History Data

Oak Grove Cemetery Data

The records of 6400+ graves can be searched, sorted, browsed, and downloaded (as a comma-separated value file) on the Oak Grove Cemetery Data project website.

Licensed Data

Gale Digital Collections

Through our licensing of content from Gale Digital Collections, we have a access to text corpora for many of our primary source databases. Use of this material for text and data mining is governed by Gale’s terms of use.

Format: text corpus (XML).

Access to this data is limited to members of the W&L community.