The Caselaw Access Project makes 360 years of U.S. case law available as a machine-readable text corpus. In developing a research community around the dataset, we’ve been creating and sharing resources for getting started.
- Return Cases from 100 Years Ago Today with the CAP API
- Retrieve Cases by Citation with the CAP Case Browser
- Get Opinion Author
- Creating a Data Analysis Workspace with Voyant and the CAP API
The Programming Historian shares peer-reviewed tutorials for computational workflows in the humanities. Here are a group of their guides for working with text data, from processing to analysis:
- Cleaning Data with OpenRefine
- Reshaping JSON with jq
- Basic Text Processing in R
- Getting Started with Topic Modeling and MALLET
- Corpus Analysis with Antconc
- Analyzing Documents with TF-IDF
We want to share and build ways to start working with Caselaw Access Project data. Do you have an idea for a future tutorial? Drop us a line to let us know!