On The Books

On The Books

Current

How do we identify racist language in legal documents? Instead of proliferating racist ideas, can algorithms help us better study the history of race and advocate for justice? An interdisciplinary team of UNC researchers, scholars, and experts–including several Research Hub librarians–developed a text mining project to answer these questions.

On the Books: Jim Crow and Algorithms of Resistance is a project of the University of North Carolina at Chapel Hill Libraries that used text mining and machine learning to discover Jim Crow and racially-based legislation signed into law in North Carolina between Reconstruction and the Civil Rights Movement. The team developed:

A publicly accessible, plain-text corpus of North Carolina Session Laws from 1866-1967 for general legal and historical research, and a list of Jim Crow laws discovered.
A public git repository containing general scripts, open source software, and documentation for the benefit of similar projects.
A short white paper describing their methods and workflows for accurate, large-scale OCR text conversion and text analysis for future teams seeking to create large-scale digital corpora and/or experiment with data-driven discovery.
A website for educators and researchers interested in Southern and African American History that lists and contextualizes the North Carolina segregation laws identified.


See something wrong with this page or something that should be updated? Click here to let us know!