Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement

The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related element...

Full description

Saved in:
Bibliographic Details
Published inThe code4lib journal no. 46
Main Author Erin Wolfe
Format Journal Article
LanguageEnglish
Published Code4Lib 01.11.2019
Online AccessGet full text
ISSN1940-5758
1940-5758

Cover

Loading…
More Information
Summary:The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related elements, each novel is analyzed for a variety of elements such as literary style, targeted content analysis, historical context, and other areas. Librarians at KU have worked to develop a variety of computational text analysis processes designed to assist with specific aspects of this metadata collection, including text mining and natural language processing, automated subject extraction based on word sense disambiguation, harvesting data from Wikidata, and other actions.
ISSN:1940-5758
1940-5758