IU's HathiTrust Works with KU on Text Analysis Project

A new text analysis project at Indiana University, which just won a $500,000 grant from Andrew W. Mellon Foundation, will focus initially on a collection of Black fiction compiled at the University of Kansas. The award will allow experts in IU's HathiTrust Digital Library to create reusable worksets and research models for analyzing digital collections. The purpose of "Scholar-Curated Worksets for Analysis, Reuse & Dissemination" (SCWAReD, pronounced "squared") is to come up with new methods for working with digital collections that emphasize content tied to "historically under-resourced and marginalized textual communities," as the campus explained in a press release.

The broad mission for HathiTrust is to provide tools and services to support computational research on a growing collection of digital texts. SCWAReD will apply human expertise with advanced technologies to identify, recover and curate texts by writers "hidden among vast digital library collections."

The Illinois team will be led by Stephen Downie, co-director of HathiTrust and associate dean for research in the School of Information Sciences at the University of Illinois, Urbana-Champaign.

The first model to be produced by SCWAReD will involve a joint enterprise with the University of Kansas' Project on the History of Black Writing (HBW), begun in 1983 by Maryemma Graham, a professor of English. After compiling a dedicated archive of Black fiction, HBW created the Black Book Interactive Project (BBIP) at KU, to increase the visibility of and research on Black-authored materials. The BBIP team will work with HathiTrust to produce a workset on the HBW corpus. Results will include an analysis of the texts, generation of "derived data," documentation and a project whitepaper. Graham will also serve a role in selecting three other competitively chosen scholar-curated collections to be funded under SCWAReD.

"This partnership allows us to realize the original intent of what many call the 'digital turn': an ability to share knowledge more broadly and to advance scholarship through collaborative opportunities enhanced by technology," Graham noted. "Unfortunately, the legacy of racialized practices has followed us and made too much of our knowledge invisible. While it might not be possible to start on a level playing field, we can work together to develop a model for building more inclusive databases and content-specific worksets that derive from them. This is an unusual, but much needed partnership that can be replicated across the digital landscape: We both bring something to the table, we both care about research, and we both care about what a rigorous investigation into a more diverse knowledge network can tell us."

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • student reading a book with a brain, a protective hand, a computer monitor showing education icons, gears, and leaves

    4 Steps to Responsible AI Implementation

    Researchers at the University of Kansas Center for Innovation, Design & Digital Learning (CIDDL) have published a new framework for the responsible implementation of artificial intelligence at all levels of education.

  • glowing digital brain interacts with an open book, with stacks of books beside it

    Federal Court Rules AI Training with Copyrighted Books Fair Use

    A federal judge ruled this week that artificial intelligence company Anthropic did not violate copyright law when it used copyrighted books to train its Claude chatbot without author consent, but ordered the company to face trial on allegations it used pirated versions of the books.

  • server racks, a human head with a microchip, data pipes, cloud storage, and analytical symbols

    OpenAI, Oracle Expand AI Infrastructure Partnership

    OpenAI and Oracle have announced they will develop an additional 4.5 gigawatts of data center capacity, expanding their artificial intelligence infrastructure partnership as part of the Stargate Project, a joint venture among OpenAI, Oracle, and Japan's SoftBank Group that aims to deploy 10 gigawatts of computing capacity over four years.

  • laptop displaying a phishing email icon inside a browser window on the screen

    Phishing Campaign Targets ED Grant Portal

    Threat researchers at cybersecurity company BforeAI have identified a phishing campaign spoofing the U.S. Department of Education's G5 grant management portal.