IU's HathiTrust Works with KU on Text Analysis Project

A new text analysis project at Indiana University, which just won a $500,000 grant from Andrew W. Mellon Foundation, will focus initially on a collection of Black fiction compiled at the University of Kansas. The award will allow experts in IU's HathiTrust Digital Library to create reusable worksets and research models for analyzing digital collections. The purpose of "Scholar-Curated Worksets for Analysis, Reuse & Dissemination" (SCWAReD, pronounced "squared") is to come up with new methods for working with digital collections that emphasize content tied to "historically under-resourced and marginalized textual communities," as the campus explained in a press release.

The broad mission for HathiTrust is to provide tools and services to support computational research on a growing collection of digital texts. SCWAReD will apply human expertise with advanced technologies to identify, recover and curate texts by writers "hidden among vast digital library collections."

The Illinois team will be led by Stephen Downie, co-director of HathiTrust and associate dean for research in the School of Information Sciences at the University of Illinois, Urbana-Champaign.

The first model to be produced by SCWAReD will involve a joint enterprise with the University of Kansas' Project on the History of Black Writing (HBW), begun in 1983 by Maryemma Graham, a professor of English. After compiling a dedicated archive of Black fiction, HBW created the Black Book Interactive Project (BBIP) at KU, to increase the visibility of and research on Black-authored materials. The BBIP team will work with HathiTrust to produce a workset on the HBW corpus. Results will include an analysis of the texts, generation of "derived data," documentation and a project whitepaper. Graham will also serve a role in selecting three other competitively chosen scholar-curated collections to be funded under SCWAReD.

"This partnership allows us to realize the original intent of what many call the 'digital turn': an ability to share knowledge more broadly and to advance scholarship through collaborative opportunities enhanced by technology," Graham noted. "Unfortunately, the legacy of racialized practices has followed us and made too much of our knowledge invisible. While it might not be possible to start on a level playing field, we can work together to develop a model for building more inclusive databases and content-specific worksets that derive from them. This is an unusual, but much needed partnership that can be replicated across the digital landscape: We both bring something to the table, we both care about research, and we both care about what a rigorous investigation into a more diverse knowledge network can tell us."

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • laptop on a clean desk with colorful image icons dynamically emanating from the screen

    Stability AI Releases Stable Diffusion 3.5 Text-to-Image Generation Model

    Stability AI, developer of open source models focused on text-to-image generation, has released Stable Diffusion 3.5, the latest version of its deep learning, text-to-image model.

  • happy woman sitting in front of computer

    Delightful Progress: Kuali's Legacy of Community and Leadership

    CEO Joel Dehlin updates us on Kuali today, and how it has thrived as a software company that succeeds in the tech marketplace while maintaining the community values envisioned in higher education years ago.

  • An abstract depiction of a virtual reality science class featuring two silhouetted figures wearing VR headsets

    University of Nevada Las Vegas to Build VR Learning Hub for STEM Courses

    A new immersive learning center at the University of Nevada, Las Vegas is tapping into the power of virtual reality to support STEM engagement and student success. The institution has partnered with Dreamscape Learn on the initiative, which will incorporate the company's interactive VR platform into introductory STEM courses.

  • landscape photo with an AI rubber stamp on top

    California AI Watermarking Bill Garners OpenAI Support

    ChatGPT creator OpenAI is backing a California bill that would require tech companies to label AI-generated content in the form of a digital "watermark." The proposed legislation, known as the "California Digital Content Provenance Standards" (AB 3211), aims to ensure transparency in digital media by identifying content created through artificial intelligence. This requirement would apply to a broad range of AI-generated material, from harmless memes to deepfakes that could be used to spread misinformation about political candidates.