U Illinois Grant To Tame Unstructured Data for Research

The Andrew Mellon Foundation last week awarded $1.2 million grant to the Graduate School of Library and Information Science at the University of Illinois at Urbana-Champaign to help find ways to solve the so-called "80 percent problem."

The challenge of the problem is developing tools to access the 80 percent of all information that is unstructured, not residing in easily accessible computer formats, or is open source-based or not secret.

The grant, which will be shared with the National Center for Supercomputing Applications, will build on NCSA's D2K software--which analyzes data in a variety of research and business domains--and IBM's Unstructured Information Management Architecture.

The goal is development of what the researchers call a Software Environment for the Advancement of Scholarly Research (SEASR), which will help bridge unstructured and structured data.
"There are trillions and trillions of bytes of data available, but the collections are dispersed and finding the relevant material is time consuming," said Michael Welge, the principal investigator on the project. "Someone who wants to research 19th century novels or the work of Cervantes has a wealth of information available to them, but without tools to help them they'll spend a long time searching that haystack for their particular needle."

Read More:

About the Author

Paul McCloskey is contributing editor of Syllabus.

Featured

  • glowing brain above stacked coins

    The Higher Ed Playbook for AI Affordability

    Fulfilling the promise of AI in higher education does not require massive budgets or radical reinvention. By leveraging existing infrastructure, embracing edge and localized AI, collaborating across institutions, and embedding AI thoughtfully across the enterprise, universities can move from experimentation to impact.

  • programming code and digital gears

    NVIDIA Intros Open Source Tools for Building and Deploying AI Agents

    At its recent GTC 2026 conference, NVIDIA rolled out a new open source software package designed to help organizations build, deploy, and manage AI agents.

  • abstract colored blocks

    OpenAI Drops Sora Short-Form AI Video Platform

    OpenAI is reportedly dropping Sora, its generative AI model that creates short video clips from text prompts, images, or existing video inputs. The move upends the company's December partnership with The Walt Disney Company.

  • Blue metallic mesh fabric folds

    Microsoft Acquires Osmos for Agentic AI Data Engineering

    In a strategic move to reduce time-consuming manual data preparation, Microsoft has acquired Seattle-based startup Osmos, specializing in agentic AI for data engineering.