Big Data Could Pose Unsustainable Challenges to Universities

Big data research operations in higher education could hit a wall. While universities are "meeting many current needs," according to a new research project, big data work is taxing institutional technology, human and financial resources. On many campuses, the infrastructure supporting big data is highly decentralized, running in individual labs, and dependent more on personal interactions than structured and coordinated programs. Considering the value of research for those schools, both financially and as a major ingredient of their brands, the stakes for creating sustainable research infrastructure and practices are high.

That's the conclusion of a new report from Ithaka S+R, which teamed up with librarians from 20 colleges and universities to understand how well schools can support their research efforts now and into the future. Participants interviewed more than 200 faculty members, exploring how researchers work with big data and identifying the challenges they faced.

According to "Big Data Infrastructure at the Crossroads: Support Needs and Challenges for Universities," the challenges are many:

  • There's a tension between disciplinary and interdisciplinary mentalities. While big data is primarily an interdisciplinary enterprise, "divergent incentive structures, cultures and unequal access to funding can affect disciplinary participation in big data research projects."
  • Managing complex data isn't easy. As the report noted, "the work of acquiring, cleaning, and organizing data is typically the most labor-intensive aspect of big data projects."
  • The structure for collaboration often emphasizes "local, lab-based" IT over the centralized IT operations.
  • There's confusion about sharing of data, both formal and informal.
  • The ethical aspects of big data research are still in flux, creating uncertainty about what the best practices are.
  • Researchers favor informal training for those involved in projects over "formal training in big data methods." That leaves "the potential for blind spots" in their research efforts.

The report also offered numerous recommendations useful to university research leaders, libraries, computing centers, IT and information professionals, faculty and staff who engage in big data research, along with the publishers, funders and others with stakes in research infrastructures.

For example, the authors suggested that institutions create protocols for regular assessment of on-campus big data infrastructure, including mapping resources and assembling working groups across IT, libraries, high-performance computing, research offices and other relevant divisions, "to coordinate support services, identify gaps and reduce redundancies." The report also suggested that universities produce a formal catalog of data services and resources for circulation to researchers.

Individual departments were encouraged to hire people who could be embedded into research teams, to provide data science, data management, statistical and computational expertise.

Libraries could create and update guides to datasets that would be of interest to their research communities, perhaps in collaboration with other academic libraries; and also host events for researchers, to enable them to share their work across fields.

"As big data grows, the difficulty of supporting the research mission of universities — already a substantial challenge for administrators — will increase," the authors noted in their conclusion. "Making big data sustainable, if that is possible (its carbon costs are daunting), will require coordinated action by universities, something that is difficult to accomplish at institutions with decentralized bureaucracies and cultures."

The report is openly available on the Ithaka S+R website.

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • glowing brain, connected circuits, and abstract representations of a book and graduation cap on a light gray gradient background

    Snowflake Launches Program to Upskill 100,000 People in Data and AI

    Cloud data platform Snowflake is embarking on an effort to train and certify more than 100,000 users on its AI Data Cloud by 2027. The One Million Minds + One Platform program will provide Snowflake-delivered courses, training materials, and free access to Snowflake software, at no cost to learners.

  • two abstract humanoid figures made of interconnected lines and polygons, glowing slightly against a dark gradient background

    Microsoft Introduces Copilot Chat Agents for Education

    Microsoft recently announced Microsoft 365 Copilot Chat, a new pay-as-you-go offering that adds AI agents to its existing free chat tool for Microsoft 365 education customers.

  • hand touching glowing connected dots

    Registration Now Open for Tech Tactics in Education: Thriving in the Age of AI

    Tech Tactics in Education has officially opened registration for its May 7 virtual conference on "Thriving in the Age of AI." The annual event, brought to you by the producers of Campus Technology and THE Journal, offers hands-on learning and interactive discussions on the most critical technology issues and practices across K–12 and higher education.

  • Three cubes of noticeably increasing sizes are arranged in a straight row on a subtle abstract background

    A Sense of Scale

    Gardner Campbell explores the notion of scale in education and shares some of his own experience "playing with scale" — scaling up and/or scaling down — in an English course at VCU.