Big Data Could Pose Unsustainable Challenges to Universities

Big data research operations in higher education could hit a wall. While universities are "meeting many current needs," according to a new research project, big data work is taxing institutional technology, human and financial resources. On many campuses, the infrastructure supporting big data is highly decentralized, running in individual labs, and dependent more on personal interactions than structured and coordinated programs. Considering the value of research for those schools, both financially and as a major ingredient of their brands, the stakes for creating sustainable research infrastructure and practices are high.

That's the conclusion of a new report from Ithaka S+R, which teamed up with librarians from 20 colleges and universities to understand how well schools can support their research efforts now and into the future. Participants interviewed more than 200 faculty members, exploring how researchers work with big data and identifying the challenges they faced.

According to "Big Data Infrastructure at the Crossroads: Support Needs and Challenges for Universities," the challenges are many:

  • There's a tension between disciplinary and interdisciplinary mentalities. While big data is primarily an interdisciplinary enterprise, "divergent incentive structures, cultures and unequal access to funding can affect disciplinary participation in big data research projects."
  • Managing complex data isn't easy. As the report noted, "the work of acquiring, cleaning, and organizing data is typically the most labor-intensive aspect of big data projects."
  • The structure for collaboration often emphasizes "local, lab-based" IT over the centralized IT operations.
  • There's confusion about sharing of data, both formal and informal.
  • The ethical aspects of big data research are still in flux, creating uncertainty about what the best practices are.
  • Researchers favor informal training for those involved in projects over "formal training in big data methods." That leaves "the potential for blind spots" in their research efforts.

The report also offered numerous recommendations useful to university research leaders, libraries, computing centers, IT and information professionals, faculty and staff who engage in big data research, along with the publishers, funders and others with stakes in research infrastructures.

For example, the authors suggested that institutions create protocols for regular assessment of on-campus big data infrastructure, including mapping resources and assembling working groups across IT, libraries, high-performance computing, research offices and other relevant divisions, "to coordinate support services, identify gaps and reduce redundancies." The report also suggested that universities produce a formal catalog of data services and resources for circulation to researchers.

Individual departments were encouraged to hire people who could be embedded into research teams, to provide data science, data management, statistical and computational expertise.

Libraries could create and update guides to datasets that would be of interest to their research communities, perhaps in collaboration with other academic libraries; and also host events for researchers, to enable them to share their work across fields.

"As big data grows, the difficulty of supporting the research mission of universities — already a substantial challenge for administrators — will increase," the authors noted in their conclusion. "Making big data sustainable, if that is possible (its carbon costs are daunting), will require coordinated action by universities, something that is difficult to accomplish at institutions with decentralized bureaucracies and cultures."

The report is openly available on the Ithaka S+R website.

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • group of college students looking at large screen of data visualizations

    Scalable Cloud Strategies: Values for Higher Education

    From a massive, 23-campus cloud-and-security transformation, to a small college's "lift and shift" entry into the public cloud, Unisys Higher Education Strategist Christopher Wessells knows how higher education leverages the cloud. Here, he examines some of the values scalable cloud strategies offer our institutions.

  • a glowing golden coin with a circuit board pattern, set against a gradient blue and white background with faint stock market graphs and metallic letters "AI" integrated into the design

    Google to Invest $1 Billion in AI Startup Anthropic

    Google is reportedly investing more than $1 billion in generative AI startup Anthropic, expanding its stake in one of Silicon Valley's leading artificial intelligence firms, according to a source familiar with the matter.

  • abstract human figures stand on a glowing grid floor in a vibrant digital landscape with floating holographic buildings, luminous data orbs, and a neon blue and purple gradient sky

    Metaverse Org Declares the Technology Is Accelerating in Spite of Rise of AI

    A new report from the Metaverse Standards Forum (MSF) declares the technology initiative is alive and well, despite skyrocketing attention paid to artificial intelligence.

  • glowing brain, connected circuits, and abstract representations of a book and graduation cap on a light gray gradient background

    Snowflake Launches Program to Upskill 100,000 People in Data and AI

    Cloud data platform Snowflake is embarking on an effort to train and certify more than 100,000 users on its AI Data Cloud by 2027. The One Million Minds + One Platform program will provide Snowflake-delivered courses, training materials, and free access to Snowflake software, at no cost to learners.