Regional Project Targets Data Storage for Research Cloud

The on-going challenge of figuring out how to store and share the data generated in research projects will receive $4.9 million worth of attention in a project undertaken at four institutions. The University of Michigan, Michigan State U, Wayne State U and Indiana U are collaborating on a new kind of petabyte-scale data system.

Multi-Institutional Open Storage Research InfraStructure (MI-OSiRIS) will combine object-based software-defined storage technology with a network infrastructure to give users a distributed storage system that they can access from any of their institutions. It'll run on commodity hardware.

The data system will run Ceph, a massively scalable open source storage platform that supports multiple storage interface approaches and allows for configuration of access, replication, distribution and integrity on a domain basis. The software-defined storage aspects of Ceph will enable the system to self-manage operations, such as pinpointing the "optimal network path" between a scientist and the data, wherever it may be stored.

The infrastructure will use each school's existing authentication to let researchers give controlled access to their data across universities. Along the way, the details of design, code and operational experiences will be documented and published to allow MI-OSiRIS to be replicated elsewhere by other groups of cross-institutional collaborators.

"What we're trying to do here is expedite the time to discovery," said Shawn McKee, a research scientist in physics at U Michigan's College of Literature, Science and the Arts. "Scientists should be able to focus on their science without having to become experts in data management."

McKee is one of numerous researchers who will test the system in projects exploring ocean modeling, biostatistics, cancer, degenerative diseases and aquatic biology. Another is Hiroko Dodge, professor of neurology at the U Michigan Medical School, and colleagues at Wayne State, who will employ MI-OSiRIS in their study of Alzheimer's. Sensors in the homes of seniors gather 24/7 information about their walking speed, sleep patterns and computer and phone usage. That data is combined with additional data from cognitive tests, MRIs and other areas.

"Direct access to data between our sister institutions will eliminate hours and even days lost copying massive files from one place to another," said Patrick Gossman, deputy CIO for research at Wayne State. "The end result will be improved research productivity in health, aging, the environment and other areas important to us all."

The new infrastructure for MI-OSiRIS will be built by U Michigan's Advanced Research Computing-Technology Services. The work is being funded by the National Science Foundation.

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • large group of college students sitting on an academic quad

    Student Readiness: Learning to Learn

    Melissa Loble, Instructure's chief academic officer, recommends a focus on 'readiness' as a broader concept as we try to understand how to build meaningful education experiences that can form a bridge from the university to the workplace. Here, we ask Loble what readiness is and how to offer students the ability to 'learn to learn'.

  • Graphic of connected devices protected by digital padlocks

    Veeam Launches Agent Commander to Help Detect Enterprise AI Risk

    Veeam Software has introduced Agent Commander, a new platform designed to help enterprises detect AI risk, protect AI systems, and undo AI mistakes.

  • abstract coding

    Anthropic's New AI Model Targets Coding, Enterprise Work

    Anthropic has released Claude Opus 4.6, introducing a million-token context window and automated agent coordination features as the AI company seeks to expand beyond software development into broader enterprise applications.

  • globe surrounded by network connections

    AI Adoption Is Surging, but Infrastructure and Language Gaps Persist

    Artificial intelligence may be spreading faster than previous waves of consumer tech, but a report from Microsoft's AI Economy Institute suggests its benefits are concentrating in a relatively small set of countries, with infrastructure and language emerging as major dividing lines.