NSF-Funded Hubs Power Big Data Project 'Spokes'

The National Science Foundation has begun filling in the details of its national plan for big data research. Less than a year ago the NSF introduced funding for four regional hubs focused on data science innovation, led by consortia of research universities. Now, the federal agency has announced $11 million in planning activities and awards to 10 "big data spokes" projects in areas identified as important by each of those hubs. Similar to the hubs, the spokes are expected to serve in a "convening and coordinating" role; however, while the hubs extend broadly into multiple fields, each spoke will reflect a specific mission.

For example, one spoke will work on developing a new standardized modular data-sharing license and a platform that makes data sharing easier and helps to enforce the license. Partners on that project include researchers from a team of institutions — Brown University, MIT and Drexel — along with publishers  Elsevier and Thomson Reuters; technology companies Intel, Microsoft Research and Oracle; as well as Rhode Island Hospital.

Under the grant, which totals about a million dollars, the researchers will create a generalized, modular licensing agreement that covers the most common data-sharing stumbling blocks. Then they'll design a data platform and toolset, named "ShareDB," which uses and enforces the stipulations of the licensing agreement.

"The hope is that by standardizing data-sharing agreements and providing infrastructure to better enforce the contract, we can significantly increase the chance for a university and a company to reach a successful agreement in a shorter amount of time," said Tim Kraska, a computer science professor at Brown, in a prepared statement.

Kraska's MIT colleague in the Northeast Big Data Hub project said he considers the hub and spoke initiative "a great way to connect those of us working in big data in the northeast." As Samuel Madden,  a professor of electrical engineering and computer science, explained, "As a computer scientist focused on software for sharing data, I've been able to connect to a diverse group of researchers and leaders interested in a wide range of broader issues, ranging from hardware infrastructure to software architectures to the legal, ethical and societal implications of data sharing."

Other northeast projects examine the use of data in education and the integration of environmental factors and causal reasoning approaches for large-scale observational health research.

The South Data Hub is awarding funding to projects involving smart grid use of big data, the use of big data for environmental sustainability and the use of large-scale medical informatics for improving patient care coordination and engagement with fitness trackers and environmental monitors.

Projects from the Midwest Hub will examine digital agriculture with unstaffed aircraft systems and the development of neuroscientific computational resources for better understanding neural operations in the human brain.

Projects in the West Hub are investigating the use of big data in urban spaces for "knowledge discovery and real-time interventions" and development of a "next generation, online computational model library" to bring transparency and accessibility to scientific research.

"The [big data] spokes advance the goals and regional priorities of each [big data] hub, fusing the strengths of a range of institutions and investigators and applying them to problems that affect the communities and populations within their regions," said Jim Kurose, assistant director of NSF's Computer and Information Science and Engineering Directorate. "We are pleased to be making this substantial investment today to accelerate the nation's big data R&D innovation ecosystem."

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • glowing digital brain made of blue circuitry hovers above multiple stylized clouds of interconnected network nodes against a dark, futuristic background

    Report: 85% of Organizations Are Using Some Form of AI

    Eighty-five percent of organizations today are leveraging some form of AI, according to the latest State of AI in the Cloud 2025 report from Wiz. While AI's role in innovation and disruption continues to expand, security vulnerabilities and governance challenges remain pressing concerns.

  • modern college building with circuit and brain motifs

    Anthropic Launches Claude for Education

    Anthropic has announced a version of its Claude AI assistant tailored for higher education institutions. Claude for Education "gives academic institutions secure, reliable AI access for their entire community," the company said, to enable colleges and universities to develop and implement AI-enabled approaches across teaching, learning, and administration.

  • college building with a central domed rotunda, arched windows, and columns, overlaid with glowing blue circuit patterns

    Kishwaukee College Moves to Ellucian Colleague SaaS

    Illinois's Kishwaukee College is modernizing its administrative systems with an Ellucian Colleague SaaS rollout that will bring AI-powered tools to human resources, finance, and student management.

  • interconnected glowing nodes and circuits in blue and green, forming a neural network on a dark background with a futuristic design

    Tech Giants Launch $100 Billion AI Infrastructure Network Project

    OpenAI, SoftBank, and Oracle have unveiled a new venture, Stargate, through which they aim to build a massive AI infrastructure network across the United States. The initiative, which was announced at the White House with President Donald Trump, has been described as the "largest AI infrastructure project in history."