Big Data Freeway to Connect Researchers along West Coast

Researchers in Pacific coast states are gaining access to a networking infrastructure that connects the science "DMZs" of multiple research institutions. A DMZ is a secure network "enclave" for high-speed data transfer frequently needed in data-intensive research projects. The Pacific Research Platform (PRP), as it's named, will eventually allow participating universities and other research organizations to move data a thousand times faster than the Internet that currently links campuses. The new high-capacity, regional "freeway system" is being built with a $5 million grant given by the National Science Foundation to the Universities of California in San Diego and Berkeley.

Participants of the PRP include most of the research universities on the West Coast: the 10 University of California campuses, San Diego State University, the California Institute of Technology, the University of Southern California, Stanford and the University of Washington. These institutions are already connected via the 100-gigabit CENIC. The new initiative also extends to include several other campuses outside of California, the Lawrence Berkeley National Laboratory and four national supercomputer centers, among other participants.
Participants of the PRP include most of the research universities on the West Coast.

The PRP builds on prior NSF and U.S. Department of Energy investments. In the last three years, NSF has funded more than 100 American campuses to upgrade their network capacity for greatly enhanced science data access, creating science DMZs within each. The PRP partnership extends that to a regional model.

The drivers of the PRP are 15 multi-campus data-intensive application teams, providing feedback to technical design staff that includes a large group of network engineers, network providers and measurement programmers at the PRP institutions. The application areas include particle physics, astronomy/astrophysics, earth sciences, biomedicine and scalable multimedia, which in turn will provide models for other applications.

"PRP will enable researchers to use standard tools to move data to and from their labs and their collaborators' sites, supercomputer centers and data repositories distant from their campus IT infrastructure, at speeds comparable to accessing local disks," explained PRP co-principal investigator Thomas DeFanti, a research scientist at UC San Diego.

The project came out of a retreat of researchers and a subsequent 2014 workshop that led to a decision to explore the feasibility of the PRP. Network engineers from a number of PRP member campuses worked for 10 weeks during the early part of 2015 on a proof-of-concept demonstration that used elements of the proposed infrastructure that already existed. The test involved disk-to-disk data transfers from within one campus science DMZ to another. After tuning and optimization tests demonstrated data transfer speeds of 9.6Gb/per second out of 10 from four UC sites to San Diego's campus; two transfers reached 36Gb/s out of 40 from UCLA and Caltech to UCSD. One PRP-optimized test moved 1.6 terabytes of data in four minutes; the default campus Internet required three hours to transfer just 0.1 terabytes.

"Research in data-intensive fields is increasingly multi-investigator and multi-institutional, depending on ever more rapid access to ultra-large heterogeneous and widely distributed datasets," said UC San Diego Chancellor Pradeep Khosla in a prepared statement. "The Pacific Research Platform will make it possible for PRP researchers to transfer large datasets to where they work from their collaborators' labs or from remote data centers."

The PRP will be rolled out in two phases. In the first phase, the work will focus on getting all member campuses running on its data-sharing architecture, PRPv1. Once that's done, the consortium will develop and offer PRPv2, an IPv6-based application with enhanced security and software-defined networking features.

NSF has awarded funding to hold a PRP design workshop at UC San Diego in October. This event will bring together application driver researchers, distributed computer architects, network engineers and multi-institutional IT/telecom administrators to refine the PRP implementation.

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • person signing a bill at a desk with a faint glow around the document. A tablet and laptop are subtly visible in the background, with soft colors and minimal digital elements

    California Governor Signs AI Content Safeguards into Law

    California Governor Gavin Newsom has officially signed off on a series of landmark artificial intelligence bills, signaling the state’s latest efforts to regulate the burgeoning technology, particularly in response to the misuse of sexually explicit deepfakes. The legislation is aimed at mitigating the risks posed by AI-generated content, as concerns grow over the technology's potential to manipulate images, videos, and voices in ways that could cause significant harm.

  • close-up illustration of a hand signing a legislative document

    California Passes AI Safety Legislation, Awaits Governor's Signature

    California lawmakers have overwhelmingly approved a bill that would impose new restrictions on AI technologies, potentially setting a national precedent for regulating the rapidly evolving field. The legislation, known as S.B. 1047, now heads to Governor Gavin Newsom's desk. He has until the end of September to decide whether to sign it into law.

  • illustration of a VPN network with interconnected nodes and lines forming a minimalist network structure

    Report: Increasing Number of Vulnerabilities in OpenVPN

    OpenVPN, a popular open source virtual private network (VPN) system integrated into millions of routers, firmware, PCs, mobile devices and other smart devices, is leaving users open to a growing list of threats, according to a new report from Microsoft.

  • interconnected cubes and circles arranged in a grid-like structure

    Hugging Face Gradio 5 Offers AI-Powered App Creation and Enhanced Security

    Hugging Face has released version 5 of its Gradio open source platform for building machine learning (ML) applications. The update introduces a suite of features focused on expanding access to AI, including a novel AI-powered app creation tool, enhanced web development capabilities, and bolstered security measures.