Research Libraries Collaborate on Shared Digital Repository

A group of the nation's largest research libraries are collaborating to create a repository of their digital collections, including millions of books. These holdings will be archived and preserved in a single repository called the HathiTrust. Materials in the public domain will be available for reading online.

The initiative was launched jointly by the 12-university consortium known as the Committee on Institutional Cooperation (CIC) and the 11 university libraries of the University of California system. The newest member of the collaboration is the University of Virginia. UC's participation will be coordinated by the California Digital Library (CDL), which has been involved in multiple digitization projects.


"This effort combines the expertise and resources of some of the nation's foremost research libraries and holds even greater promise as it seeks to grow beyond the initial partners," said John Wilkin, associate university librarian of the University of Michigan and the newly named executive director of HathiTrust. Hathi (pronounced HAH-tee), the Hindi word for elephant incorporated into the repository's name, underscores the immensity of this undertaking, Wilkin said.

As of today, HathiTrust contains two million volumes and about 750 million pages, 16 percent of which are in the public domain. Public domain materials will be available for reading online. Materials protected by copyright, although not available for reading online, are given digital archiving services to provide a reliable means to preserve collections. Organizers also expect to use those materials in the research and development of the trust.

Creation of the HathiTrust supports the digitization efforts of the CIC and the University of California, each of which has entered into collective agreements with Google to digitize portions of the collections of their libraries, more than 10 million volumes in total, as part of the Google Book Search project. Materials digitized through other means will also be made available through HathiTrust.

"The CIC Libraries have always worked at a large scale, with big collections, big user communities and high expectations for service," said Mark Sandler, director of the CIC Center for Library Initiatives. "They are not intimidated by big challenges, and will bring their comfort with this to the development of the shared digital repository."

"Researchers will benefit from the expert curation and consistent access they have long associated with the CIC research libraries," said Michael McRobbie, president of Indiana University. "Great libraries have long been essential to outstanding scholarship, and the HathiTrust collaboration among the CIC institutions, the University of California and others provides an essential tool for 21st-century scholars."

"Before this collaboration," Wilkin said, "the collections in each library existed in isolation. Now we are bringing them together, pooling resources and eliminating redundancies, and producing a valuable research tool that will be greater than the sum of its parts."

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • The AI Show

    Register for Free to Attend the World's Greatest Show for All Things AI in EDU

    The AI Show @ ASU+GSV, held April 5–7, 2025, at the San Diego Convention Center, is a free event designed to help educators, students, and parents navigate AI's role in education. Featuring hands-on workshops, AI-powered networking, live demos from 125+ EdTech exhibitors, and keynote speakers like Colin Kaepernick and Stevie Van Zandt, the event offers practical insights into AI-driven teaching, learning, and career opportunities. Attendees will gain actionable strategies to integrate AI into classrooms while exploring innovations that promote equity, accessibility, and student success.

  • cloud, database stack, computer screen, binary code, and flowcharts interconnected by lines and arrows

    Salesforce to Acquire Data Management Firm Informatica

    Salesforce has announced plans to acquire data management company Informatica for $8 billion. The deal is aimed at strengthening Salesforce's AI foundation and expanding its enterprise data capabilities.

  • stylized AI code and a neural network symbol, paired with glitching code and a red warning triangle

    New Anthropic AI Models Demonstrate Coding Prowess, Behavior Risks

    Anthropic has released Claude Opus 4 and Claude Sonnet 4, its most advanced artificial intelligence models to date, boasting a significant leap in autonomous coding capabilities while simultaneously revealing troubling tendencies toward self-preservation that include attempted blackmail.

  • NVIDIA DGX line

    NVIDIA Intros Personal AI Supercomputers

    NVIDIA has introduced a new lineup of AI-powered computing solutions designed to accelerate enterprise workloads.