Integration Brings Cerebras Inference Capabilities to Hugging Face Hub

AI hardware company Cerebras has teamed up with Hugging Face, the open source platform and community for machine learning, to integrate its inference capabilities into the Hugging Face Hub. This collaboration provides more than 5 million developers with access to models running on Cerebras' CS-3 system, the companies said in a statement, with reported inference speeds significantly higher than conventional GPU solutions.

Cerebras Inference, now available on Hugging Face, processes more than 2,000 tokens per second. Recent benchmarks indicate that models such as Llama 3.3 70B running on Cerebras' system can reach speeds exceeding 2,200 tokens per second, offering a performance increase compared to leading GPU-based solutions.

"By making Cerebras Inference available through Hugging Face, we are enabling developers to access alternative infrastructure for open source AI models," said Andrew Feldman, CEO of Cerebras, in a statement.

For Hugging Face's 5 million developers, this integration provides a streamlined way to leverage Cerebras' technology. Users can select "Cerebras" as their inference provider within the Hugging Face platform, instantly accessing one of the industry's fastest inference capabilities.

The demand for high-speed, high-accuracy AI inference is growing, especially for test-time compute and agentic AI applications. Open source models optimized for Cerebras' CS-3 architecture enable faster and more precise AI reasoning, the companies said, with speed gains ranging from 10 to 70 times compared to GPUs.

"Cerebras has been a leader in inference speed and performance, and we're thrilled to partner to bring this industry-leading inference on open source models to our developer community," commented Julien Chaumond, CTO of Hugging Face.

Developers can access Cerebras-powered AI inference by selecting supported models on Hugging Face, such as Llama 3.3 70B, and choosing Cerebras as their inference provider.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS.  He can be reached at [email protected].

Featured

  • illustration of a football stadium with helmet on the left and laptop with ed tech icons on the right

    The 2025 NFL Draft and Ed Tech Selection: A Strategic Parallel

    In the fast-evolving landscape of collegiate football, the NFL, and higher education, one might not immediately draw connections between the 2025 NFL Draft and the selection of proper educational technology for a college campus. However, upon closer examination, both processes share striking similarities: a rigorous assessment of needs, long-term strategic impact, talent or tool evaluation, financial considerations, and adaptability to a dynamic future.

  • illustration of a futuristic building labeled "AI & Innovation," featuring circuit board patterns and an AI brain motif, surrounded by geometric trees and a simplified sky

    Cal Poly Pomona Launches AI and Innovation Center

    In an effort to advance AI innovation, foster community engagement, and prepare students for careers in STEM fields and business, California State Polytechnic University, Pomona has teamed up with AI, cloud, and advisory services provider Avanade to launch a new Avanade AI & Innovation Center.

  • interconnected geometric shapes with digital lines, representing community colleges

    New Education Design Lab Initiative Convenes Five Community Colleges to Reimagine Their Future

    Education Design Lab, a nonprofit devoted to designing, prototyping, and testing education-to-workforce models, has announced the inaugural cohort of its Reimagining Community Colleges Design Challenge.

  • an online form with checkboxes, a shield icon for security, and a lock symbol for privacy, set against a clean, monochromatic background

    Educause HECVAT Vendor Assessment Tool Gets an Upgrade

    Educause has announced HECVAT 4, the latest update to its Higher Education Community Vendor Assessment Toolkit.