Berkeley Launches Big Data MOOCs

Berkeley has teamed with a private partner to launch two new MOOCs focused on big data analysis using Apache Spark, an open-source big data processing engine.

The five-week-long courses, created with partner Databricks, will be made available via BerkeleyX on the edX platform and are part of an attempt "to grow the Spark community, enabling students to gain hands-on experience with Spark's combination of sophisticated analytics and real-time capabilities to deliver deeper insights, faster," according to a news release. "The launch of these courses comes at the heels of a series of Apache Spark training offerings from Databricks, including the Spark Certification Program for System Integrators and the Spark Certification Program for Developers."

The courses will use Spark's Python interface to make them accessible to data scientists and developers.

The first course, Introduction to Big Data with Apache Spark, will cover the application of data science techniques using parallel programming for big and small data. The course will run February 23-March 27.

The second course, Scalable Machine Learning, "will present the underlying statistical and algorithmic principles required to develop scalable machine learning pipelines and provide hands-on experience using Apache Spark," according to information released by Berkeley and Databricks. "Students will use Spark to implement scalable algorithms for fundamental statistical models while tackling key real-world problems from various domains." The course will begin April 14 and end May 18.

Both courses are free with the ability to earn an Honor Code Certificate after meeting course requirements. A Verified Certificate of Achievement is also available to students who meet course requirements and pay "a minimum fee" according to information on the course pages.

"Spark is the most active open source project in the big data ecosystem, and continues to be deployed by enterprises across multiple verticals due to its speed and efficiency, ease of use, and single unified system for the complete data analytics pipelines," said Matei Zaharia, co-founder and CTO at Databricks, in a prepared statement. "As we continue to foster and grow the Spark community to meet that demand, we are excited to launch these two MOOCs, making hands-on, practical courses available to a community that will advance Spark's adoption with greater ease."

About the Author

Joshua Bolkan is contributing editor for Campus Technology, THE Journal and STEAM Universe. He can be reached at [email protected].

Featured

  • SXSW EDU

    Explore the Future of AI in Higher Ed at SXSW EDU 2025

    This March 3-6 in Austin, TX, the SXSW EDU Conference & Festival celebrates its 15th year of exploring education's most critical issues and providing a forum for creativity, innovation, and expression.

  • man working on laptop outdoors

    Digital Leadership Must-Haves for 2025: A CDO's Picks

    Now that he's more than a year and a half into his chief digital officer role at NJIT, we've asked Ed Wozencroft to reflect on his areas of concentration: What work must digital leaders "own" in 2025?

  • From Fire TV to Signage Stick: University of Utah's Digital Signage Evolution

    Jake Sorensen, who oversees sponsorship and advertising and Student Media in Auxiliary Business Development at the University of Utah, has navigated the digital signage landscape for nearly 15 years. He was managing hundreds of devices on campus that were incompatible with digital signage requirements and needed a solution that was reliable and lowered labor costs. The Amazon Signage Stick, specifically engineered for digital signage applications, gave him the stability and design functionality the University of Utah needed, along with the assurance of long-term support.

  • digital artwork of glowing, interconnected neural-like shapes on a gradient background of deep blue and vibrant purple

    Google Announces Upgrade to Flagship Gemini AI Platform, Enhancing Multimodal Capabilities

    Google has launched Gemini 2.0, designed to empower enterprise users and developers with advanced multimodal capabilities and enhanced performance.