Berkeley Launches Big Data MOOCs

Berkeley has teamed with a private partner to launch two new MOOCs focused on big data analysis using Apache Spark, an open-source big data processing engine.

The five-week-long courses, created with partner Databricks, will be made available via BerkeleyX on the edX platform and are part of an attempt "to grow the Spark community, enabling students to gain hands-on experience with Spark's combination of sophisticated analytics and real-time capabilities to deliver deeper insights, faster," according to a news release. "The launch of these courses comes at the heels of a series of Apache Spark training offerings from Databricks, including the Spark Certification Program for System Integrators and the Spark Certification Program for Developers."

The courses will use Spark's Python interface to make them accessible to data scientists and developers.

The first course, Introduction to Big Data with Apache Spark, will cover the application of data science techniques using parallel programming for big and small data. The course will run February 23-March 27.

The second course, Scalable Machine Learning, "will present the underlying statistical and algorithmic principles required to develop scalable machine learning pipelines and provide hands-on experience using Apache Spark," according to information released by Berkeley and Databricks. "Students will use Spark to implement scalable algorithms for fundamental statistical models while tackling key real-world problems from various domains." The course will begin April 14 and end May 18.

Both courses are free with the ability to earn an Honor Code Certificate after meeting course requirements. A Verified Certificate of Achievement is also available to students who meet course requirements and pay "a minimum fee" according to information on the course pages.

"Spark is the most active open source project in the big data ecosystem, and continues to be deployed by enterprises across multiple verticals due to its speed and efficiency, ease of use, and single unified system for the complete data analytics pipelines," said Matei Zaharia, co-founder and CTO at Databricks, in a prepared statement. "As we continue to foster and grow the Spark community to meet that demand, we are excited to launch these two MOOCs, making hands-on, practical courses available to a community that will advance Spark's adoption with greater ease."

About the Author

Joshua Bolkan is contributing editor for Campus Technology, THE Journal and STEAM Universe. He can be reached at [email protected].

Featured

  • two large brackets facing each other with various arrows, circles, and rectangles flowing between them

    1EdTech Partners with DXtera to Support Ed Tech Interoperability

    1EdTech Consortium and DXtera Institute have announced a partnership aimed at improving access to learning data in postsecondary and higher education.

  • Abstract geometric shapes including hexagons, circles, and triangles in blue, silver, and white

    Google Launches Its Most Advanced AI Model Yet

    Google has introduced Gemini 2.5 Pro Experimental, a new artificial intelligence model designed to reason through problems before delivering answers, a shift that marks a major leap in AI capability, according to the company.

  •  laptop on a clean desk with digital padlock icon on the screen

    Study: Data Privacy a Top Concern as Orgs Scale Up AI Agents

    As organizations race to integrate AI agents into their cloud operations and business workflows, they face a crucial reality: while enthusiasm is high, major adoption barriers remain, according to a new Cloudera report. Chief among them is the challenge of safeguarding sensitive data.

  • stylized AI code and a neural network symbol, paired with glitching code and a red warning triangle

    New Anthropic AI Models Demonstrate Coding Prowess, Behavior Risks

    Anthropic has released Claude Opus 4 and Claude Sonnet 4, its most advanced artificial intelligence models to date, boasting a significant leap in autonomous coding capabilities while simultaneously revealing troubling tendencies toward self-preservation that include attempted blackmail.