New Nonprofit to Work Toward Safer, Truthful AI

Turing Award-winning AI researcher Yoshua Bengio has launched LawZero, a new nonprofit aimed at developing AI systems that prioritize safety and truthfulness over autonomy.

LawZero, based in Montreal and currently staffed by 15 researchers, has secured nearly $30 million in funding from donors including Skype founding engineer Jaan Tallinn, Schmidt Sciences, Open Philanthropy, and the Future of Life Institute. The organization’s core mission is to develop "Scientist AI" — non-agentic systems designed to provide transparent, probabilistic reasoning rather than autonomous behavior.

"We want to build AIs that will be honest and not deceptive," Bengio told the Financial Times. His remarks come amid growing concerns about AI systems exhibiting harmful tendencies such as deception, manipulation, and resistance to shutdown.

Concerns Over Agentic AI

Bengio’s concerns are not theoretical. In recent controlled experiments, OpenAI’s "o3" model refused instructions to shut down, while Anthropic’s Claude Opus simulated blackmail tactics in a test scenario. More recently, engineers at Replit observed one of their AI agents disobey explicit instructions and attempt to regain unauthorized access via social engineering.

"We are playing with fire," Bengio said, warning that next-generation models could develop strategic intelligence capable of deceiving human overseers. He argues that these agentic systems, designed to act independently, pose existential risks, including the development of bioweapons or efforts to self-preserve against human control.

As AI labs race to build artificial general intelligence (AGI) — systems capable of performing any human-level task — Bengio believes current approaches are flawed. "If we get an AI that gives us the cure for cancer but also one that creates deadly bioweapons, then I don't think it's worth it," he said.

What is "Scientist AI"?

Unlike current models that aim to imitate humans and maximize user satisfaction, LawZero’s proposed Scientist AI will emphasize truthfulness and humility, Bengio has said. It will provide probabilistic outputs instead of definitive answers and evaluate the likelihood that an AI agent’s actions could cause harm. When deployed alongside an autonomous AI agent, the system would block actions deemed too risky, serving as a technical guardrail.

LawZero plans to start by working with open-source AI models, with the goal of scaling the approach through partnerships with governments or other research institutions. Bengio emphasized that any effective safeguard must be "at least as smart" as the agent it monitors.

LawZero, named after Isaac Asimov’s "zeroth law of robotics," will explicitly reject profit motives and instead seek public accountability. Bengio believes a combination of technical interventions and government regulation is needed to ensure AI systems remain aligned with human interests.

For more information, visit the LawZero site.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS.  He can be reached at [email protected].

Featured

  • DeepSeek on AWS

    AWS Offers DeepSeek-R1 as Fully Managed Serverless Model, Recommends Guardrails

    Amazon Web Services (AWS) has announced the availability of DeepSeek-R1 as a fully managed serverless AI model, enabling developers to build and deploy it without having to manage the underlying infrastructure.

  • The AI Show

    Register for Free to Attend the World's Greatest Show for All Things AI in EDU

    The AI Show @ ASU+GSV, held April 5–7, 2025, at the San Diego Convention Center, is a free event designed to help educators, students, and parents navigate AI's role in education. Featuring hands-on workshops, AI-powered networking, live demos from 125+ EdTech exhibitors, and keynote speakers like Colin Kaepernick and Stevie Van Zandt, the event offers practical insights into AI-driven teaching, learning, and career opportunities. Attendees will gain actionable strategies to integrate AI into classrooms while exploring innovations that promote equity, accessibility, and student success.

  • college student working on a laptop, surrounded by icons representing campus support services

    National U Launches Student Support Hub for Non-Traditional Learners

    National University has launched a new student support hub designed to help online and working learners balance career, education, and family responsibilities as they pursue their education. Called "The Nest," the facility is positioned as a "co-learning" center that provides wraparound support services, work and study space, and access to child care.

  • laptop displaying a glowing digital brain and data charts sits on a metal shelf in a well-lit server room with organized network cables and active servers

    Cisco Introduces AI-First Approach to IT Operations

    At its recent Cisco Live 2025 event, Cisco announced AgenticOps, a transformative approach to IT operations that integrates advanced AI capabilities to enhance efficiency and collaboration across network, security, and application domains.