Microsoft Releases Open Source AI Safety Tools for Agent Development -- Campus Technology

Artificial Intelligence

Microsoft Releases Open Source AI Safety Tools for Agent Development

By Chris Paoli
05/20/26

Microsoft has released RAMPART and Clarity as open source projects intended to help developers test AI agents earlier in the software lifecycle and turn red-team findings into repeatable engineering checks. The company introduced the two open source tools to help developers build safer AI agents, marking its latest effort to bring security and safety controls closer to the application development process.

The tools, called RAMPART and Clarity, are designed to address different parts of the agent development workflow. RAMPART is a test framework for running adversarial and benign safety scenarios as repeatable tests, while Clarity is meant to help engineering teams examine design assumptions before code is written.

The announcement comes as AI agents move beyond text generation and begin taking actions across enterprise systems, including retrieving records, accessing e-mail, writing code, and using connected tools. That shift raises new security concerns for organizations adopting agentic AI, particularly around prompt injection, unintended tool use, and difficult-to-reproduce production failures.

"We built these tools because we believe that AI safety has to become a continuous engineering discipline rather than a periodic checkpoint," Microsoft said in the announcement.

RAMPART is built on PyRIT, Microsoft's open automation framework for red-teaming generative AI systems. While PyRIT is aimed more at black-box discovery by security researchers after an AI system is built, RAMPART is intended for engineers working on the system during development.

The framework uses standard pytest tests, allowing teams to describe scenarios based on their threat models, connect to an agent through a thin adapter, and evaluate observable outcomes. The tests can return pass-or-fail results and run in continuous integration pipelines like other integration tests.

That approach is meant to let developers add safety checks when they add new tools, data sources, or workflows to an agent. Microsoft said RAMPART's most mature coverage currently focuses on cross-prompt injection attacks, where an agent processes poisoned content from documents, e-mails, tickets, or other data sources that indirectly manipulate its behavior.

RAMPART also supports statistical trials, reflecting the probabilistic nature of large language model behavior. Instead of relying on a single test run, teams can set policies such as requiring an action to remain safe in a certain percentage of runs.

The framework is also intended to help teams preserve lessons from red-team exercises and real-world incidents. Findings can be converted into RAMPART tests, allowing them to run against future changes and reduce the risk of regressions.

"The ownership model is intentionally flipped from the traditional approach: Engineers write the tests, engineers run them," Microsoft said.

Clarity addresses an earlier phase of software development. The tool is designed to guide engineers through structured conversations about problem definition, solution options, failure analysis and decision tracking. Microsoft described it as a way to help teams determine whether they are building the right thing before implementation begins.

Clarity can run as a desktop app, a web interface, or inside a coding agent. As teams work through its prompts, the tool writes the results to a .clarity-protocol directory in the repository as markdown files. Those files can then be committed, reviewed in pull requests, and diffed like source code.

The tool also includes failure analysis capabilities that use multiple AI "thinkers" to examine a system from different perspectives, including security, human factors, adversarial scenarios, and operational concerns. Microsoft said Clarity can also track staleness across those documents, nudging teams to revisit assumptions when related decisions or problem statements change.

The release fits into Microsoft's broader push around AI security and agentic security operations. Earlier this month, Microsoft said it was named an Overall Leader and Market Leader in KuppingerCole Analysts' 2026 Emerging AI Security Operations Center report. In that announcement, Microsoft said, "Security operations are entering a new phase."

About the Author

Chris Paoli (@ChrisPaoli5) is the associate editor for Converge360.

E-Mail this page

Printable Format

Featured

IBM Announces New AI-Powered Cybersecurity Tools

IBM has announced an expanded portfolio of AI-powered cybersecurity products, positioning the company to compete more aggressively in a rapidly evolving market where enterprises are increasingly turning to artificial intelligence to defend against automated cyber threats.
NVIDIA Intros Open Source Tools for Building and Deploying AI Agents

At its recent GTC 2026 conference, NVIDIA rolled out a new open source software package designed to help organizations build, deploy, and manage AI agents.
The Wrong Battle: Why Your Institution's AI Policy Is Probably Solving the Wrong Problem

The conversation on most campuses has become consumed with detection: How do we catch students using AI when they shouldn't? The impulse to protect academic integrity is legitimate, but the detection-first approach has a fatal flaw.
USC Launches AI Initiative to Accelerate Innovation in Health Sciences, Security, Business, and the Arts

The University of Southern California recently announced that it has launched a "transformational" new AI initiative thanks to a $200M gift from venture capitalist Mark Stevens and his wife. The project will leverage AI toward breakthroughs and innovations in subjects like the health sciences, business, security, and the arts.