Microsoft Releases Open Source AI Safety Tools for Agent Development

Microsoft has released RAMPART and Clarity as open source projects intended to help developers test AI agents earlier in the software lifecycle and turn red-team findings into repeatable engineering checks. The company introduced the two open source tools to help developers build safer AI agents, marking its latest effort to bring security and safety controls closer to the application development process.

The tools, called RAMPART and Clarity, are designed to address different parts of the agent development workflow. RAMPART is a test framework for running adversarial and benign safety scenarios as repeatable tests, while Clarity is meant to help engineering teams examine design assumptions before code is written.

The announcement comes as AI agents move beyond text generation and begin taking actions across enterprise systems, including retrieving records, accessing e-mail, writing code, and using connected tools. That shift raises new security concerns for organizations adopting agentic AI, particularly around prompt injection, unintended tool use, and difficult-to-reproduce production failures.

"We built these tools because we believe that AI safety has to become a continuous engineering discipline rather than a periodic checkpoint," Microsoft said in the announcement.

RAMPART is built on PyRIT, Microsoft's open automation framework for red-teaming generative AI systems. While PyRIT is aimed more at black-box discovery by security researchers after an AI system is built, RAMPART is intended for engineers working on the system during development.

The framework uses standard pytest tests, allowing teams to describe scenarios based on their threat models, connect to an agent through a thin adapter, and evaluate observable outcomes. The tests can return pass-or-fail results and run in continuous integration pipelines like other integration tests.

That approach is meant to let developers add safety checks when they add new tools, data sources, or workflows to an agent. Microsoft said RAMPART's most mature coverage currently focuses on cross-prompt injection attacks, where an agent processes poisoned content from documents, e-mails, tickets, or other data sources that indirectly manipulate its behavior.

RAMPART also supports statistical trials, reflecting the probabilistic nature of large language model behavior. Instead of relying on a single test run, teams can set policies such as requiring an action to remain safe in a certain percentage of runs.

The framework is also intended to help teams preserve lessons from red-team exercises and real-world incidents. Findings can be converted into RAMPART tests, allowing them to run against future changes and reduce the risk of regressions.

"The ownership model is intentionally flipped from the traditional approach: Engineers write the tests, engineers run them," Microsoft said.

Clarity addresses an earlier phase of software development. The tool is designed to guide engineers through structured conversations about problem definition, solution options, failure analysis and decision tracking. Microsoft described it as a way to help teams determine whether they are building the right thing before implementation begins.

Clarity can run as a desktop app, a web interface, or inside a coding agent. As teams work through its prompts, the tool writes the results to a .clarity-protocol directory in the repository as markdown files. Those files can then be committed, reviewed in pull requests, and diffed like source code.

The tool also includes failure analysis capabilities that use multiple AI "thinkers" to examine a system from different perspectives, including security, human factors, adversarial scenarios, and operational concerns. Microsoft said Clarity can also track staleness across those documents, nudging teams to revisit assumptions when related decisions or problem statements change.

The release fits into Microsoft's broader push around AI security and agentic security operations. Earlier this month, Microsoft said it was named an Overall Leader and Market Leader in KuppingerCole Analysts' 2026 Emerging AI Security Operations Center report. In that announcement, Microsoft said, "Security operations are entering a new phase."

About the Author

Chris Paoli (@ChrisPaoli5) is the associate editor for Converge360.

Featured

  • abstract illustration of artificial intelligence

    CSU Shares AI Learnings in Systemwide Survey

    In a systemwide survey of more than 94,000 faculty, staff, and students, California State University recently documented widespread AI use across its 22 campuses.

  • AI logo near computer equipment

    White House Releases National Policy Framework for AI

    The White House has released a four-page AI policy framework aimed at setting a national approach to AI, with priorities including child safety, intellectual property protections, truth and accuracy guardrails, and worker training for an AI-driven economy.

  • Dana Brunson facilitates a roundtable discussion with research and higher education IT leaders

    Internet2: Closing the Access Gap for Research Cyberinfrastructure

    Internet2's Research Engagement Team brings CIOs and other campus technology leadership together with research computing and data facilitators, forming a community that enables research cyberinfrastructure at institutions of all types and sizes.

  • Silhouettes of business professionals stand against a blurred futuristic city skyline at night, with a glowing digital network data connection

    It's Time for Higher Ed to Get Serious About AI Strategy

    Without a coordinated strategy that involves multiple academic and administrative units across the entire campus, colleges risk wasting resources, duplicating efforts, and ultimately failing to deliver on the promise of deploying technology to improve learning and operations.