PyRIT — Python Risk Identification Tool for generative AI

Microsoft AI Red Team’s open-source Python framework for orchestrating multi-turn and single-turn adversarial attacks against generative-AI systems. The wiki’s CMM cites PyRIT as the “orchestration / multi-turn” attack category in the D7 L4 four-quadrant red-team coverage requirement.

What it does

PyRIT is an orchestration framework, not a fixed probe library. It composes:

Component	Role
Targets	Adapters for OpenAI, Azure OpenAI, Anthropic, Google, HuggingFace, custom HTTP/WebSocket, web apps via Playwright
Prompt converters	Encoding, persona injection, language transforms applied before sending
Scorers	True/false, Likert scale, classification — backed by LLMs, Azure AI Content Safety, or user logic
Memory	Stateful storage for multi-turn attacks
Orchestrators	Multi-turn attack strategies: Crescendo, TAP (Tree of Attacks with Pruning), Skeleton Key
Datasets	Adversarial-prompt corpora

v0.13.0 (April 2026) introduced TargetConfiguration (replacing TargetCapabilities) and AttackTechniqueRegistry for composable attacks, plus ISO 42001-aligned harm definitions and a VisualLeakBench dataset loader.

Repo move (April 2026)

The canonical repo is now microsoft/PyRIT (formerly Azure/PyRIT, archived March 27 2026). Wiki references using the old URL still redirect but should be updated. As of May 2026: 3.8k stars, 747 forks, 20 releases.

What it does not do

Gap	Filled by
Fixed probe library (DAN, GCG, encoding etc.)	Garak
Regression-style pass/fail dashboards	Promptfoo
24/7 managed continuous service	Mindgard CART

This is the basis for the wiki’s four-quadrant D7 L4 evidence requirement — single-tool coverage is explicitly not L4.

Direct quotes

“An open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.” — repo README
“Automated and human-led AI red teaming — a flexible, extensible framework for assessing the security and safety of generative AI systems at scale.” — microsoft.github.io/PyRIT

How the wiki uses it

CMM D7 L4 — orchestration / multi-turn red-team category
Measurement Protocol — interview script asks “which tools were used (Promptfoo / PyRIT / Garak / Mindgard CART)?”
Cross-checked in Agentic AI Threat Classes 2026 §Class 2 (long-running adaptive APT campaigns) — multi-turn orchestration is what scales an attack, and PyRIT is the canonical OSS toolkit attackers and defenders both use.

Caveats

API instability across minor versions — v0.12 → v0.13 changed the target abstraction. Wiki-referenced code snippets predating April 2026 are stale.
Multimodal coverage is thin — VisualLeakBench landed in v0.13.0 but vision/audio/voice attack libraries remain limited compared to text.
MCP / tool-use red-team primitives are sparse — PyRIT can drive tool-using targets but ships limited canned tool-abuse scenarios. This is the seam Promptfoo’s BOLA/BFLA plugins fill at the agentic layer.

Enterprise Security in the Agentic AI Era

Explorer

PyRIT — Python Risk Identification Tool for generative AI

PyRIT — Python Risk Identification Tool for generative AI

What it does

Repo move (April 2026)

What it does not do

Direct quotes

How the wiki uses it

Caveats

See Also

Graph View

Table of Contents

Backlinks