Moderations API — POST /v1/moderations with BR guardrail layer
2026-03-22
What We Built
OpenAI-compatible moderation endpoint at POST /v1/moderations with BR-specific guardrail categories layered on top. Returns standard OpenAI moderation format with additional br_jailbreak and br_pii_detected categories.
BR categories use pattern-based detection from existing guardrail checks (jailbreak patterns, PII regex) — no external API call needed, sub-millisecond latency.
Why It Matters
Unique in the AI gateway space — no competitor offers moderation with built-in jailbreak and PII detection as additional signal. Customers get both provider moderation AND BR's own security analysis in one API call.
Lockstep Checklist
- [x] API Routes:
src/api/routes/moderations.ts+ mount inserver.ts - [x] TS SDK:
moderations.create() - [x] Python SDK:
moderations.create()sync + async - [x] MCP Schemas: N/A — not agent-facing
- [x] Master Record: N/A — auto-updated