Guardrails + Audit Included in Every Plan

Compliance-First AI Inference

Content safety, PII detection, and audit logging come standard. Start free, scale with managed models. No add-on subscriptions, no surprises.

Free

Free$5 credit included

Try the API with $5 of free credit on our shared cluster. No credit card required. Or self-host with our open-source stack.

Guardrails + PII detection + audit logging
OpenAI-compatible API endpoint
LLMs, embeddings, transcription
$5 one-time API credit (Pro rates)
20 RPM / 40K TPM rate limits
7-day audit log retention
Community support
Self-hosted containers available (GHCR)

Pro

$99/month + per-token usage

Managed model catalog with full compliance stack. Per-token billing — pay only for what you use.

Everything in Free
Prompt injection prevention
7-framework compliance exports (HIPAA, SOC 2, PCI DSS, GDPR, CCPA, NIST, FERPA)
Semantic cache + batch API
Full managed model catalog
300 RPM / 500K TPM rate limits
30-day audit log retention
Per-token usage billing via Stripe
Dashboard + API key management
Email support (48hr SLA)
99.5% uptime SLA

Start Pro

Business

$499/month + per-token usage

BYOB (Bring Your Own Backend) support, policy templates, and BAA available. Full compliance reporting for auditors.

Everything in Pro
BYOB — bring your own provider keys
Network policy enforcement
RAG, routing, and A/B testing
Pre-built compliance policy templates
1-year audit log retention + exports
BAA available (HIPAA)
1,000 RPM / 5M TPM rate limits
Email support (24hr SLA)
99.9% uptime SLA

Talk to an Engineer

Enterprise

Customcommitted monthly spend

Customer-deployed inference project, private endpoints, custom models. Full compliance documentation and support.

Everything in Business
Customer-deployed inference project
Private endpoints in your VNet
Custom models + fine-tuning support
Unlimited audit retention + legal holds
Custom guardrail rules + policies
7-framework compliance documentation
Dedicated solutions engineer
Slack + phone support (1hr SLA)
99.99% uptime SLA

Talk to an Engineer

Compute Pricing

All tiers pay per token. Enterprise gets committed capacity at a flat monthly rate.

Per-Model Rates

Free + Pro + Business

Model	Input	Output	Tiers
GPT-4o	$3.50 / 1M tokens	$14.00 / 1M tokens	Pro+
GPT-4o mini	$0.20 / 1M tokens	$0.80 / 1M tokens	Free+
GPT-4.1	$2.50 / 1M tokens	$10.00 / 1M tokens	Pro+
GPT-4.1 mini	$0.50 / 1M tokens	$2.00 / 1M tokens	Free+
o4-mini	$1.50 / 1M tokens	$6.00 / 1M tokens	Business+
DeepSeek R1	$0.70 / 1M tokens	$2.80 / 1M tokens	Pro+
Llama 4 Scout	$0.30 / 1M tokens	$0.90 / 1M tokens	Pro+
Phi-4	$0.14 / 1M tokens	$0.28 / 1M tokens	Free+
Llama 3.1 8B	$0.06 / 1M tokens	$0.12 / 1M tokens	Free+
Mistral Small	$0.15 / 1M tokens	$0.45 / 1M tokens	Free+
Mistral Large	$2.60 / 1M tokens	$7.80 / 1M tokens	Business+
Grok 3 Mini	$0.50 / 1M tokens	$1.50 / 1M tokens	Pro+
text-embedding-3-large	$0.17 / 1M tokens	—	All
text-embedding-3-small	$0.03 / 1M tokens	—	All
Whisper large-v3	—	$0.13/min	Pro+

Free tier $5 credit burns at these rates. Volume discounts available on Business+.

Cost Calculator

Estimate your monthly spend based on model and usage.

Model

Monthly Tokens1M

100K500K1M5M10M50M100M

Output Token Ratio30%

10% (short replies)70% (long generation)

Estimated Monthly Cost

Free

$0.38

$0.38 tokens

Pro

$99.38

$99 base + $0.38 tokens

Business

$499.38

$499 base + $0.38 tokens

Included in Every Plan

Compliance Built In, Not Bolted On

Every ACAI deployment includes production-grade guardrails and audit logging at no extra cost. Because compliance shouldn't be a premium feature.

Content Safety

Content safety scoring for hate, violence, self-harm, and sexual content on every request.

Included free

PII Detection & Redaction

10 built-in patterns (SSN, credit cards, PHI, MRN, DOB) with detect, redact-logs, or redact-all modes.

Included free

Prompt Injection Prevention

4-layer detection pipeline: encoding analysis, pattern matching, heuristic scoring, and Prompt Shield API.

Included free

Audit Logging & Compliance

Every request logged with full audit trail. 7 compliance export formats (HIPAA, SOC 2, PCI DSS, GDPR, CCPA, NIST 800-53, FERPA). Legal holds and retention policies.

Included free

Feature Comparison

Everything scales with your tier. No add-on subscriptions.

Feature	Free	Pro	Business	Enterprise
Content safety + PII detection	✓	✓	✓	✓
Audit logging	7 days	30 days	1 year	Unlimited
Prompt injection prevention	—	✓	✓	✓
7-framework compliance exports	—	✓	✓	✓
Semantic cache	—	✓	✓	✓
Batch API	—	✓	✓	✓
RAG (knowledge bases)	—	—	5 GB	Unlimited
Routing + A/B testing	—	—	✓	✓
Custom guardrail rules	—	—	—	✓
Custom models + fine-tuning	—	—	—	✓
Legal holds	—	—	—	✓
BAA (HIPAA)	—	—	✓	✓
Compute isolation	Shared	Shared	BYOB	VNet
SLA	—	99.5%	99.9%	99.99%
Support	Community	Email 48hr	Email 24hr	Slack 1hr

Build It Yourself vs. ACAI

What it takes to build the same compliance pipeline in-house.

Build It Yourself

✗PII detection engine — regex patterns, NER model integration, testing across data types
✗Content safety scoring — hate, violence, self-harm, sexual content classification
✗Prompt injection detection — encoding analysis, pattern matching, heuristic scoring
✗Data classification system — 4-level taxonomy, per-key enforcement, routing rules
✗Tamper-proof audit logging — dual sinks, correlation IDs, retention policies
✗7 compliance report renderers — HIPAA, SOC 2, PCI DSS, GDPR, CCPA, NIST, FERPA control mappings
✗API gateway — auth, rate limiting, model routing, streaming proxy
✗Dashboard — API key management, audit trail viewer, report generation
✗Ongoing maintenance — new regulations, pattern updates, security patches

Typically requires 2-3 engineers for 3-6 months to build and maintain.

ACAI

✓Change your OpenAI base_url to api.agilecloud.ai — same SDK, same code
✓14+ PII patterns + Azure AI Language NER — pre-configured, no ML ops
✓Content safety + prompt injection — 4-layer pipeline, active on every request
✓4 data classification levels — per-key floors, automatic enforcement
✓Tamper-proof audit trail — PostgreSQL + immutable Blob, correlation IDs included
✓7 compliance frameworks — evidence reports mapped to specific regulation controls
✓37 models from 10 providers — managed catalog, zero provisioning
✓Dashboard included — API keys, audit viewer, reports, cost tracking
✓We maintain it — new regulations, pattern updates, infrastructure, uptime SLA

Pro starts at $99/month + per-token usage. First $5 free, no credit card.

Frequently Asked Questions

What compliance features are included?

Every plan — including Free — includes content safety filtering, PII detection and redaction, prompt injection prevention, and full audit logging. Compliance exports for all 7 frameworks (HIPAA, SOC 2, PCI DSS, GDPR, CCPA, NIST 800-53, FERPA) are available on Pro and above. BAA execution is available on Business and above.

How does per-token billing work?

Free and Pro tiers bill per token processed. Rates vary by model — premium models like GPT-4o cost more per token than lightweight models like Llama 3.1 8B or Phi-4. Your Free $5 credit burns at the same per-model rates. See the pricing table above for exact rates.

What's the difference between Pro and Business?

Pro uses our managed model catalog — great for development and moderate production workloads. Business adds BYOB (Bring Your Own Backend) support so you can route through your own provider keys while keeping the full compliance layer. Business also unlocks BAA execution for HIPAA compliance.

Can I start with Pro and upgrade later?

Yes. Most customers start on Pro to validate their use case, then upgrade to Business when they need dedicated compute or BAA. The API is identical — same endpoints, same SDKs, zero migration effort.

What about self-hosting?

Free self-hosted containers are available on GitHub Container Registry (ghcr.io/devoptimum/acai-api and acai-web). Pull the images, bring your own inference backend, and run the full compliance pipeline locally. Self-hosted evidence packs are marked ‘unattested’ — upgrade to SaaS for ACAI-signed evidence packs with chain-of-custody verification.

What models can I run?

All tiers access a managed model catalog (GPT-4o, GPT-4.1, DeepSeek R1, Llama 4, Phi-4, Mistral, and 37 models total). Business adds BYOB — bring your own OpenAI, Anthropic, or any provider keys. Enterprise adds custom fine-tuned models.

What regions do you support?

Free and Pro run in US East and US South Central. Business can deploy to additional regions. Enterprise supports sovereign and government cloud regions.

Compute Pricing

All tiers pay per token. Enterprise gets committed capacity at a flat monthly rate.

Per-Model Rates

Free + Pro + Business

Model	Input	Output	Tiers
GPT-4o	$3.50 / 1M tokens	$14.00 / 1M tokens	Pro+
GPT-4o mini	$0.20 / 1M tokens	$0.80 / 1M tokens	Free+
GPT-4.1	$2.50 / 1M tokens	$10.00 / 1M tokens	Pro+
GPT-4.1 mini	$0.50 / 1M tokens	$2.00 / 1M tokens	Free+
o4-mini	$1.50 / 1M tokens	$6.00 / 1M tokens	Business+
DeepSeek R1	$0.70 / 1M tokens	$2.80 / 1M tokens	Pro+
Llama 4 Scout	$0.30 / 1M tokens	$0.90 / 1M tokens	Pro+
Phi-4	$0.14 / 1M tokens	$0.28 / 1M tokens	Free+
Llama 3.1 8B	$0.06 / 1M tokens	$0.12 / 1M tokens	Free+
Mistral Small	$0.15 / 1M tokens	$0.45 / 1M tokens	Free+
Mistral Large	$2.60 / 1M tokens	$7.80 / 1M tokens	Business+
Grok 3 Mini	$0.50 / 1M tokens	$1.50 / 1M tokens	Pro+
text-embedding-3-large	$0.17 / 1M tokens	—	All
text-embedding-3-small	$0.03 / 1M tokens	—	All
Whisper large-v3	—	$0.13/min	Pro+

Free tier $5 credit burns at these rates. Volume discounts available on Business+.

Feature Comparison

Everything scales with your tier. No add-on subscriptions.

Feature	Free	Pro	Business	Enterprise
Content safety + PII detection	✓	✓	✓	✓
Audit logging	7 days	30 days	1 year	Unlimited
Prompt injection prevention	—	✓	✓	✓
7-framework compliance exports	—	✓	✓	✓
Semantic cache	—	✓	✓	✓
Batch API	—	✓	✓	✓
RAG (knowledge bases)	—	—	5 GB	Unlimited
Routing + A/B testing	—	—	✓	✓
Custom guardrail rules	—	—	—	✓
Custom models + fine-tuning	—	—	—	✓
Legal holds	—	—	—	✓
BAA (HIPAA)	—	—	✓	✓
Compute isolation	Shared	Shared	BYOB	VNet
SLA	—	99.5%	99.9%	99.99%
Support	Community	Email 48hr	Email 24hr	Slack 1hr

Build It Yourself vs. ACAI

What it takes to build the same compliance pipeline in-house.

Build It Yourself

✗PII detection engine — regex patterns, NER model integration, testing across data types
✗Content safety scoring — hate, violence, self-harm, sexual content classification
✗Prompt injection detection — encoding analysis, pattern matching, heuristic scoring
✗Data classification system — 4-level taxonomy, per-key enforcement, routing rules
✗Tamper-proof audit logging — dual sinks, correlation IDs, retention policies
✗7 compliance report renderers — HIPAA, SOC 2, PCI DSS, GDPR, CCPA, NIST, FERPA control mappings
✗API gateway — auth, rate limiting, model routing, streaming proxy
✗Dashboard — API key management, audit trail viewer, report generation
✗Ongoing maintenance — new regulations, pattern updates, security patches

Typically requires 2-3 engineers for 3-6 months to build and maintain.

ACAI

✓Change your OpenAI base_url to api.agilecloud.ai — same SDK, same code
✓14+ PII patterns + Azure AI Language NER — pre-configured, no ML ops
✓Content safety + prompt injection — 4-layer pipeline, active on every request
✓4 data classification levels — per-key floors, automatic enforcement
✓Tamper-proof audit trail — PostgreSQL + immutable Blob, correlation IDs included
✓7 compliance frameworks — evidence reports mapped to specific regulation controls
✓37 models from 10 providers — managed catalog, zero provisioning
✓Dashboard included — API keys, audit viewer, reports, cost tracking
✓We maintain it — new regulations, pattern updates, infrastructure, uptime SLA

Pro starts at $99/month + per-token usage. First $5 free, no credit card.

Frequently Asked Questions

What compliance features are included?

How does per-token billing work?

What's the difference between Pro and Business?

Can I start with Pro and upgrade later?

What about self-hosting?

What models can I run?

What regions do you support?

Free and Pro run in US East and US South Central. Business can deploy to additional regions. Enterprise supports sovereign and government cloud regions.