ACAI
ProductEvidenceDocsPricing
ACAI

Continuous compliance for AI. Every call scanned, classified, audit-logged, and evidence-ready.

Product

  • AI Layer
  • Sample Reports
  • Pricing
  • Documentation
  • Quickstart
  • Start Free

Company

  • About
  • Talk to an Engineer
  • Security
  • Support

Legal

  • Privacy Policy
  • Terms of Service
Service-Disabled Veteran-Owned Small Business
© 2026 Agile Cloud & AI LLC. All rights reserved.
OverviewQuick StartMigration GuideCompliance Quick StartNext Steps

User Guide

AuthenticationChat CompletionsEmbeddingsTranscriptionModelsGuardrailsRate LimitsError HandlingBYOK / Passthrough

Features

Batch APISemantic CacheRAGPromptsSmart RoutingRealtime APIAudit & Compliance

Developer

ArchitectureSelf-HostingAPI ReferenceInteractive DocsConfigurationContributing
Back to site

Documentation

Everything you need to integrate ACAI into your applications. OpenAI-compatible endpoints, compliance features built in, zero vendor lock-in.

Getting Started

Quick Start

Get your first API call running in under 2 minutes.

Authentication

API keys, Bearer tokens, and key management.

Migration Guide

Switch from OpenAI, Azure, or Anthropic in 2 lines.

Compliance Quick Start

Go from zero to audit-ready in 30 minutes.

API Endpoints

Chat Completions

Generate text with LLMs. Streaming and non-streaming.

Embeddings

Generate vector embeddings for search and RAG.

Audio Transcription

Transcribe audio files to text with Whisper.

Text to Speech

Convert text to natural-sounding speech audio.

Image Generation

Generate images from text with DALL-E 3 and GPT Image 1.

Video Generation

Generate videos from text prompts with Sora.

Platform

Models

Available models, aliases, and capabilities.

Guardrails

Content safety, PII detection, injection prevention.

Rate Limits

Per-tier rate limits and usage quotas.

BYOK / Passthrough

Use your own API keys for frontier models.

Features

Batch API

Async bulk inference at 50% off real-time rates.

Semantic Cache

Deduplicate similar requests for cost savings.

RAG

Managed vector search and grounded generation.

Prompt Management

Versioned templates with Jinja2 and A/B testing.

Smart Routing

A/B testing, budget controls, fallback chains.

Realtime API

WebSocket streaming for audio and text.

Audit & Compliance

HIPAA/SOC 2 exports, retention, legal holds.

Developer

Architecture

System design, inference engines, and scaling.

Self-Hosting

Deploy ACAI in your own infrastructure.

API Reference

Full endpoint reference for all APIs.

Configuration

Environment variables and deployment config.

Contributing

How to contribute to the open-source project.