ACAI
ProductEvidenceDocsPricing
ACAI

Continuous compliance for AI. Every call scanned, classified, audit-logged, and evidence-ready.

Product

  • AI Layer
  • Sample Reports
  • Pricing
  • Documentation
  • Quickstart
  • Start Free

Company

  • About
  • Talk to an Engineer
  • Security
  • Support

Legal

  • Privacy Policy
  • Terms of Service
Service-Disabled Veteran-Owned Small Business
© 2026 Agile Cloud & AI LLC. All rights reserved.
OverviewQuick StartMigration GuideCompliance Quick StartNext Steps

User Guide

AuthenticationChat CompletionsEmbeddingsTranscriptionModelsGuardrailsRate LimitsError HandlingBYOK / Passthrough

Features

Batch APISemantic CacheRAGPromptsSmart RoutingRealtime APIAudit & Compliance

Developer

ArchitectureSelf-HostingAPI ReferenceInteractive DocsConfigurationContributing
Back to site

Quick Start

Get your first API call running in under 2 minutes.

1. Get an API Key

Sign in to the dashboard and create an API key. Copy it — you won't see it again.

2. Make Your First Request

ACAI is OpenAI-compatible. Point any OpenAI SDK or HTTP client at https://api.agilecloud.ai and use your API key.

cURL

curl https://api.agilecloud.ai/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "user", "content": "Hello, world!"}
    ]
  }'

Python (OpenAI SDK)

from openai import OpenAI

client = OpenAI(
    base_url="https://api.agilecloud.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello, world!"}],
)

print(response.choices[0].message.content)

JavaScript (OpenAI SDK)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.agilecloud.ai/v1",
  apiKey: "YOUR_API_KEY",
});

const response = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "Hello, world!" }],
});

console.log(response.choices[0].message.content);

3. Explore Endpoints

ACAI exposes three primary inference endpoints, plus passthrough access to frontier models:

  • Chat Completions — generate text with LLMs (streaming + non-streaming)
  • Embeddings — generate vector embeddings for search and RAG
  • Audio Transcription — transcribe audio files with Whisper
  • Frontier Models (BYOK) — route to GPT-4o, Claude, Gemini with your own API keys

4. What's Included

Every request through ACAI automatically gets compliance and platform features at no extra cost:

  • Content safety guardrails and PII detection
  • Prompt injection prevention
  • Audit logging with correlation IDs
  • HIPAA and SOC 2 compliance exports
  • Token-bucket rate limiting
  • Semantic caching — deduplicate similar requests for cost savings
  • Batch API — async bulk inference at 50% off
  • RAG — managed vector search and grounded generation
  • Smart routing — A/B testing, budget controls, and fallback chains
  • Prompt management — versioned templates with Jinja2 rendering

Ready for More?

Now that you've made your first call, set up compliance policies, explore the model catalog, install the SDK, and generate your first audit report.

Sign Up Free →Next Steps →