Quick Start

Get your first API call running in under 2 minutes.

1. Get an API Key

2. Make Your First Request

ACAI is OpenAI-compatible. Point any OpenAI SDK or HTTP client at https://api.agilecloud.ai and use your API key.

cURL

curl https://api.agilecloud.ai/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "user", "content": "Hello, world!"}
    ]
  }'

Python (OpenAI SDK)

from openai import OpenAI

client = OpenAI(
    base_url="https://api.agilecloud.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello, world!"}],
)

print(response.choices[0].message.content)

JavaScript (OpenAI SDK)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.agilecloud.ai/v1",
  apiKey: "YOUR_API_KEY",
});

const response = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "Hello, world!" }],
});

console.log(response.choices[0].message.content);

3. Explore Endpoints

ACAI exposes three primary inference endpoints, plus passthrough access to frontier models:

Chat Completions — generate text with LLMs (streaming + non-streaming)
Embeddings — generate vector embeddings for search and RAG
Audio Transcription — transcribe audio files with Whisper
Frontier Models (BYOK) — route to GPT-4o, Claude, Gemini with your own API keys

4. What's Included

Every request through ACAI automatically gets compliance and platform features at no extra cost:

Content safety guardrails and PII detection
Prompt injection prevention
Audit logging with correlation IDs
HIPAA and SOC 2 compliance exports
Token-bucket rate limiting
Semantic caching — deduplicate similar requests for cost savings
Batch API — async bulk inference at 50% off
RAG — managed vector search and grounded generation
Smart routing — A/B testing, budget controls, and fallback chains
Prompt management — versioned templates with Jinja2 rendering

Ready for More?

Now that you've made your first call, set up compliance policies, explore the model catalog, install the SDK, and generate your first audit report.