Documentation
Everything you need to integrate DirectAI into your applications. OpenAI-compatible endpoints, compliance features built in, zero vendor lock-in.
Getting Started
API Endpoints
Platform
Features
Batch API
Async bulk inference at 50% off real-time rates.
Semantic Cache
Deduplicate similar requests for cost savings.
RAG
Managed vector search and grounded generation.
Prompt Management
Versioned templates with Jinja2 and A/B testing.
Smart Routing
A/B testing, budget controls, fallback chains.
Realtime API
WebSocket streaming for audio and text.
Audit & Compliance
HIPAA/SOC 2 exports, retention, legal holds.