One API. Every model.Arbitrage and optimizeyour inference

Quant-trading-grade optimization to make your inference cheaper, faster, and more reliable across models and providers.

Supported model providers

OpenAIOpenAI
AnthropicAnthropic
Google AI StudioGoogle AI Studio
GrokxAI
FireworksFireworks AI
together.aiTogether AI
DeepSeekDeepSeek



Start optimizing in seconds

Change a few lines in your code. Then optimization kicks in.

1from auriko import Client
2 
3client = Client()
4response = client.chat.completions.create(
5 model="gpt-4o",
6 messages=[{"role": "user", "content": "Hello!"}],
7 routing={
8 "optimize": "cost",
9 "max_ttft_ms": 200,
10 "data_policy": "zdr",
11 }
12)

Works with OpenAI compatible API. Learn more


Route in milliseconds, scale without limits