Pricing
Simple per-token pricing.
No minimums. No contracts. No platform fees. Every model, same key.
| Model | Context | Input $/Mtok | Cache Read $/Mtok | Output $/Mtok | Docs | |
|---|---|---|---|---|---|---|
DeepSeek-R1-0528 DeepSeek | 164K context | $1.35 | - | $5.4 | Docs ↗ | |
DeepSeek-V3.1 DeepSeek | 164K context | $0.6 | $0.06 | $1.7 | Docs ↗ | |
DeepSeek-V3.2 DeepSeek | 164K context | $0.56 | $0.06 | $1.68 | Docs ↗ | |
DeepSeek-V4-Flash DeepSeek | 1M context | $0.19 | - | $0.51 | Docs ↗ | |
DeepSeek-V4-Pro DeepSeek | 1M context | $1.93 | $0.17 | $3.83 | Docs ↗ | |
gemma-4-26B-A4B-it Google | 262K context | $0.15 | $0.01 | $0.6 | Docs ↗ | |
Llama-3.3-70B-Instruct Meta | 128K context | $0.72 | - | $0.72 | Docs ↗ | |
MiniMax-M2 MiniMax | 197K context | $0.3 | $0.03 | $1.2 | Docs ↗ | |
kimi-k2-thinking Moonshot | 262K context | $0.6 | $0.06 | $2.5 | Docs ↗ | |
Kimi-K2.5 Moonshot | 262K context | $0.6 | - | $3 | Docs ↗ | |
Kimi-K2.6 Moonshot | 262K context | $1.04 | $0.18 | $4.4 | Docs ↗ | |
gpt-oss-120b OpenAI | 131K context | $0.09 | - | $0.36 | Docs ↗ | |
gpt-oss-20b OpenAI | 131K context | $0.07 | - | $0.25 | Docs ↗ | |
Qwen3-235B-A22B-Instruct-2507 Qwen / Alibaba | 262K context | $0.22 | - | $0.88 | Docs ↗ | |
Qwen3-Coder-480B-A35B-Instruct Qwen / Alibaba | 262K context | $0.22 | $0.02 | $1.8 | Docs ↗ | |
Qwen3-Next-80B-A3B-Instruct Qwen / Alibaba | 262K context | $0.15 | - | $1.2 | Docs ↗ | |
Qwen3-Next-80B-A3B-Thinking Qwen / Alibaba | 262K context | $0.15 | - | $1.2 | Docs ↗ | |
GLM-4.7 Z.ai | 200K context | $0.6 | - | $2.2 | Docs ↗ | |
GLM-5 Z.ai | 200K context | $1 | $0.1 | $3.2 | Docs ↗ | |
GLM-5.1 Z.ai | 203K context | $1.54 | $0.29 | $4.84 | Docs ↗ | |
GLM-5.2 Z.ai | 262K context | $1.49 | $0.27 | $4.62 | Docs ↗ |
Prices are in USD per million tokens. All inference is processed on US-owned infrastructure with zero data retention.
Questions? hello@tera.gw. API docs and integration guides at tera.gw.