60% of the official price
Same GLM 5.2 API surface, billed at 60% of what you would pay official. Transparent per-token pricing, no minimums, pay as you go.
GLM 5.2 API for builders who need lower cost
Run the model everyone is talking about through flatkey: 60% of the official price, on dedicated self-hosted compute. Compatible with OpenAI SDK and Claude Code CLI. One line to switch.
No card to start · OpenAI SDK + Claude Code CLI compatible · Switch in one line
https://router.flatkey.ai/v1/messages# ~/.claude/settings.json
{
"env": {
"ANTHROPIC_BASE_URL": "https://router.flatkey.ai",
"ANTHROPIC_AUTH_TOKEN": "YOUR_FLATKEY_KEY",
"ANTHROPIC_DEFAULT_HAIKU_MODEL": "glm-5.2",
"ANTHROPIC_DEFAULT_SONNET_MODEL": "glm-5.2",
"ANTHROPIC_DEFAULT_OPUS_MODEL": "glm-5.2",
"CLAUDE_CODE_AUTO_COMPACT_WINDOW": "1000000",
"CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC": "1",
"API_TIMEOUT_MS": "3000000"
}
}
Why flatkey for GLM 5.2
Same GLM 5.2 API surface, billed at 60% of what you would pay official. Transparent per-token pricing, no minimums, pay as you go.
We run GLM 5.2 on owned GPU capacity, not a thin reseller layer. That gives teams a clearer operating path before they move campaign traffic into production.
Pricing
Exact same GLM 5.2 request shape. You just pay less on every call.
Official GLM 5.2
GLM 5.2 on flatkey
Pricing claim follows the campaign offer. Recheck the live billing configuration before launch.
Drop-in code
Keep the OpenAI SDK or Claude Code CLI workflow. Point traffic to flatkey and set the model name.
from openai import OpenAI
client = OpenAI(
base_url="https://router.flatkey.ai/v1",
api_key="YOUR_FLATKEY_KEY",
)
client.chat.completions.create(
model="glm-5.2",
messages=[{"role": "user", "content": "..."}],
)# ~/.claude/settings.json
{
"env": {
"ANTHROPIC_BASE_URL": "https://router.flatkey.ai",
"ANTHROPIC_AUTH_TOKEN": "YOUR_FLATKEY_KEY",
"ANTHROPIC_DEFAULT_HAIKU_MODEL": "glm-5.2",
"ANTHROPIC_DEFAULT_SONNET_MODEL": "glm-5.2",
"ANTHROPIC_DEFAULT_OPUS_MODEL": "glm-5.2",
"CLAUDE_CODE_AUTO_COMPACT_WINDOW": "1000000",
"CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC": "1",
"API_TIMEOUT_MS": "3000000"
}
}Built for paid traffic that converts
Use chat completions in code, or route Claude Code CLI usage through the same flatkey account.
A dedicated GLM lane gives volume campaigns more predictable headroom.
Market the value clearly: flatkey bills GLM 5.2 at 60% of official.
Simple top-ups and account billing without chasing separate provider accounts.
One key can test GLM, GPT, Claude, Gemini, video, image, and more.
Let developers create a key and make the first GLM 5.2 call quickly.
Create one flatkey API key, point your OpenAI-compatible client at our router, and start testing GLM 5.2 on dedicated compute.
Get your GLM 5.2 keyThe page targets GLM 5.2 compatible API usage through flatkey. Confirm exact runtime availability in the dashboard before production rollout.
No. Use the OpenAI-compatible base URL, keep the SDK, and set the model to glm-5.2. Claude Code CLI traffic can also route through flatkey.
Developers arriving from Google, Reddit, and regional campaigns who want a cheaper GLM 5.2 API key with a familiar integration path.