Z.ai: GLM 4.5

Deprecated

z-ai/glm-4.5

Released Jul 25, 2025131,000 context

$0.55/M input tokens$2/M output tokens

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

Z.ai: GLM 4.5

z-ai/glm-4.5

Z.ai: GLM 4.5

z-ai/glm-4.5

Effective Pricing for GLM 4.5

Actual cost per million tokens across providers over the past hour

Effective Pricing for GLM 4.5

Actual cost per million tokens across providers over the past hour