Z.ai’s GLM-4.7 family brings two complementary models to DeepMask: the 358B flagship GLM-4.7 for deep reasoning, preserved thinking across long agentic workflows, and high-fidelity UI generation; and the lightweight GLM-4.7 Flash for high-volume, real-time automation where hundreds of small decisions are needed per minute. Both models offer strong bilingual (English/Chinese) performance and native support for interleaved thinking.Documentation Index
Fetch the complete documentation index at: https://documentation.deepmask.io/llms.txt
Use this file to discover all available pages before exploring further.
- GLM-4.7
- GLM-4.7 Flash
About
GLM-4.7 is the 358B parameter flagship model from Z.ai. It achieves coding scores aligned with Claude Sonnet 4.5 and features “Preserved Thinking” for agentic workflows — maintaining a complex logical plan across hundreds of individual tool calls without losing track of the goal. It is particularly strong at bilingual English/Chinese reasoning, full-stack prototype generation, and high-fidelity UI/UX code generation.GLM-4.7 is an open-source model from Z.ai. Its 200K context window and preserved thinking architecture make it a strong choice for long-horizon agentic tasks.
Key Capabilities
Agentic Coding
Focuses on task completion rather than snippets — builds whole executable frameworks and app skeletons.
UI/UX Generation
Strong understanding of UI/UX principles, producing well-structured and visually polished web layouts.
Bilingual Mastery
Leading performance in technical and legal English/Chinese translation and cross-language reasoning.
Long-Horizon Planning
Executes 300+ sequential tool calls without losing track of the original goal or accumulated context.
Use Cases
- Full-stack prototype generation — Create structurally complete, ready-to-run application skeletons from a description or diagram.
- Multi-document content creation — Generate 16:9 presentations and posters with coherent visual and logical structure.
- Technical research — Synthesize cross-border research papers across multiple languages into unified summaries.
- Complex workflow automation — Execute long multi-step agent workflows involving search, code execution, and document generation.
Specifications
| Specification | Value |
|---|---|
| Model Provider | Z.ai |
| Main Use Cases | Expert Coding, Complex Workflow Automation, STEM Research |
| Reasoning Effort | Adaptive (Standard/High) |
| GPQA Diamond | 85.7% |
| Max Context | 200K Tokens |
| Latency (TTFT) | 0.65s |
| Throughput | 76 Tokens/sec |