glm-47-flash

Public

Description

GLM 4.7 Flash is a 30B A3B MoE model form Z.ai. It supports a context length of 128k tokens and achieves strong performance on coding benchmarks among models of similar scale.

Capabilities

Trained for tool use

ReasoningSupports reasoning

Minimum system memory

16GB

GLM 4.7 Flash by Z.ai

GLM 4.7 Flash is a 30B A3B MoE model form Z.ai. It supports a context length of 128k tokens and achieves strong performance on coding benchmarks among models of similar scale.

Custom Fields

Special features defined by the model author

Enable Thinking

: boolean

(default=true)

Controls whether the model will think before replying

Clear Thinking

: boolean

(default=false)

Controls whether thinking content is cleared from history

Parameters

Custom configuration options included with this model

Repeat Penalty

Disabled

Temperature

0.2

Top K Sampling

Top P Sampling

0.95

Sources

The underlying model files this model uses

Based on

🤗lmstudio-community/GLM-4.7-Flash-GGUF→

GGUF

🤗lmstudio-community/GLM-4.7-Flash-MLX-6bit→

MLX

🤗lmstudio-community/GLM-4.7-Flash-MLX-8bit→

MLX