30B-A3B GLM-4.7-Flash Released

TheCornCollector@piefed.zip · 2 hours ago

30B-A3B GLM-4.7-Flash Released

panda_abyss@lemmy.ca · 1 hour ago

Anyone get this working in llama.cpp yet?

I know flash attention and PyTorch have patchy support.

TheCornCollector@piefed.zip · 1 hour ago

Seems to be a new architecture so custom support is needed.

Tracking issue

PR

FaceDeer@fedia.io · 2 hours ago

Oo. I use Qwen3-30B-A3B-Thinking-2507 as my generic “workhorse” local LLM, so this looks like it might be a nice upgrade with exactly the same basic specs. I’ll try it out.

30B-A3B GLM-4.7-Flash Released

30B-A3B GLM-4.7-Flash Released

zai-org/GLM-4.7-Flash · Hugging Face