Close Menu
Tech Nova Mindset – Empower Innovation and Forward Thinking

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Anthropic becomes first AI startup to join the Frontier carbon removal coalition

    June 17, 2026

    How to watch ‘Million Dollar Nannies’ online from anywhere

    June 17, 2026

    I Found 22 Early Prime Day Deals That Are Worth Shopping Now

    June 17, 2026
    Facebook X (Twitter) Instagram
    Trending
    • Anthropic becomes first AI startup to join the Frontier carbon removal coalition
    • How to watch ‘Million Dollar Nannies’ online from anywhere
    • I Found 22 Early Prime Day Deals That Are Worth Shopping Now
    • IEEE’s 2026 Education Week Emphasized Lifelong Learning
    • Seattle migraine startup Vedana Therapeutics launches with $46M
    • Webinar Today: How Modern Breaches Bypass MFA and Evade Detection
    • Google is already killing off this helpful Gemini-powered tool
    • The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible
    Tech Nova Mindset – Empower Innovation and Forward Thinking
    • Home
    • Gadgets
    • Reviews
    • Tech News
    • Future Tech
    • AI & Robotics
    • How-To Guides
    • More
      • Cybersecurity
      • Startups & Innovation
    Tech Nova Mindset – Empower Innovation and Forward Thinking
    Home»Startups & Innovation»Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
    Startups & Innovation

    Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

    kirklandc008@gmail.comBy kirklandc008@gmail.comJune 17, 2026No Comments7 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Today, Chinese AI startup Z.ai (formerly Zhipu AI) announced the immediate release of GLM-5.2, a 753-billion parameter open-weights large language model (LLM) engineered specifically to dominate “long-horizon” autonomous coding and engineering tasks.

    Available immediately on Hugging Face, the Z.ai API, and more than 20 third-party coding environments, the model boasts a highly stable 1-million-token context window alongside enterprise subscription tiers starting at just $12.60 per month.

    In excellent news for cost and security-conscious businesses, z.ai has released GLM-5.2’s core weights under an unrestricted MIT open-source license, allowing enterprises to download the model freely from Hugging Face, customize or fine-tune it to their liking, and run it potentially locally or via virtual machines for only the cost of their compute and electricity.

    This is an increasingly appealing option for enterprises, as state-of-the-art American proprietary models face an uncertain and potentially interrupted regulatory future, following the Trump Administration’s export control directive last week prohibiting foreign nationals from using Anthropic’s new Claude Fable 5 model (which that company responded to by taking the models in question entirely offline for all users).

    For enterprise technical decision-makers, z.ai’s GLM-5.2 provides a highly capable path to host frontier-level AI locally, entirely bypassing the geographic fencing and commercial limitations.

    IndexShare re-uses one indexer for every four sparse attention layers, reducing compute needs

    Under the hood, GLM-5.2 operates with 753 billion parameters and introduces a major architectural optimization called “IndexShare”.

    In standard massive language models, recalculating attention mechanisms across long documents is computationally exorbitant. IndexShare solves this by reusing the identical indexer across every four sparse attention layers.

    At the maximum 1-million-token context length, this single innovation reduces per-token compute FLOPs by a massive 2.9 times.

    The model also features an upgraded Multi-Token Prediction (MTP) layer for speculative decoding, which boosts accepted token length by up to 20% during inference.

    Additionally, Z.ai has implemented flexible, selectable “Thinking Modes”. Users can toggle the model’s reasoning effort between “Max,” designed to push the limits of logical problem-solving, or “High,” which strikes a careful balance between high-end performance and latency-sensitive token efficiency.

    State-of-the-art benchmarks for an open model, and matching, even beating proprietary leaders on some categories

    On industry-standard third-party benchmark tests, GLM-5.2 performs above most open source flagship models, even DeepSeek v4 and scores near or above its closed-weights rivals, OpenAI’s GPT-5.5 and Anthropic’s Claude Opus 4.8.

    GLM-5.2 benchmark comparison bar charts. Credit: z.ai

    The model particularly shines in agentic tool use and long-horizon software engineering tasks:

    • SWE-bench Pro: GLM-5.2 scored 62.1, decisively beating GPT-5.5 (58.6) and its own predecessor, GLM-5.1 (58.4).

    • FrontierSWE (Dominance): Designed to test long-horizon task completion, GLM-5.2 hit 74.4%, surpassing GPT-5.5 (72.6%) and finishing in a near-tie with Claude Opus 4.8 (75.1%).

    GLM-5.2 long horizon task benchmarks. Credit: z.ai

    • MCP-Atlas: On this tool-usage evaluation, GLM-5.2 achieved a 77.0, outscoring GPT-5.5 (75.3) and performing just shy of Claude Opus 4.8 (77.8).

    • Humanity’s Last Exam (w/ Tools): When equipped with external tools, GLM-5.2 reached a score of 54.7, coming out ahead of GPT-5.5 (52.2) and tracking closely behind Claude Opus 4.8 (57.9).

    • PostTrainBench & SWE-Marathon: In extended, multi-hour engineering workloads, GLM-5.2 consistently topped GPT-5.5, scoring 34.3% against GPT-5.5’s 25.0% on PostTrainBench, and 13.0% against GPT-5.5’s 12.0% on SWE-Marathon.

    While GLM-5.2 trails Claude Opus 4.8 and GPT-5.5 slightly on raw Terminal-Bench 2.1 scores (81.0 versus 85.0 and 84.0, respectively), it significantly outscores Google’s Gemini 3.1 Pro (74.0).

    Beyond traditional coding metrics, GLM-5.2 took an impressive first place on the crowdsourced design task benchmark Design Arena, beating out even the aforementioned state-of-the-art Claude Fable 5 with an ELO score of 1360.

    Furthermore, the impact of Z.ai’s new selectable “thinking modes” is clearly visible in the data: under the “Max” effort level, GLM-5.2 pushes to peak intelligence, but utilizes nearly 85k output tokens per task. Switching to the “High” effort setting sacrifices only a few points in performance while effectively halving the required token output, providing a crucial optimization lever for latency-sensitive applications.

    Available via Coding Plans and API

    To operationalize the model, Z.ai launched the GLM Coding Plan, aiming squarely at developer workflows rather than simple chat interfaces.

    The plan offers out-of-the-box support for third-party U.S. and global agentic coding harnesses and tools including Claude Code, OpenClaw, Cline, Kilo Code, Crush, and Factory, among others. The Coding Plan pricing tiers (when billed annually) are highly competitive:

    • Lite: $12.60 per month ($151.20 per year starting in the 2nd year), geared toward lightweight iteration on small repositories.

    • Pro: $50.40 per month for day-to-day development on mid-sized repositories, offering 5x the usage allowance of the Lite plan.

    • Max: $112.00 per month for heavy workloads, offering 20x the Lite usage and dedicated resources during peak hours.

    For enterprise developers integrating the raw model into their own applications, Z.ai’s API pricing undercuts its Western rivals significantly while matching the exact rates of the previous GLM-5.1 generation.

    GLM-5.2 API access is priced at $1.40 per million input tokens and $4.40 per million output tokens, making it a mid-priced model globally, but about

    Sorted by total cost (input + output) from least to most expensive. Pricing shown is standard pay-as-you-go pricing per 1 million tokens.

    Model

    Input

    Output

    Total Cost

    Source

    MiMo-V2.5 Flash

    $0.10

    $0.30

    $0.40

    Xiaomi MiMo

    deepseek-v4-flash

    $0.14

    $0.28

    $0.42

    DeepSeek

    deepseek-v4-pro

    $0.435

    $0.87

    $1.305

    DeepSeek

    MiniMax-M3

    $0.30

    $1.20

    $1.50

    MiniMax

    Gemini 3.1 Flash-Lite

    $0.25

    $1.50

    $1.75

    Google

    Qwen3.7-Plus

    $0.40

    $1.60

    $2.00

    Alibaba Cloud

    MiMo-V2.5

    $0.40

    $2.00

    $2.40

    Xiaomi MiMo

    Grok 4.3 (low context)

    $1.25

    $2.50

    $3.75

    xAI

    MiMo-V2.5 Pro (≤256K)

    $1.00

    $3.00

    $4.00

    Xiaomi MiMo

    Kimi-K2.6

    $0.95

    $4.00

    $4.95

    Moonshot/Kimi

    GLM-5.2

    $1.40

    $4.40

    $5.80

    Z.ai

    Grok 4.3 (high context)

    $2.50

    $5.00

    $7.50

    xAI

    MiMo-V2.5 Pro (>256K)

    $2.00

    $6.00

    $8.00

    Xiaomi MiMo

    Qwen3.7-Max

    $2.50

    $7.50

    $10.00

    Alibaba Cloud

    Gemini 3.5 Flash

    $1.50

    $9.00

    $10.50

    Google

    Gemini 3.1 Pro Preview (≤200K)

    $2.00

    $12.00

    $14.00

    Google

    GPT-5.4

    $2.50

    $15.00

    $17.50

    OpenAI

    Gemini 3.1 Pro Preview (>200K)

    $4.00

    $18.00

    $22.00

    Google

    Claude Opus 4.8

    $5.00

    $25.00

    $30.00

    Anthropic

    GPT-5.5

    $5.00

    $30.00

    $35.00

    OpenAI

    Claude Fable 5 / Claude Mythos 5

    $10.00

    $50.00

    $60.00

    Anthropic

    To further optimize costs for long-context workloads, Z.ai offers a cached input rate of just $0.26 per million tokens, alongside a limited-time offer for free cached input storage.

    The stark contrast between open-weights innovators and proprietary Western labs has not gone unnoticed by the developer community.

    On X, prolific AI observer Lisan al Gaib (@scaling01) argued that “frontier labs are absolutely scamming you on API pricing”.

    The post noted that while massive open models like the 744-billion-parameter GLM-5.2 charge $4.40 per million output tokens and DeepSeek-V4-Pro (1.6 trillion parameters) charges just $0.87, proprietary models demand heavy premiums: Anthropic’s Sonnet 4.6 and Opus 4.8 charge $15.00 and $25.00 respectively, while OpenAI’s GPT-5.5 costs $30.00 for output.

    Highlighting that open-model developers are operating profitably without relying on the newest “fancy Blackwell chips,” the commentator suggested that leading proprietary labs are “probably at 90%+ margins at this point”.

    The beauty of the unmodified MIT License for enterprise use

    The most disruptive aspect of the GLM-5.2 release is its licensing. Z.ai released the model’s weights under an MIT open-source license, establishing it as a “Pure Open” system.

    The company’s technical documentation explicitly notes that this license guarantees “no regional limits” and allows “technical access without borders”.

    For enterprise technology leaders, an MIT license means the software can be used, modified, and commercialized without paying royalties or adhering to restrictive “acceptable use” governance policies common to dual-use licenses.

    It allows engineering teams to host frontier-level AI on their own sovereign infrastructure, entirely eliminating vendor lock-in.

    Warm reception among AI developers and toolmakers

    The developer reaction to the release has been immediate and overwhelmingly positive.

    The team behind Kilo Code confirmed day-one integration, posting on X: “GLM-5.2 runs in Kilo Code on day one. The 1M context window and Max effort mode are both live. Point your config at it and go!”.

    Open-source coding environment Cline IDE echoed this sentiment on X, noting the economic advantage: “GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench, and beats every other open model available. It also beats Gemini, making it a frontier-level model for a fraction of the cost. Open weights is back. This model is a game changer. Available in Cline now!”.

    Similarly, rival open source coding desktop agent Eigent AI also tested the model’s new capabilities on complex agentic workflows, noting on X: “threw a real long-horizon task: research 30 companies across 6 sectors of the AI infrastructure stack, structure it into JSON, then build an interactive HTML report… where 5.2 pulls ahead: -> plans…”.

    16th beats benchmarks coding cost GLM5.2 GPT5.5 longhorizon multiple openweights Z.ais
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    kirklandc008@gmail.com
    • Website

    Related Posts

    Seattle migraine startup Vedana Therapeutics launches with $46M

    June 17, 2026

    Your smart home subscriptions are about to cost a lot more money—here’s what you should do about it

    June 17, 2026

    AI, Quantum And The New Cybersecurity Framework Imperative

    June 17, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Google DeepMind Plans to Track AGI Progress With These 10 Traits of General Intelligence

    March 21, 20263 Views

    The AirPods 4 and Lego’s brick-ified Grogu are our favorite deals this week

    October 12, 20253 Views

    Nothing CEO says phone prices are going to keep going up

    June 12, 20262 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Recent Posts
    • Anthropic becomes first AI startup to join the Frontier carbon removal coalition
    • How to watch ‘Million Dollar Nannies’ online from anywhere
    • I Found 22 Early Prime Day Deals That Are Worth Shopping Now
    • IEEE’s 2026 Education Week Emphasized Lifelong Learning
    • Seattle migraine startup Vedana Therapeutics launches with $46M

    Anthropic becomes first AI startup to join the Frontier carbon removal coalition

    June 17, 2026

    How to watch ‘Million Dollar Nannies’ online from anywhere

    June 17, 2026

    I Found 22 Early Prime Day Deals That Are Worth Shopping Now

    June 17, 2026

    IEEE’s 2026 Education Week Emphasized Lifelong Learning

    June 17, 2026
    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2026 TechNovaMindset. Designed by By Pro.

    Type above and press Enter to search. Press Esc to cancel.