Qwen 3.7 open-weight watch — what is confirmed, what is still rumor, what to actually do now
Qwen 3.7-Max launched on the API two days ago, but the open-weight release the local-LLM crowd is waiting for has no announced date. Here is what is actually known, what is reasonable to project from Alibaba's past cadence, and what to run right now.
Two days after Qwen 3.7-Max landed on the API, the loudest question on r/LocalLLaMA and the Qwen GitHub issues isn’t “is it good” — it’s “when can I download it and run it on my own hardware?”
Short answer as of May 22, 2026: nobody outside Alibaba knows for sure, and the GitHub evidence is unambiguous. Here’s the Qwen org’s Pinned repos right now:
What’s confirmed
- Qwen 3.7-Max API: live on Alibaba Cloud Model Studio / DashScope since May 20-21, 2026 — see our Qwen 3.7-Max walkthrough for the real screenshots and how to use it via the free Qwen Chat
- Max stays proprietary: per Alibaba’s release patterns and third-party reporting, the flagship Max variant is API-only. Smaller variants are what historically get open-weighted
- Apache 2.0 is Alibaba’s consistent open-weight license — Qwen 3.5-27B, Qwen 3.6-35B-A3B (MoE), and Qwen 3.6-27B (dense) all shipped under Apache 2.0
What’s a reasonable projection (not confirmed)
Alibaba’s pattern for the Qwen 3.6 family was: API first, open weights about 3 weeks later.
| Milestone | Qwen 3.6 (last cycle) | Qwen 3.7 (this cycle, projected) |
|---|---|---|
| API launch | Late March 2026 | May 20-21, 2026 ← confirmed |
| First open-weight (MoE) | April 17, 2026 (~3 weeks later) | ~mid June 2026 (projected) |
| Dense variant follow-up | April 23, 2026 (+6 days) | ~late June 2026 (projected) |
These are projections, not promises. Alibaba has published no Qwen 3.7 open-weight timeline. If the cadence breaks (either way), the projection above is wrong. Don’t commit production migrations to it.
What’s pure speculation right now
- Specific parameter counts (community guesses range from “another 35B MoE” to “70B dense” — none confirmed)
- HuggingFace repo URLs (no
Qwen/Qwen3.7-*placeholder has been created) - Whether a smaller (sub-10B) variant will ship for laptop / Mac unified memory
- Whether Qwen 3.7 will require new tokenizer / inference-stack changes vs Qwen 3.6
What to actually do right now
Three options ordered by pragmatism:
- Run Qwen 3.6-35B-A3B (MoE) locally today. It’s already on HuggingFace, Apache 2.0, runs in ~21GB VRAM at Q4. It’s not 3.7 but it’s what’s available, and the agentic workflow improvements Alibaba shipped in 3.7 mostly require the API anyway (Max-only).
- Use Qwen 3.7-Max via API now for the agent-frontier capability. $1-3 per million tokens (final pricing not yet posted on DashScope); test via chat.qwen.ai free first.
- Watch the Qwen GitHub org weekly. A new top-level
Qwen3.7repo appearing is the unambiguous signal that an open-weight release is imminent. Subscribe to GitHub releases on github.com/QwenLM/Qwen3.6 — the next major bump there often telegraphs the new family.
For builders deciding between locking in on Claude Haiku 4.5 (closed, API-only) vs Qwen 3.7 (closed Max API today, but a future open variant): see our Gemini 3.5 Flash vs Claude Haiku 4.5 comparison for the frame, then add Qwen as “Apache-2.0-licensed-eventually” — which is a real differentiator if your stack needs self-hosting.
Sources
- Qwen GitHub organization — the screenshot above
- Qwen3.6 GitHub repo — current open-weight family
- HuggingFace: Qwen/Qwen3.6-35B-A3B — the largest currently-open Qwen
- HuggingFace: Qwen/Qwen3.6-27B — current dense open variant
- aimadetools — How to Run Qwen 3.7 Locally — projected timeline analysis
- r/LocalLLaMA discussion thread — where this question is being asked
Source: Qwen GitHub org