2026-05-21 09:16 UTCIn-site rewrite1 min readUpdated: 2026-06-30 13:03 UTC

Qwen 3.7 Tops Chinese AI Models Globally, Ranks Fifth Worldwide

According to the latest Artificial Analysis ranking, Alibaba's Qwen3.7-Max scored 56.6, ranking first among Chinese models and fifth globally, approaching the performance of GPT, Claude, and Gemini. The model is designed for agentic AI, achieving breakthroughs in coding, agent, and reasoning capabilities, and will soon be available on Alibaba Cloud Bailian.

Source量子位Author: 量子位的朋友们

On May 21, 2026, independent AI benchmarking platform Artificial Analysis released its latest global large model ranking. Alibaba's flagship model, Qwen3.7-Max, achieved a score of 56.6, surpassing all other Chinese models including Kimi-K2.6, DeepSeek-v4-Pro-Max, and GLM5.1, and securing the fifth position worldwide. This marks a significant milestone for Chinese AI development, as the model's performance is now comparable to top-tier models such as GPT-5.4 (xhigh), Gemini3.1 Pro Preview, and Claude-Opus4.7 (max). The ranking is widely recognized in the industry as one of the most authoritative and influential benchmarks for large language models.

Qwen3.7-Max is specifically designed for agentic AI, featuring substantial breakthroughs in core capabilities like coding, agent behavior, and reasoning. The model integrates seamlessly with various agent frameworks, including Claude Code, OpenClaw, Hermes Agent, and Qwen Code. It can independently execute complex long-horizon tasks involving up to 35 hours of continuous work and over 1,000 tool invocations, delivering production-grade results suitable for enterprise-level challenges. This represents a leap forward in autonomous AI agent capabilities.

According to sources close to the matter, Qwen3.7-Max will soon be available as an API service on Alibaba Cloud Bailian, further expanding its accessibility for developers and businesses. This achievement continues Alibaba's dominance in the Chinese AI landscape, following the previous milestone of Qwen3.6-Max-Preview, which had set the best performance for Chinese models just one month earlier. The rapid improvement underscores the intense competition and innovation in the Chinese AI industry, with Alibaba consistently pushing the boundaries of what is possible with large language models.