Here is a number that should make every tech executive in Silicon Valley put down their coffee: 6%.
That is not a discount code. It is not a rounding error. It is the percentage of the industry‘s typical pre-training cost that Baidu’s brand-new ERNIE 5.1 model burned to achieve performance that now sits behind only three models on the entire planet in search capability. Only OpenAI‘s GPT-5.5 and Anthropic’s Claude Opus 4.6 and 4.7 rank higher. Everyone else — Google, Meta, xAI, Mistral — is looking up at a model that cost roughly the price of a nice office renovation to train, while their own models demanded the financial equivalent of a small nation‘s GDP.
This story broke at Baidu Create 2026, the company’s annual AI developer conference held in Beijing on May 13 and 14, and it landed like a thunderclap. Baidu‘s Hong Kong-listed shares surged more than 7% to a three-month high the morning after the keynote. The market was not just reacting to a single model. It was reacting to a thesis — one that has been quietly building for eighteen months and is now impossible to ignore: Chinese AI companies, hemmed in by American semiconductor export controls, are not collapsing. They are getting leaner, meaner, and far more dangerous.
The story of ERNIE 5.1 begins with a technical philosophy Baidu calls “Multi-dimensional Elastic Pre-training,” but the plain-English version is more provocative. Instead of training each model size from scratch — the brute-force approach that has turned AI development into an arms race of GPU hoarding — Baidu built a framework called “Once-For-All.” In a single training run, the system simultaneously optimizes a vast family of sub-models that share weights but differ in depth, width, and the number of activated expert blocks. Think of it as baking one giant cake from which you can slice dozens of perfectly formed smaller cakes, rather than baking each one in a separate oven.
From the massive ERNIE 5.0 architecture — roughly 2.4 trillion parameters, launched in January 2026 — Baidu extracted an optimized subnet to create ERNIE 5.1. The result is startling: total parameters compressed to about one-third, activation parameters cut to roughly one-half, and a pre-training compute bill that landed at approximately 6% of what comparable models cost elsewhere. This is not some marketing sleight-of-hand claiming “we built a Ferrari for the price of a bicycle.” It is a statement about efficiency: at the same parameter scale and performance tier, Baidu used only 6% of the compute that the rest of the industry burns. The word “efficiency” hardly does it justice.
The numbers on the leaderboard are what turn this from an engineering curiosity into a strategic event. On the Arena Search leaderboard — the global benchmarking platform that pits AI models against each other in blind, crowd-sourced evaluations — ERNIE 5.1 scored 1,223 points, securing fourth place globally and first among all Chinese models. It is the only Chinese model on that list. The three models above it — GPT-5.5, Claude Opus 4.6, and Claude Opus 4.7 — are products of companies with essentially unrestricted access to the world‘s most advanced GPUs. Baidu operates under a different reality.
Since October 2022, the United States has imposed increasingly tight export controls on advanced semiconductors bound for China. The explicit goal was to slow down China’s AI progress by starving its labs of cutting-edge hardware. For a while, the assumption in Western capitals was that this would work — that Chinese models would inevitably fall behind as the compute gap widened. ERNIE 5.1 suggests something quite different happened instead. Facing hardware scarcity, Chinese labs turned necessity into a competitive advantage. They stopped trying to out-spend and started trying to out-think. The result is a model that performs at the frontier while using a fraction of the resources. If anything, the sanctions may have accelerated China‘s push toward algorithmic efficiency — and in doing so, may have produced a capability that is far more scalable and commercially threatening than a brute-force model could ever be.
The agentic capabilities of ERNIE 5.1 tell a parallel story. On the τ³-bench and SpreadsheetBench-Verified evaluation tasks, ERNIE 5.1 surpassed DeepSeek-V4-Pro, with agentic abilities now approaching the level of leading closed-source models. Its creative writing matches Gemini 3.1 Pro. On the AIME26 mathematics competition using tools, it scored 99.6, second only to Gemini 3.1 Pro. These are not niche metrics — they represent the core competencies that enterprises care about when deciding which AI to embed into their workflows.
And that is where Baidu Create 2026 got truly interesting. CEO Robin Li did not spend the keynote doing a victory lap on model benchmarks. Instead, he introduced a new metric for the AI era: Daily Active Agents, or DAA. His argument was blunt. Tokens measure input and cost; DAA measures output and value. How many AI agents are actually out there doing work and delivering results for humans? Li predicted that global DAA could easily surpass 10 billion in the near future. This is not a technologist obsessing over parameters. This is a CEO repositioning his company around outcomes.
Baidu rolled out four major agent products to back up the rhetoric. DuMate is a general-purpose AI agent that can autonomously handle emails, analyze sales data, generate procurement recommendations, create promotional materials, and even build mini-applications on the fly — it already hits state-of-the-art levels on multiple international agent benchmarks. Miaoda, a coding agent now in version 3.0, lets users build applications by simply describing what they want; during the keynote, an eight-year-old child built a working app on stage. Baidu Yijing upgrades the company’s digital human technology into what it calls the world‘s first full-scenario multi-agent digital human platform, now expanding internationally with multi-language support. And Famo 2.0, a self-evolving decision-making agent, has already delivered a 10.21% efficiency improvement for the Port of Qingdao’s automated terminal operating system. These are not demos. They are deployed products touching real economic activity.
The infrastructure underneath all of this is equally telling. Baidu‘s homegrown Kunlunxin P800 chip has completed scaled verification, with multiple ten-thousand-card clusters delivered since 2025. An entirely domestic Kunlunxin cluster successfully trained a key version of ERNIE 5.1 with a 97% effective training rate and linear scalability exceeding 85% at the ten-thousand-card level. The Tianchi 256-card supernode launches in June, promising a 25% throughput improvement and 50% better inference efficiency for mainstream models. The “chips, cloud, models, agents” stack — what Baidu calls its full-stack AI capability — is now demonstrably real.
To be clear, some skepticism is warranted. Industry analysts have noted that the 6% figure is likely an exaggerated marketing number, and there is a healthy debate about whether ERNIE 5.1’s performance on narrow benchmarks translates to broad real-world superiority. But even if the true cost figure is 10% or 15%, the directional signal remains the same: China has figured out how to do more with less, and that capability compounds over time.
For a global audience that has spent two years fixated on the OpenAI-versus-Google rivalry, Baidu Create 2026 should serve as a wake-up call. The AI race is not a two-player game. It is a multi-front competition in which resource constraints are breeding a different kind of innovation — one that prizes efficiency over scale, deployment over demos, and agentic autonomy over chatbot cleverness. The company often dismissed as “China‘s Google” just showed the world that it can train a top-four global model for pocket change. What happens when that efficiency advantage gets applied to the next generation of models, and the one after that?
The answer, increasingly, looks like a world where the AI frontier is not defined by who can afford the most GPUs, but by who can squeeze the most intelligence out of every single one. Right now, the company making the strongest case for that title is headquartered in Beijing.