DeepSeek R1-0528: Chinaโ€™s AI Breakthrough Challenges Silicon Valleyโ€™s Dominance

May 28, 2025
DeepSeek quietly released DeepSeek R1-0528, an upgraded version of its reasoning model that the Chinese startup describes as a โ€œminor trial upgradeโ€. Yet the improvements are anything but minor, delivering performance gains that position this open-source model dangerously close to proprietary alternatives from OpenAI and Google.
For enterprises navigating the complex AI ecosystem, this development signals a critical inflection point where cost-effective, open-source models are rapidly approachingโ€”and in some cases surpassingโ€”their expensive closed-source counterparts.

Major Performance Leaps in Mathematical Reasoning

In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The results speak volumes about the modelโ€™s enhanced capabilities.
For instance, in the AIME 2025 test, the modelโ€™s accuracy has increased from 70% in the previous version to 87.5% in the current version. This 17.5-point improvement in mathematical reasoning represents one of the most dramatic performance gains seen in recent AI model updates.
The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic. Its overall performance is now approaching that of leading models, such as O3 and Gemini 2.5 Pro.

The Power of Deeper Reasoning

Whatโ€™s driving these improvements? The model demonstrates deeper chain-of-thought reasoning, using nearly double the tokens per query on challenging problems (averaging 23K tokens of โ€œthinkingโ€ vs 12K before). This intensive computational approach allows the model to work through complex problems more thoroughly, resulting in significantly higher accuracy rates.
โ€œDeepSeekโ€™s latest upgrade is sharper on reasoning, stronger on math and code, and closing in on top-tier models like Gemini and O3,โ€ according to Adina Yakefu, AI researcher at Hugging Face. The upgraded model has โ€œmajor improvements in inference and hallucination reductionโ€.

Game-Changing Distilled Models

Perhaps even more significant for businesses is DeepSeekโ€™s release of distilled models. DeepSeek also released a smaller, โ€œdistilledโ€ version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks. The smaller updated R1, which was built using the Qwen3-8B model Alibaba launched in May as a foundation, performs better than Googleโ€™s Gemini 2.5 Flash on AIME 2025.
According to the cloud platform NodeShift, Qwen3-8B requires a GPU with 40GB-80GB of RAM to run (e.g., an Nvidia H100). DeepSeekโ€™s distilled new R1 AI model can run on a single GPU, putting it within reach of hobbyists.
This accessibility democratizes advanced AI reasoning capabilities, allowing smaller organizations to deploy sophisticated models without massive infrastructure investments.

Competitive Positioning Against Industry Leaders

How does DeepSeek R1-0528 stack up against the competition? The upgraded DeepSeek R1 model is just behind OpenAIโ€™s o4-mini and o3 reasoning models on LiveCodeBench, a site that benchmarks models against different metrics.
Notably, o3โ€™s superior scores sometimes rely on running in a costly โ€œhigh effortโ€ mode (using extended thinking or tool use), which is computationally expensive. R1โ€“0528 nearly matches o3โ€™s performance without such extreme settings.
For coding tasks, the o4-mini-high mode achieved ~69% on the Aider-Polyglot test, essentially on par with DeepSeek R1โ€“0528โ€™s 71โ€“72% on that test.

Open Source Advantages and Commercial Licensing

DeepSeek-R1-0528-Qwen3-8B is available under a permissive MIT license, meaning it can be used commercially without restriction. This licensing model removes barriers for businesses looking to integrate advanced AI reasoning capabilities into their products and services.
The use of DeepSeek-R1 models is also subject to MIT License. DeepSeek-R1 series (including Base and Chat) supports commercial use and distillation, providing unprecedented flexibility for enterprise deployment.

Enhanced Capabilities and Reduced Hallucinations

In May 2025, they released DeepSeek-R1-0528, an upgraded version with better benchmark performance, fewer hallucinations, and new capabilities like function calling and JSON output support. This update introduces several key improvements: improved benchmark performance across both reasoning and factual tasks, enhanced front-end capabilities for smoother interaction in chat platforms, reduced hallucinations, increasing factual reliability, and support for JSON output and function calling.
These improvements make the model more suitable for production environments where reliability and structured output formats are critical business requirements.

Strategic Implications for AI Cost Management

The implications for AI cost management are profound. Organizations using multi-model AI platforms can now access reasoning capabilities that rival proprietary models at dramatically reduced costs. The availability of multiple model sizesโ€”from 8 billion to 671 billion parametersโ€”allows businesses to optimize their AI infrastructure based on specific use case requirements.
Where OpenAI o1 costs $15 per million input tokens and $60 per million output tokens, DeepSeek Reasoner, which is based on the R1 model, costs $0.55 per million input and $2.19 per million output tokensโ€”representing cost savings of over 90%.

The Future of Open Source AI Leadership

This version shows DeepSeek is not just catching up, itโ€™s competing. The rapid advancement of open-source models like DeepSeek R1-0528 suggests that the future of AI may be more distributed and accessible than many anticipated. For enterprises, this trend toward high-performance, cost-effective open-source models reinforces the value of unified AI platforms that provide access to multiple model vendors. As the competitive landscape continues to evolve rapidly, the ability to switch between models based on performance, cost, and specific requirements becomes a critical strategic advantage. The DeepSeek R1-0528 upgrade represents more than just incremental improvementโ€”it signals a fundamental shift in AI economics where exceptional performance no longer requires premium pricing or proprietary access.
Ready to optimize your AI costs without sacrificing performance? StickyPrompts gives you instant access to DeepSeek R1-0528 alongside leading models from OpenAI, Anthropic, and Googleโ€”all through one unified interface with transparent, usage-based pricing. Start comparing models and cutting costs today with our free trial.
Start your free Sticky Prompts trial now! ๐Ÿ‘‰ ๐Ÿ‘‰ ๐Ÿ‘‰