Perhaps even more significant for businesses is DeepSeekโs release of distilled models. DeepSeek also released a smaller, โdistilledโ version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks. The smaller updated R1, which was built using the Qwen3-8B model Alibaba launched in May as a foundation, performs better than Googleโs Gemini 2.5 Flash on AIME 2025.
According to the cloud platform NodeShift, Qwen3-8B requires a GPU with 40GB-80GB of RAM to run (e.g., an Nvidia H100). DeepSeekโs distilled new R1 AI model can run on a single GPU, putting it within reach of hobbyists.
This accessibility democratizes advanced AI reasoning capabilities, allowing smaller organizations to deploy sophisticated models without massive infrastructure investments.