0次浏览 发布时间:2025-08-20 11:27:00
TMTPOST -- Chinese AI company DeepSeek on Tuesday released DeepSeek V3.1, a powerful open-source language model that insiders say may match the capabilities of the most advanced proprietary systems from U.S. tech giants—with one critical difference: it’s free and open to all.
A major breakthrough in DeepSeek V3.1 was uncovered by "Rookie", a moderator of the DeepSeek and LocalLLaMA community forums. Rookie claims to have identified four special tokens embedded within the model architecture, two of which have garnered particular attention:
Search Token: Enables the model to connect to the internet in real time, retrieving the most up-to-date information.
Thinking Token: Allows the model to engage in internal reasoning and multistep “thought” processes, improving depth and coherence in its responses.
“These aren’t just upgrades—they solve fundamental pain points that other hybrid systems have been struggling with for years,” Rookie wrote in a widely shared forum post. “With V3.1, DeepSeek has created a true ‘hexagonal warrior’—a model balanced across all dimensions: speed, reasoning, context length, and accessibility.”
The release timing appears anything but accidental. DeepSeek V3.1 arrived just weeks after the launch of OpenAI’s GPT-5 and Anthropic’s Claude 4—two flagship models positioned as the pinnacle of Western AI development. Yet while these models remain tightly guarded behind commercial APIs and usage limits, DeepSeek’s offering is fully open-source.
This divergence isn’t just technical—it’s philosophical.
“When U.S. companies treat advanced AI as proprietary gold mines,” observed journalist Poe Zhao, “China’s DeepSeek hands out the pickaxes.”
At its core, the rivalry between these models reflects a deeper ideological split:
The American Model: Closed-source, centralized, and monetized through high-cost APIs and restricted access.
The Chinese Model: Open-source, decentralized, and geared toward widespread adoption as a public good.
By removing the "R1" label and consolidating all access under the unified V3.1 interface that features a 128k context window, consistent formatting, and uniform response behavior, DeepSeek appears to be streamlining its ecosystem in a move observers see as a response to model fragmentation in China’s AI scene.
While GPT-5 and Claude 4 remain at the forefront in terms of private sector innovation, DeepSeek V3.1’s release signals a strategic realignment. By democratizing access to cutting-edge AI, DeepSeek may be rewriting the rules of global AI competition and reshaping who gets to participate in building the future of intelligence.
As open-source communities around the world rush to experiment with and deploy DeepSeek V3.1, the implications of this release are only beginning to unfold. One thing is clear: the AI race is no longer just about performance and it’s about access as well.