DeepSeek Releases New Version of Model Behind Its AI Chatbot – PYMNTS.com

Welcome to the forefront of conversational AI as we explore the fascinating world of AI chatbots in our dedicated blog series. Discover the latest advancements, applications, and strategies that propel the evolution of chatbot technology. From enhancing customer interactions to streamlining business processes, these articles delve into the innovative ways artificial intelligence is shaping the landscape of automated conversational agents. Whether you’re a business owner, developer, or simply intrigued by the future of interactive technology, join us on this journey to unravel the transformative power and endless possibilities of AI chatbots.
Chinese artificial intelligence startup DeepSeek has released a new version of the open-source AI model behind its controversial chatbot.
Complete the form to unlock this article and enjoy unlimited free access to all PYMNTS content — no additional logins required.
yesSubscribe to our daily newsletter, PYMNTS Today.
By completing this form, you agree to receive marketing communications from PYMNTS and to the sharing of your information with our sponsor, if applicable, in accordance with our Privacy Policy and Terms and Conditions.
Citing a post on DeepSeek’s official WeChat group, Bloomberg reported that DeepSeek V3.1 is ready for testing.
The new version has a longer context window, or space for prompting, of 128,000 tokens. That’s roughly 96,000 words or about two 200-page English novels.
DeepSeek’s V3 model caused a stir in January when the startup claimed it only cost $5.6 million to train using about 2,000 of slower Nvidia chips.
That’s far cheaper than the millions it took to train frontier models from OpenAI, Google, Anthropic and others. The news wiped $600 billion of market value from Nvidia in one day. But governments soon banned the use of the DeepSeek chatbot out of concerns the data would be kept on Chinese servers.
While the startup didn’t share much more on WeChat, a post on Reddit said the latest version of the chatbot is “very, very verbose,” and also observed that the “r1 in the think button” has disappeared, indicating V3.1 could be a mixed reasoning model.
R1 is a reasoning model that DeepSeek also developed. It is offered through the three major U.S. hyperscalers AWS, Microsoft Azure and Google Cloud. The cloud providers have said the model is hosted locally so data would not be sent to China.
Developers are still waiting for R2, the next model release of R1, according to Bloomberg.
Read also: Remember DeepSeek? Many Adopt Its AI Models Despite Security Concerns
In the global AI race, only China is able to compete effectively with the U.S., Bloomberg reported. Chinese companies such as Alibaba, DeepSeek and Moonshot have developed AI models that have capabilities approaching the best ones in the U.S.
While the U.S. has banked on largely closed, proprietary AI models, China has pushed open-source models that generally are free to download and use. China is sacrificing short-term profits to ensure Chinese AI is adopted globally, according to Bloomberg. China’s 14th five-year blueprint for development, released in 2020, favored the open-source approach. Some Chinese artificial intelligence startup managers also believe the fastest way to enter new markets and compete with U.S. models is to offer open AI models.
Read more: DeepSeek Upgrades AI Reasoning Model to Rival OpenAI and Google
For all PYMNTS AI coverage, subscribe to the daily AI Newsletter.
DeepSeek Releases New Version of Model Behind Its AI Chatbot
Nuvei and Zuora Launch Recurring Payments Solution for International Enterprises
Tether Recruits White House Crypto Vet Bo Hines
Crypto Exchange Gemini Discloses $75 Million Credit Line With Ripple
We’re always on the lookout for opportunities to partner with innovators and disruptors.