China's DeepSeek Rumoured To Launch R2 Model, Here's What To Expect

A Reuters report in March said DeepSeek was preparing to launch R2 as soon as this month. But the company is yet to confirm the date.

Edited by: Bhavya Sukheja
Feature
Apr 30, 2025 09:46 am IST
- Published On Apr 30, 2025 09:40 am IST
- Last Updated On Apr 30, 2025 09:46 am IST

Read Time: 3 mins

Twitter
WhatsApp
Facebook
Reddit
Email

China's DeepSeek Rumoured To Launch R2 Model, Here's What To Expect

Experts have claimed that R2 is a "better vision" than R1.

DeepSeek is set to release its advanced AI model, DeepSeek-R2.
R2 aims to outperform OpenAI's GPT-4, being 97.3% cheaper to develop.
The model will utilize a hybrid mixture-of-experts architecture for efficiency.

Did our AI summary help?

Let us know.

Chinese artificial intelligence startup DeepSeek is ready with an advanced model, which is expected to be released in the coming days. According to the South China Morning Post (SCMP), DeepSeek-R2, the successor to the R1, will be cheaper and better, giving tough competition to ChatGPT's maker OpenAI. Notably, these speculations swirling on social media come amid an intensifying US-China tech war. It also comes months after the startup released two advanced open-source AI models, V3 and R1, which were built at a fraction of the cost and computing power that major tech companies typically require for large language model (LLM) projects.

What to expect?

According to SCMP, the new advanced model, R2 is said to have been developed with a so-called hybrid mixture-of-experts (MoE) architecture, making it 97.3 per cent cheaper than OpenAI's GPT-4o model. MoE is a machine-learning approach that divides an AI model into separate sub-networks to jointly perform a task. This will greatly reduce computation costs during pre-training and achieve faster performance during inference time, the outlet reported.

Experts have claimed that R2 is a "better vision" than R1, which had no vision functionality. Additionally, it is expected to feature 1.2 trillion parameters and will be trained on 5.2 petabytes of data.

With this new model, DeepSeek could position Huawei as the first major challenger to NVIDIA, experts said. The AI startup is also planning to take over Meta in dominating the open-source AI category by making its own models free to use, they added.

Also Read | Meet Sukant Singh Suki, First Indian To Complete Three 200-Mile Ultramarathons

A Reuters report in March said DeepSeek was preparing to launch R2 in April. But the company is yet to confirm the date.

DeepSeek-V3-0324

Notably, DeepSeek has rapidly emerged as a notable player in the global AI landscape in recent months, releasing a series of models that compete with Western counterparts while offering lower operational costs.

In March, the company released a major upgrade to its V3 large language model, intensifying competition with US tech leaders like OpenAI and Anthropic. According to Reuters, the new model was made available through AI development platform Hugging Face, marking the company's latest push to establish itself in the rapidly evolving AI market.

At the time, experts said DeepSeek-V3-0324 demonstrates significant improvements in areas such as reasoning and coding capabilities compared to its predecessor, with benchmark tests showing enhanced performance across multiple technical metrics published on Hugging Face.

Show full article

Track Latest News Live on NDTV.com and get news updates from India and around the world

China, DeepSeek, DeepSeek R2

Committed To Closer Ties With India, Says Justin Trudeau Amid Row

In Avoiding Repeat Of Telangana, BJP Pays Price In Tamil Nadu

Man Complains Of Stomach Pain For Years, Doctors Find This Inside His Body

"They Can Speak For...": US On India's Response On Canada's Allegations

China's DeepSeek Rumoured To Launch R2 Model, Here's What To Expect

A Reuters report in March said DeepSeek was preparing to launch R2 as soon as this month. But the company is yet to confirm the date.

What to expect?

DeepSeek-V3-0324