Alibaba updates AI model Qwen, claims superior performance

Jan. 31, 2025.
2 mins. read.

3 Interactions

About the Writer

Giulio Prisco

212.44874 MPXR

Giulio Prisco is Senior Editor at Mindplex. He is a science and technology writer mainly interested in fundamental science and space, cybernetics and AI, IT, VR, bio/nano, crypto technologies.

Alibaba has updated its artificial intelligence (AI) model Qwen and claimed that the new release Qwen 2.5-Max outperforms DeepSeek-V3, Reuters reports.

The latest Alibaba technical report on Qwen is the “Qwen2 Technical report” published on arXiv in September 2024.

Qwen2.5-Max “has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies,” says Alibaba’s announcement. “Today, we are excited to share the performance results of Qwen2.5-Max and announce the availability of its API through Alibaba Cloud. We also invite you to explore Qwen2.5-Max on Qwen Chat!”

The announcement presents the performance results of Qwen2.5-Max alongside leading state-of-the-art models, including DeepSeek V3, GPT-4o, and Claude-3.5-Sonnet.

Chinese AI technology in rapid motion

DeepSeek’s AI models, V3 and the recently released R1, have shaken up the tech world, causing stock prices to drop. Investors now question the high costs of AI development in the US due to DeepSeek’s low-cost strategy.

As reported by Reuters, DeepSeek’s founder Liang Wenfeng said that his company focuses on AGI, or artificial general intelligence. Liang believes big tech companies might not be ideal for future AI due to their high costs and rigid structures. He thinks smaller, more flexible teams like DeepSeek can innovate better. “Large foundational models require continued innovation, tech giants’ capabilities have their limits,” he said.

DeepSeek’s achievements have pushed Chinese competitors like Alibaba to improve their AI. DeepSeek-V2, the predecessor of V3, started an AI price war in China by being very cheap and open-source. This led Alibaba to slash prices on its models by up to 97%. Other companies like Baidu and Tencent also cut costs following suit.

Reuters also reports that TikTok owner ByteDance quickly updated its AI model after DeepSeek-R1’s release, claiming it beat OpenAI’s o1 in a benchmark test known as AIME, which checks how well AI understands complex instructions. DeepSeek-R1 also claimed to match OpenAI’s o1 in several benchmarks.

#LargeLanguageModels(LLMs)

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

2 thoughts on “Alibaba updates AI model Qwen, claims superior performance”

This is amazing. DeepSkeek triggers every company in the sector to do their best.

1 Like

Dislike

💯 💘 😍 ✨ 🎉 👏
🟨 😴 😡 ❌ 🤮 💩

Giulio Prisco
3 mons ago
212.44874 MPXR

1 interactions

IF DeepSeek really managed to do the same things with much less money, then it is a big thing and other companies in the sector are forced to catch up. However, I think DeepSeek has vastly underreported the real costs. But even if the real cost reduction is not a factor 1000 but a mere factor 10 (as I think it is likely), it is still big.

1 Like

Dislike

💯 💘 😍 ✨ 🎉 👏
🟨 😴 😡 ❌ 🤮 💩

Share

Reply

firehiwot kebede
3 mons ago
15.69978 MPXR

2 interactions

This is amazing. DeepSkeek triggers every company in the sector to do their best.


Dislike

💯 💘 😍 ✨ 🎉 👏
🟨 😴 😡 ❌ 🤮 💩


1. Giulio Prisco
  3 mons ago
  212.44874 MPXR
  
  1 interactions
  
  IF DeepSeek really managed to do the same things with much less money, then it is a big thing and other companies in the sector are forced to catch up. However, I think DeepSeek has vastly underreported the real costs. But even if the real cost reduction is not a factor 1000 but a mere factor 10 (as I think it is likely), it is still big.
  
  
  Dislike
  
  💯 💘 😍 ✨ 🎉 👏
  🟨 😴 😡 ❌ 🤮 💩

Exciting News! Our Mobile App is Here!

Welcome Back

No account? Create One

Join

Already have an account? Sign in

forgot password

Alibaba updates AI model Qwen, claims superior performance

About the Writer

Giulio Prisco

RELATED NEWS

Solving train scheduling and other complex problems with AI

Stargate’s $500 billion AI data center plan considers UK investment

From Tokens to Thought: Meta Chain-of-Thought and the Evolution of Reasoning in AI

New photonic chip speeds up AI training

Chinese AI technology in rapid motion

share

Copy link

Facebook

Twitter

Telegram

Linkedin

Interactions