Alibaba updates AI model Qwen, claims superior performance
Jan. 31, 2025.
2 mins. read.
3 Interactions
Alibaba has updated its artificial intelligence (AI) model Qwen and claimed that the new release Qwen 2.5-Max outperforms DeepSeek-V3.
Alibaba has updated its artificial intelligence (AI) model Qwen and claimed that the new release Qwen 2.5-Max outperforms DeepSeek-V3, Reuters reports.
The latest Alibaba technical report on Qwen is the “Qwen2 Technical report” published on arXiv in September 2024.
Qwen2.5-Max “has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies,” says Alibaba’s announcement. “Today, we are excited to share the performance results of Qwen2.5-Max and announce the availability of its API through Alibaba Cloud. We also invite you to explore Qwen2.5-Max on Qwen Chat!”
The announcement presents the performance results of Qwen2.5-Max alongside leading state-of-the-art models, including DeepSeek V3, GPT-4o, and Claude-3.5-Sonnet.
Chinese AI technology in rapid motion
DeepSeek’s AI models, V3 and the recently released R1, have shaken up the tech world, causing stock prices to drop. Investors now question the high costs of AI development in the US due to DeepSeek’s low-cost strategy.
As reported by Reuters, DeepSeek’s founder Liang Wenfeng said that his company focuses on AGI, or artificial general intelligence. Liang believes big tech companies might not be ideal for future AI due to their high costs and rigid structures. He thinks smaller, more flexible teams like DeepSeek can innovate better. “Large foundational models require continued innovation, tech giants’ capabilities have their limits,” he said.
DeepSeek’s achievements have pushed Chinese competitors like Alibaba to improve their AI. DeepSeek-V2, the predecessor of V3, started an AI price war in China by being very cheap and open-source. This led Alibaba to slash prices on its models by up to 97%. Other companies like Baidu and Tencent also cut costs following suit.
Reuters also reports that TikTok owner ByteDance quickly updated its AI model after DeepSeek-R1’s release, claiming it beat OpenAI’s o1 in a benchmark test known as AIME, which checks how well AI understands complex instructions. DeepSeek-R1 also claimed to match OpenAI’s o1 in several benchmarks.
Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.
2 Comments
2 thoughts on “Alibaba updates AI model Qwen, claims superior performance”
This is amazing. DeepSkeek triggers every company in the sector to do their best.
🟨 😴 😡 ❌ 🤮 💩
IF DeepSeek really managed to do the same things with much less money, then it is a big thing and other companies in the sector are forced to catch up. However, I think DeepSeek has vastly underreported the real costs. But even if the real cost reduction is not a factor 1000 but a mere factor 10 (as I think it is likely), it is still big.
🟨 😴 😡 ❌ 🤮 💩