back Back

Alibaba updates AI model Qwen, claims superior performance

Jan. 31, 2025.
2 mins. read. 3 Interactions

Alibaba has updated its artificial intelligence (AI) model Qwen and claimed that the new release Qwen 2.5-Max outperforms DeepSeek-V3.

About the Writer

Giulio Prisco

139.03817 MPXR

Giulio Prisco is Senior Editor at Mindplex. He is a science and technology writer mainly interested in fundamental science and space, cybernetics and AI, IT, VR, bio/nano, crypto technologies.

Alibaba has updated its artificial intelligence (AI) model Qwen and claimed that the new release Qwen 2.5-Max outperforms DeepSeek-V3, Reuters reports.

The latest Alibaba technical report on Qwen is the “Qwen2 Technical report” published on arXiv in September 2024.

Qwen2.5-Max “has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies,” says Alibaba’s announcement. “Today, we are excited to share the performance results of Qwen2.5-Max and announce the availability of its API through Alibaba Cloud. We also invite you to explore Qwen2.5-Max on Qwen Chat!”

The announcement presents the performance results of Qwen2.5-Max alongside leading state-of-the-art models, including DeepSeek V3, GPT-4o, and Claude-3.5-Sonnet.

Chinese AI technology in rapid motion

DeepSeek’s AI models, V3 and the recently released R1, have shaken up the tech world, causing stock prices to drop. Investors now question the high costs of AI development in the US due to DeepSeek’s low-cost strategy.

As reported by Reuters, DeepSeek’s founder Liang Wenfeng said that his company focuses on AGI, or artificial general intelligence. Liang believes big tech companies might not be ideal for future AI due to their high costs and rigid structures. He thinks smaller, more flexible teams like DeepSeek can innovate better. “Large foundational models require continued innovation, tech giants’ capabilities have their limits,” he said.

DeepSeek’s achievements have pushed Chinese competitors like Alibaba to improve their AI. DeepSeek-V2, the predecessor of V3, started an AI price war in China by being very cheap and open-source. This led Alibaba to slash prices on its models by up to 97%. Other companies like Baidu and Tencent also cut costs following suit.

Reuters also reports that TikTok owner ByteDance quickly updated its AI model after DeepSeek-R1’s release, claiming it beat OpenAI’s o1 in a benchmark test known as AIME, which checks how well AI understands complex instructions. DeepSeek-R1 also claimed to match OpenAI’s o1 in several benchmarks.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter

Comment on this article

2 Comments

2 thoughts on “Alibaba updates AI model Qwen, claims superior performance

  1. This is amazing. DeepSkeek triggers every company in the sector to do their best.

    1 Like
    Dislike
    Share
    Reply
    1. IF DeepSeek really managed to do the same things with much less money, then it is a big thing and other companies in the sector are forced to catch up. However, I think DeepSeek has vastly underreported the real costs. But even if the real cost reduction is not a factor 1000 but a mere factor 10 (as I think it is likely), it is still big.

      1 Like
      Dislike
      Share
      Reply

2

Like

Dislike

Share

1

Comments
Reactions
💯 💘 😍 🎉 👏
🟨 😴 😡 🤮 💩

Here is where you pick your favorite article of the month. An article that collected the highest number of picks is dubbed "People's Choice". Our editors have their pick, and so do you. Read some of our other articles before you decide and click this button; you can only select one article every month.

People's Choice
Bookmarks