๐Ÿ‡ญ๐Ÿ‡ฐFreshcollected in 16m

Alibaba enters embodied AI race with Qwen Robot Suite

Alibaba enters embodied AI race with Qwen Robot Suite
PostLinkedIn
๐Ÿ‡ญ๐Ÿ‡ฐRead original on SCMP Technology

๐Ÿ’กAlibaba's first move into embodied AI: See how their Qwen models are being adapted for physical robot control.

โšก 30-Second TL;DR

What Changed

Alibaba launched Qwen Robot Suite to enable robots to perceive and reason in physical spaces.

Why It Matters

This launch positions Alibaba as a major contender in the robotics AI space, potentially accelerating the integration of LLMs into industrial and service robots. It signals a broader trend of big tech companies moving beyond digital chatbots into physical automation.

What To Do Next

Monitor the Tongyi Lab GitHub repository for upcoming documentation or API releases related to the Qwen Robot Suite for potential integration into your robotics projects.

Who should care:Developers & AI Engineers

๐Ÿง  Deep Insight

Web-grounded analysis with 12 cited sources.

๐Ÿ”‘ Enhanced Key Takeaways

  • โ€ขThe Qwen Robot Suite consists of three distinct core models: Qwen-RobotManip for generalizable vision-language-action, Qwen-RobotNav for scalable vision-language navigation, and Qwen-RobotWorld, a video world model for embodied intelligence.
  • โ€ขAlibaba's strategy with the Qwen Robot Suite is to provide an open AI model layer that can be adopted by various hardware partners across different robot form factors, rather than developing its own proprietary robot hardware.
  • โ€ขThe Qwen-RobotManip model, a component of the suite, was trained on over 38,000 hours of open-source data and achieved top performance in the generalist track of the RoboChallenge real-robot benchmark.
  • โ€ขAlibaba anticipates that AI-related product revenue will become the primary growth driver for its cloud segment, signaling a significant strategic shift towards monetizing its AI advancements.
  • โ€ขThe suite is designed to enable robots to adapt to diverse and unfamiliar environments, perform real-world tasks, and execute instructions given in natural language.

๐Ÿ› ๏ธ Technical Deep Dive

  • Qwen Robot Suite Components: The suite comprises three core models: Qwen-RobotManip, Qwen-RobotNav, and Qwen-RobotWorld.
  • Qwen-RobotManip: This is a generalizable vision-language-action (VLA) model built on the Qwen3.5-4B architecture. It was trained on over 38,000 hours of open-source data to handle objects and topped the generalist track of the RoboChallenge real-robot benchmark.
  • Qwen-RobotNav: A vision-language navigation model designed to help machines understand and move through physical spaces. It integrates vision-language capabilities into motion control, unifying instruction following, goal navigation, object tracking, and autonomous driving tasks.
  • Qwen-RobotWorld: A video world model that integrates vision-language capabilities into world dynamics prediction, allowing a single model to forecast physically plausible futures across manipulation, driving, and navigation scenarios.
  • Qwen-VLA (Vision-Language-Action): A general-purpose model built upon the Qwen multimodal backbone, extending visual perception, language understanding, and spatial reasoning into continuous action generation and trajectory prediction.
  • Unified Architecture: Qwen-VLA unifies robotic manipulation, vision-language navigation, and cross-embodiment control, aiming for a single generalist policy model.
  • Training Data: Qwen-VLA's training involves joint pretraining on real robot data, human egocentric data, synthetic simulation data, and general vision-language data.
  • Action Decoder: Qwen-VLA utilizes a 1.15 billion parameter diffusion transformer action decoder for generating continuous actions.
  • Embodiment-Aware Prompts: The models use embodiment-aware prompts, allowing the same weights to control different robot configurations (e.g., single arm, bimanual setup, navigation robot) by adapting behavior through prompt conditioning.
  • Training Pipeline: The Qwen-VLA training recipe includes stages such as Text-to-Action (T2A) pre-training, Continual Pre-training (CPT), Supervised Fine-Tuning (SFT), and Reinforcement Learning (RL).
  • Qwen-RobotClaw: Alibaba also revealed an internal robotic agent framework, Qwen-RobotClaw, which enables Qwen VLM agents to invoke the Qwen-Robot Suite models as tools for physical world interaction and managing long-horizon tasks.

๐Ÿ”ฎ Future ImplicationsAI analysis grounded in cited sources

Alibaba will significantly expand its market share in industrial automation and logistics.
The Qwen Robot Suite's focus on real-world tasks and its pilot testing with Alibaba Cloud enterprise customers in the robotics sector suggest a direct application and competitive advantage in its strong e-commerce and logistics operations.
The open platform approach of the Qwen Robot Suite will foster a broader ecosystem of hardware partners adopting Alibaba's embodied AI.
By emphasizing the AI model layer and positioning Qwen-VLA as an open platform for various robot form factors, Alibaba encourages wider adoption and integration by hardware manufacturers.
AI-related product revenue will become a dominant driver for Alibaba's cloud segment in the near future.
Alibaba's CEO, Eddie Wu, has explicitly stated this expectation, indicating a strong internal strategic focus on monetizing AI offerings through its cloud services.

โณ Timeline

2014
Alibaba Group founded the Institute of Data Science & Technologies (iDST), initiating R&D in core AI technologies.
2017-10
DAMO Academy, Alibaba's global research institution, was officially established.
2022
Tongyi Lab was officially established, and the Qwen project was launched.
2023-04
The Tongyi Qwen large language model series was officially released.
2023-08
Alibaba published its first open-source model, the 7-billion-parameter LLM Qwen-7B.
2026-02
Alibaba's Damo Academy introduced RynnBrain, an open-source foundation model for robotics.

๐Ÿ“Ž Sources (12)

Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.

  1. morningstar.com
  2. scmp.com
  3. pandaily.com
  4. moomoo.com
  5. technologymagazine.com
  6. qwen.ai
  7. youtube.com
  8. cleverence.com
  9. aliyun.com
  10. youtube.com
  11. alibabacloud.com
  12. businesstimes.com.sg
๐Ÿ“ฐ

Weekly AI Recap

Read this week's curated digest of top AI events โ†’

๐Ÿ‘‰Related Updates

AI-curated news aggregator. All content rights belong to original publishers.
Original source: SCMP Technology โ†—