Qwen Open-Sources Advanced ASR And Forced Alignment Models With Multi-Language Capabilities

January 29, 2026January 29, 2026

Alibaba Cloud introduced that it has made its Qwen3-ASR and Qwen3-ForcedAligner AI fashions open-source, providing superior instruments for speech recognition and compelled alignment.

The Qwen3-ASR household contains two all-in-one fashions, Qwen3-ASR-1.7B and Qwen3-ASR-0.6B, which help language identification and transcription throughout 52 languages and accents, leveraging large-scale speech information and the Qwen3-Omni basis mannequin.

Internal testing signifies that the 1.7B mannequin delivers state-of-the-art accuracy amongst open-source ASR techniques, whereas the 0.6B model balances efficiency and effectivity, able to transcribing 2,000 seconds of speech in a single second with high concurrency.

The Qwen3-ForcedAligner-0.6B mannequin makes use of a non-autoregressive LLM strategy to align textual content and speech in 11 languages, outperforming main force-alignment options in each pace and accuracy.

Alibaba Cloud has additionally launched a complete inference framework beneath the Apache 2.0 license, supporting streaming, batch processing, timestamp prediction, and fine-tuning, aimed toward accelerating analysis and sensible functions in audio understanding.

Qwen3-ASR and Qwen3-ForcedAligner at the moment are open supply — production-ready speech fashions designed for messy, real-world audio, with aggressive efficiency and robust robustness.
● 52 languages & dialects with auto language ID (30 languages + 22 dialects/accents)
● Robust in… pic.twitter.com/q7RWjJFXgH

— Qwen (@Alibaba_Qwen) January 29, 2026

Qwen3-ASR And Qwen3-ForcedAligner Models Demonstrate Leading Accuracy And Efficiency

Alibaba Cloud has launched efficiency outcomes for its Qwen3-ASR and Qwen3-ForcedAligner fashions, demonstrating main accuracy and effectivity throughout various speech recognition duties.

The Qwen3-ASR-1.7B mannequin achieves state-of-the-art outcomes amongst open-source techniques, outperforming industrial APIs and different open-source fashions in English, multilingual, and Chinese dialect recognition, together with Cantonese and 22 regional variants.

It maintains dependable accuracy in difficult acoustic situations, reminiscent of low signal-to-noise environments, youngster or aged speech, and even singing voice transcription, attaining common phrase error charges of 13.91% in Chinese and 14.60% in English with background music.

The smaller Qwen3-ASR-0.6B balances accuracy and effectivity, delivering high throughput and low latency beneath high concurrency, able to transcribing as much as 5 hours of speech in on-line asynchronous mode at a concurrency of 128.

Meanwhile, the Qwen3-ForcedAligner-0.6B outperforms main end-to-end compelled alignment fashions together with Nemo-Forced-Aligner, WhisperX, and Monotonic-Aligner, providing superior language protection, timestamp accuracy, and help for various speech and audio lengths.

The submit Qwen Open-Sources Advanced ASR And Forced Alignment Models With Multi-Language Capabilities appeared first on Metaverse Post.

Featured Technology

Tokenized Pokémon card trades surge 5.5x to $124 million in August
ByRicardo September 4, 2025

Pokémon buying and selling playing cards would be the subsequent real-world asset (RWA) class to transfer on-chain as blockchain know-how extends its attain past conventional markets. Over the previous 12 months, tokenization has reworked entry to conventional markets like gold and US treasuries, that are primarily operated on environment friendly digital rails. However, collectibles like…

Read More Tokenized Pokémon card trades surge 5.5x to $124 million in August
Featured News Report

Aave DAO Proposes $50M Annual Buyback Program To Strengthen Aavenomics And Market Stability
ByRicardo October 22, 2025

Aave Chain Initiative, a delegate platform and repair supplier for the Aave DAO, has introduced a proposal to ascertain a long-term AAVE token buyback program funded by means of protocol income. The program goals to set an annual finances of $50 million with versatile execution parameters, permitting the Aave DAO to strategically deploy capital to…

Read More Aave DAO Proposes $50M Annual Buyback Program To Strengthen Aavenomics And Market Stability
Featured News Report

Vitalik Buterin: Prediction Markets Should Shift From Short‑Term Betting Toward Consumer Price‑Stability Tools
ByRicardo February 16, 2026

Ethereum co-founder Vitalik Buterin has outlined an idea for a future system wherein value indices and prediction markets are created for a variety of products and companies. His proposal describes the usage of native giant language fashions to tailor a customized basket of prediction‑market exposures for every particular person, reflecting their anticipated future spending. Under…

Read More Vitalik Buterin: Prediction Markets Should Shift From Short‑Term Betting Toward Consumer Price‑Stability Tools
Featured News Report

OKX Launches AI Developer Toolkit On OnchainOS Designed For Autonomous AI Agents
ByRicardo March 3, 2026

Cryptocurrency change OKX introduced that it has launched a local AI layer on its proprietary onchain platform, OnchainOS, marking the primary developer toolkit designed particularly to allow AI brokers to function autonomously throughout a number of blockchains. The new layer gives infrastructure that permits AI brokers to handle wallets, execute trades, and pay for providers…

Read More OKX Launches AI Developer Toolkit On OnchainOS Designed For Autonomous AI Agents
Featured News Report

Noah V2 Introduces AI-Powered Collaborative Workflow For End-To-End Onchain App Development
ByRicardo December 25, 2025

AI–pushed system for growing blockchain software program, Noah AI has launched Noah V2, a launch designed to cut back the time required to rework ideas into functioning purposes. The up to date model is meant to attenuate obstacles between artistic planning and sensible implementation. Based on the announcement, Noah Agent capabilities as a collaborative onchain…

Read More Noah V2 Introduces AI-Powered Collaborative Workflow For End-To-End Onchain App Development
Featured News Report

0G Labs Confirmed As Participant At DePIN Expo 2025
ByRicardo August 1, 2025

DePIN Expo 2025, a professional exhibition centered on Decentralized Physical Infrastructure Networks (DePIN), has announced the strategic backing of 0G Labs, a prominent global project in the AI Layer 1 blockchain space. As a developer of decentralized AI infrastructure, 0G Labs will present its Decentralized AI Operating System (DeAIOS) at the event, illustrating how AI…

Read More 0G Labs Confirmed As Participant At DePIN Expo 2025

Qwen Open-Sources Advanced ASR And Forced Alignment Models With Multi-Language Capabilities

Qwen3-ASR And Qwen3-ForcedAligner Models Demonstrate Leading Accuracy And Efficiency

Tokenized Pokémon card trades surge 5.5x to $124 million in August

Aave DAO Proposes $50M Annual Buyback Program To Strengthen Aavenomics And Market Stability

Vitalik Buterin: Prediction Markets Should Shift From Short‑Term Betting Toward Consumer Price‑Stability Tools

OKX Launches AI Developer Toolkit On OnchainOS Designed For Autonomous AI Agents

Noah V2 Introduces AI-Powered Collaborative Workflow For End-To-End Onchain App Development

0G Labs Confirmed As Participant At DePIN Expo 2025

Curated by experts. Filtered for relevance.

Resources

About

Subscribe & learn more every day!

Qwen3-ASR And Qwen3-ForcedAligner Models Demonstrate Leading Accuracy And Efficiency

Similar Posts

Curated by experts. Filtered for relevance.

Resources

About

Subscribe & learn more every day!