HybridCodec: Fast Dual-Stream, Semantically Enhanced Neural Audio Codec

arXiv CS Monday 08 June 2026, 04:00 UTC By Arjun Gangwar, S Umesh 1 min read

Key Points

arXiv:2606.06743v1 Announce Type: new Abstract: The popularity of neural audio codecs as speech tokenizers has surged with the advent of Multimodal Large Language Models. New codec architectures with semantic and acoustic disentanglement have emerged. There are two main approaches to introduce semantic information into codec models: one distills semantic information from SSL representations into the first RVQ layer, while the other maintains separate streams for semantic and acoustic features. We propose HybridCodec, a unified architecture that combines both paradigms. It employs separate semantic and acoustic branches while distilling SSL representations into the semantic stream. This design ensures strong disentanglement without requiring an SSL model during inference. HybridCodec shows superior semantic specialization (RVQ-1) on in-domain test set and competitive reconstruction (RVQ-all). We demonstrate its robustness in out-of-domain and zero-shot cross-lingual settings, achieving a 3x speedup over existing dual-stream models.

Semantically Enhanced Neural Audio Codec (ORG) HybridCodec (ORG) SSL (ORG) RVQ (ORG)

Originally published by arXiv CS Read original →

Starlink rival Qianfan hits satellite milestone, but is it too slow and costly? Constellation now has 201 satellites in orbit but the company is said to be under pressure to ramp up launches The constellation now has 201 satellites after a successful launch on board a Zhuque-2E rocket from the Gobi Desert at 4.23pm Beijing time on Tuesday. The mission delivered Qianfan DTC-01 – a direct-to-cell test satellite – alongside a satellite from China Mobile, state broadcaster CCTV reported.

South China Morning Post 59m ago

Violent Anti-Immigration Protests Erupt Across Northern Ireland

Here Are the Best Ways to Clean Stains and Save Your Money 04:47 Serena Williams Wins After 4 Years Away From Competition 00:25 Pope Leo XIV to Hold Mass at Spain’s Iconic Basilica 02:34 Now Playing Violent Anti-Immigration Protests Erupt Across Northern Ireland 00:26 UP NEXT Who Are the Nuns Praying for the San Antonio Spurs at Games? 01:12

NBC News 1h ago

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’ Wall Street has spent months debating how much SpaceX is worth. Behind the scenes, a different challenge has occupied the institutions responsible for bringing it public: preparing the plumbing systems needed to support what could become the largest IPO in history. S&P Global Inc.’s Equity Bookbuild group, which helps underwriters capture and allocate investor demand during initial public offerings, has spent weeks expanding the...

Bloomberg Markets 1h ago

NASA names crew for Artemis III lunar lander rehearsal

NASA has named the four astronauts set to fly the Artemis III mission in an announcement that raised as many questions as it answered. The quartet is comprised of a Space Shuttle veteran, Randy Bresnik, as commander, and the European Space Agency's Luca Parmitano, whose helmet filled with water during an International Space Station (ISS) spacewalk. NASA astronauts Frank Rubio and Andre Douglas will serve as mission specialists.

The Register 2h ago

HybridCodec: Fast Dual-Stream, Semantically Enhanced Neural Audio Codec

Related Stories

Starlink rival Qianfan hits satellite milestone, but is it too slow and costly?

Violent Anti-Immigration Protests Erupt Across Northern Ireland

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’

NASA names crew for Artemis III lunar lander rehearsal