Home Politics A Pocket Offline Model for Simultaneous Speech...
Politics

A Pocket Offline Model for Simultaneous Speech Translation as CUNI Submission to IWSLT 2026

Key Points

arXiv:2606.03948v1 Announce Type: new Abstract: We implement simultaneous translation capability with the offline direct speech-to-text translation model Canary, using the state-of-the-art policy AlignAtt, and submit it to IWSLT 2026 Simultaneous Speech Translation Shared task for Czech to English and English to German and Italian. The strengths of our system are: (1) high translation quality, outperforming similarly sized baselines both in low- and high-latency regimes in computationally...

arXiv:2606.03948v1 Announce Type: new Abstract: We implement simultaneous translation capability with the offline direct speech-to-text translation model Canary, using the state-of-the-art policy AlignAtt, and submit it to IWSLT 2026 Simultaneous Speech Translation Shared task for Czech to English and English to German and Italian. The strengths of our system are: (1) high translation quality, outperforming similarly sized baselines both in low- and high-latency regimes in computationally unaware simulations; (2) low computational requirements, as the model has only 1B parameters; (3) multilinguality -- support of 25 source and 25 target languages.
Pocket Offline Model (ORG) Canary (ORG) AlignAtt (ORG) IWSLT 2026 Simultaneous Speech (ORG) Czech (ORG) German (ORG) Italian (ORG)
Originally published by arXiv CS Read original →