Incremental BPE Tokenization

arXiv CS Monday 01 June 2026, 04:00 UTC By Shenghu Jiang, Ruihao Gong 1 min read

Key Points

arXiv:2605.30813v1 Announce Type: new Abstract: We propose a novel algorithm for incremental Byte Pair Encoding (BPE) tokenization. The algorithm processes each input byte in worst-case $\mathcal{O}(\log^2 t)$ time, leading to an overall complexity of $\mathcal{O}(n \log^2 t)$, where $n$ is the input length and $t$ is the maximum token length. The algorithm incrementally maintains BPE tokenization results for every prefix of the input text, implementing the standard BPE merge procedure defined by a fixed set of merge rules. This enables efficient partial tokenization in streaming settings. Functioning as a drop-in replacement for standard BPE, our approach achieves a speedup of up to ${\sim}3\times$ over Hugging Face's tokenizers, and demonstrates significant latency reductions over OpenAI's tiktoken on pathological inputs. We further introduce an eager output algorithm that enables streaming output, emitting tokens as soon as token boundaries are determined during incremental tokenization. Overall, our results demonstrate that BPE tokenization can be performed incrementally with strong worst-case guarantees, while providing practical latency benefits in modern large language model pipelines. Code: https://github.com/ModelTC/mtc-inc-bpe

Incremental BPE Tokenization (ORG) t)$ (ORG) BPE (ORG) Face (PERSON) OpenAI (LOCATION)

Originally published by arXiv CS Read original →

Alien: Isolation 2 keeps the classic horror game's uncompromising approach to raising tension We played the opening prologue of the horror sequel, and it's still got ways to bring the scares. One of the best horror games of the 2010s was Creative Assembly's Alien: Isolation, but it certainly took some time for audiences to see it in its proper light. Though it initially received mixed reception for its tense encounters and stark difficulty, it's now seen large reappraisal from horror fans...

Engadget 36m ago

We Had a World review – a playwright torn between his warring mother and grandmother

Hampstead theatre, LondonJoshua Harmon studies his family’s fraught matriarchal relations in this thoughtful dramaIn an empathetic act of theatrical archivism, American playwright Joshua Harmon (Bad Jews) follows the shifting, sinking relationship between his mother and grandmother. Tracing the family’s fractures back through Harmon’s life, We Had a World is a thoughtful if sedate staging of duty, care and the relational ties that can’t be shaken loose. Renee (Suzanne Bertish) is a far...

The Guardian Culture 1h ago

'The Social Reckoning' trailer drops with Jeremy Strong as Zuckerberg in Aaron Sorkin's Facebook sequel

The first preview has arrived for the highly anticipated sequel to "The Social Network. info:"The Social Network" is easily one of the best films released over the past 20 years. If I remember correctly, I saw it twice in theaters while living in Montana when it came out in 2010.'SOCIAL NETWORK' STAR BRISTLES AT BEING ASSOCIATED WITH 'PROBLEMATIC' MARK ZUCKERBERGThe Aaron Sorkin film chronicles the rise of Facebook and its founders, specifically Mark Zuckerberg (Jesse Eisenberg) and Eduardo...

Fox News 1h ago

Kristin Scott Thomas tells of horror double tragedy that inspired 'tough' new role

Kristin Scott Thomas tells of horror double tragedy that inspired 'tough' new role Dame Kristin Scott Thomas tells of the grief which marred her childhood, and how she's channelled such unimaginable heartbreak into her directorial debut in My Mother's Wedding Dame Kristin Scott Thomas was five years old when her father was killed in a plane crash. Six years later, her stepfather died the same way. Now, more than half a century on, she has drawn on that double tragedy for her debut as a film...

Daily Mirror 1h ago

Incremental BPE Tokenization

Related Stories

Alien: Isolation 2 keeps the classic horror game's uncompromising approach to raising tension

We Had a World review – a playwright torn between his warring mother and grandmother

'The Social Reckoning' trailer drops with Jeremy Strong as Zuckerberg in Aaron Sorkin's Facebook sequel

Kristin Scott Thomas tells of horror double tragedy that inspired 'tough' new role