Home Knowledge Base VoiceBank

VoiceBank

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

SB-RF: Schr\"odinger Bridge Rectified Flow for One-Step Robust Speech Enhancement

Announce Type: new Abstract: Generative models have shown impressive results in speech enhancement but often suffer from multi-step inference. We propose SB-RF, a one-step generative framework integrating Rectified Flow (RF) with Schr\"odinger Bridge (SB) theory. SB-RF constructs a conditional bridge between clean and noisy speech distributions via entropy-regularized optimal transport.

arXiv CS 5d ago

G-MaP-SE: Guided Speech Enhancement via GMM-Based Prior Matching

arXiv:2606.08580v1 Announce Type: cross Abstract: Using speaker embeddings as conditioning can strengthen speech enhancement, but most methods either require clean enrollment audio or rely on embeddings extracted from noisy speech, which are fragile under noise and domain shift. We propose G-MaP-SE, a guided enhancement framework that builds a clean-speech embedding prior with a Gaussian Mixture Model (GMM) and refines a noisy conditioning embedding by matching it to this prior. The matched...

arXiv CS 1d ago

Speech Enhancement Based on Drifting Models

arXiv:2604.24199v4 Announce Type: replace Abstract: We propose Speech Enhancement based on Drifting Models (DriftSE), a novel generative framework that formulates denoising as an equilibrium problem. Rather than relying on iterative sampling, DriftSE natively achieves one-step inference by evolving the pushforward distribution of a mapping function to directly match the clean speech distribution. This evolution is driven by a Drifting Field, a learned correction vector that guides samples...

arXiv CS 1d ago