Home Knowledge Base FID

FID

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Conditional Collapse in Sign Language Production: A Diagnostic and a Scaling Argument

arXiv:2606.01643v1 Announce Type: new Abstract: Sign Language Production (SLP) is the task of generating avatar sign language motion from natural language text. The quality of the generated motion is typically evaluated by a motion-space Fr\'echet distance (FID) and back-translation (BT) BLEU score on benchmarks such as How2Sign.

arXiv CS 8d ago

Can We Predict The Human Preference For Text-to-Image Content Prior To Generation And Is It Even Useful To Do So?

arXiv:2606.05478v1 Announce Type: new Abstract: Diffusion Models (DM) have revolutionized text-driven generation by enabling the synthesis of high-quality, photorealistic visual content from user prompts. Whereas prior advances in visual generation such as VAEs and GANs were primarily evaluated on perceptual or visual similarity metrics such as FID PSNR, DM advances have fostered the development of more advanced Human Preference Metrics (HPM) that model and quantify human judgment as scalar...

arXiv CS 5d ago

Diffusion Image Generation with Explicit Modeling of Data Manifold Geometry

Announce Type: replace Abstract: Image generative models aim to sample data points from the underlying data manifold, a task that requires learning and decoding a dense, low-dimensional, and compact parameterization space. To achieve this, we propose the Data Manifold-aware Image diffusioN moDel (MIND), a novel framework that explicitly models manifold geometry by integrating discrete patch tokenization into the score function of a continuous diffusion model. This approach successfully...

arXiv CS 1d ago

Israeli director Nadav Lapid, angered by boycott calls, pulls out of Marseille festival

The Israeli filmmaker, a critic of Benjamin Netanyahu exiled in France for five years, wonders 'what the hell I'm supposed to do' here if 'my presence is unacceptable and I can simply be erased or swept aside from a film event'. No one is a prophet in their own land, as the saying goes, but sometimes host countries are hardly any more receptive...

Euronews 1d ago

Learning to Solve Generative ODEs Beyond the Linear Span

Announce Type: new Abstract: Diffusion and flow generative models sample by integrating a learned ODE, but high quality still requires many sequential model evaluations. Solver learning reduces this cost by adapting scalar coefficients, timesteps, or both, while keeping the backbone model fixed. In this work, we identify a structural bottleneck in this update family: each step remains span-limited.

arXiv CS 1d ago

Complexity-Balanced Diffusion Splitting

new Abstract: Standard continuous-time generative models rely on monolithic architectures that must navigate vastly different signal regimes, from isotropic noise to intricate data distributions. While scaling model capacity improves performance, deploying a massive network uniformly across the entire generative timeline is inherently inefficient. In this work, we propose Complexity-Balanced Splitting (CBS), a principled framework for temporal capacity allocation that distributes the...

arXiv CS 5d ago

CoMPAS3D: A Dataset and Benchmark for Interactive Motion

arXiv:2507.19684v2 Announce Type: replace Abstract: Socially interactive humanoid robots must engage with humans through their bodies, adapting in real time to a partner's movement, intent, and abilities. This requires models that understand not just how bodies move, but what movement means in a shared social context. Yet evaluation frameworks for interactive motion generation do not measure whether generated follower motion is legible within a shared movement vocabulary, nor whether it is...

arXiv CS 7d ago

FlowC2S: Flowing from Current to Succeeding Frames for Fast and Memory-Efficient Video Continuation

arXiv:2604.17625v2 Announce Type: replace Abstract: This paper introduces a novel methodology for generating fast and memory-efficient video continuations. Our method, dubbed FlowC2S, fine-tunes a pre-trained text-to-video flow model to learn a vector field between the current and succeeding video chunks. Two design choices are key.

arXiv CS 8d ago

Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching

Announce Type: replace Abstract: Flow matching has recently emerged as a promising alternative to diffusion-based generative models, particularly for text-to-image generation. Despite its flexibility in allowing arbitrary source distributions, most existing approaches rely on a standard Gaussian distribution, a choice inherited from diffusion models, and rarely consider the source distribution itself as an optimization target in such settings.

arXiv CS 8d ago

Fixed-Point Masked Generative Modeling

new Abstract: Masked Generative Models (MGMs) enable parallel decoding and achieve strong performance across modalities, but require full-sequence bidirectional transformers at every step, making training costly and degrading quality under low sampling budgets. Existing work improves efficiency via better samplers or cheaper fixed-depth denoisers, but they still allocate a fixed amount of denoiser computation to each refinement step. We introduce Fixed-Point Masked Generative Models...

arXiv CS 9d ago