Home Knowledge Base Nano Banana 2

Nano Banana 2

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

MMAE: A Massive Multitask Audio Editing Benchmark

arXiv:2606.07229v1 Announce Type: new Abstract: We introduce MMAE, a Massive Multitask Audio Editing benchmark, serving as the first comprehensive evaluation testbed designed for general-purpose instruction-based audio editing. Spurred by the shift toward intelligent creation, interactive editing has rapidly expanded from visual domains, pioneered by models like Nano-banana 2 for images and Gemini-Omni for video, into audio. However, the current evaluation infrastructure lags severely,...

arXiv CS 2d ago

GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration

arXiv:2605.31039v1 Announce Type: new Abstract: Real-world image restoration (IR) is bottlenecked by the scarcity of high-quality paired training data. Synthetic datasets are abundant but often fail to model real-world degradations, while real-world paired datasets are expensive and difficult to capture. As a result, IR models trained on these datasets show limited generalization in real-world scenarios.

arXiv CS 9d ago

GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration

Announce Type: replace Abstract: Real-world image restoration (IR) is bottlenecked by the scarcity of high-quality paired training data. Synthetic datasets are abundant but often fail to model real-world degradations, while real-world paired datasets are expensive and difficult to capture. As a result, IR models trained on these datasets show limited generalization in real-world scenarios.

arXiv CS 6d ago

TECCI: Tricky Edits of Collected and Curated Images

arXiv:2606.01213v1 Announce Type: new Abstract: Despite tremendous recent progress, current text-guided image editing methods still struggle with many aspects of editing involving instruction following, minimally editing the source image, and ensuring high visual quality. These problems are especially apparent when the requested edit is challenging, such as those that involve position, motion, viewpoint, scale and creative edits. To systematically test generative image editors, we propose a...

arXiv CS 8d ago

Gemini in Chrome expands further to Latin America and the Middle East

Gemini in Chrome expands further to Latin America and the Middle East The AI browser feature is now available in nearly every region around the world, with the rather large exception of Europe. Gemini in Chrome continues to roll out and has now landed in Latin America, the Middle East and Africa, Google announced. Like it or not, the AI browser feature is now available in nearly every region around the world, with the rather large exception of Europe.

Engadget 5h ago

Faithful, Enriched, and Precise: Benchmarking Natural-Science Illustration Generation by T2I models

Announce Type: replace Abstract: Scientific illustrations are essential tools for communicating research findings, especially in natural science, where they visualize complex concepts and processes. As Text-to-Image (T2I) models become increasingly capable, researchers have started to use them for scientific illustration generation. However, existing benchmarks often assess outputs at a holistic level, overlooking fine-grained elements, while scientific reasoning ability and output...

arXiv CS 2d ago

Faithful, Enriched, and Precise: Benchmarking Natural-Science Illustration Generation by T2I models

Announce Type: new Abstract: Scientific illustrations are essential tools for communicating research findings, especially in natural science, where they visualize complex concepts and processes. As Text-to-Image (T2I) models become increasingly capable, researchers have started to use them for scientific illustration generation. However, existing benchmarks often assess outputs at a holistic level, overlooking fine-grained elements, while scientific reasoning ability and output conciseness...

arXiv CS 5d ago