Seeing Through the MiRAGE: Evaluating Multimodal Retrieval Augmented Generation

arXiv CS Tuesday 02 June 2026, 04:00 UTC By Alexander Martin, William Walden, Reno Kriz, Dengjia Zhang, Kate Sanders, Eugene Yang, Chihsheng Jin, Benjamin Van Durme 1 min read

Key Points

arXiv:2510.24870v2 Announce Type: replace Abstract: We introduce MiRAGE, an evaluation framework for retrieval-augmented generation (RAG) from multimodal sources. As audiovisual media becomes a prevalent source of information online, it is essential for RAG systems to integrate information from these sources into generation. However, existing evaluations for RAG are text-centric, limiting their applicability to multimodal settings. MiRAGE is a claim-centric approach to multimodal RAG evaluation, consisting of InfoF1, which assesses factuality and information coverage, and CiteF1, which assesses citation support and completeness. We show that, when applied by humans, MiRAGE strongly aligns with extrinsic judgments of output quality. We additionally introduce an automatic implementation of MiRAGE as well as multimodal variants of three prominent text-based RAG metrics -- ALCE, ARGUE, and RAGAS -- demonstrating the limitations of text-centric work and laying the groundwork for automatic evaluation. We release open-source implementations and outline evaluation methods for multimodal RAG.

ALCE (ORG)

Originally published by arXiv CS Read original →

Starlink rival Qianfan hits satellite milestone, but is it too slow and costly? Constellation now has 201 satellites in orbit but the company is said to be under pressure to ramp up launches The constellation now has 201 satellites after a successful launch on board a Zhuque-2E rocket from the Gobi Desert at 4.23pm Beijing time on Tuesday. The mission delivered Qianfan DTC-01 – a direct-to-cell test satellite – alongside a satellite from China Mobile, state broadcaster CCTV reported.

South China Morning Post 43m ago

Violent Anti-Immigration Protests Erupt Across Northern Ireland

Here Are the Best Ways to Clean Stains and Save Your Money 04:47 Serena Williams Wins After 4 Years Away From Competition 00:25 Pope Leo XIV to Hold Mass at Spain’s Iconic Basilica 02:34 Now Playing Violent Anti-Immigration Protests Erupt Across Northern Ireland 00:26 UP NEXT Who Are the Nuns Praying for the San Antonio Spurs at Games? 01:12

NBC News 1h ago

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’ Wall Street has spent months debating how much SpaceX is worth. Behind the scenes, a different challenge has occupied the institutions responsible for bringing it public: preparing the plumbing systems needed to support what could become the largest IPO in history. S&P Global Inc.’s Equity Bookbuild group, which helps underwriters capture and allocate investor demand during initial public offerings, has spent weeks expanding the...

Bloomberg Markets 1h ago

NASA names crew for Artemis III lunar lander rehearsal

NASA has named the four astronauts set to fly the Artemis III mission in an announcement that raised as many questions as it answered. The quartet is comprised of a Space Shuttle veteran, Randy Bresnik, as commander, and the European Space Agency's Luca Parmitano, whose helmet filled with water during an International Space Station (ISS) spacewalk. NASA astronauts Frank Rubio and Andre Douglas will serve as mission specialists.

The Register 2h ago

Seeing Through the MiRAGE: Evaluating Multimodal Retrieval Augmented Generation

Related Stories

Starlink rival Qianfan hits satellite milestone, but is it too slow and costly?

Violent Anti-Immigration Protests Erupt Across Northern Ireland

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’

NASA names crew for Artemis III lunar lander rehearsal