Aligning Dense Retrievers with LLM Utility via Distillation

arXiv CS Monday 01 June 2026, 04:00 UTC By Rajinder Sandhu, Di Mu, Cheng Chang, Md Shahriar Tasjid, Himanshu Rai, Maksims Volkovs, Ga Wu 1 min read

Key Points

arXiv:2604.22722v2 Announce Type: replace Abstract: Dense vector retrieval is the practical backbone of Retrieval- Augmented Generation (RAG), but similarity search can suffer from precision limitations. Conversely, utility-based approaches leveraging LLM re-ranking often achieve superior performance but are computationally prohibitive and prone to noise inherent in perplexity estimation. We propose Utility-Aligned Embeddings (UAE), a framework designed to merge these advantages into a practical, high-performance retrieval method. We formulate retrieval as a distribution matching problem, training a bi-encoder to imitate a utility distribution derived from perplexity reduction using a Utility-Modulated InfoNCE objective. This approach injects graded utility signals directly into the embedding space without requiring test-time LLM inference. On the QASPER benchmark, UAE improves retrieval Recall@1 by 30.59%, MAP by 30.16% and Token F1 by 17.3% over the strong semantic baseline BGE-Base. Crucially, UAE is over 180x faster than the efficient LLM re-ranking methods preserving competitive performance, demonstrating that aligning retrieval with generative utility yields reliable contexts at scale.

Aligning Dense Retrievers (PERSON) LLM (ORG) QASPER (ORG) Recall@1 (ORG) Token F1 (PERSON) BGE-Base (ORG)

Originally published by arXiv CS Read original →

Genetically modified worms can now produce and deliver drugs inside a living body, scientists say In a proof-of-concept lab experiment, scientists demonstrated that intestinal parasites could make and release therapeutic agents inside a living host. Scientists genetically tweaked a tiny, worm-like parasite to produce a life-saving antitoxin from inside a living host. In a first-of-its-kind study, researchers modified the hookworm Ancylostoma ceylanicum so that it produces antibodies that...

Live Science 39m ago

Indonesia Landslides Devastated Endangered Orangutans, Study Finds

More than 5 percent of the species is estimated to have been lost when a climate-fueled storm unleashed torrents of water, mud and debris.

NYT Science 48m ago

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast The Atlantic's enigmatic "cold blob" has once again been linked to a weakening of key ocean currents and a devastating climate tipping point. A mysterious "cold blob" in the Atlantic Ocean is a sign that key ocean currents are weakening, a new study has found, with potentially devastating long-term impacts on our climate and weather. The cold blob, or North Atlantic...

Live Science 53m ago

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing Neuroscientist Dr David Cox has spoken about how what we eat influences how we age while revealing the one 'superfood' he consumes daily to be as healthy as possible A neuroscientist and health journalist has revealed the one 'superfood' he eats every single day to slow down the ageing process. Dr David Cox, who is the author of The Age Code, made the comments on Tonight on ITV. The documentary looked at...

Daily Mirror 54m ago

Aligning Dense Retrievers with LLM Utility via Distillation

Related Stories

Genetically modified worms can now produce and deliver drugs inside a living body, scientists say

Indonesia Landslides Devastated Endangered Orangutans, Study Finds

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing