Lighting the Way for BRIGHT: Reproducible Baselines with Anserini, Pyserini, and RankLLM

arXiv CS Tuesday 02 June 2026, 04:00 UTC By Sahel Sharifymoghaddam, Yijun Ge, Jimmy Lin 1 min read

Key Points

arXiv:2509.02558v2 Announce Type: replace Abstract: Retrieval benchmarks for large language models (LLMs) should reflect the long, reasoning-intensive queries typical of retrieval-augmented generation (RAG). We present a systematic study of BRIGHT, a reasoning-focused retrieval benchmark, along with strong, reproducible reference methods integrated into Anserini, Pyserini, and RankLLM. We evaluate lexical, sparse, dense, and fusion-based retrievers, as well as LLM rerankers, under long-query settings. In reproducing BRIGHT's lexical baseline, we identify a key under-documented detail: query-side BM25 (BM25Q), which applies BM25 weighting to the query itself. On long, multi-sentence queries, BM25Q consistently outperforms standard BM25, making it the strongest lexical baseline for reasoning-oriented retrieval. We further audit the BRIGHT corpus, uncovering data quality issues that impact evaluation, and offer mitigation. Finally, we study the generalizability of BM25Q across five additional benchmarks, finding its gains largely specific to BRIGHT, while fusion with standard BM25 provides the most consistent improvements across datasets.

Anserini (LOCATION) Pyserini (LOCATION) LLM (ORG)

Originally published by arXiv CS Read original →

Genetically modified worms can now produce and deliver drugs inside a living body, scientists say In a proof-of-concept lab experiment, scientists demonstrated that intestinal parasites could make and release therapeutic agents inside a living host. Scientists genetically tweaked a tiny, worm-like parasite to produce a life-saving antitoxin from inside a living host. In a first-of-its-kind study, researchers modified the hookworm Ancylostoma ceylanicum so that it produces antibodies that...

Live Science 37m ago

Indonesia Landslides Devastated Endangered Orangutans, Study Finds

More than 5 percent of the species is estimated to have been lost when a climate-fueled storm unleashed torrents of water, mud and debris.

NYT Science 46m ago

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast The Atlantic's enigmatic "cold blob" has once again been linked to a weakening of key ocean currents and a devastating climate tipping point. A mysterious "cold blob" in the Atlantic Ocean is a sign that key ocean currents are weakening, a new study has found, with potentially devastating long-term impacts on our climate and weather. The cold blob, or North Atlantic...

Live Science 51m ago

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing Neuroscientist Dr David Cox has spoken about how what we eat influences how we age while revealing the one 'superfood' he consumes daily to be as healthy as possible A neuroscientist and health journalist has revealed the one 'superfood' he eats every single day to slow down the ageing process. Dr David Cox, who is the author of The Age Code, made the comments on Tonight on ITV. The documentary looked at...

Daily Mirror 52m ago

Lighting the Way for BRIGHT: Reproducible Baselines with Anserini, Pyserini, and RankLLM

Related Stories

Genetically modified worms can now produce and deliver drugs inside a living body, scientists say

Indonesia Landslides Devastated Endangered Orangutans, Study Finds

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing