Home Knowledge Base EVidence

EVidence

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Attend to Evidence: Evidence-Anchored Spatial Attention Supervision for Multimodal RLVR

Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) improves vision-language models (VLMs) by optimizing outcome rewards derived from final answers. However, such outcome-only rewards do not tell the model which image regions justify an answer. For questions that require visual grounding, these rewards cannot distinguish responses supported by relevant visual evidence from those produced by language-prior shortcuts or lucky guesses.

arXiv CS 9d ago

Temporal Evidence Routing with Structured Visual Evidence for TimeLogicQA

Announce Type: new Abstract: TimeLogicQA evaluates whether video question answering systems can reason over temporal relations such as event existence, ordering, persistence, boundary conditions, and overlap. We address this task with a visual evidence routing pipeline that separates perception from symbolic temporal reasoning. The system first parses each question into event targets, answer mode, candidate options, and temporal operators.

arXiv CS 8d ago

The CIFAR Synthetic Evidence Corpus for Detecting AI-Generated Evidence

arXiv:2606.07916v1 Announce Type: new Abstract: The growing ability of generative models to produce realistic documents poses a direct challenge to evidentiary workflows in the justice system and the courts, where decisions increasingly depend on the authenticity of evidence such as receipts, communications, and administrative records. Unlike social media or academic settings, evidentiary documents are often only subtly altered, with small, localized edits that preserve overall plausibility...

arXiv CS 1d ago

A Four-Condition Diagnostic Protocol for Evidence Utilization in Long-Context and Retrieval-Augmented Language Models

Announce Type: new Abstract: Final-answer accuracy, retrieval recall, and citation overlap do not by themselves identify whether a long-context or retrieval-augmented language model used the evidence it was given. A model can answer from parametric memory, fail despite receiving the right passages, or cite evidence without converting it into the requested answer. This paper proposes a matched four-condition evidence-availability protocol--no evidence, full context, retrieved evidence, and...

arXiv CS 2d ago

EMBER: Efficient Memory via Budgeted Evidence Retention for Long-Horizon Agents

Announce Type: new Abstract: Long-horizon agents can archive large histories, but future answers still incur retrieval, rereading, and context costs. When retained memory misses answer-relevant evidence, the system must return to larger portions of the raw history. We study budgeted evidence survival: before the query is known, which source evidence should be retained so that it remains recoverable and usable under a fixed retained source-evidence token budget?

arXiv CS 5d ago

Quantum-Inspired Trace-Augmented Evidence Selection for Reasoning over Structured Hypothesis Spaces

arXiv:2606.06941v1 Announce Type: new Abstract: Large language models (LLMs) now solve a wide range of expert-level exams at or above human level, yet remain brittle on specialised, evidence-intensive domains such as law. On these tasks, errors arise not only from gaps in world knowledge but also from subtle distinctions between pieces of evidence and inconsistent use of supporting evidence. The most common aggregator over sampled chain-of-thought (CoT) traces, majority vote, returns the...

arXiv CS 2d ago

HKVM-RAG: Key-Value-Separated Hypergraph Evidence Organization for Multi-Hop RAG

arXiv:2606.07218v1 Announce Type: new Abstract: Multi-hop RAG poses a data-engineering problem beyond passage matching: under fixed retrieval budgets, a system must organize retrieved text into evidence units that expose answer chains. Dense retrievers score passages independently, while graph-based memories make associations explicit but often rely on pairwise or entity-centered keys that fragment multi-hop evidence. We present HKVM-RAG, a key-value-separated evidence-organization layer.

arXiv CS 2d ago

James Webb Space Telescope finds evidence the mysterious 'little red dots' are black hole stars

James Webb Space Telescope finds evidence the mysterious 'little red dots' are black hole stars "None of the previous little red dots have all of the pieces of evidence in the same place." Astronomers using the James Webb Space Telescope may be close to solving the mystery of "little red dots" in the early universe. The team has studied one of these strange objects, designated GLIMPSE-17775, finding evidence it is a black hole star — a ravenously feeding, growing supermassive black hole...

Space.com 2h ago

EASE-TTT: Evidence-Aligned Selective Test-Time Training for Long-Context Question Answering

arXiv:2606.06906v1 Announce Type: new Abstract: Long-context question answering (QA) remains challenging for smaller language models even when answer-bearing evidence is already present in the input. Existing within-context retrieval methods localize and expose candidate evidence chunks for the question, but they stop at input-level evidence exposure rather than adapting the query-side attention parameters that control how the model allocates attention over full-context positions. In...

arXiv CS 2d ago

When Knowledge Is Not Free: Cost-Aware Evidence Selection in Retrieval-Augmented Generation

Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) typically assumes that external knowledge is free, but many high-quality sources are paywalled, licensed, restricted, or otherwise costly to access. We introduce cost-aware RAG, a setting where retrieved evidence is assigned access-cost tiers and systems must answer under an explicit evidence-access budget. We instantiate this setting by augmenting MS MARCO v2.1 with access-friction tiers and evaluate budgeted evidence...

arXiv CS 8d ago