Home Knowledge Base Reliability

Reliability

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Reusing Fusion-Time Spectral Reliability for Adaptive Fusion and Expert Routing in RGB-Infrared Object Detection

arXiv:2606.01173v1 Announce Type: new Abstract: RGB-infrared detectors typically discard the statistics generated during cross-modal fusion, leaving downstream modules unaware of whether the current interaction is reliable. We propose to extract a parameter-free, 7-dimensional spectral reliability descriptor -- summarizing band energy, amplitude ratio, phase consistency, and cross-modal correlation -- and to reuse it beyond the fusion stage. The descriptor drives both Spectral Reliability...

arXiv CS 8d ago

Making Embodied AI Reliable: A Community Agenda from Testing to Formal Verification

arXiv:2606.03593v1 Announce Type: new Abstract: Embodied AI systems are increasingly deployed in open-world environments, yet ensuring their reliability remains a fundamental challenge. Drawing on discussions from the AAAI'26 Bridge Program on "Making Embodied AI Reliable with Testing and Formal Verification", this article argues that reliability in embodied AI is inherently a lifecycle assurance problem arising from uncertainty, human interaction, and emergent behaviors across tightly...

arXiv CS 7d ago

Reliability-Guided Depth Fusion for Glare-Resilient Navigation Costmaps

Announce Type: new Abstract: Specular glare on reflective floors, glass boundaries, and glossy indoor surfaces frequently corrupts active-stereo RGB-D depth measurements, producing holes and spikes that accumulate as persistent phantom obstacles in occupancy-grid costmaps. This paper presents a glare-resilient costmap construction method based on explicit depth-reliability modeling. A lightweight Depth Reliability Map network (DRM-Net) predicts per-pixel measurement trustworthiness under...

arXiv CS 7d ago

Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory

arXiv:2602.00521v2 Announce Type: replace Abstract: While LLM-as-a-Judge is widely used in automated evaluation, existing validation practices primarily operate at the level of observed outputs, offering limited insight into whether LLM judges themselves function as stable and reliable measurement instruments. To address this limitation, we introduce a two-phase diagnostic framework for assessing reliability of LLM-as-a-Judge, grounded in Item Response Theory (IRT). The framework adopts...

arXiv CS 9d ago

Reformulating Energy Storage Capacity Accreditation Problem with Marginal Reliability Impact

arXiv:2601.22096v2 Announce Type: replace Abstract: To enhance the efficiency of capacity markets, many electricity markets in the U.S. are adopting or planning to implement marginal capacity accreditation reforms. This paper provides new insights into energy storage capacity accreditation using Marginal Reliability Impact (MRI). We reformulate the commonly used reliability-based storage dispatch model as an optimization problem, enabling direct calculation of the MRI from the Lagrange...

arXiv CS 2d ago

Wind Turbine Maintenance Log Labelling Framework: LLM-Driven Data Correction and Enrichment via Semantic Extraction of Reliability Intelligence

Announce Type: new Abstract: As wind turbine fleets age, data-driven reliability engineering is essential to optimise their operation and maintenance for service life extension and levelised cost of energy reduction. Failure event descriptions within historical maintenance logs are a source of valuable reliability intelligence.

arXiv CS 9d ago

New study casts doubt on reliability of mental health diagnosis interviews

Diagnostic interviews seen as ‘gold standard’ vary in reliability from condition to condition, study saysDiagnostic interviews – the most common way to diagnose substance use and mental disorders including depression, anxiety, bipolar and personality disorders – vary in reliability from condition to condition, according to a new study in Jama Network Open. Laura Duncan, a psychiatry professor at McMaster University in Ontario, Canada, and one of the study’s authors, said diagnostic...

The Guardian UK 4d ago

New study casts doubt on reliability of mental health diagnosis interviews

Diagnostic interviews seen as ‘gold standard’ vary in reliability from condition to condition, study saysDiagnostic interviews – the most common way to diagnose substance use and mental disorders including depression, anxiety, bipolar and personality disorders – vary in reliability from condition to condition, according to a new study in Jama Network Open. Laura Duncan, a psychiatry professor at McMaster University in Ontario, Canada, and one of the study’s authors, said diagnostic...

The Guardian Health 4d ago

EHRBench: An Automated and Reliable EHR-based Benchmark for Clinical Decision Making with LLMs

arXiv:2605.30637v1 Announce Type: new Abstract: Clinical decision-making (CDM) is central to real-world clinical workflows, where clinicians infer diagnoses, select treatments, or anticipate future health outcomes under incomplete evidence. LLMs are increasingly used to support these decisions due to strong language capabilities, broad biomedical knowledge, and efficiency, yet the reliability of LLMs on real-world clinical decision tasks remains insufficiently understood. To evaluate CDM...

arXiv CS 9d ago

Toward Reliable Semantic Communication: Beyond Average Performance

Announce Type: new Abstract: Semantic communication has emerged as a promising paradigm for improving transmission efficiency by conveying task-relevant semantics rather than raw data. Although recent studies have achieved notable gains in communication efficiency and average task performance, reliability remains a fundamental bottleneck in dynamic and uncertain environments. In particular, most existing designs are still optimized mainly for average-case behavior, while lower-tail...

arXiv CS 8d ago