Home Knowledge Base SEMJ

SEMJ

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

When Languages Disagree: Self-Evolving Multilingual LLM Judges

arXiv:2606.08092v1 Announce Type: new Abstract: Multilingual LLM-as-a-judge is widely used to evaluate model outputs across languages, but suffers from cross-lingual inconsistency (Fu and Liu, 2025). Existing methods typically treat this inconsistency as noise and mitigate it through voting or aggregation. In this work, we instead show that multilingual inconsistency can provide complementary evaluation signals.

arXiv CS 1d ago