LexRubric
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
LexRubric: A Rubric-Guided Diagnostic Benchmark for Open-Ended Legal Tasks
arXiv:2606.09389v1 Announce Type: new Abstract: As large language models (LLMs) are increasingly applied to real-world legal tasks, evaluating the reliability of their open-ended legal responses has become essential. These tasks require context-sensitive answers and allow little room for error, motivating fine-grained and diagnostic evaluation that can identify specific sources of response quality failures.