Home Knowledge Base PRMBench

PRMBench

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

The Hidden Bias of Process Reward Models:PRISM for Rewarding the Right Reasoning

arXiv:2606.09078v1 Announce Type: new Abstract: Process Reward Models (PRMs) improve credit assignment for reasoning by providing step-level feedback. However, we identify a hidden bias in PRMs caused by severe imbalance in step-level training data.

arXiv CS 1d ago