Home Knowledge Base PhysDox

PhysDox

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

PhysDox: Benchmarking LLMs on Physical Feasibility Auditing of Physiological Sensing Protocols

arXiv:2606.05003v1 Announce Type: new Abstract: Large language models (LLMs) increasingly assist in experimental design, yet fluent protocols often remain physically infeasible. We introduce PhysDox, a physical feasibility auditing benchmark for biomedical protocols comprising a 683-sample expert-curated Gold set and a 5,000-sample Silver set across six sensing domains. We formulate the task as a two-stage evaluation: severity detection classifying protocols as valid, minor, or fatal,...

arXiv CS 6d ago