Home Knowledge Base SimulCost

SimulCost

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs

Announce Type: replace Abstract: Evaluating LLM agents for scientific tasks has focused on token costs while ignoring tool-use costs like simulation time and experimental resources. As a result, metrics like pass@k become impractical under realistic budget constraints. To address this gap, we introduce SimulCost, the first benchmark targeting cost-sensitive parameter tuning in physics simulations.

arXiv Physics 9d ago

SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs

Announce Type: replace-cross Abstract: Evaluating LLM agents for scientific tasks has focused on token costs while ignoring tool-use costs like simulation time and experimental resources. As a result, metrics like pass@k become impractical under realistic budget constraints. To address this gap, we introduce SimulCost, the first benchmark targeting cost-sensitive parameter tuning in physics simulations.

arXiv CS 9d ago