Home Technology Acquiring Human-Like Data-Efficient Mechanics Prediction...
Technology

Acquiring Human-Like Data-Efficient Mechanics Prediction from Deep Reinforcement Learning

Key Points

Announce Type: replace Abstract: Humans can infer mechanical outcomes by learning from a few observations. This capacity for mechanics intuition is acquired in a data-efficient manner. Here, we propose a reinforcement learning framework to mimic this process, in which an agent encodes continuous physical observation parameters into its state and is trained via episodic switching across closely related observations.

arXiv:2601.21881v2 Announce Type: replace Abstract: Humans can infer mechanical outcomes by learning from a few observations. This capacity for mechanics intuition is acquired in a data-efficient manner. Here, we propose a reinforcement learning framework to mimic this process, in which an agent encodes continuous physical observation parameters into its state and is trained via episodic switching across closely related observations. With merely two or three similar observations, the agent acquires robust mechanics intuition that generalizes over wide parameter ranges beyond the training data. Our method is demonstrated on the brachistochrone, a large-deformation elastic plate, and the quantum harmonic oscillator. We explain this generalization through a unified theoretical view: it is associated with cross-parameter Bellman consistency encouraged by episodic switching across neighboring task parameters, promoting approximate stationarity of the Bellman residual with respect to physical variations. This is consistent with a smooth policy that tracks a low-dimensional solution manifold underlying the continuum of tasks. Our work identifies episodic switching as a practically effective and theoretically motivated route to artificial mechanics intuition and suggests a computational analogy to data-efficient generalization in biological learners.
Data-Efficient Mechanics Prediction (ORG) Deep Reinforcement Learning arXiv:2601.21881v2 (ORG)
Originally published by arXiv Physics Read original →