Home Knowledge Base Meta-World

Meta-World

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

When are LLMs Sufficient Policy Optimizers for Sequential RL Tasks?

arXiv:2605.30719v1 Announce Type: new Abstract: We study when large language models (LLMs) can serve as effective black-box policy optimizers for reinforcement learning (RL) tasks, i.e., when can we replace classical RL algorithms with an LLM? We explore this question by introducing Prompted Policy Optimization (PromptPO), an iterative method that prompts an LLM with Python descriptions of the state space, action space, and reward function, then has it generate and refine executable policies...

arXiv CS 9d ago

OMP: One-step Meanflow Policy with Directional Alignment

arXiv:2512.19347v3 Announce Type: replace Abstract: Robot manipulation has increasingly adopted data-driven generative policy frameworks, yet the field faces a persistent trade-off: diffusion models suffer from high inference latency, while flow-based methods often require complex architectural constraints. Although in image generation domain, the MeanFlow paradigm offers a path to single-step inference, its direct application to robotics is impeded by critical theoretical pathologies,...

arXiv CS 7d ago

ReGIL: Retrieval-Guided Imitation Learning from a Single Demonstration

arXiv:2606.09381v1 Announce Type: new Abstract: Learning robot manipulation policies with deep neural networks from a single demonstration remains highly challenging, as even small deviations from the demonstrated trajectory can quickly compound into failure, while collecting substantial online interaction data is costly. We propose ReGIL, a retrieval-guided imitation learning framework that treats a single demonstration as an external memory. ReGIL repeatedly queries this static memory...

arXiv CS 1d ago