Home › Knowledge Base › Optimal Control Approach

Optimal Control Approach

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Optimal Control Approach for Non-prehensile Ball Juggling Using a 7-DoF Manipulator

Announce Type: new Abstract: Non-prehensile object manipulation skills are important for real-world robot interactions, enabling highly dynamic tasks such as balancing a glass on a tray or the controlled sliding of items on a table. Among such tasks, those characterised by high-speed manipulation requirements and general sensitivity of the resulting hybrid dynamics are particularly hard to accomplish. Within these, juggling can be seen as a highly challenging maneuver to be solved.

arXiv CS 2d ago

Self-Optimizing Control of Continuous Processes Based on Reinforcement Learning

new Abstract: This paper addresses the Self-Optimizing Control (SOC) problem in industrial continuous processes and proposes a Reinforcement-Learning (RL)-based SOC approach to improve dynamic performance under high-frequency disturbances. In the proposed framework, the SOC controlled variable structure is embedded in the Actor network, and reward functions are designed based on economic indicators. Through interaction with the environment, the RL agent optimizes controlled variables while...

arXiv CS 6d ago

Learning Local Optimal Controller for a Class of Nonlinear Systems via Impulse-Supervised Exploration

arXiv:2606.03107v1 Announce Type: new Abstract: This paper develops an impulse-supervised confined exploration framework for learning local optimal controller for a class of nonlinear systems. The proposed approach combines continuous-time approximate dynamic programming (ADP) with an impulsive supervisory layer, where impulsive braking confines the state within a prescribed region in which a local linear approximation of the nonlinear system is valid.

arXiv CS 7d ago

A Single-Loop Bilevel Deep Learning Method for Optimal Control of Obstacle Problems

arXiv:2601.04120v2 Announce Type: replace-cross Abstract: Optimal control of obstacle problems arises in a wide range of applications and is computationally challenging due to its nonsmoothness, nonlinearity, and bilevel structure. Classical numerical approaches rely on mesh-based discretization and typically require solving a sequence of costly subproblems. In this work, we propose a single-loop bilevel deep learning method, which is mesh-free, scalable to high-dimensional and complex...

arXiv CS 7d ago

Trajectory Planning for Non-Communicating Mobile Robots using Inverse Optimal Control

Announce Type: new Abstract: To enable an efficient interaction of non-communicating mobile robots in collision avoidance scenarios, we present a novel combined trajectory planning and prediction algorithm. Inverse optimal control is used to estimate unknown goal states of all robots based on observed past trajectories. Each robot also takes the perspective of other robots in considering self-prediction and solves a joint prediction problem using the estimated goal states.

arXiv CS 9d ago

Large-Scale LLM Inference with Heterogeneous Workloads: Prefill-Decode Contention and Asymptotically Optimal Control

Announce Type: replace Abstract: Large Language Models (LLMs) are rapidly becoming critical infrastructure for enterprise applications, driving unprecedented demand for GPU-based inference services. A key operational challenge arises from the two-phase nature of LLM inference: a compute-intensive \emph{prefill} phase that processes user input, followed by a memory-bound \emph{decode} phase that generates output tokens. When these phases share GPU resources, prefill tasks throttle the...

arXiv CS 5d ago

A Continuification Approach to CAV Control in Mixed Traffic via Variable Speed Limits

arXiv:2606.09534v1 Announce Type: new Abstract: This paper presents a method for controlling traffic via the use of connected and automated vehicles (CAVs) acting as moving bottlenecks. Current methods for moving bottleneck control use a couple PDE-ODE model, based on the Lighthill-Whitham-Richard (LWR) model, to represent the influence of the CAV. Control of the CAV is normally achieved by designing the control on the ODE which models the speed of the moving bottleneck.

arXiv CS 1d ago

Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization

arXiv:2510.05342v2 Announce Type: replace Abstract: Direct Preference Optimization (DPO) has emerged as a simple and effective method for aligning large language models. However, its reliance on a fixed temperature parameter leads to suboptimal training on diverse preference data, causing overfitting on easy examples and under-learning from informative ones. Recent methods have emerged to counter this.

arXiv CS 8d ago

Clipped Affine Policy: Low-Complexity Near-Optimal Online Power Control for Energy Harvesting Communications over Fading Channels

arXiv:2601.07622v2 Announce Type: replace Abstract: This paper studies online power control for battery-limited point-to-point energy harvesting communications over slow block-fading channels. A linear-policy-based approximation is developed for the relative-value function in the Bellman equation of the power control problem. This approximation leads to two fundamental parameterized clipped affine policies: an optimistic policy derived from a certainty-equivalence-type approximation and a...

arXiv CS 2d ago

Optimal Finite-Horizon LQR Control for Traffic Flow via Variable Speed Limits

Announce Type: replace-cross Abstract: This article presents a finite-horizon linear quadratic regulator for the control of the first-order Lighthill-Whitham-Richards traffic model with a triangular fundamental diagram. The in-domain control action is realized through variable speed limits implemented as a source term in the governing hyperbolic partial differential equation. Unlike prior studies on infinite-horizon formulations, this article develops a finite-horizon LQR framework, deriving...

arXiv CS 1d ago