Home › Knowledge Base › Learning Local Optimal Controller

Learning Local Optimal Controller

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Learning Local Optimal Controller for a Class of Nonlinear Systems via Impulse-Supervised Exploration

arXiv:2606.03107v1 Announce Type: new Abstract: This paper develops an impulse-supervised confined exploration framework for learning local optimal controller for a class of nonlinear systems. The proposed approach combines continuous-time approximate dynamic programming (ADP) with an impulsive supervisory layer, where impulsive braking confines the state within a prescribed region in which a local linear approximation of the nonlinear system is valid.

arXiv CS 7d ago

Learning Predictive Control with Deep Koopman Operators for Autonomous Vehicle Motion Planning

arXiv:2606.08136v1 Announce Type: new Abstract: Model Predictive Control (MPC) is widely used for autonomous-vehicle (AV) motion planning, but its real-time applicability is often limited by the need for accurate models and online solution of nonlinear, nonconvex optimization problems in dynamic road environments. Actor-critic reinforcement learning offers a promising alternative for online policy generation, yet its policy-learning process often lacks explicit control-theoretic structure....

arXiv CS 1d ago

Residual-Controlled Multiplier Learning for Stochastic Constrained Decision-Making

arXiv:2606.07088v1 Announce Type: new Abstract: Stochastic constrained decision-making requires optimizing performance objectives while enforcing statistical requirements such as safety or fairness. However, standard primal--dual methods struggle to update multipliers robustly under stochastic mini-batch feedback, as the noise of mini-batch gradients and constraint estimates can be directly accumulated into the multiplier memory.

arXiv CS 2d ago

Explainable Data-driven Deep Reinforcement Learning Methods for Optimal Energy Management in Buildings

arXiv:2606.02049v1 Announce Type: new Abstract: The increasing integration of renewable energy sources into power systems, particularly in buildings equipped with photovoltaic (PV) panels and energy storage systems, introduces significant complexity in energy systems. Volatile power generation, varying electricity tariffs, and increased entities, e.g., PV systems, and heat pumps, have increased the complexity and made the system harder to operate. This leads to the demand for additional...

arXiv CS 8d ago

PairAlign: A Framework for Sequence Tokenization via Self-Alignment with Applications to Audio Tokenization

Announce Type: replace Abstract: Many operations on sensory data -- comparison, memory, retrieval, and reasoning -- are naturally expressed over discrete symbolic structures. In language this interface is given by tokens; in audio, it must be learned. Existing audio tokenizers rely on quantization, clustering, or codec reconstruction, assigning tokens locally, so sequence consistency, compactness, length control, termination, and edit similarity are rarely optimized directly.

arXiv CS 1d ago

Variational Proximal Policy Optimization

Announce Type: cross Abstract: Reinforcement Learning from Human Feedback via Proximal Policy Optimization often suffers from policy mode collapse, brittle exploration loops, and distribution drift. This paper introduces Variational Proximal Policy Optimization (\(\textsc{VP}_2\textsc{O}\)), a particle-based variational inference framework that maps policy optimization to Stein Variational Gradient Descent within a Mixture-of-Experts architecture. By leveraging functional kernels over...

arXiv CS 1d ago

Learning-Based Navigation for Indoor Mobile Robots

arXiv:2605.30468v1 Announce Type: new Abstract: This paper presents a learning-based navigation framework for indoor mobile robots. The proposed method combines a supervised neural global planner, trained from cost-aware A* expert trajectories, with the proposed Learning-Based DWA local planner, which is formulated as discrete candidate selection over the Dynamic Window Approach (DWA) action lattice. For local planning, the policy is first trained by behavior cloning and then refined by...

arXiv CS 9d ago

Human-Like Neural Nets by Catapulting

Human-like Neural Nets by Catapulting Speculative proposal to create artificial neural nets with human-like performance by high-learning-rate/regularization training of overparameterized NNs to trigger catapulting/grokking. Over-parameterization as a route to true generalization would resolve many outstanding mysteries of artificial versus natural intelligence. There are many mysteries about deep learning and human intelligence, but we could describe the biggest anomaly this way: why are...

Hacker News 3d ago

Towards Guaranteed Optimal PID Tuning for Uncertain Nonlinear Systems

arXiv:2606.04787v1 Announce Type: new Abstract: Despite the widespread use of PID controllers in engineering practice, designing optimal PID parameters has long been regarded as a challenging problem in both theory and practice, particularly when faced with uncertain nonlinear dynamical systems. Based on the authors' PID control theory established recently for MIMO nonlinear uncertain systems (Zhao and Guo, 2022), which provides a concrete PID parameter set for global stability of PID...

arXiv CS 6d ago

Latent Geometry Beyond Search: Amortizing Planning in World Models

arXiv:2605.08732v2 Announce Type: replace Abstract: Modern vision-based world models can represent observations as compact yet expressive latent manifolds, but fast goal-oriented planning in these spaces remains challenging. This raises a central question: when does a learned representation simplify control, rather than merely enabling prediction? We study this question in a pretrained LeWorldModel, whose latent geometry is regularized for smoothness and uniformity.

arXiv CS 2d ago