Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

Abstract

This paper presents a partially model-free adaptive optimal control solution to the deterministic nonlinear discrete-time (DT) tracking control problem in the presence of input constraints. The tracking error dynamics and reference trajectory dynamics are first combined to form an augmented system. Then, a new discounted performance function based on the augmented system is presented for the optimal nonlinear tracking problem. In contrast to the standard solution, which finds the feedforward and feedback terms of the control input separately, the minimization of the proposed discounted performance function gives both feedback and feedforward parts of the control input simultaneously. This enables us to encode the input constraints into the optimization problem using a nonquadratic performance function. The DT tracking Bellman equation and tracking Hamilton-Jacobi-Bellman (HJB) are derived. An actor-critic-based reinforcement learning algorithm is used to learn the solution to the tracking HJB equation online without requiring knowledge of the system drift dynamics. That is, two neural networks (NNs), namely, actor NN and critic NN, are tuned online and simultaneously to generate the optimal bounded control policy. A simulation example is given to show the effectiveness of the proposed method.

Keywords

Hamilton–Jacobi–Bellman equationControl theory (sociology)Optimal controlNonlinear systemFeed forwardReinforcement learningComputer scienceBounded functionTracking errorTracking (education)TrajectoryBellman equationArtificial neural networkDiscrete time and continuous timeMathematical optimizationFunction (biology)MathematicsControl (management)Artificial intelligenceControl engineeringEngineering

Affiliated Institutions

The University of Texas at Arlington US

Related Publications

Contributions to the theory of optimal control

R. E. Kalman

This paper was in fact the first to introduce the RDE as an algorithm for computing the state feedback gain of the optimal controller for a general linear system with a quadrati...

1960 1697 citations

New Results in Linear Filtering and Prediction Theory

R. E. Kalman , R. S. Bucy

A nonlinear differential equation of the Riccati type is derived for the covariance matrix of the optimal filtering error. The solution of this “variance equation” completely sp...

1961 Journal of Basic Engineering 6246 citations

Adaptive finite time filtering

R. Bucy , James W. Follin

A detailed analysis of a particular adaptive filter has been carried out and the required extension of the theory to the general case is indicated. The filter measures the spect...

1962 IRE Transactions on Automatic Control 6 citations

A New Approach to Linear Filtering and Prediction Problems

R. E. Kalman

The classical filtering and prediction problem is re-examined using the Bode-Shannon representation of random processes and the “state-transition” method of analysis of dynamic ...

1960 Journal of Basic Engineering 30005 citations

Adaptive multilevel finite element solution of the Poisson-Boltzmann equation I. Algorithms and examples

Michael Holst , Nathan Baker , F. Wang

This article is the first of two articles on the adaptive multilevel finite element treatment of the nonlinear Poisson–Boltzmann equation (PBE), a nonlinear eliptic equation ari...

2000 Journal of Computational Chemistry 276 citations

Publication Info

Year: 2014
Type: article
Volume: 26
Issue: 1
Pages: 140-151
Citations: 311
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

311

OpenAlex

Cite This

APA Style

                            
                                    Bahare Kiumarsi, 
                                
                                    Frank L. Lewis
                                
                            (2014). 
                            Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems. 
                            IEEE Transactions on Neural Networks and Learning Systems
                            , 26
                            (1)
                            , 140-151.
                            https://doi.org/10.1109/tnnls.2014.2358227

Identifiers

DOI: 10.1109/tnnls.2014.2358227