Open access peer-reviewed chapter - ONLINE FIRST

An Implementable and Stabilizing Model Predictive Control Strategy for Inverted Pendulum-Like Behaved Systems

By Odilon S.L. de Abreu, Márcio A.F. Martins and Leizer Schnitman

Submitted: November 12th 2019Reviewed: February 5th 2020Published: March 7th 2020

DOI: 10.5772/intechopen.91629

Downloaded: 31

Abstract

In control theory, the inverted pendulum is a class of dynamic systems widely used as a benchmarking for evaluating several control strategies. Such a system is characterized by an underactuated behavior. It is also nonlinear and presents open-loop unstable and integrating modes. These dynamic features make the control more difficult, mainly when the controller synthesis seeks to include constraints and the guarantee of stability of the closed-loop system. This chapter presents a stabilizing model predictive control (MPC) strategy for inverted pendulum-like behaved systems. It has an offset-free control law based on an only optimization problem (one-layer control formulation), and the Lyapunov stability of the closed-loop system is achieved by adopting an infinite prediction horizon. The controller feasibility is also assured by imposing a suitable set of slacked terminal constraints associated with the unstable and integrating states of the system. The effectiveness of the implementable and stabilizing MPC controller is experimentally demonstrated in a commercial-didactic rotary inverted pendulum prototype, considering both cases of stabilization of the pendulum in the upright position and the output tracking of the rotary arm angle.

Keywords

  • rotary inverted pendulum
  • model predictive control
  • nonlinear system
  • Lyapunov stability
  • feasible-optimization problem

1. Motivation

Dynamic inverted pendulum-featured apparatuses are widespread in systems and control theory. These represent a class of nonlinear and underactuated electromechanical systems, which, in turn, are composed of open-loop unstable and integrating modes. The scale-up of inverted pendulum-based conceptual sketches in practical mechanisms and real applications has been in progress, of which one can cite stabilization of rocket launch, robot balance, and segway-like means of transportation, among others [1, 2].

In control theory, inverted pendulum-type systems have enabled extensive studies concerning controller architectures ranging from proportional-integrative-derivative (PID)-like classical strategies to more advanced technique ones, such as optimal and adaptive strategies. In the middle of the advanced control strategies, the so-called class of model predictive control (MPC) strategies has been preferred by systematically handling system constraints. In fact, any MPC algorithm makes explicit use of a system model to predict its outputs and to obtain an optimal control law that minimizes the prediction error and control efforts [3, 4, 5]. Since it requires an online solution to its associated optimization problem, it first became popular in applications of slow dynamic systems, such as those in petroleum refineries and petrochemical industries [6]. However, with advances in hardware development and optimization techniques, MPC applications have been extended to fast dynamic systems, including inverted pendulum-like mechatronic systems [7, 8, 9].

On the other hand, when one seeks to control open-loop unstable systems such as inverted pendulum-like behaved ones, the guarantee of stability associated with control laws plays a crucial role concerning practical implementation purposes. In particular, the synthesis of stabilizing MPC laws deals essentially with terminal state constraints. One of the most heavily studied stability approach is one based on a dual-mode framework, which is composed of two distinct control modes: in the first mode, a conventional MPC law forces the system states to converge to a certain invariant set at the end of the finite prediction horizon, while in the second control mode, a local state-feedback controller takes over and drives the state to the desired operating point within this set [10]. This approach, however, requires the computation of the invariant set parameters, which is obtained from an offline numerical procedure. Although this set can be obtained offline with standard algorithms, undesired convergence and numerical issues may appear as the system dimension increases. Furthermore, the control horizon should be large enough such that the system states at the end of this horizon lie in the invariant set; otherwise the resulting optimization problem becomes infeasible, thus compromising both feasibility and control performance.

Another way to guarantee the closed-loop system stability of MPC controllers is to adopt an infinite prediction horizon, the so-called infinite-horizon model predictive control (IHMPC). Since the infinite-horizon problems cannot be directly handled by an optimization algorithm, the realization of IHMPC controllers is obtained from the combination between a terminal cost term and terminal equality constraints [11]. The terminal cost, associated with open-loop stable modes of the system, is calculated through the solution of the Lyapunov equation, whereas the terminal constraints are necessary to limit the objective function when the system is composed of integrating and unstable modes. However, stability proof is only achieved if the constrained optimization problem is feasible. The feasibility is also a critical issue of this approach, particularly because the domain of attraction of the controller becomes quite reduced by virtue of the associated hard constraints. Although there is already a rich theory in this field, the applications are heavily limited to theoretical works [12]. Among the methods developed to circumvent this issue so far, the approach based on slacked terminal constraints seems to be more adequate for practical implementation purposes, with recent applications reported in the literature, one implementation in a crude oil distillation [13] and the other in an inverted pendulum mechatronic-like fast dynamic system, namely, customized engine control unit [14].

This class of controllers formulates optimization problems that are always feasible through the suitable inclusion of slack variables in the control laws, without compromising their convergence and stabilizing properties. Also, these IHMPC controllers make use of the customized state-space models, obtained from an analytical expression of the step response of the system. Because of this, their formulations have been gradually developed over time. Odloak [15] focused on systems with open-loop stable poles, his work being extended to contemplate simple integrating poles as well in [6], commonly found in systems of the process industry. Then, Santoro and Odloak [16] encompassed time delays to the formulation proposed in [6]. For open-loop stable and unstable time-delay processes, Martins and Odloak [12] synthesized their IHMPC controller. More recently, the master’s dissertation [17] included integrating poles in the last work formulation [12], such that its associated IHMPC controller can be directly applied to rotary inverted pendulum-behaved systems, the case understudy of this chapter. The implementation in a real system of the feasible-optimization problem-based stabilizing MPC controller proposed in [17] has not yet been documented in the literature. This gap will be filled in the present work.

2. System description

The objective of the MPC controller to be explored in this work is to stabilize the pendulum rod in the upright position while it leads the rotary arm angle to the desired positions. To this end, the rotary inverted pendulum used here will be a commercial-didactic prototype manufactured by Quanser. This prototype is installed in the Control Laboratory of the Center for Technological Training in Industrial Automation (CTAI) at the Federal University of Bahia (UFBA). Figure 1 illustrates the features of such a system [18].

Figure 1.

Rotary inverted pendulum prototype manufactured by Quanser (left) and the schematic diagram (right).

The rotary inverted pendulum prototype consists of a servomotor system, whose voltage Vmapplied to it is responsible for generating torque in the rotary arm of angle (θ). The long pendulum rod is connected to the end of the rotary arm, and its angle, α, is zero when it is upright in the vertical position (cf. Figure 1).

The governing mathematical model of this system can be obtained by the Euler-Lagrange formalism, resulting in the following well-known equations [18]:

θ¨=bcsinαα̇2+bdsinαcosαceθ̇+cfVmacb2cos2α,α¨=adsinαb2sinαcosαα̇2becosαθ̇+bfcosαVmacb2cos2α,E1

where θ¨, θ̇, α¨, and α̇represent angular accelerations and velocities associated with rotary arm angle and inverted pendulum angle, respectively. In addition, the parameters abcdefare constants related to the physical dimensions of the various components that make up the inverted pendulum prototype. Information about the modeling and physical parameters of the system can be referred to [18].

Since one of the control objectives aims to the stabilization of the pendulum in the upright position, it is quite adequate to assume that αwill suffer small variations, which implies that sinαα, cosα0, and α̇20. Then, after some algebraic manipulations in Eq. (1), applying the Laplace transform as well, one turns out to be the following transfer function matrix (Gs):

θsαs=fcs2fdsacb2s3+ecs2adsedbfsacb2s3+ecs2adsedGsVms.E2

This model representation of the system in terms of transfer functions is useful to obtain the state-space formulation to be used in the stabilizing MPC control law, as will be shown in the next section.

3. Stabilizing MPC formulation

The stabilizing MPC control law used in this work seeks to solve an infinite-horizon optimization problem, such that its objective function is composed of the following terms:

Jk=j=1yk+jkyspδy,kΨunFunjmδun,kjmΔtδi,kQy2+j=0m1Δuk+jkR2+δy,kSy2+δun,kSun2+δi,kSi2,E3

where mis the control horizon, Δuk+jkRnuis the vector of input moves at time step k+j, QyRny×nyis a positive-definite weighting matrix of the controlled outputs, RRnu×nuis a positive-definite weighting matrix of the input moves, yspRnyis the vector of references of the controlled variables, and yk+jkRnyis the vector of the predicted outputs at time step k+jcomputed at time step k, considering a state-space model obtained from an analytical expression of the step response of the system described as in Eq. (2), namely:

xsk+1xstk+1xunk+1xik+1=Iny0ny×nd0ny×nunΔtIny0nd×nyFst0nd×nun0nd×ny0nun×ny0nun×ndFun0nun×ny0ny0ny×nd0ny×nunInyAxskxstkxunkxik+BsBstBunBiBΔuk,E4
yk=InyΨstΨun0ny×nuCxskxstkxunkxik.E5

In the state-space model defined in the pair of Eqs. (4) and (5), xskRnyare the artificial integrating states introduced by the incremental form of inputs, xstkCnstare the stable states of the system, xunkCnunare the unstable states of the system, and xikRnyare the true integrating states of the system. Inand 0nare identity and null matrices of n×ndimension, respectively. The remaining matrices (Fst, Fun,Bs,Bst,Bun,Bi,Ψste Ψun) are obtained from step-response coefficients of the transfer function matrix of the system, and the details can be referred to [19].

In the objective function, there are also δy,k, δun,k, and δi,kthat are slack variables introduced into the control law so as to provide additional degrees of freedom to the resulting optimization problem, thus assuring the feasibility of the controller. These slack variables are weighted by positive defined matrices SyRny×ny, SunRnun×nun, and SiRni×ni, respectively. In fact, the set of slack variables adopted in the problem formulation is responsible for softening, when necessary, terminal constraints that are imposed to limit the infinite-horizon objective function, owing to the existence of open-loop unstable and integrating modes.

It should be kept in mind that the objective function defined in Eq. (3) can be rewritten as follows:

Jk=j=1myk+jkyspδy,kΨunFunjmδun,kjmΔtδi,kQy2+j=1yk+m+jkyspδy,kΨunFunjδun,kjΔtδi,kQy2+j=0m1Δuk+jkR2+δy,kSy2+δun,kSun2+δi,kSi2.E6

Then, with the aid of the state-space model used to carry out the prediction of the system, it is possible to demonstrate that the objective function becomes:

Jk=j=1myk+jkyspδy,kΨunFunjmδun,kjmΔtδi,kQy2+j=1xsk+mk+jΔtxik+mk+ΨstFstjxstk+mk+ΨunFunjxunk+mkyspδy,kΨunFunjδun,kjΔtδi,kQy2+j=0m1Δuk+jkR2+δy,kSy2+δun,kSun2+δi,kSi2.E7

It is worth emphasizing that if constraints are not imposed at the end of the control horizon, the objective function value will increase unbounded. To this end, the following terminal constraints are imposed on the optimization control problem:

xsk+mkyspδy,k=0,E8
xunk+mkδun,k=0,E9
xik+mkδi,k=0.E10

Furthermore, the term associated with stable modes of the system comprises a convergent series, giving rise to the so-called terminal cost, namely:

j=1ΨstFstjxstk+mkQy2=xstk+mkQ¯2,E11

where Q¯is the terminal weighting matrix obtained from the solution to the Lyapunov equation of the system. In symbols:

Q¯=FstΨstQyΨstFst+FstQ¯Fst.E12

Therefore, the feasible-optimization problem-based stabilizing MPC control law is summarized as follows:

Problem 1.

minΔuk,δy,k,δun,k,δi,kJk=j=1myk+jkyspδy,kΨunFunjmδun,kjmΔtδi,kQy2+xstk+mkQ¯2+j=0m1Δuk+jkR2+δy,kSy2+δun,kSun2+δi,kSi2,

subject to Eqs. (8), (9), and (10), and

Δuk+jkU,j=0,,m1,E13
U=ΔumaxΔuk+jkΔumaxΔuk+jk=0,jmuminuk1+i=0jΔuk+ikumax,E14

where Δuk=ΔukkΔuk+m1kTis the sequence of control moves along the control horizon.

Remark 1. The slack variables play a remarkable role with respect to the feasibility of the control formulation, i.e., the control law of Problem 1 will always provide a feasible solution, either the nominal case (linear model) or plant-model mismatch, an object under study of this work.

Remark 2. The weighting matrices Sy, Sun, and Si(additional tuning parameters when compared to conventional MPC strategies) should be carefully selected. For instance, the values of Syshould be chosen sufficiently large, e.g., orders of magnitude larger than Qy(103Qy), to guarantee that the solution of the slacked optimization problem will only use the slack vector when the terminal constraints need to be softened. While δun,kand δi,kdo not need to be minimized a priori, by issues of achieving the closed-loop stability as fast as possible, one seeks their minimization weighted by large enough values of positive-definite Sunand Sin(102Qy) in order to enforce them to zero in a finite number of steps.

Remark 3. From the stability point of view, the master’s dissertation [17] demonstrates the conditions necessary to prove that the objective function behaves as a Lyapunov function, thus assuring that the control actions obtained from the solution of Problem 1 drive the system asymptotically to the reference value (desired steady state), if it is reachable; otherwise, the system will converge to an equilibrium point (reachable steady state) lying at a minimum distance from the desired steady state.

4. Results and discussion

This section is devoted to present the implementation results of the feasible-optimization problem-based stabilizing MPC controller (Problem 1) in the rotary inverted pendulum prototype described in Section 2. The ultimate goal of the controller is to maintain the pendulum rod in the upright position after it has been swung up to this position by the energy-based swing-up control scheme embedded in the system. In addition, the IHMPC controller is simultaneously designed to track the desired positions to be configured for rotary arm angles. In Quanser apparatus, an unconstrained linear-quadratic regulator (LQR) controller makes up the control system, besides the swing-up control strategy. The existing LQR strategy will be replaced by the IHMPC controller, and this scheme is depicted in Figure 2.

Figure 2.

Schematic representation of the application of the IHMPC controller in the rotary inverted pendulum prototype.

The architecture used for this real-time implementation of the IHMPC controller is summarized in Figure 3. From this figure, it is possible to note the information exchange among software-hardware-equipment mechanisms of the prototype. The control law is solved at each sampling time on the computer i7-8550H with 1.80GHz processor and 16GB of RAM, using Matlab script and Quarc real-time control toolbox. The software-hardware interface is done via USB communication through the Q8-USB acquisition board. This acquisition system acts, in turn, as an interface between the digital part of the system (controller) and the analogic one that is composed of the amplifier (VoltPAQ-X1).

Figure 3.

The architecture used in communication among software-hardware-equipment mechanisms.

For the experimental results presented as follows, we consider a scenario of square wave-type tracking on the rotary arm, while the controller must maintain the pendulum rod around the upright vertical position, even in the existing unmeasured disturbance scenarios. The constraints associated with the control signal and control actions (decision variables) are those established in Table 1. Note that there is a strict condition of ±1V on the control actions. Also, the IHMPC tuning parameters considered were sampling period Δt= 2 milliseconds, Qy=diag16×102, R=9.8×102, Sy=diag105104, Sun=diag1026×104, and Si=102. The state estimator used here was the Kalman filter, whose tuning parameters associated with process noise and measurement noise were the following covariance matrices QKalman=I9×9and RKalman=2.4×106I2×2, respectively. Finally, a control horizon of m=9has been adopted as an appropriate value to attain the desired control performance, which was chosen from a sensitivity analysis, as will also be shown here.

VariablesMinimum value (V)Maximum value (V)
Control signal−1212
Control actions−11

Table 1.

Constraints on system inputs.

The closed-loop system results are depicted in Figures 4 and 5. From Figure 4, one can see that after about 5.5 seconds, the time necessary that the swing-up control acted to lead the pendulum rod to its upright position, the IHMPC controller takes over and performs quite well both tasks associated with the rotary arm angle tracking and the stabilization of the pendulum rod within an acceptable range lying at about ±2. It is also noteworthy that after the execution of the square-wave trajectory on the rotary arm angle, the controller had a great performance concerning impulse-like external disturbances inserted in the pendulum rod since the controlled variables are momentarily moved away from their set points, but soon they are brought back to their original positions.

Figure 4.

Controlled variables: rotary arm angle ( θ ) and pendulum rod angle ( α ).

Figure 5.

Behavior of the control signal (tension applied to the servomotor).

Even though the stability of the IHMPC controller is only related to the nominal case (linear model), it proved to be very sufficient in a realistic plant-model mismatch scenario, including nonlinearities existing in the rotary inverted pendulum apparatus, such as dead zone, friction, backlash, hysteresis, and so on. This model uncertainty scenario was responsible for non-prohibitive oscillations, within a practical implementation purpose, on the constrained control signal (cf. Figure 5), which were reflected in the controlled outputs.

Furthermore, it is worth mentioning that fulfilling a tighter constraint (±1V) by a conventional stabilizing MPC controller, e.g., [11, 12], could result in an unfeasibility scenario; however, since IHMPC controller used here is based on a feasible-optimization formulation, its control law always will provide a feasible solution while the system is controllable, thus becoming it implementable in practice. The IHMPC controller uses its additional degrees of freedom (slack variables), when necessary, in order to comply with the terminal constraints. Figure 6 illustrates the use of the slack variables in the control problem. It is observed that the controller makes use of these variables immediately after a perturbation in the system occurred namely, set-point changes and unmeasured disturbance entrance, situations in which it can be hard to comply with non-slacked terminal constraints. However, as the system goes to an acceptable cyclic steady state around its set point, due to the noise degree intrinsic to the system, the slack variables converge to the origin very fast and systematically.

Figure 6.

Behavior of the slack variables associated with the feasible-optimization formulation of the IHMPC controller.

On the other hand, in order to obtain a satisfactory performance as in the results presented earlier, an effort with respect to the controller tuning was necessary. This tuning task is easier in the stabilizing MPC controllers than the conventional finite-horizon MPC ones [13], as demonstrated in what follows. In this case, it was sufficient to handle only the control horizon. In inverted pendulum-like fast dynamic systems, when one applies a more aggressive control policy, i.e., small control horizon, it can cause undesired overshoots, while adopting large control horizons, it cannot have time sufficient to act with control action properly, thus bringing unnecessary oscillations or even causing the instability of the closed-loop system.

To work around this trade-off, we proceeded with sensitivity analysis on the control horizon, keeping the same remaining tuning parameters shown in the preceding experimental results. Figure 7 summarizes the aforementioned analysis. Note that as the control horizon increases, the oscillations decrease until the control horizon m=9. However, a value greater than m=9makes the closed-loop system go back to having undesired and larger oscillations, thus jeopardizing the use of energy associated with the control signal. Therefore, the use of the IHMPC controller enabled a simple analysis concerning only one tuning parameter, which yielded an appropriate value to meet the desired control performance in the real case.

Figure 7.

Sensitivity analysis on the control horizon of the closed-loop system.

5. Conclusions

In this chapter, we have investigated the application of an implementable and stabilizing model predictive control model strategy in a commercial-didactic rotary inverted pendulum apparatus, hitherto unexplored in the literature. Although the guarantee of stability of the controller is devoted to the nominal case (linear model), its formulation based on a feasible-optimization problem allows it to be used in any plant-model mismatch scenario in practice, such as one series of nonlinearities existing in the real system, namely, dead zone, friction, and backlash, among other unmodelled dynamics. The experimental results showed the effectiveness and robustness of the controller in the aforementioned plant-model mismatch setting by performing quite well its task in the rotary arm angle tracking and stabilization of the pendulum rod around the upright position as well as in the optimum use of energy associated with control efforts.

A simple tuning procedure was adopted by virtue of using a stabilizing MPC controller, which allowed us to handle only one tuning parameter through sensitivity analysis on the control horizon. The value found was quite adequate to attain the control objectives in terms of the trade-off existing between the performance on the controlled variables and the use of energy related to the control signal.

The future direction for this research is to further extend this controller to guarantee the stability of the nonlinear case, including the energy-based swing-up control schemes.

Acknowledgments

The authors would like to thank the Brazilian research agencies CAPES, CNPq, and FAPESB for their financial support.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Odilon S.L. de Abreu, Márcio A.F. Martins and Leizer Schnitman (March 7th 2020). An Implementable and Stabilizing Model Predictive Control Strategy for Inverted Pendulum-Like Behaved Systems [Online First], IntechOpen, DOI: 10.5772/intechopen.91629. Available from:

chapter statistics

31total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us