InTech uses cookies to offer you the best online experience. By continuing to use our site, you agree to our Privacy Policy.

Robotics » Humanoid Robotics » "Human-Robot Interaction", book edited by Daisuke Chugo, ISBN 978-953-307-051-3, Published: February 1, 2010 under CC BY-NC-SA 3.0 license. © The Author(s).

Chapter 18

Design of a Neural Controller for Walking of a 5-Link Planar Biped Robot via Optimization

By Nasser Sadati, Guy A. Dumont, and Kaveh Akbari Hamed
DOI: 10.5772/8144

Article top

Overview

Single support phase and the configuration variables.
Figure 1. Single support phase and the configuration variables.
The control loop used for the biped walker.
Figure 2. The control loop used for the biped walker.
The CPG network and the synaptic connections.
Figure 3. The CPG network and the synaptic connections.
The virtual inverted pendulum.
Figure 4. The virtual inverted pendulum.
The snapshots of one step for the biped robot with the best fitness.
Figure 5. The snapshots of one step for the biped robot with the best fitness.
The CPG outputs and the joint angle positionsof leg joints during
							
								
									10
									 
									(
									s
									)
								
							
						.
Figure 6. The CPG outputs and the joint angle positionsof leg joints during 10   ( s ) .
The phase plot of joint angle vs. velocity at the unactuated joint (
							
								
									
										q
										0
									
									−
									
										
											q
											˙
										
										0
									
								
							
						plane) during
							
								
									10
									 
									(
									s
									)
								
							
						.
Figure 7. The phase plot of joint angle vs. velocity at the unactuated joint ( q 0 − q ˙ 0 plane) during 10   ( s ) .
The phase plots of joint angle vs. velocity at the leg joints during
							
								
									10
									 
									(
									s
									)
								
							
						.
Figure 8. The phase plots of joint angle vs. velocity at the leg joints during 10   ( s ) .
The control signals of the servo controllers during
							
								
									10
									 
									(
									s
									)
								
							
						.
Figure 9. The control signals of the servo controllers during 10   ( s ) .
The ground reaction forces at the leg ends during
							
								
									10
									 
									(
									s
									)
								
							
						.
Figure 10. The ground reaction forces at the leg ends during 10   ( s ) .
Stick figure of the robot.
Figure 11. Stick figure of the robot.
The phase plot of joint angle vs. velocity at the unactuated joint.
Figure 12. The phase plot of joint angle vs. velocity at the unactuated joint.
Maximum amplitude of the pulse vs. pulse duration.
Figure 13. Maximum amplitude of the pulse vs. pulse duration.

Design of a Neural Controller for Walking of a 5-Link Planar Biped Robot via Optimization

Nasser Sadati1, 2, Guy A Dumont1 and Kaveh Akbari Hamed2

1. Introduction

Underactuation, impulsive nature of the impact with the environment, the existence of feet structure and the large number of degrees of freedom are the basic problems in control of the biped robots. Underactuation is naturally associated with dexterity [1]. For example, headstands are considered dexterous. In this case, the contact point between the body and the ground is acting as a pivot without actuation. The nature of the impact between the lower limbs of the biped walker and the environment makes the dynamic of the system to be impulsive. The foot-ground impact is one of the main difficulties one has to face in design of robust control laws for biped walkers [2]. Unlike robotic manipulators, biped robots are always free to detach from the walking surface and this leads to various types of motions [2]. Finally, the existence of many degrees of freedom in the mechanism of biped robots makes the coordination of the links difficult. According to these facts, designing practical controller for biped robots remains to be a challenging problem [3]. Also, these features make applying traditional stability margins difficult.

In fully actuated biped walkers where the stance foot remains flat on the ground during single support phase, well known algorithms such as the Zero Moment Point (ZMP) principle guarantees the stability of the biped robot [4]. The ZMP is defined as the point on the ground where the net moment generated from ground reaction forces has zero moment about two axes that lie in the plane of ground. Takanishi [5], Shin [6], Hirai [7] and Dasgupta [8] have proposed methods of walking patterns synthesis based on ZMP. In this kind of stability, as long as the ZMP lies strictly inside the support polygon of the foot, then the desired trajectories are dynamically feasible. If the ZMP lies on the edge of the support polygon, then the trajectories may not be dynamically feasible. The Foot Rotation Indicator (FRI) [9] is a more general form of the ZMP. FRI is the point on the ground where the net ground reaction force would have to act to keep the foot stationary. In this kind of stability, if FRI is within the convex hull of the stance foot, the robot is possible to walk and it does not roll over the toe or the heel. This kind of walking is named as fully actuated walking. If FRI is out of the foot projection on the ground, the stance foot rotates about the toe or the heel. This is also named as underactuated walking. For bipeds with point feet [10] and Passive Dynamic walkers (PDW) [11] with curved feet in single support phase, the ZMP heuristic is not applicable. Westervelt in [12] has used the Hybrid Zero Dynamics (HZD) [13], [14] and Poincaré mapping method [15]-[18] for stability of RABBIT using underactuated phase. The controller proposed in this approach is organized around the hybrid zero dynamics so that the stability analysis of the closed loop system may be reduced to a one dimensional Poincaré mapping problem. HZD involves the judicious choice of a set of holonomic constraints that were imposed on the robot via feedback control [19]. Extracting the eigenvalues of Poincaré return map is commonly used for analyzing PDW robots. But using of eigenvalues of Poincaré return maps assumes periodicity and is valid only for small deviation from limit cycle [20].

The ZMP criterion has become a very powerful tool for trajectory generation in walking of biped robots. However, it needs a stiff joint control of the prerecorded trajectories and this leads to poor robustness in unknown rough terrain [20] while humans and animals show marvelous robustness in walking on irregular terrains. It is well known in biology that there are Central Pattern Generators (CPG) in spinal cord coupling with musculoskeletal system [21]-[23]. The CPG and the feedback networks can coordinate the body links of the vertebrates during locomotion. There are several mathematical models which have been proposed for a CPG. Among them, Matsuoka's model [24]-[26] has been studied more. In this model, a CPG is modeled by a Neural Oscillator (NO) consisting of two mutually inhibiting neurons. Each neuron in this model is represented by a nonlinear differential equation. This model has been used by Taga [22], [23] and Miyakoshi [27] in biped robots. Kimura [28], [29] has used this model at the hip joints of quadruped robots.

The robot studied in this chapter is a 5-link planar biped walker in the sagittal plane with point feet. The model for such robot is hybrid [30] and it consists of single support phase and a discrete map to model the frictionless impact and the instantaneous double support phase. In this chapter, the goal is to coordinate and control the body links of the robot by CPG and feedback network. The outputs of CPG are the target angles in the joint space, where P controllers at joints have been used as servo controllers. For tuning the parameters of the CPG network, the control problem of the biped walker has been defined as an optimization problem. It has been shown that such a control system can produce a stable limit cycle (i.e. stride). The structure of this chapter is as follows. Section 2 models the walking motion consisting of single support phase and impact model. Section 3 describes the CPG model and tuning of its parameters. In Section 4, a new feedback network is proposed. In Section 5, for tuning the weights of the CPG network, the problem of walking control of the biped robot is defined as an optimization problem. Also the structure of the Genetic algorithm for solving this problem is described. Section 6 includes simulation results in MATLAB environment. Finally, Section 7 contains some concluding remarks.

2. Robot model

The overall motion of the biped involves continuous phases separated by abrupt changes resulting from impact of the lower limbs with the ground. In single support phase and double support phase, the biped is a mechanical system that is subject to unilateral constraints [31]-[33]. In this section, the biped robot has been assumed as a planar robot consisting of n rigid links with revolute and parallel actuated joints to form a tree structure. In the single support phase, the mechanical system consists of n+2 DOF, where n1 DOF associated with joint coordinates which are actuated, two DOF associated with horizontal and vertical displacements of the robot in the sagittal plane which are unactuated, and one DOF associated with orientation of the robot in sagittal plane which is also unactuated. With these assumptions, the generalized position vector of the system ( qe ) can be split in two subsets q and r . It can be expressed as

where q:=(q0,q1,...,qn1)T encapsulates the joint coordinates and q0 which is the unactuated DOF between the stance leg and the ground. Also r:=(x,y)T2 is the Cartesian coordinates of the stance leg end.

A. Single support phase

Figure 1 depicts the single support phase and configuration variables of a 5-link biped robot ( n=5 ). In the single support phase, second order dynamical model immediately follows from Lagrange's equation and the principle of virtual work [34]-[36]

media/image12.jpg

Figure 1.

Single support phase and the configuration variables.

Me(qe)q¨e+He(qe,q˙e)+Ge(qe)=BeuBeFe(qe,q˙e)+Jest(qe)TFext,st,
(2)

where Me(qe)(n+2)×(n+2) is the symmetric and positive definite inertia matrix, He(qe,q˙e)n+2 includes centrifugal and Coriolis terms and Ge(qe)n+2 is the vector containing gravity terms. Also u:=(u1,u2,...,un1)Tn1 includes the joint torques applied at the joints of the robot, Be(n+2)×(n1) is the input matrix, Fe(qe,q˙e)n1 includes the joint frictions modeled by viscous and static friction terms, Jest(qe):=rst/qe2×(n+2) is the Jacobian at the stance leg end. Also Fext,st:=(Fxext,st,Fyext,st)T2 is the ground reaction force at the stance leg end. With setting qe:=(qT,rT)T in (2), the dynamic equation of the mechanical system can be rewritten as the following form

[M11(q)M12(q)M12(q)Tm I2] [q¨r¨]+[He1(qe,q˙e)He2(qe,q˙e)]+[Ge1(qe)Ge2(qe)]=[0u02×1][0F(q,q˙)02×1]+Jest(qe)T [Fxext,stFyext,st],
(3)

where m is the total mass of the robot. If we assume that the Cartesian coordinates have been attached to the stance leg end and the stance leg end is stationary (i.e. in contact with the ground and not slipping), these assumptions (i.e. r=0, r˙=0, r¨=0 ) will allow one to solve for the ground reaction force as explicit functions of (q,q˙,u) [37], [38]. Also, the dynamic equation in (3) will be reduced with this assumptions and this will lead to a lower dimensional mechanical model which describes the single support phase if the stance leg end is stationary as follows

M(q)q¨+H(q,q˙)+G(q)=[0u][0F(q,q˙)]Fext,st=Ψ(q,q˙,u),
(4)

where M(q)=M11(q) and Ψ(.):TQ×n12 is a nonlinear mapping of (q,q˙,u) . Also TQ:={x:=(qT,q˙T)T| qQ, q˙n} is the state space of the reduced model where Q is a simply connected, open subset of [π,π)n . Note that q0 is an unactuated DOF in (4) (i.e. without actuation) and hence dimudimq . It can be shown that

Ψ(q,q˙,u)=[i=1nmix¨i  i=1nmi(y¨i+g)]=[mx¨cmm(y¨cm+g)],
(5)

where ri:=(xi,yi)T and rcm:=(xcm,ycm)T are the coordinate of the mass center of link i and the mass center of the robot, respectively, mi is the mass of the link i and g is the gravitational acceleration. With assumption xcm=f1(q) and ycm=f2(q) , we have

r¨cm(q,q˙,q¨)=[f1/qf2/q]q¨+[q˙T(2f1/q2)q˙q˙T(2f2/q2)q˙].
(6)

With setting q¨=M(q)1(0,u¯T)TM(q)1(H(q,q˙)+G(q)) where u¯:=uF(q,q˙) from equation (4) in equation (6) and using equation (5), we have

Ψ(q,q˙,u)=mJc(q)M1(q) [0u]mJc(q)M1(q) [0F(q,q˙)]mJc(q)M1(q)H(q,q˙)mJc(q)M1(q)G(q)+m [q˙TH1(q)q˙q˙TH2(q)q˙]+[0mg],                                       
(7)

where Jc(q):=rcm/q=[f1/qf2/q]2×n is the Jacobian matrix at the center of mass, also H1(q):=2f1/q2n×n and H2(q):=2f2/q2n×n . The validity of the reduced model in (4) is dependent on two following conditions

i)   y¨cm+g0ii)  |x¨cm|μ |y¨cm+g|,
(8)

where μ is the static friction coefficient between the stance leg end and the ground. The first condition in (8) is to ensure that the stance leg end is contact with the walking surface and the second condition is to ensure that the slipping dos not occur at the stance leg end [39]. The dynamic equation of (4) in the state-variable is expressed as x˙=f(x)+g(x)u where x:=(qT,q˙T)TTQ is the state vector. If we assume that x1:=q and x2:=q˙ , we get x=(x1T,x2T)T and

f(x)=[x2M1(x1) (H(x1,x2)+G(x1)+ (0F(x1,x2)))]g(x)=[0n×n1M1(x1) (01×n1In1)].
(9)

B. Frictionless impact model

In this section, following assumptions are done for modeling the impact [40]:

A1. the impact is frictionless (i.e. F(q,q˙)=0 ). The main reason for this assumption is the problem arising of the introducing of dry friction [2];

A2. the impact is instantaneous;

A3. the reaction forces due to the impact at impact point can be modeled as impulses;

A4. the actuators at joints are not impulsive;

A5. the impulsive forces due to the impact may result in instantaneous change in the velocities, but there is no instantaneous change in the positions;

A6. impact results in no slipping and no rebound of the swing leg; and

A7. stance foot lifts from the ground without interaction.

With these assumptions, impact equation can be expressed by the following equation

Me(qe)q˙e(t+)Me(qe)q˙e(t)=Jesw(qe)TδFext,sw,
(10)

where δFext,sw:=tt+Fext,sw(τ)dτ is the impulsive force at impact point and Jesw(qe):=rsw/qe2×(n+2) is the Jacobian matrix at the swing leg end. The assumption A6 implies that impact is plastic. Hence, impact equation becomes

Me(qe)q˙e(t+)Jesw(qe)TδFext,sw=Me(qe)q˙e(t)Jesw(qe)q˙e(t+)=0.
(11)

This equation is solvable if the coefficient matrix has full rank. The determinant of the coefficient matrix is equal to detMe(qe)×det(Jesw(qe)Me(qe)1Jesw(qe)T) and it can be shown that the coefficient matrix has full rank iff the robot is not in singular position. The solution of the equation in (11) can be given by the following equation

[q˙e(t+)δFext,sw]=Λ(qe) [Me(qe)q˙e(t)0],
(12)

where

Λ(qe):=[Me(qe)Jesw(qe)TJesw(qe)0]1,
(13)

and also qe(t):=(q(t)T,0,0)T and qe(t+):=(q(t+)T,0,0)T . The map from q˙e(t) to q˙e(t+) without relabeling is

q˙e(t+) =Λ11(qe)Me(qe)q˙e(t)δFext,sw=Λ21(qe)Me(qe)q˙e(t).
(14)

After solving these equations, it is necessary to change the coordinates since the former swing leg must now become the stance leg. Switching due to the transfer of pivot to the point of contact is done by relabeling matrix [39], [40] Rn×n . Hence, we have

q(t+)=Rq(t)q˙(t+)=R[In0n×2]Λ11(qe)Me(qe)q˙e(t).  
(15)

The final result is an expression for x+ in terms of x , which is written as [39]-[41]

In equation (16), Δ(.):STQ is the impact mapping where S:={(q,q˙)TQ| ysw(q)=0} is the set of points of the state-space where the swing leg touches the ground. x+:=(q(t+)T,q˙(t+)T)T and x:=(q(t)T,q˙(t)T)T are the state vector of the system after impact and the state vector of the system before impact, respectively. Also, we have

Δ(x):=[Rx(t)1Σ(x(t)1)x2(t)],
(17)

where Σ(.):Qn×n by Σ(x1(t)):=R[In0n×2]Λ11(qe)Me(qe)[In02×n] . The ground reaction force due to the impact can be shown as the following form

δFext,sw=Γ(x1(t))x2(t),
(18)

where Γ(.):Q2×n by Γ(x1(t)):=Λ21(qe)Me(qe)[In02×n] . The validity of the results of equation (17) depends on two following conditions

i)      Θ(x1(t))x2(t)0ii)     μ |Γ2(x1(t))x2(t)||Γ1(x1(t))x2(t)|0,
(19)

where Θ(x1(t)):=Jysw(Rx1(t))Σ(x1(t))1×n and Jysw(q):=ysw/q1×n . The first condition is to ensure that the swing foot lifts from the ground at t+ . The second condition is to ensure that the impact results in no slipping [39]. The valid results are used to re-initialize the model for next step. Furthermore, the double support phase has been assumed to be instantaneous. If we define

Ω:={x=(x1T,x2T)TS|Θ(x1)x20 ,μ |Γ2(x1)x2||Γ1(x1)x2|0},
(20)

the hybrid model of the mechanical system can be given by

x˙=f(x)+g(x) u     xSx+=Δ(x)            xΩ,
(21)

where x(t):=limτtx(τ) . For xSΩ , this model is not valid. Also the validity conditions in (8) can not be expressed only as a function of x and they can be expressed as a function of (x,u) .

3. Control system

Neural control of human locomotion is not yet fully understood, but there are many evidences suggesting that the main control of vertebrates is done by neural circuits called central pattern generators (CPG) in spinal cord which have been coupled with musculoskeletal system. These central pattern generators with reflexes can produce rhythmic movements such as walking, running and swimming.

A. Central pattern generator model

There are several mathematical models proposed for CPG. In this section, neural oscillator model proposed by Matsuoka has been used [24], [25]. In this model, each neural oscillator consists of two mutually inhibiting neurons (i.e. extensor neuron and flexor neuron). Each neuron is represented by the following nonlinear differential equations

τu˙{e,f}i= u{e,f}i+wfey{f,e}iβv{e,f}i+u0+Feed{e,f}i+j=1nw{e,f}ijy{e,f}jτv˙{e,f}i= v{e,f}i+y{e,f}iy{e,f}i  =max(0,u{e,f}i),
(22)

where suffixes f and e mean flexor muscle and extensor muscle, respectively. Also suffix i means the i th oscillator. ui is the inner state of i th neuron, yi is the output of the i th neuron, vi is a variable which represents the degree of self-inhibition effect of the i th neuron, u0 is an external input from brain with a constant rate and Feedi is a feedback signal from the mechanical system which can be an angular position or an angular velocity. Moreover, τ and τ are the time constants associated with ui and vi , respectively, β is a constant representing the degree of the self-inhibition influence on the inner state and wij is a connecting weight between the i th and j th neurons. Finally, the output of the neural oscillator is a linear combination of the extensor neuron inner state and the flexor neuron inner state

yNO,i=peue,i+pfuf,i.  
(23)

The positive or negative value of yNO,i corresponds to activity of flexor or extensor muscle, respectively. The output of the neural oscillator can be used as a reference trajectory, joint torque and phase. In this chapter, it is used as a reference trajectory at joints. The studied robot (see Fig. 1) has four actuated joints (i.e. hip and knee joints of the legs). We assume that one neural oscillator has been used for generating reference trajectories at each of the actuated joints.

B. Tuning of the CPG parameters

The walking period is a very important factor since it much influences stability, maximum speed and energy consumption. The walking mechanism has its own natural frequency determined mainly by the length of the links of the legs. It appears that humans exploit the natural frequencies of their arms, swinging pendulums at comfortable frequencies equal to the natural frequencies [26]. Human arms can be thought of as masses connected by springs, whose frequency response makes the energy and the control required to move the arm vary with frequency [26]. Humans certainly learn to exploit the dynamics of their limbs for rhythmic tasks [42], [43]. Robotic examples of this idea include open-loop stable systems where the dynamics are exploited giving systems which require little or no active control for stable operation (e.g. PDW [11]). At the resonant frequency, the control need only inject a small amount of energy to maintain the vibration of the mass of the arm segment on the spring of the muscles and tendons. Extracting and using the natural frequency of the links of the robots is a desirable property of the robot controllers. According to these facts, we match the endogenous frequency of each neural oscillator with the resonant frequency of the corresponding link. On the other hand, when swinging or supporting motions of the legs are closer to the free motion, there will not be any additional acceleration and deceleration and the motion will be effective [44]. When no input is applied to the CPG, the frequency of it is called endogenous frequency. Endogenous frequency of the CPG is mainly determined

by τ and τ . In this section, we change the value of τ with constant value of τ/τ . In this case, the endogenous frequency of CPG is proportional to 1/τ . It was pointed out that the proper value of the τ/τ for stable oscillation is within [0.1,0.5] [42]. After tuning the time constants of the CPG, other parameters of CPG can be tuned by using the necessary conditions for free oscillation. These necessary conditions for free oscillation can be written as the following form [24], [25]

i)    β wfe1ii)   wfe (1+τ/τ)iii)  u00. 
(24)

Table I specifies the lengths, masses and inertias of each link of the robot studied in this chapter [3]. By these data and extracting and using resonant frequencies of the links, we match the endogenous frequency of the CPG with the resonant frequency of each link. In this case, τ is designed at 0.13 (s) and τ=1.53τ=0.2 (s) for all of the neural oscillators. According to conditions in (24), we tune β and wfe to 2 and -2, respectively. Also u0 is equal to 5. The amplitude of the output signal of the CPG is approximately proportional to u0 , pe and pf . The output parameters of the CPGs (i.e. pe and pf of oscillators at the knee and the hip joints) can be determined by the amplitude of the desired walking algorithm. Table II specifies the designed values of the output parameters of the oscillators at the knee and the hip joints of the robot.

mass (kg) length (m) inertia (kgm2)
Torso12.000.6251.33
Femur6.800.400.47
Tibia3.200.400.20

Table 1.

The parameters of the robot.

kneehip
pf 0.110.15
pe 0.010.02

Table 2.

The output parameters of the cpg.

4. Feedback network

It is well known in biology that the CPG network with feedback signals from body can coordinate the members of the body, but there is not yet a suitable biological model for feedback network. The control loop used in this section is shown in the Fig. 2 where θ˜:=(q1,q2,q3,q4)T encapsulates the actuated joint coordinates and there is not any feedback signal from the unactuated DOF (i.e. q0 ). The feedback network in this control loop is for autonomous adaptation of the CPG network. In other hand, by using feedback network, the CPG network (i.e. the higher level of the control system) can correct its outputs (i.e. reference trajectories) in various conditions of the robot.

In animals, the stretch reflexes act as feedback loop [44]. In this section, the feedback signals to the CPG neurons of the hip joints are the tonic stretch reflex as follows [22], [23]

Feede,h=  ktsr,h(θhipθ0,hip)Feedf,h=ktsr,h(θhipθ0,hip),
(25)

where ktsr,h is a constant value and also θ0,hip is the neutral point of this feedback loop at hip joints. We tune the ktsr,h and θ0,hip to 1 and 0 (rad) , respectively.

One of important factors in control of walking is the coordination of the knee and the hip joints in each leg. For tuning the phase difference between the oscillators of the knee and the hip joints in each leg, we propose the following feedback structure which is applied only at oscillators of the knee joints

media/image135.jpeg

Figure 2.

The control loop used for the biped walker.

Feede,k=   ktsr,k(θkneeθ0,knee)u( θ˙hip)+kf(θhipθ¯0,hip)u(θ˙hip)Feedf,k= ktsr,k(θkneeθ0,knee)u( θ˙hip)kf(θhipθ¯0,hip)u(θ˙hip),
(26)

where ktsr,h and kf are constant values, θ0,knee is the neutral point of the tonic stretch reflex signal at knee joints and u(.) is a unit step function. The first terms of feedback signals in (26) are the tonic stretch reflex terms. These terms are active in stance phase (i.e. θ˙hip0 ). With these terms, we force the mechanical system to fix the stance knee at a certain angular position (i.e. θ0,knee ) during the single support phase like the knee joints of the human being. We call θ0,knee as the bias of the stance knee. In this section, we tune ktsr,h and θ0,knee to 10 and 0.1 (rad) , respectively. The second terms in (26) are active in swinging phase (i.e. θ˙hip0 ). These terms force the knee oscillator to increase its output at the beginning of swinging phase (i.e. θhipθ¯0,hip ). Also these terms force the knee oscillator to decrease its output at the end of swinging phase (i.e. θhipθ¯0,hip ). We tune kf and θ¯0,hip to 4 and 0 (rad) , respectively.

5. Tuning of the weights in the CPG network

The coordination and the phase difference among the links of the biped robot in the discussed control loop are done by the synaptic weights of connections in the CPG network. There are two kinds of connections in the CPG network. One of them is the connections among the flexor neurons and the other one is the connections among the extensor neurons. The neural oscillators in the CPG network can be relabeled as shown in the Fig. 3. According to this relabeling law,

media/image146.jpeg

Figure 3.

The CPG network and the synaptic connections.

NO1, NO2, NO3 and NO4 correspond to the right knee, the right hip, the left hip and the left knee neural oscillators, respectively. We show the weight matrix among the flexor and extensor neurons by Wf and We , respectively. According to the symmetry between the right leg and the left leg, these matrixes can be written as the following form

W{f,e}=[0w{f,e},(1,2)w{f,e},(1,3)w{f,e},(1,4)w{f,e},(2,1)0w{f,e},(2,3)w{f,e},(2,4)w{f,e},(2,4)w{f,e},(2,3)0w{f,e},(2,1)w{f,e},(1,4)w{f,e},(1,3)w{f,e},(1,2)0].
(27)

This symmetry can be given by the following equations

w{f,e},(i,j)=w{f,e},(5i,5j)      ;i,j=1, ... ,4w{f,e},(i,i)=0                     ;i=1, ... ,4.
(28)

In this chapter, we assume Wf=We . With this assumption and the symmetry between legs, there are six unknown weights which should be determined (bold lines in Fig. 3). For tuning the unknown weights of the CPG network, we should use a tool of the concept of stability for the biped robots. But the concept of stability and stability margin for biped robots is difficult to precisely define, especially for underactuated biped robots with point feet. Since the discussed robot in this chapter has point feet, the ZMP heuristic is not applicable for trajectory generation and verification of the dynamic feasibility of trajectories during execution. In addition, extracting the eigenvalues magnitude of the Poincaré return map may be sufficient for analyzing periodic bipedal walking but they are not sufficient for analyzing nonperiodic motions such as when walking over discontinuous rough terrain. Also, large disruptions from a limit cycle, such as when being pushed, cannot be analyzed using this technique. Some researchers [45] have suggested that angular momentum about the Center of Mass (CoM) should be minimized throughout a motion. As studied in [20], minimizing the angular momentum about the CoM is neither necessary nor sufficient condition for stable walking. According to these facts, for tuning the weights of the CPG network, we define the control problem of the underactuted biped walking as an optimization problem. By finding the optimal solution of the optimization problem, the unknown weights are determined. The total cost function of the optimization problem in this chapter is defined as a summation of sub cost functions and it can be given by

J(X):=a1J1(X)+a2J2(X)+a3J3(X),
(29)

where

X:=(w(1,2),w(1,3),w(1,3),w(2,1),w(2,3),w(2,4))T
(30)

and X[0.5,0.5]6 . Also ai ;i=1,2,3 are the positive weights. The first sub cost function in (29) can be defined as a criterion of the difference between the distance travelled by the robot in the sagittal plane and the desired distance

J1(X):=1DmT1+...+Titfsl(Ti),
(31)

where sl(Ti) is the step length of the i th step, Ti is the time duration of the i th step and Dm is an upper bound of the traveled distance. Also, tf is the duration of the simulation. This sub cost function is a good criterion of the stability.

The second sub cost function in (29) can be defined as the least value of the normalized height of the CoM of the mechanical system during simulation and it can be given by

J2(X):=mint[0tf]ycm(t)ycm,max,
(32)

where ycm,max is the value of the height of the CoM where the vector q is equal to zero. Since the biped should maintain an erect posture during locomotion, this sub cost function is defined as a criterion of the erect body posture.

The regulation of the rate change of the angular momentum about the CoM is not a good indicator of whether a biped will fall but the reserve in angular momentum that can be utilized to help recover from push or other disturbance is important. We use the rate change of the angular momentum about the CoM for defining the third sub cost function. With

media/image165.jpeg

Figure 4.

The virtual inverted pendulum.

setting xcm=lsinφ and ycm=lcosφ in equation (5) where l is the distance from the stance leg end to the CoM and φ is the angle from the stance leg end to the CoM with vertical being zero (see Fig. 4), the equation (5) becomes

ml2φ¨+2mll˙φ˙mglsinφ=lFml¨mlφ˙2+mgcosφ=Fl,
(33)

where Fl:=Fxext,stsinφ+Fyext,stcosφ and F:=Fxext,stcosφ+Fyext,stsinφ . Also, the total momentum about the stance leg end consists of the angular momentum of the CoM rotating the stance foot plus the angular momentum about the CoM

Htot=ml2φ˙+Hcm,
(34)

where Htot and Hcm are the angular momentums about the stance leg end and CoM, respectively. Also the net angular momentum rate change is equal to H˙tot=mgxcm=mglsinφ [3], [20]. With differentiating of equation (34) and setting H˙tot=mglsinφ in it and comparing with equation (33), it can be shown that

Hence, the third sub cost function is defined as following

J3(X):=11+ 0 tf| H˙cm(t) |2dt=11+ 0 tf| l(t)F(t) |2dt.
(36)

In this chapter, a1=4 , a2=1 and a3=1 and the control problem of the biped walking is defined as the optimal solution of the following optimization problem

By using Genetic algorithm, the optimal solution can be determined. Genetic algorithm is one of the evolutionary algorithms based on the natural selection. In this section, the size of each generation in this algorithm is equal to 400, and at the end of each generation, 50% of chromosomes are preserved and the others are discarded. The roulette strategy is employed for selection and 100 selections are done by this strategy. With applying one-point crossover, 200 new chromosomes are produced. The mutation is done for all of the chromosomes with the probability of 10% except the elite chromosome which has the most fitness. Also, each parameter is expressed in 8 bits.

6. Simulation results

In this section, the simulation of a 5 link planar biped robot is done in MATLAB environment. Table I specifies the lengths, masses and inertias of each link of the robot. This is the model of RABBIT [3]. RABBIT has 50:1 gear reducers between its motors and links. In this biped robot, the joint friction is modeled by viscous and static friction terms as described by F(q,q˙):=Fvq˙+Fssgn(q˙) . Joint PI controllers have been used as servo controllers. Because of the existence of the abrupt changes resulting from the impacts in the hybrid model, the servo controller does not include the derivative terms. We have designed PH=30 , PK=30 , IH=10 and IK=10 for the servo controllers at the hip and the knee joints. Also in optimization problem, we tune Dm=10 (m) and tf=10 (s) . By using Genetic algorithm, the optimal solution of the optimization problem in (37) is determined after 115 generations. The optimal solution of the optimization problem in (37) is equal to X=(0.063,0.429,0.172,0.141,0.109,0.016)T.

media/image193.jpeg

Figure 5.

The snapshots of one step for the biped robot with the best fitness.

The period of the neural oscillators in the biped robot with the best fitness is equal to 1.10 (s) . The time between consecutive impacts for this robot is equal to T=0.55 (s) . Also the step length during the walking (the distance between consecutive impacts) is equal to sl=0.33 (m) . The snapshots of one step for the best biped robot at limit cycle in this set of experiments are depicted in Fig. 5. In this picture, the left leg is taking a step forward. It can be seen that the swing leg performs a full swing and it allows sufficient ground clearance for the foot to be transferred to a new location. In Fig. 6, the CPG outputs and the joint angle positions of the leg joints during 10 (s) are shown with dashed lines and solid lines, respectively. Figure 7 depicts the phase plot and the limit cycle of joint angle vs. velocity at the unactuated joint ( q0q˙0 plane) during 10 (s) . Also Fig. 8 depicts the limit cycles at the phase plots of the leg joints during 10 (s) .

media/image201.jpg

Figure 6.

The CPG outputs and the joint angle positionsof leg joints during 10 (s) .

media/image203.jpg

Figure 7.

The phase plot of joint angle vs. velocity at the unactuated joint ( q0q˙0 plane) during 10 (s) .

media/image206.jpg

Figure 8.

The phase plots of joint angle vs. velocity at the leg joints during 10 (s) .

Control signals of the servo controllers during 10 (s) are depicted in Fig. 9. The validity of the reduced single support phase model and impact model can be seen by plotting the ground reaction forces as plotted in Fig. 10.

media/image207.jpg

Figure 9.

The control signals of the servo controllers during 10 (s) .

media/image208.jpg

Figure 10.

The ground reaction forces at the leg ends during 10 (s) .

For evaluating the robustness of the limit cycle of the closed loop system, an external force as disturbance is applied to the body of the biped robot. We assume that the external force is applied at the center of mass of the torso and it can be given by Fd(t):=Fd(u(ttd)u(ttdΔtd)) where Fd is the disturbance amplitude, td is the time when the disturbance is applied, Δtd is the duration of the pulse and u(.) is a unit step function. The stick figure of the robot for a pulse with amplitude Fd=25 (N) and with pulse duration equal to Δtd=0.5 (s) which is applied at td=3 (s) is shown in Fig. 11. This figure shows the robustness of the limit cycle due to disturbance. Also Fig. 12 shows the stable limit cycle at the unactuated joint. Figure 13 shows the maximum value of the positive and negative pulses vs. pulse duration which don’t result in falling down.

media/image217.jpeg

Figure 11.

Stick figure of the robot.

media/image218.jpg

Figure 12.

The phase plot of joint angle vs. velocity at the unactuated joint.

media/image219.jpg

Figure 13.

Maximum amplitude of the pulse vs. pulse duration.

7. Conclusion

In this chapter, the hybrid model was used for modeling the underactuated biped walker. This model consisted of single support phase and the instantaneous impact phase. The double support phase was also assumed to be instantaneous. For controlling the robot in underactuated walking, a CPG network and a new feedback network were used. It is shown that the period of the CPG is the most important factor influencing the stability of the biped walker. Biological experiments show that humans exploit the natural frequencies of their arms, swinging pendulums at comfortable frequencies equal to the natural frequencies. Extracting and using the natural frequency of the links of the robots is a desirable property of the robot controller. According to this fact, we match the endogenous frequency of each neural oscillator with the resonant frequency of the corresponding link. In this way, swinging motion or supporting motion of legs is closer to free motion of the pendulum or the inverted pendulum in each case and the motion is more effective.

It is well known in biology that the CPG network with feedback signals from body can coordinate the members of the body, but there is not yet a suitable biological model for feedback network. In this chapter, we use tonic stretch reflex model as the feedback signal at the hip joints of the biped walker as studied before. But one of the most important factors in control of walking is the coordination or phase difference between the knee and the hip joints in each leg. We overcome this difficulty by introducing a new feedback structure for the knee joints oscillators. This new feedback structure forces the mechanical system to fix the stance knee at a constant value during the single support phase. Also, it forces the swing knee oscillator to increase its output at the beginning of swinging phase and to decrease its output at the end of swinging phase.

The coordination of the links of the biped robot is done by the weights of the connections in the CPG network. For tuning the synaptic weight matrix in CPG network, we define the control problem of the biped walker as an optimization problem. The total cost function in this problem is defined as a summation of the sub cost functions where each of them evaluates different criterions of walking such as distance travelled by the biped robot in the sagittal plane, the height of the CoM and the regulation of the angular momentum about the CoM. By using Genetic algorithm, this problem is solved and the synaptic weight matrix in CPG network for the biped walker with the best fitness is determined. Simulation results show that such a control loop can produce a stable and robust limit cycle in walking of the biped walker. Also these results show the ability of the proposed feedback network in correction of the CPG outputs. This chapter also shows that by using the resonant frequencies of the links, the number of unknown parameters in the CPG network is reduced and hence applying Genetic algorithm is easier.

References

1 - J. W. Grizzle, C. Moog, C. Chevallereau, 2005 “Nonlinear control of mechanical systems with an unactuated cyclic variable,” IEEE Transactions on Automatic Control, 30 5 559 576 , May
2 - Y. Hurmuzlu, F. Genot, B. Brogliato, 2004 “Modeling, stability and control of biped robots-a general framework,” Automatica, 40 1647 1664 ,
3 - C. Chevallereau, G. Abba, Y. Aoustin, F. Plestan, E. R. Westervelt, C. Canduas-de, Wit, J. W. Grizzle, 2003 “RABBIT: A testbed for advanced control theory,” IEEE Control Systems Magazine, 23 5 57 79 , October
4 - M. Vukobratovic, D. Juricic, 1969 “Contribution on the synthesis of biped gait,” IEEE Transactions on Biomedical Engineering, 16 1 1 6 ,
5 - A. Takanishi, M. Ishida, Y. Yamazaki, I. Kato, 1985 “The realization of dynamic walking robot WL-10RD,” Int. Conf. Advanced Robotics, 459 466 .
6 - C. L. Shin, Y. Z. Li, S. Churng, T. T. Lee, W. A. Cruver, 1990 “Trajectory synthesis and physical admissibility for a biped robot during the single support phase,” IEEE Int. Conf. Robotics and Automation,, 1646 1652 .
7 - K. Hirai, M. Hirose, Y. Haikawa, T. Takenaka, 1998 “The development of honda humanoid robot,” IEEE Int. Conf. Robotics and Automation, , 1321 1326 .
8 - A. Dasgupta, Y. Nakamura, 1999 “Making feasible walking motion of humanoid robots from human motion capture data,” IEEE Int. Conf. Robotics and Automation, 1044 1049 .
9 - A. Goswami, 1999 “Postural stability of biped robots and the foot rotation indicator (FRI) point,” International Journal of Robotic Research, 18 6 523 533 , June
10 - F. Plestan, J. W. Grizzle, E. R. Westervelt, G. Abba, 2003 “Stable walking of a 7-DOF biped robot,” IEEE Transactions on Robotics and Automation, 19 4 653 668 , August
11 - T. Mc Geer, 1990 “Passive dynamic walking,” International Journal of Robotic Research, 9 2 62 82 ,
12 - E. R. Westervelt, J. W. Grizzle, D. E. Koditschek, 2003 “Hybrid zero dynamics of planar biped walkers,” IEEE Transactions on Automatic Control, 48 1 42 56 , January
13 - A. Isidori, 1995 Nonlinear Control Systems: An Introduction, 3rd ed. Berlin, Germany: Springer-Verlag,
14 - A. Isidori, C. Moog, 1988 “On the nonlinear equivalent of the notion of transmission zeros,” in Proc. IIASA Conf.: Modeling Adaptive Control, C. Byrnes and A. Kurzhanski, Eds., Berlin, Germany,, 146 157 .
15 - J. Guckenheimer, P. Holmes, 1996 Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields, corrected second printing ed., ser. Applied Mathematical Sciences. New York: Springer-Verlag, 42
16 - B. Morris, J. W. Grizzle, 2005 “A restricted Poincaré map for determining exponentially stable periodic orbits in systems with impulse effects: Application to bipedal robots,” in Proc. of IEEE 2005 Conference on Decision and Control,
17 - S. Nersesov, V. Chellaboina, W. Haddad, 2002 “A generalized of Poincaré’s theorem to hybrid and impulsive dynamical systems,” Int. J. Hybrid Systems, 2 35 51 ,
18 - Y. Hurmuzlu, 1993 “Dynamics of bipedal gait- part 2: stability analysis of a planar five-link biped,” Journal of Applied Mechanics, 60 337 343 , June
19 - J. H. Choi, J. W. Grizzle, 2005 “Feedback control of an underactuated planar bipedal robot with impulsive foot action,” Robotica, 23 567 580 , September
20 - J. E. Pratt, R. Tedrake, 2007 “Velocity based stability margins for fast bipedal walking,” http://www.ai.mit.edu/projects/leglab,
21 - S. Grillner, 1981 “Control of locomotion in bipeds, tetrapods and fish,” Handbook of Physiology II, American Physiol. Society, Bethesda, MD, 1179 1236 ,
22 - G. Taga, Y. Yamaguchi, H. Shimizu, 1991 “Self-organized control of bipedal locomotion by neural oscillators,” Biolog. Cybern., 65 147 159 ,
23 - G. Taga, 1995 “A model of the neuro-musculo-skeletal system for human locomotion II: real-time adaptability under various constraints,” Biolog. Cybern., 73 113 121 ,
24 - K. Matsuoka, 1987 “Mechanism of frequency and pattern control in the neural rhythm generators,” Biolog. Cybern., 56 345 353 , 1987.
25 - K. Matsuoka, 1985 “Sustained oscillations generated by mutually inhibiting neurons with adaptation,” Biolog. Cybern., 52 367 376 ,
26 - M. M. Williamson, 1998 “Neural control of rhythmic arm movements,” Neural Networks, 11 1379 1394 ,
27 - S. Miyakoshi, G. Taga, Y. Kuniyoshi, A. Nagakubo, 1998 “Three dimensional bipedal stepping motion using neural oscillators-towards humanoid motion in the real word,” IROS98, 84 89 ,
28 - H. Kimura, Y. Fukuoka, Y. Hada, K. Takase, 2002 “Three-dimensional adaptive dynamic walking of a quadruped rolling motion feedback to CPGs controlling pitching motion,” IEEE International Conference on Robotics and Automation,, 2228 2233 .
29 - H. Kimura, Y. Fukuoka, 2000 “Adaptive dynamic walking of the quadruped on irregular terrain-autonomous adaptation using neural system model,” IEEE International Conference on Robotics and Automation,, 436 443 .
30 - H. Ye, A. N. Michel, L. Hou, 1998 “Stability theory for hybrid dynamical systems,” IEEE Trans. Automatic Control, 43 4 461 474 , Apr.
31 - B. Brogliato, 2003 “Some perspectives on the analysis and control of complementarity systems,” IEEE Transaction on Automatic Control, 48 6 918 935 ,
32 - B. Brogliato, S. I. Niculescu, M. Monteiro, 2000 “On the tracking control of a class of complementarity-slackness hybrid mechanical systems,” Systems and Control Letters, 39 255 266 ,
33 - B. Brogliato, S. I. Niculescu, P. Orhant, 1997 “On the control of finite dimensional mechanical systems with unilateral constraints,” IEEE Transactions on Automatic Control, 42 2 200 215 ,
34 - H. Goldstein, 1980 Classic Mechanics, 2nd ed. Reading. MA: Addison Wesley,
35 - M. W. Spong, M. Vidyasagar, 1991 Robot Dynamics and Control, New York: Wiley,
36 - E. Dombre, W. Khalil, 2002 Modeling, Identification and Control of Robots, Paris: Hermes Sciences,
37 - R. M. Murray, Z. Li, S. Sastry, 1993 A Mathematical Introduction to Robotic Manipulation, Boca Raton, FL: CRC Press, 1993.
38 - E. R. Westervelt, J. W. Grizzle, 2002 “Design of asymptotically stable walking for a 5-link planar biped walker via optimization,” 2002 IEEE International Conference on Robotics and Automation, Wahington D.C.,, 3117 3122 .
39 - Y. Hurmuzlu, 1993 “Dynamics of bipedal gait-part I: objective functions and the contact event of a planar five-link biped,” Journal of Applied Mechanics, 60 331 336 , June
40 - J. H. Choi, J. W. Grizzle, 2004 “Planar bipedal robot with impulsive foot action,” IEEE Conf. on Decision and Control, Paradise Island, Bahamas, December 296 302 .
41 - Y. Hurmuzlu, D. Marghitu, 1994 “Rigid body collisions of planar kinematic chains with multiple contact points,” International Journal of Robotics Research, 13 1 82 92 ,
42 - K. Schneider, R. F. Zernicke, R. A. Schmidt, T. J. Hart, 1989 “Changes in limb dynamics during the practice of rapid arm movements,” Journal of Biomechanics, 22 805 817 ,
43 - G. P. Bingham, R. C. Shmidt, L. D. Rosenblum, 1989 “Hefting for a maximum distance throw: a smart perceptual mechanism,” Journal of Experimental Psychology Human Perception and Performance,: 15 3 507 528 ,
44 - H. Kimura, Y. Fukuoka, K. Konaga, Y. Hada, K. Takase, 2001 “Towards 3D adaptive dynamic walking of a quadruped robot on irregular terrain by using neural system model,” IEEE/RSJ International Conference on Intelligent Robots and Systems, 2312 2317 .
45 - M. Abdullah, A. Goswami, 2005 “A biomechanicaly motivated two-phase strategy for biped upright balance control,” IEEE International Conference on Robotics and Automation,