Open access peer-reviewed chapter

Distributed Optimization of Multi-Robot Motion with Time-Energy Criterion

By Mohamad T. Shahab and Moustafa Elshafei

Submitted: October 30th 2018Reviewed: March 6th 2019Published: April 11th 2019

DOI: 10.5772/intechopen.85668

Downloaded: 230

Abstract

This paper is an application of a special case of distributed optimization problem. It is applied on optimizing the motion of multiple robot systems. The problem is decomposed into L subproblems with L being the number of robot systems. This decomposition reduces the problem to solving a single robot problem. The optimization problem is solved via a distributed algorithm, utilizing subgradient method. A global objective function is set as the sum of individual robot objectives in time and energy. Constraints are divided into two sets, namely, robot-individual constraints and robots’ interactions (collision) constraints. The approach is applied for the case of wheeled mobile robots: we are able to generate in parallel for each robot an optimized control input trajectory and then illustrate it in simulation examples.

Keywords

  • distributed algorithms
  • multi-robot systems
  • numerical optimal control
  • time-energy minimization

1. Introduction

Research in multi-robot systems is motivated by several notions; namely, some motivation can be put as [1]:

  • It is complex for one single robot system to fulfill complex tasks. Instead, more than one system would simplify the solution.

  • Tasks are generally distributed in nature.

  • Multiple limited-resource robot systems are more efficient to deal with than a single powerful robot system.

  • Speed of the task process increases through parallelism in multiple robot systems.

  • Robustness increases as redundancy is introduced in multiple systems.

Until recently, the number of real-life implementations of multi-robot systems is relatively small. The reason is the complexity associated with the field. Also, the related technologies are relatively new. Emergence of autonomous driving vehicle technology and market can push the boundaries in the field. As technology develops, new venues for application will open for mainstream use rather than only in research and development labs. Due to its promising applicability, autonomous cars and vehicles (or various intelligent transportation systems in general) sit at the forefront [2, 3]. To name a few, benefits include reducing congestions [4], increasing road safety [5], and, of course, self-driving cars [6]. Another application in civil environments is related to safety and security like rescue missions of searching for missing people in areas hard for humans to operate in [7] or searching for dangerous materials or bombs [8] in an evacuated building. Also, another area of application of multi-robot systems is in military area; research was done heavily in the fields of unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs) [9, 10].

Many approaches are developed to tackle the issue of multiple robot systems. Under the inspiration of biological systems and the need of technologies, many problems are defined as cooperative motions. Cooperative motion is discussed in [11, 12, 13, 14, 15, 16]. Optimization in both time and energy has been tackled in the literature [17, 18, 19, 20]. There is an opportunity to incorporate concept of time/energy optimization into the paradigm of multi-robot systems.

This paper investigates the solution of a time-energy optimal control problem for multiple mobile robots; namely, the paper is to study the problem as a nonlinear programming (NLP) problem. The main idea of the solution used here is to utilize distributed optimization techniques to solve the overall optimization problem. Solving for optimal time and energy of more than one robot system adds more burden on the problem; robot interaction with each other is added to the problem. This paper will focus more on the distributed aspect of the problem; more details about the numerical optimal control problem formulation can be found in [21]. In [21], the problem of controlling the motion of a single mobile robot is solved using the direct method of numerical optimal control (see [22]); this showed great flexibility in incorporating physical constraints and nonlinear dynamics of the system.

The rest of this section will define the global problem formulation. Discussion about distributed optimization and associated algorithm is presented in Section 2. Section 3 will apply the method on the multi-robot problem. Application to wheeled mobile robots and simulation examples are discussed in Section 4 followed by the conclusion.

1.1 Global problem formulation

We can present the discrete time global optimization (numerical optimal control) problem for L robots as follows:

minuitsik,iiHxiN+ziNs.t.zik+1=zik+tsik·Lxikuiktsikxik+1=fDxikuiktsikgxikuiktsik0Ωixikiuikitsiki0k,zi0=0,xi0=x0i,i=1,2,,LE1

with tbeing the time-independent variable, kbeing the time index in discrete domain, tsibeing the sampling period, and Nbeing the number of time discrete instants across the time horizon, i.e., k=0,1,,N. The sampling period corresponds to the length of time the system input uitis kept constant (zero-order hold): we assume the system input to be

uit=uik,fortkit<tk+1i,tk+1i=tki+tsik,t0i=0,i

System behavior is governed by the nonlinear dynamic system in fDxikuiktsikwith xikas the robot istates an initial condition of xi0=x0i. The above optimal control problem has a final state objective of HxiNwith the Lagrangian Lxikuiktsikinformation being embedded into a dummy state variable zik.

The above optimization problem can be viewed as having two sets of control variables; the first set resembles the discretized system inputs, uikk, and the other set consists of the variable sampling period, tsikk. Let us have the Lagrangian for the problem be

L=xTQx+uTRu+β,

with βbeing the scalar weight on time. The performance is restricted by a collection of inequality constraints of robot-specific constraints g.0and robot-interaction constraints of Ωi.0. The objective function is just the summation of individual objectives. In this paper, as it will be explained later, we consider only collision avoidance as robot-interaction requirement; however, the above formulation can also meet other considerations. It can be shown that the objective function in (1) corresponds to objective function of the form

mini=1LHxitf+t0tfxtTQxt+utTRut+βdt

2. Distributed optimization

Here in this section, the concept of distributed optimization is explored. This area tackles optimization problems with distributed nature. Consider the following optimization problem:

minuiii=1LJiuis.t.uiΩi,fori=1,,LE2

with Jiuias the objective function and Ωithe set of constraints. We can easily separate the problem into its corresponding sub problems. An ith subproblem is easily put as

minuiJiuis.t.uiΩiE3

Observing the global problem in (2), we can see that it is just equivalent to the combination of all the subproblems; it is easy to see that solving for each subproblem (3) individually will result in the solution for the whole global problem. Now, however, consider the following problem:

minuii=1LJiuis.t.giu1u2uL0,iE4

You see clearly that the above problem cannot be trivially separated into some subproblems due to the constraint giu1u2uL0. This can be called a complicating constraint or a coupling constraint. In the next subsection, we discuss an optimization method that will help us in solving this kind of problems.

Decomposition in mathematics is the concept of breaking a mathematical problem into smaller subproblems that can be solved independently while not violating the original problem. Primary works of [23, 24] discuss multiple aspects of optimization in general while exploring specific classes as well; these works are excellent resources for reading and understanding. Viewing the applications of distributed optimization will convey the impression that they, however different, are all mostly very similar theoretically. Terms of networked, distributed, decentralized, cooperative, and the like are becoming all corresponding to somewhat similar problems. Other works related to this area and the area of multi-agent systems can be found in [25, 26, 27, 28, 29].

2.1 Subgradient method

Before going further, we discuss a method used in solving distributed optimization problem which will help us in solving the problem of this paper. This method is called subgradient methods [30]. These methods are similar to the popular optimization algorithms using gradient descent. However, they are extended to escape function differentiation. The works [31, 32, 33] also explore the method in the perspective of multi-agent systems.

Consider the typical problem:

minuJuE5

This typical problem can be solved using any gradient descent method. At iteration mof an algorithm, a solver, or an optimizer, can be constructed as

um+1=umαmdmE6

with αmas a predefined step size. For a standard gradient method, the vector dmcontains the gradient information of the problem. The simplest definition is to have

dm=JumE7

However, for the subgradient method [33], we will have a definition of

dm=pmE8

with pm, called subgradient of Ju, being any vector that satisfies the following:

JxJumpmTxum,xE9

The subgradient method is a simple first-order algorithm to minimize a possibly nondifferentiable function. The above definition escapes the requirement of a differentiated objective function. It is defined as finding any vector that makes the optimization algorithm go to better value in a first-order optimality sense. Of course, when a gradient Jumexists, we can compute the subgradient as the gradient. As it is a first-order method, it could have a lower performance than other second-order approaches. However, the advantage here is that it does not require differentiation. Also, and perhaps more importantly, it gives us flexibility to solve problems in a distributed manner as will be seen later.

Now observe the following constrained optimization problem:

minuJus.t.gu0E10

Let gube a vector of Mconstraints. Then, we can define the dual problem. Let us define the dual function of λand uas

qλu=Ju+λTguE11

The vector λof size Mcorresponds to the multipliers associated with each constraint. The dual problem relaxes the constraints of the original primal problem in (10) and solves for λto maximize the dual function:

maxλqλus.t.λ0E12

The dual optimization problem is the pair of two optimization problems, namely, a maximization in λas in (12) and a minimization in u. The pair resembles a maximization-minimization problem. You can visualize the solution of the problem as attacking the effect of constraint violation while solving for the original minimization problem concurrently.

Now, an algorithm for solving the dual problem utilizing subgradient method is discussed. Let us define

uλ=argminuJu+λTguE13

The above definition is to clarify the minimum attained at any value of λ. So, with the above definition, at iteration m, with also denoting um=uλm, we can safely have

qλum=Jum+λTgum=minuJu+λTguE14

Now, it is obvious from (14) that at iteration m, a subgradient of the dual function in (14) as function of λcan be computed as pm=gum. At iteration mof the algorithm, an update for the multipliers is constructed as

λm+1=Pλ0λm+αmgumE15

The projection operator Pλ0.is to ensure that the value of the update λm+αmpmis positive or enforced to zero. Also, observe the ascent update with the “+” sign rather than a descent update as it is a maximization. We can assume an initial λ0=0or any other positive value. An optimal solution to the original problem in (10) will be attained as m, with the optimal solution value of um.

2.2 The distributed algorithm

Recall the problem of combination of Lsubproblems in (2). Now, let us have the following global problem:

minuii=1LJiuis.t. uiΩi,igiu1u2uL0,iE16

As mentioned, the constraints giu1u2uL0are the complicating (or coupling) constraints. We can formulate the dual pair problems to be

maxλiqλis.t. λi0,iE17

Put in mind that λi=λ1λ2λL. If we define the notations of

u=u1u2uLT,λ=λ1λ2λLT,

then we can apply the primal-dual update from (13) and (15) at an iteration mas

um+1*=argminuiΩii=1LJiui+λmiTgiuλm+1=Pλ0λm+αm·g1um*g2um*gLum*TE18

We can see that the above pair of updates can easily be distributed; after the relaxing of the constraints, the primal problem can be separated. The facility of subgradients lets us propose that any iteration mfor subproblem ihas

um+1i=φminJiumi+λmiTgium1um2umLλm+1i=Pλi0λmi+αgium1um2umLE19

The function φmin.is any algorithm minimizer for the primal problem constrained by uiΩi,i. Observe that the primal update for each subproblem needs only the latest values of the other subproblem updates. During the computations of an iteration, computation of um+1iand λm+1iis done independent of each other; the updates above can be computed in parallel for each subproblem. The only information shared after each iteration is um1,um2,,umLamong all.

3. Distributed algorithm for multi-robot system

3.1 Problem formulation

Now, in this section we can apply the previous discussion into the problem of optimizing the motion of multiple robots. Recall the global optimization problem of motion of Lmobile robots from (1)

minuitsik,ii=1LHxiN+ziNs.t.zik+1=zik+tsik·Lxikuiktsikxik+1=fDxikuiktsikgixikuiktsik0Ωixikiuikitsiki0k=0,1,,N,i=1,,L,zi0=0,xi0=x0iE20

You can see that the problem above is just the combination of Lsubproblems with superscript icorresponding to each robot. The objective function is just the summation of individual objectives. The coupling constraint of Ωixikiuikitsiki0,kis the only difference to the single robot problem. To simplify the notations, let us define

u¯i=uiktsikk=0N

The above definition is just to reduce the notation of robot input sequence. If we have, for example, two robot inputs ui=u1iu2iT(e.g., wheel torques), then we have a total of 3×L×Ncontrol variables of the optimization problem. We can condense notation of the global problem of multi-robot system without loss of generality to be

minu¯iii=1LJiu¯is.t.u¯iΞiΩiu¯1u¯2u¯L0,iE21

with objectives

Jiu¯i=HxiN+ziNE22

which are subject to the set of individual robot iconstraints of

Ξi:zik+1=zik+tsik·Lxikuiktsikxik+1=fDxikuiktsikgixikuiktsik0k,zi0=0,xi0=x0iE23

3.2 Distributed algorithm

Returning back to the primal-dual problem pair in Section 2, we can establish the algorithm updates according to the defined updates in (19). At each iteration m, we update each robot inputs and multipliers according to

u¯m+1i=φminJiu¯mi+λmiTΩmiλm+1i=Pλi0λmi+αΩmiE24

The minimizer update φmin.is responsible to solve the single robot optimization (primal) problem according to the objective defined in (22) and subject to constraints in (23). In this paper, the minimizer update φmin.is selected to be any state-of-the-art nonlinear programming (NLP) algorithm. Let us have the step size αfor the dual update to be constant. This is sufficient for converging to a solution of the original problem [33]. You can read the algorithm updates in (24) at iteration mas each robot independently optimizes its whole motion throughout the whole time horizon k=0,1,,Nwhile at the same time puts in mind the extra cost of cooperation/interaction with others introduced by the term λmiTΩmiand so on.

3.3 Algorithm convergence

In this brief section, elaboration is put forth about how to practically use the algorithm. The ultimate goal is to optimize primal problem with no collision violation, i.e., reaching optimal dual (maximum) solution. At each global iteration, we only need to improve the primal problem values for the updated extra cost of the interaction constraint, λiTΩi. In this paper, a perfect solution is to optimize while maintaining Ωik0. A logical property is to monitor the M-element vector of constraints for positive values, i.e., violations. So, a stopping criterion for the algorithm can be chosen to be some minimum change TolJin the primal problem value:

Jm+1JmTolJ withJm=i=1LJu¯miE25

We can also distribute the stopping decision to individual robots by observing the change in individual objective values.

With condition (25) on its own, we cannot always be satisfying the collision requirement. So, this condition can be accompanied by a condition on the collision constraint violation. For all robots, elements of the complete constraint vector Ωikshould be less than a relatively small positive value TolΩ. So, for each robot, an extra stopping criterion along with criteria in (25) is to have

maxΩmikTolΩE26

Specific values of TolΩand TolJdepend on the nonlinear programming algorithm and/or the global desired requirements. Note that the behavior of the two tolerance parameters could be competitive with each other.

4. Application to wheeled mobile robots

4.1 System description

Figure 1 shows the individual robot system considered here. Robot state includes xi=xyϕθRθLvRvLTwith both position xyand orientation ϕand θRθLvRvLas the right and left wheel angular positions and velocities, respectively. Robot input includes the respective wheel torques ui=τRτLT. You can have the details of the applied nonlinear dynamic model fDxikuiktsikbased on the system model developed in [34, 35]. Details of discretization and choice of parameters of the robot model can be found in [21]. As mentioned before, for choices of Q,Rin Section 1, the Lagrangian for the problem is chosen to include the cost for energy spent by the torques of the wheels, the cost for kinetic energy spent by robot body, and the weight on time. Individual robot constraints include final desired configuration tolerance, torque limits, and the ensurance of zero final velocities (see more details in [36]).

Figure 1.

Wheeled mobile robot.

4.2 Collision avoidance

Here, we will discuss the formulation and the structure of the coupling constraints. The robots can be designed to perform any cooperative strategy in their motion. Here, we only consider the global goal of optimizing the motion of each robot in time and energy while avoiding colliding with each other during the motion. Let us define the coupling constraint vector across the discrete time indexes as

Ωik=Ωixikiuikitsiki.

For the ith robot, it tries to avoid colliding with the rest of L1robots at each of its time indexes k. Let us label elements of the constraint vector as Ωijk. Each element is corresponding to a definition of constraint at time index kfor all other robots, ji. So, for each robot, the constraint vector Ωiis of size M=L1×N; of course, the multiplier vector λiin (24) is of the same size.

We define the collision avoidance by constraining motion of other robots to be outside a safety circle region around each irobot at the position xiyiin the 2D plane:

xikx̂jk2+yikŷjk2p2.

So, we can define each element of the constraint vector as

Ωijk=p2xikx̂jk2+yikŷjk2E27

The radius of the safety region is chosen as p. Because of the definition of the sampling period variable, at each of discrete time step k,the actual time variable does not necessarily imply tik=tjkfor all the other L1robots. That is why you see that the x- and y- coordinates in (23) of the jthrobot are noted as x̂jŷjwhich are chosen to be calculated as linear interpolation of positions according to available actual times of tik and tjk.

4.3 Simulation examples

You can follow the whole distributed algorithm for the time-energy optimization of multi-robot system with collision avoidance in the flowchart in Figure 2. View the flowchart as the process for each individual robot (subproblem). We implement the algorithm in Figure 2. For the primal minimizer update in (24), the nonlinear programming (NLP) function offminconin MATLAB is used. We solved the problem for motion of L=3mobile robots. Utilizing the parallel capability in MATLAB, the distributed steps are solved independently utilizing three parallel processors. We choose number of instants N=40; so, we are going to optimize 120 control variables for each robot. More details can be found in [36].

Figure 2.

Distributed algorithm flowchart to optimize multi-robot motion.

Example 1. In this example, exploration of the behavior of the algorithm is shown. The problem has the following desired values of initial and final positions and orientations for the three robots:

x01y01ϕ01=08π2,xf1yf1ϕf1=05π
x02y02ϕ02=1010,xf2yf2ϕf2=85π2
x03y03ϕ03=50π,xf3yf3ϕf3=81π2

Here, robot 1 has equal objective weights of 5 on both time and energy, robot 2 has weights of 10 on energy and 1 on time, and robot 3 has 10 on time and 1 on energy. The maximum number of internal NLP iterations (primal update) is set to only 10. The step size is set to α=0.1. Maximum global iterations are allowed for 100 iterations. The safety circle radius is chosen to be p=2.

Figure 3 contains the objective (time-energy) value evolution throughout iterations. You can see the stable convergence as the algorithm progresses. Figure 4 shows each of the robots’ safety clearances during the optimized motion. In Figure 5, snapshots of motion of the three robots at different time instants are depicted. This illustrates the collision avoidance attained throughout the optimized motion. Observe also how different are the speeds of each robot because of objective weights; note from Figure 4 that each robot has a different final time for their motion. The algorithm has shown good performance at eliminating collision constraint violations. Figures 4 and 5 show an instant (around t = 12) where robots 1 and 3 violate collision distance with very small value, but no collision occurs. This is because the maximum number of iterations of the algorithm is exhausted. This indicates the possibility for the motion to be optimized even more if collision constraints were relaxed or if more algorithm iterations were allowed. Initialization of the algorithm also plays a role in algorithm evolution.

Figure 3.

The time-energy objective values throughout global iterations (Example 1).

Figure 4.

Collision avoidance (Example 1).

Figure 5.

Snapshots of optimized motions at different instants (Example 1).

Example 2. This example illustrates more the satisfaction of the objectives. In this example the safety circle radius is put as p=3. Here we choose the following initial and final positions and orientations for the three robots:

x01y01ϕ01=80π2,xf1yf1ϕf1=80π2
x02y02ϕ02=080,xf2yf2ϕf2=080
x03y03ϕ03=80π2,xf3yf3ϕf3=80π2

After applying our approach, you can see the resulting optimized motions in Figure 6. In Figure 6, errors in x- and y-coordinates and orientation of each robot are shown with respect to time. It is clear that errors of zero are achieved. In Figure 7 for each robot, constraint evaluations, i.e., safety clearance, are displayed for the other two robots throughout time. You can see that robots come close to each other sometimes but without violating the safety distance. This result is attained maybe because of special structure of initial and final positions and orientations. That could have given flexibility for the algorithm.

Figure 6.

Optimized trajectories of the three robots: each row of plots shows x-coordinate error, y-coordinate error, and orientation error, respectively; each column of plots show robots 1, 2, and 3 errors, respectively (Example 2).

Figure 7.

Collision avoidance (Example 2).

5. Conclusion

The paper investigated the time-energy minimization onto the multi-robot case. A global objective function is formulated as the sum of individual robot objectives in time and energy. Constraints are divided into two sets, namely, robot-individual constraints and robots’ interaction constraints. The problem is decomposed into L subproblems with L being the number of robot systems. The subproblems are coupled with each other by the collision avoidance information. Applying a distributed algorithm solved the problem iteratively. The overall output gives optimized motions for all robots in time and energy while adhering and not colliding with each other. We applied our approach to the case of three wheeled mobile robots: we generated in parallel for each robot an optimized control input trajectory.

An extension to this study is to generate optimized motion trajectories and apply them experimentally. A possible area for experimentation is full-scale autonomous vehicles. Issues related to communication and distributing information during the parallel algorithm will need to be incorporated and investigated. Also, aspects of state estimation and localization of the robot system will come into the place which were not considered in this work. A possible other investigation is to distribute the problem further onto the time variable k; this will lead the problem to the domain of distributed model predictive control. This will, possibly, pave the way to faster deployment into autonomous vehicles.

Acknowledgments

The authors would like to thank King Fahd University of Petroleum and Minerals supporting this work.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Mohamad T. Shahab and Moustafa Elshafei (April 11th 2019). Distributed Optimization of Multi-Robot Motion with Time-Energy Criterion, Path Planning for Autonomous Vehicles - Ensuring Reliable Driverless Navigation and Control Maneuver, Umar Zakir Abdul Hamid, Volkan Sezer, Bin Li, Yanjun Huang and Muhammad Aizzat Zakaria, IntechOpen, DOI: 10.5772/intechopen.85668. Available from:

chapter statistics

230total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

Related Content

This Book

Next chapter

Introductory Chapter: Roles of Path Planning in Providing Reliable Navigation and Control for Autonomous Vehicles and Robots

By Umar Zakir Abdul Hamid, Volkan Sezer, Bin Li, Yanjun Huang and Muhammad Aizzat Zakaria

Related Book

First chapter

Cloud Robotics and Autonomous Vehicles

By Khuram Shahzad

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us