Grasp and Path Relinking to Solve the Problem of Selecting Efficient Work Teams

The process of selecting objects, activities, people, projects, resources, etc. is one of the activi‐ ties that is frequently realized by human beings with some objective, and based on one or more criteria: economical, space, emotional, political, etc. For example, as a daily experience people should select what means of transportation and routes to utilize to arrive at a deter‐ mined destination according to the price, duration of the trip, etc. In these cases, one must select the best subset of elements based on a large set of possibilities, the best in some sense, and in many cases there is an interest in the selected elements not appearing amongst them‐ selves, if not it is better that they have different characteristics so that they can represent the existing diversity in the collection of original possibilities. Of course at this level people make these decisions intuitively, but commonsense, generally, is not a good advisor with problems that require optimized decision-making, and simple procedures that apparently offer effective solutions lead to bad decisions, thus this can be avoided by applying mathe‐ matical models that can guarantee obtainable effective solutions. In other human activities the selection of this subset has economic implications that involve a selection of a more di‐ verse subset, a crucial decision, and difficult to obtain, which requires a correct process of optimization guided by a methodical form.


Introduction
The process of selecting objects, activities, people, projects, resources, etc. is one of the activities that is frequently realized by human beings with some objective, and based on one or more criteria: economical, space, emotional, political, etc. For example, as a daily experience people should select what means of transportation and routes to utilize to arrive at a determined destination according to the price, duration of the trip, etc. In these cases, one must select the best subset of elements based on a large set of possibilities, the best in some sense, and in many cases there is an interest in the selected elements not appearing amongst themselves, if not it is better that they have different characteristics so that they can represent the existing diversity in the collection of original possibilities. Of course at this level people make these decisions intuitively, but commonsense, generally, is not a good advisor with problems that require optimized decision-making, and simple procedures that apparently offer effective solutions lead to bad decisions, thus this can be avoided by applying mathematical models that can guarantee obtainable effective solutions. In other human activities the selection of this subset has economic implications that involve a selection of a more diverse subset, a crucial decision, and difficult to obtain, which requires a correct process of optimization guided by a methodical form.
In the Operations Research literature, the maximum diversity problem (MDP) can be formulated by the following manner: If V = {1, 2, ⋯ , n} is the original set, and M is the selected subset, M ⊂ V , the search for optimizing the objective is as follows: In the equation (1) the objective function div(M ) represents the measurement that has been made of the diversity in the subset selected. There are some existing models to achieve this goal, as well as a number of practical applications, as reported in [1,2,3,4,5]; in particular, we target the Max-Mean dispersion model in which the average distance between the selected elements is maximized, this way not only is there a search for the maximization of diversity, if not also the equitable selected set, also, the number of elements selected are as well a decision variable, as mentioned in [6].
Traditionally the MDP has permitted the resolution of concrete problems of great interest, for example: the localization of mutually competitive logistic facilities, for illustration see [3], composition of the panels of judges, [7], location of dangerous facilities, [1], new drugs design [8], formulation of immigration policies and admissions [9].
In the past, a great part of the public's interest in diversity was centered around themes such as justice and representation. On the other hand, lately there has been a growing interest in the exploitation of the benefits of diversity. Recently, in [6], it a potential case of the application of the selection of efficient work teams is mentioned. In practice, there are many examples when the diversity in a group enhances the group's ability to solve problems, and thus, leads to more efficient teams, firms, schools. For this reason, efforts have begun on behalf of the investigators to identify how to take advantage of the diversity in human organizations, beginning with the role played by the diversity in groups of people, for example in [10], Page et al. introduces a general work plan showing a model of the functionality of the problem solving done by diverse groups. In this scenario, it is determined that the experts in solving problems possess different forms of presenting the problem and their own algorithms that they utilize to find their solutions. This focus can be used to establish a relative result in the composition of an efficient team within a company. In the study it is determined that in the selection of a team to solve problems based in a population of intelligent agents, a team of selected agents at random surpasses a team composed by the best suited agents. This result is based on the intuition that when an initial group of problem solvers becomes larger, the agents of a greater capability will arrive to a similar conclusion, getting stuck in local optimum, and its greater individual capacity is more than uncompensated by the lack of diversity.
This chapter is organized in the following manner, beginning with the Section 2 study of concepts relating to diversity, and how it can be measured. Later on, in Section 4 we are introduced to the classic Maximum Diversity Problem, with differing variants, and the new problem Max-Mean, with which we attempted to resolve the first objective described by the equation (1), also revised are the formulations of the mathematical programming for these problems, and its properties are explored. In Section 5 an algorithm is developed based on GRASP with path relinking in which the local search is developed mainly with the methodology based on Variable neighborhood search, in Section 4 there is a documented extensive computerized experimentation.

Definitions
Similarities are understood to be a resemblance between people and things. Although it is common to accept that diversity is an opposite concept of similarities, both terms perform within different structures, since similarities are a local function for each pair of elements. In contrast, diversity is a characteristic associated to a set of elements, which is calculated with the function of the dissimilarities within all the possible pairings. Where dissimilarities are the exact opposite of the similarities.
To be even more specific, to measure the diversity in M , div(M ) , it is required to first have a clear definition of the connection, distance, or dissimilarity between each pair i, j ∈ M . The estimation of this distance depends on the concrete problem that is being analyzed, in particular in complex systems like social groups a fundamental operation is the assessment of the similarities between each individual pair. Many measurements of the similarities that are proposed in the literature, in many cases show similarities that are assessed as a distance in some space with adequate characteristics, generally in a metric space, as for example the Euclidian distance. In the majority of applications each element is supposed to able to be represented by a collection of attributes, and defining x ik as the value of the attribute k of the element i, then, for example, utilizing the Euclidian distance: Under this model, d, satisfies the axioms of a metric, although the empirical observation of attractions and differences between individuals forces abandoning these axioms, since they obligate an unnecessary rigid system with properties that can not adapt adequately the frame of work of this investigation: the measurements of similarities In the literature, one can find the different measurements of similarities that can be applied to groups of people. For example, in [11] it is established that "the measurements of similarities of the cosine is a popular measurement of the similarities". On the other hand, in [10] it is established that the measurement of dissimilarities to treat the problem of the relation between the diversity and the productivity of groups of people can be established to solve problems. These measurements are developed in section 1.2. In [6] a similar measurement is utilized to solve a real case.

Similarity measurements
Given two individuals i, j with the characteristics x jp ) is defined by the measurement of similarities of the Cosine like: On the other hand, in [10] the authors explain the problem with how diversity presents a group can increase the efficiency to solve problems, in particular in its investigation that authors use the following measurement of dissimilarities: Where: This measurement will take a negative value (in the case of similarities) and positives (in the case of dissimilarities). In general terms, we are referring to a d ij as the dissimilarities or the distance between i and j.

Equity, diversity, and dispersion
The growing interest in the treatment of diversity also has originated in an effort to study the management of fairness, that is to say that all the practices and processes utilized in the organizations to guarantee a just and fair treatment of individuals and institutions. Speaking in general terms, the fair treatment is that which has or has exhibited fairness, being terms that are synonyms: just, objective, or impartial. Many authors, like French, in [12] the argument is that equality has to do with justice, for example the distribution of resources or of installations or public service infrastructures, and in the same manner the achievement of equality in diversity has been identified within as a problem of selection and distribution. Synthesized, one can say that the equality represents an argument concerning the willingness for justice, understanding this as a complicated pattern of decisions, actions, and results in which each element engages as a member of the subset given.
The other sub problem that should be resolved is how to measure diversity. Given a set V = {1, 2, ⋯ , n}, and a measure of dissimilarity d ij defined between every pair of elements of V , and a subset M ⊂ V , different forms have been established as their measure of diversity.

The measure of dispersion of the sum
With this calculated measurement of diversity and a subset as the sum of the dissimilarities between all the pairs of their elements; this is to say, the diversity of a subset M is calculated with the equation (4):

The measurement of dispersion of the minimum distance
In this case of the diversity of a subset given the establishment of how the minimum of these types of dissimilarities between the pairs of elements of the set; this is to say, like in equation (5).
This type of measurement can be useful with contexts that can make very close undesirable elements, and thereby having a minimum distance that is great is important.

The measurement of the average dispersion
For a subset M , the average diversity is calculated by the expression of the equation (6) div Notice that this measurement of diversity is intimately associated with the measurement of the dispersion of the sum, that constitutes the numerator of the equation (6). In the literature lately some references have appeared in which the diversity is measured in this manner, for example in [13], in the context of systems Case-based reasoning, CBR, the authors defined the diversity of the subset of some cases, like the average dissimilarity between all the pairs of cases considered. So much so that in [6] diversity of a subset is defined by the equation (6) within the context of the models of the dispersion equation.

The maximum diversity problem
Once determined how to resolve the sub problem of estimating the existence of diversity in a set, the following is establishing the problem of optimizing what to look for the determined subset with maximum diversity. Such problem is named in the literature as The Maximum Diversity Problem.
The most studied model probably is the Problem in which it maximizes the sum of the distances or dissimilarities between the elements selected, this is to say the maximum measure of diversity of the sum established in the equation (4). In the literature there is also the problem also known with other denominations, as the Max-Sum problem [14], the Maximum Dispersion problem [15], Maximum Edge Weight Clique problem, [16], the Maximum edgeweighted subgraph problem, [18], or the Dense k-subgraph problem, [19].
the principal characteristics, that makes is different than the rest of the models of diversity, being that the number of elements selected also is a decision variable.

Formulations & mathematical programming models
Given a set V = {1, 2, ⋯ , n}, and the dissimilarity relation d ij , the problem is selecting a subset M ⊂ V , of cardinality m < n, of maximum diversity: The manner in which diversity is measured in the equation (7) permits constructing the formulations of the different maximum diversity problems.

The Max-Sum problem
The Max-Sum problem consists in selecting the subset that has the maximum diversity, measuring the agreement of the equation (4): Introducing the binary variables: Therefore, this problem can be formulated as a problem of quadratic binary programming:

The Max-Mean problem
This problem can be described as: Generically speaking, this problem deals with the maximization of the average diversity. A formulation of the mathematical programming with the binary variables is then: In this problem the objective function (11) is the average of the sum of the distances between the selected elements, the constraint (12) indicates that at least two elements should be selected. Just as presented in [20], this is a fractional binary optimization problem, but can be linearized utilizing new binary variables, this way the problem is formulated for the equations (14) to (19): Notice that the Max-Mean problem cannot be resolved applying a solution method for any of the other problems, unless applied repeatedly for all the possible values of m = |M |; m = 2, 3, … , n. Surprisingly, as seen in Section 4, to find the solution of the Max-Mean problem with exact methods through resolving (n -1) Max-Sum problems requires much less time that resolves directly the formulation (14)- (19).

Computational complexity
This is known as the Max-Sum problem it is strongly NP-hard, as demonstrated in [9]. Recently, it has also been demonstrated in [20] that the Max-Mean problem is strongly NPhard if the measurements of dissimilarities take a positive value and negative. Here the property 3 is demonstrated, this then indicates that if d ij satisfying the properties of a metric, then the diversity div(M ) for any M ⊂ V is always less than div(M ∪ {k }) for any k ∉ M , then, a solution with m < n elements cannot be optimal in the Max-Mean problem, from there the optimum of this case is selecting all the elements.
Property 1 [12] The Max-Sum Problem is Strongly NP-hard.

Property 2 [6]:
If the dissimilarity coefficients d ij does not have restrictions in the sign, then the Max-Mean problem is strongly NP-hard.

Property 3:
The Max-Mean problem has a trivial solution M = V , if the dissimilarity measure is a metric.

Proof:
The Max-Mean problem consists in selecting a subset M such that div(M ) is maximized.
Demonstrating that given the instance in which the dissimilarities are not negative, symmetrical, and satisfy the triangular inequality, the solution to the Max-Mean problem is selecting all the elements, that is to say: Adding over all the possible pairs of elements in M : But the right side of the last expression is equivalent to (|M | -1) times ∑ i∈M d ik , If representing with m = |M |, then: Divided by m on has: Adding the term ∑ i, j∈M i< j d ij on both sides of the last inequality: Finally dividing for (m + 1) : Recent Advances on Meta-Heuristics and Their Application to Real Scenarios 4. An efficient method to solve the Max-Mean problem

Exact solution for the MIP formulation
It is evident that an optimal solution can be obtained for the Max-Mean problem in an indirect manner if resolving the Max-Sum model for all the possible values of m; meaning, for m = 2, 3, … , n, and then dividing the remaining solutions for the corresponding value of m. Then, the best value of these (n -1) values is the optimal Max-Mean model. Therefore, if is the optimal value of the objective function of the Max-Sum problem with m selected elements, and Z Max-Mean * is the optimal value of the Max-Mean problem, then: This research takes into account two new types of test instances: • Type I: This set contains 60 matrices of sizes: n = 20, 25, 30, 35, 150 and 500 with random numbers in -1,1 generated from a uniform distribution.
These test instances are found as available in the web site of the project OPTSICOM, [21]. Figure 1 shows the result of the resolution of the Max-Mean problem in an indirect way, for the test instances of type I and type II, of size n = 30, solving in an exact manner in each example 29 Max-Sum problems, each one of the cuadratic binary formulation (8)- (10). In this investigation, the Max-Sum problems are solved by the method of dynamic search using Cplex 12.4.0, the professional solver for mixed integer linear programming problems. Progress in computer technology and in design of MIP efficient algorithms and their implementation in Cplex 12.4.0 together with mathematical advance lead in some cases to satisfactory solution times. Unfortunately the MIP formulation described above cannot be solved in reasonable times for medium or large problems.
Also, Figure 1 shows that the Max-Mean value of the Max-Sum solution increases as m increases from 2 to certain value, and then this value decreases in the rest of the range. We have observed the same pattern (approximately a concave function) in all the examples tested with positive and negative distances randomly generated. We will consider this pattern to design an efficient GRASP algorithm.  Table 1 shows, that for each method and for each size of a problem, the average value of the objective function (Value) in the optimal solution, the average number of elements that end up being selected in the optimal solution (m), and the average time in seconds (CPU ), ND signifies that the value is not available because the solution was not reached in 5 hours. Cplex 12.4.0 only permitted solving small problems in moderate times. In particular in the linear formulation (14)- (19) can only be resolved in test instances of n < 30, and for n = 30 the solution could not be obtained in a 5 hour process. Experiments with Cplex corroborate the difficulties that commercial branch-and-bound codes encounter when approaching the Max-Sum and Max-Mean problem with this manner. Surprisingly, the Max-Sum model applied (n -1) times permits resolving instances of a greater size in less time, and one could obtain the solution for n = 30 in 102.30 seconds on average, and for n = 35 in 719.51 seconds in the type I problems, in the type II problems this requires more time. Yet, in instances of size n = 50 in 5 hours cannot obtain the optimum solution for this strategy.

TYPE I TYPE II
It can be concluded that if one desires to resolve the Max-Mean problem in an exact manner it is preferable to use the strategy to solve (n -1) times the Max-Sum model since the it consistently worked in much less time in all the experiments. This could be due to the fact that the relaxation continues in the Max-Sum problem providing better levels than the relaxation provided by the continued Max-Mean problem.
Given that the problems of the maximum diversity are NP-hard, it is clear that is required to make a heuristic design to resolve problems of large and medium size. In [6] a algorithm is developed based in GRASP that exploits the characteristics of the Max-Mean problem, and that is hybridized with other successful techniques of intensification, like Path Relinking (PR), and Variable Neighborhood Search, (VNS). This algorithm has resulted as an efficient solution to the medium and large problems.

Solving the Max-Mean problem
In this section, we describe a heuristic developing in [6] to solve the Max-Mean problem. This heuristic consists of a phase of construction GRASP, with a local search phase based on the Variable Neighborhood Search methodology subsequently it is improved with incorporation of a phase of post processing, based on Path Relinking.

GRASP construction phase
From the results shown in Figure 1, we can design a new constructive method in which we add elements to the partial solution under construction as long as the Max-Mean value improves, and when this value starts to decrease, we stop the construction. In this way, the method selects by itself the value of m, which seems adequate to this problem.
In place of a typical GRASP construction for diversity in which, first, each candidate element is evaluated by a greedy function to construct the Restricted Candidate List (RCL) and then an element is selected at random from RCL we utilizing an alternative design, in accordance with the proposed in recent studies [22] in which we first apply the randomization and then the greediness can obtain improved outcomes. In particular, in our constructive method for the Max-mean problem, we first randomly choose candidates and then evaluate each candidate according to the greedy function, selecting the best candidate, permitting better results.
More so specifically, given a partial solution M k with k selected elements, the list of candidates CL is formed by the (n -k) unselected elements. The list of restricted candidates, RCL , contains a fraction α(0 < α < 1) of the elements of CL selected randomly, where α where is a parameter that should be selected adequately, generally by computational experiments. Then, for each element i ∈ RCL , the method computes its contribution, eval(i), if it is added to M k to obtain M k ∪ {i}: Where div(• ) is the mean diversity defined in the equation (6).
Afterwards, the method selects the best candidate i * in RCL if this improves the actual partial solution; this is to say, if eval ( i * ) > 0, and add it to the partial solution, Figure 2 show the pseudo-code of this phase of construction of the method that one calls heuristic GRASP.

11
candidate according to the greedy function, selecting the best candidate, permitting better results.
More so specifically, given a partial solution‫ܯ‬ with݇ selected elements, the list of candidates‫ܮܥ‬ is formed by theሺ݊ − ݇ሻ unselected elements. The list of restricted candidates, ‫,ܮܥܴ‬ contains a fraction ߙሺ0 < ߙ < 1ሻ of the elements of‫ܮܥ‬ selected randomly, where ߙ where is a parameter that should be selected adequately, generally by computational experiments.
Then, for each element݅ ∈ ‫ܮܥܴ‬ , the method computes its contribution, ݁‫݈ܽݒ‬ሺ݅ሻ, if it is added to ‫ܯ‬ to obtain ‫ܯ‬ ∪ ሼ݅ሽ: Afterwards, the method selects the best candidate݅ * inܴ‫ܮܥ‬ if this improves the actual partial solution; this is to say, if ݁‫݈ܽݒ‬ሺ݅ * ሻ > 0, and add it to the partial solution, ‫ܯ‬ ାଵ = ‫ܯ‬ ∪ ሼ݅ * ሽ; in a contrary case, if ݁‫݈ܽݒ‬ሺ݅ * ሻ ≤ 0, the method stop. Figure 2 show the pseudo code of this phase of construction of the method that one calls heuristic GRASP.

Local search in GRASP
The GRASP construction usually does not obtain a local optimum and it is customary in GRASP to apply a local search method to the solution constructed. As shown in [6], previous local search methods for diversity problems limit themselves to exchange a selected with an unselected element, keeping constant the number m of selected elements. Since we do not have this size constraint in the Max-Mean model and we admit solutions with any value of m, we can consider an extended neighborhood based on the Variable Neighborhood Descent (VND) methodology.
We consider the combination of three neighborhoods in our local search procedure: as the current solution (and come back to N 1 in the next iteration). Otherwise, since none of the neighborhoods contain a solution better that the current one, the method stops.
To accelerate the search in these neighborhoods, one would not make the exploration in a sequential manner over the elements of a specific neighborhood, if not one would evaluate the potential contribution to the partial solution of the following manner: Given a solution M m , one calculates the contribution of each element selected i, just like the potential contribution of each element unselected i like: Thus, when exploring N 1 one searches for the elements selected in the given order by d s , where the element with the smallest value is tested first. Similarly, when exploring N 2 proving the selected elements in the same order but the elements unselected in the inverse order, this is to say, first considering the elements not selected with a grand potential contribution to the partial solution.
Finally, when exploring N 3 the elements not selected, that are considered to be added in the actual solution, they are explored in the same manner than in N 2 , in which the element with the largest contribution is considered first. Figure 3 outlines the pseudo-code of this phase.  The path relinking procedure PR(x, y) starts with the first solution x, called the initiating solution, and gradually transforms it into the final one y called the guiding solution. At each iteration we consider to remove an elements in x not present in y, or to add an element in y not present in x. The method selects the best one among these candidates, creating the first intermediate solution, x (1). Then, we consider to remove an element in x(1) not present in y, or to add an element in y not present in x (1). The best of these candidates is the second in-termediate solution x (2). In this way we generate a path of intermediate solutions until we reach y. The output of the PR algorithm is the best solution, different from x and y, found in the path. We submit this best solution to the improvement method. Figure 4 shows a pseudo-code of the entire GRASP with Path Relinking algorithm in which we can see that we apply both PR(x, y) and PR(y, x) to all the pairs x, y in the elite set ES. ) and PR(x j ,x i ), let x be the best solution found 11. Apply the local search phase of GRASP to x x'.

Comparison with existing methods
We also propose a new adaptation of existing methods for several models of maximum diversity problem.
Prokopyev et al. in [20] introduced several models to deal with the equitable dispersion problem and the maximum diversity problem. The authors proposed a GRASP with local search for the Max-MinSum variant in which for each selected element (in M ), they compute the sum of the distances to the other selected elements (also in M ) and then calculate the minimum of these values. The objective of the Max-MinSum model is to maximize this minimum sum of distances. We can adapt the method above, originally proposed for the Max-MinSum, to the Max Mean model. We call this adapted method GRASP1.
Also, Duarte and Martí in [26] proposed different heuristics for the Max-Sum model. In particular the authors adapted the GRASP methodology to maximize the sum of the distances among the selected elements. We also adapt this algorithm to solve the Max-Mean Model, and we call the entire method (constructive phase + local search) GRASP2.
Adaptation details of these algorithms can be seen in [6] In the final experiment we target the 20 largest instances in our data set (n=500). Table 3 shows the average results on each type of instances of GRASP1, GRASP2 and our two meth- ods, GRASP and GRASP with Path Relinking described in this Section. Results in Table 3 are in line with the results obtained in the previous experiments. They confirm that GRASP consistently obtains better results than GRASP1 and GRASP2. As shown in the last column of Table 3, Path Relinking is able to improve the results of GRASP in all the instances.

Numeric experiments with test instances
This section contains the results of a large number of numerical experiments that is made to evaluate and calibrate the GRASP algorithm, which was implemented in Mathematica V.7 1 , the experiments are processed in an Intel Core 2 Laptop, 1.4 GHz and 2GB de RAM. The parameters of the algorithms were calibrated through extensive computational experiments.

GRASP heuristic performance on small problems
In this section a comparison is made of the performance of the heuristic GRASP and the exact optimal reported for small problems. The results are shown in Table 2.
Small instances of size n = 30 were used, the largest are for those that can be resolved with Cplex 12.4.0 in an exact manner in reasonable times. Since the optimal is known, a measurement of the precision of the methods is the difference in relative percentage with respect to the optimum (GAP). Table 2 shows the average of the objective function (Value), the average number of elements selected (m), the times that the optimum was reached (# of optimal times), the relative difference with the optimal (GAP) and the average time in seconds (CPU Time). Only applying the constructive phase of GRASP one can reach the exact optimum of the problems 90% of the times, for the test instances of type I, and the 80% of the times in the test instances of type II, and in a reduced amount of time (less than a second), also in instances in which the optimum is not found, the GAP is very small.

Solution to large problems
Being that is no longer possible to compare the optimal solution of these problems, in place of GAP it is reported that a percentage of deviation in respect to the best solutions found in the experiments, the represented value in the tables like deviation, and that it is equal to:  Table 3 shows that the Path Relinking phase permitted improvements to the results of the heuristic GRASP, GRASP1 (based in [20]) and GRASP2 (based in [26]) in all of the test instances of size n = 500 and for the two types of examples considered

Search profile in Variable Neighborhood Search (VNS) methodology by GRASP
Our local search in the heuristic GRASP utilizes three types of neighborhoods, generated according to the methodology VNS, these neighborhoods are represented by: N 1 (remove an element from the solution), N 2 (exchange a selected element with an unselected one), and N 3 (add an unselected element to the solution). This way an interesting study is measured by the contribution of each type of neighborhood to the quality of the final solution. Curiously, if one calculates the was average contribution to the improvement of the function of the objective that provides the exploration in each one of the types of neighborhoods, one can observe that the neighborhoods of type N 1 and N 3 provide greatest contribution on average compared with the visit to the neighborhood N 2 , as shown in Figure 6.

Solution of large problems using GRASP with Path Relinking
In this section the experiments made are described with the 20 test instances of size n = 500. Table 3 shows the summary of the results obtained in the large instances when applying the algorithms proposed, the values correspond to the achieved averages with each one of the test instances of this size. neighborhood of the best values found. The figure clearly shows the GRASP achieves good solutions quickly. The execution of GRASP+PR, the phase of relinking of trajectories is executed after the elite set, ‫,ܵܧ‬ has been populated, which occurs after approximately 450 seconds, on average. Then the phase of path relinking properly said, by applying the procedure to each pair of solutions of the elite set, the evolution of the best solution found show that this phase permits obtaining the best solutions quickly, surpassing the GRASP (without PR), that after a certain moment does no achieve improvements in the solutions in the same proportion that GRASP+PR, and therefore is seen surpassing due to this. Similar profiles are observed for Type II instances This way, in daily activities of organizations, companies, schools, sport teams, etc. it has been observed through evidence that diversity has an important role on the ability for groups of people to solve problems. Lately, literature investigations have shown formally that this empirical phenomenon is true, proportioning a theoretic justification for this fact, for example in [10]. A consequence of this is that, under certain circumstances, the groups of people that have conformed in a diverse manner can surpass the productivity of the groups conformed

Search profile
Finally, to complete the analysis of the comparison of the efficiency of the algorithms that are designed, graphs were made of the profile of search of the algorithms; this is to say, since these heuristics were improving the value of the objective function of the time of execution. In Figure 7 one can observe the amplified details of its profile for a search in the neighborhood of the best values found. The figure clearly shows the GRASP achieves good solutions quickly. The execution of GRASP+PR, the phase of relinking of trajectories is executed after the elite set, ES, has been populated, which occurs after approximately 450 seconds, on average. Then the phase of path relinking properly said, by applying the procedure to each pair of solutions of the elite set, the evolution of the best solution found show that this phase permits obtaining the best solutions quickly, surpassing the GRASP (without PR), that after a certain moment does no achieve improvements in the solutions in the same proportion that GRASP+PR, and therefore is seen surpassing due to this. Similar profiles are observed for Type II instances 6. A case of application for the Max-Mean problem 6.1. Teams that are more diverse are more efficient for problem solving than those less diverse This way, in daily activities of organizations, companies, schools, sport teams, etc. it has been observed through evidence that diversity has an important role on the ability for groups of people to solve problems. Lately, literature investigations have shown formally that this empirical phenomenon is true, proportioning a theoretic justification for this fact, for example in [10]. A consequence of this is that, under certain circumstances, the groups of people that have conformed in a diverse manner can surpass the productivity of the groups conformed by the people individually more capable to resolve these problems; meaning, in a certain way diversity triumphant over the ability.
From a practical point of view, this result implies that, for example, a company that wants to conform a team should not look for simply a selection of individuals with a greater qualification for it, probably the most efficient selection would be to choose a diverse group. In reality the ideal would be that the groups of work be conformed by people with great qualifications and diversity; yet, these two objectives tend to be opposing one another since the diversity of the team formed by the people more qualified tends to be smaller, as demonstrated in [24].
The idea in the background is that we have a population of capable people to realize any task; these people have different levels of ability or of productivity for resolve it, and if one must select the work teams of this population for realizing a task, one can consider two possible groups: in the first only individuals are chosen with high qualifications, and in the second "diverse" individuals are chosen in some sense It turns out that the first finish in some way arriving to the same solution, creating a more difficult and confusing work for each other, on the other hand the second group the diversity created more perspectives and thus more opportunity of avoiding a halt on the search for a solution of the problems, generating in some way the right environment to increase the individual productivity of each one, and therefore of all groups. From a formal point of view what happens in the first group, under certain hypothesis, the people that are highly qualified tend to convert into similar points of view and ways to solve problems from which the set of optimal locations that the group can reach is reduced. Although the second group of diverse members originates a set of optimal locations more widely, and thus has more opportunities to improve.

Diversity in identity and functional diversity, perspectives and heuristics
In terms of a population, understood as "diversity in identity", or simply "diversity," to the differences en its demographic characteristics, cultural, ethnics, academic formation, and work experience. On the other hand, "formal diversity" is known as the differences in how these people focus and treat problem solving. An important fact is that these two types of diversity are correlated, since it has been identified experimentally a strong correlation between two types of diversity, just as demonstrated in [25]. Given the connection, it can be deduced that diverse groups in identity are functionally diverse.
In the literature, the focus was employed on a person to resolve a problem is a representation or an encoding of the problem in its internal language, and it can be known as "perspective." Formally, a perspective P is a mapping of the set of solutions of a problem into the internal language of the person resolving a problem.
On the other hand, the way in which people attempt to resolve a problem, or how they look for solutions are known as "heuristic." Formally, a heuristic is a mapping H of the encoding of the solutions in an internal language of the person that will solve the problem into the solutions set. This way, given a particular solution, the subset generated by the mapping H is the set of the other solutions that the person considers. In this manner, the ability to resolve the problem on behalf of a person is represented by its couple of perspective-heuristic (P, H ). Two people can differ in one of these components or in both; meaning, they can have different perspectives or different heuristics, or differ on both. A solution would be the local optimum for a person if and only if when the person encodes the problem and applies the heuristic, neither of the other solutions that the person considers has the abilities, and thus will have a few optimal locales, causing the group to become stuck with one of the solutions.

How to select the most productive work team
From an intuitive point of view, the conclusion that diverse groups in identity can surpass groups that are not diverse (homogeneous) due to its grand functional diversity based on the affirmation, well reception, that if the agents inside of the groups have equal individual ability to solve problems, a functional diverse group surpasses a homogeneous group. In [24] it has demonstrated that groups with functional diversity tend to surpass the best individual agents being that the agents in the group have the same ability. This still leaves open an important question: Can a functionally diverse group, whose members have less individual ability, have a superior performance than the group of people that have more abilities individually? In [10] finally resolves in a affirmative manner this question, making a mathematical demonstration to this fact. Even though certain doubts still surge in a natural manner in respect to: How many members should this group have in such a way that the average diversity within the group be at its maximum?, and, can one detect which is the group more functionally diverse?
This way, if considering the actual situation in which an Institution desires to hire people to solve a problem. To realize a good selection the Institution usually gives a test to the applicants, around 500, to estimate their abilities individually to solve a problem. Supposing that all the applicants are individually capable to solve them, then they have the formation and experience necessary, but have different levels of ability. It is doubtful if the Institution should hire: i.
The person with the highest score obtained on the test; ii. The 10 people with the highest scores; iii. 10 people selected randomly from the group of applicants;

iv.
The 10 people most diverse in identity of the group of applicants;

v.
The group of people most diverse on average of the group of applicants.
Ignoring the possible problems of the communication within the groups, the existing literature suggests that (ii) is better than (i), [25],since most people will be looking in a wider space, having then more opportunities to obtain better solutions, in place of the action of the person graded best that will stay stuck in one of the optimal locations. Recently in [10] it has been demonstrated formally that (iii) is better than (ii).
In this manner, the institution fails based on the group of people with the highest scores, meaning the most prepared individually, go on to form the best work team, and thus the company should hire (ii), since it is demonstrated as under certain hypothesis that (iii) is a better decision, as seen in [10]. The authors have come to determine that a team of people selected randomly have more functional diversity and under certain conditions surpass the performance of (ii). since under the set of conditions identified by the authors, the functional diversity of a group of the people that are individually capable to resolve the problem necessarily becomes smaller, which in the end, the advantage of having best abilities individually is seen as more than compensated by the greater diversity of the randomly selected group. Notice that the authors in the proof do not even use the equipment with the maximum diversity, if not a randomly selected group, and even then are able to demonstrate that it is better, thanks to the greater diversity inherent in the random group next to the group with the most abilities individually. Here we prove in the corollary of the theorem 2, that if selecting the group with more diversity on average, this is to say hire the group formed according to (iv), this would result more productive than hiring than that formed randomly (iii), and, by transitivity, better than the group formed by the best scores (ii ) and lastly better than simply choosing the best scored (i).
On the other hand, the literature says little or nothing at all about (v), since classically in the problems of diversity have considered the number of elements chosen as a given value, yet in the practice applications it is not clear how to choose the number of elements to be selected, and the best option would be to leave the process itself of optimization the one that demonstrates its value. This way, the focus of our analysis is centered on the dispute between the importance of the abilities of the individuals of each person in the group, their functional diversity (trapped by the diversity of identity), and the size of the ideal group.
A conclusion to all this is that the diversity in the organizations should be encouraged, which implies new policies, organizational forms, and styles of administration. In the context of solving a problem, the value of a person depends on their ability to improve the collective decision, since the contribution of this person depends in great measure to the perspectives and heuristics of the other people that make up the teamwork. The diversity in the focus of the solution of the problem in respect to the other people is an important predictor of its value, and in the end can be more relevant than its individual ability to solve the problem on its own. This was, to estimate the potential contribution of a person in teamwork, it is more important to make an emphasis in measuring how this person thinks differently, before estimating the magnitude of the ability of the person from aptitude tests or intelligence tests.
Although one has to be more conscious of some aspects that have not been considered and that can have influence in the performance of a team of people. For illustration, the groups with diversity in identity can often have more conflicts, more problems of communication, less mutual respect and less trust amongst the members of a homogeneous groups, which can create a diminishment of performance in diverse groups. In (16) it is mentioned that the people with similar perspectives but with diverse heuristics can communicate with one another without any problem, but people with diverse perspectives can have problems when comprehending the solutions identified by the other members of the group, in this sense the best of the organizations would be to find people with similar perspectives but guarantee a diversity of heuristics, in this manner, the organizations can exploit better the benefits of the diversity while minimizing the costs of the lack of communication.

Basics hypothesis and relationship between ability and diversity
In this section it is stated in theorem 1, demonstrated in [10], that explains the logic behind the fact that a team of people chosen at random, from a database of applicants that are capable to solve problems, it is better than the team formed by the people more individually capable, from there a result is established, that is immediate, being that the team of people with the most diversity surpasses the team formed by the people with the most abilities for solving problems. To establish a theoretic result, consider the population from where the team will be selected, this is to say the applicants, represented with con Φ with to satisfy the following suppositions • The applicants are trained to solve the problem. Given the initial solution, the applicants can find a better solution, even if it is only a little better; • The problem is difficult, none of the applicants can find the optimal solution always; • The applicants are diverse, and therefore for any potential solution that is not the optimal, at least one applicant can find the best solution; • The best applicant is the only one.
If we consider a team of applicants chosen randomly from Φ to according to some distribution, the theorem establishes what, with probability 1, sample sizes N 1 and N exist, N 1 < N , just like in the collective performance of the team of the N 1 applicants chosen at random surpasses the collective performance of the N 1 best applicants.
To formulate the theorem 1 more precisely, consider X the solution set of the problem, a function that gives the value of each solution V : X → 0,1 , supposing as well that V it has the only maximum x * , and that V ( x * ) = 1. Each applicant ϕ beings from the initial solution x and uses the search rule to find the maximum, but is not always found, if not generally gets stuck in a local optimum, if ϕ(x) is the local optimum when the applicant ϕ starts his search in x. This way ϕ(X ) represents the local optimal set for the applicant ϕ.
Each applicant is characterized by the pair (ϕ, ν), ), and an estimation of the performance as the value expected of the search by treating the solving of the problem, represented by E(V ; ϕ, ν) ; this is to say that, The hypothesis should be satisfies by the applicants ϕ, with which the theorem is demonstrated through the following: HYPOTHESIS 2 (Difficulty): Hypothesis 1 indicates that given the initial solution the people always try to find better solutions, but never select the worst solution, and get stuck in the optimal locale. Hypothesis 2 implies that no one, individually, can reach the optimum always from any point. In hypothesis 3, it is established in a simple manner that the essence of diversity, when a person is stuck in an local optimum always has someone that can find the best due to a different focus. Hypothesis 4 establishes that within the set of applicants considering that a better unique performance exists. With these hypotheses, the theorem 1 is proved in [10]. The theorem shows that a randomly selected group works better than a group formed for the better, is an immediate extender of the results as presented in the following corollary, which is demonstrated here, in which it is established more directly in relation between the diversity and ability. Proof: The proof is immediate, since the theorem is based that the diversity of the set of people randomly selected is more diverse than the set of people with the most individual abilities. This way, if selecting the group of people most diverse, helps this surpass the performance of the group of people selected randomly, due to the major diversity of the first, and for theorem 1, this last group surpasses the performance of the group formed by the people with more abilities individually. It continues as transitivity the result that is shown in the corollary.

Resolution of a case study
Finally, we apply the method solving a real instance. In particular we apply them to obtain a diverse assembly of professors from a set of n=586 in the ESPOL University at Guayaquil (Ecuador). For each professor, we record 7 attributes (tenure position, gender, academic degree, research level, background, salary level, and department), and the similarity measure between each pair of them is computed with the modified difference measure described in the equation 3. The solution obtained with our GRASP+PR method in 127.1 seconds has 90 professors and a similarity value of 1.11. Table 4 it is shown that the results detailed and each one of the 10 trials.  Table 4. Average results about the 10 successive runs

Conclusions
The main result of this paper provides conditions under which, a diverse group of people will outperform a group of the best. Our result provides insights into the trade-off between diversity and ability. An ideal work team would contain high-ability problem solvers who are diverse.
According to our approach, the problem of designing the most efficient work team is equivalent to the maximum diversity problem, wich is a computationally difficult, In particular we study the solution of the Max-Mean model that arises in the context of equitable dispersion problems. It has served us well as test case for a few new search strategies that we are proposing. In particular, we tested a GRASP constructive algorithm based on a non-standard combination of greediness and randomization, a local search strategy based on the variable neighborhood descent methodology, which includes three different neighborhoods, and a path relinking post-processing.
We performed extensive computational experiments to first study the effect of changes in critical search elements and then to compare the efficiency of our proposal with previous solution procedures.
The principles of the proposed equity measure can be applied to solve the problem of selecting efficient work teams. Therefore, more research is necessary in this area, especially to solve the subproblem to measure diversity. The results from a comparative study carried out with the other algorithms favor the procedure that we proposed, also is able to solve large instances. The focus of our future research will be on the development of multi-objective optimization that attempts to balance efficiency or ability and diversity, namely a study on the selection of the best and most diverse, which gives a flexible and interactive way for decision makers to make the tradeoff between ability and diversity.