Bayesian Inference Application Bayesian Inference Application

In this chapter, we were introduced the concept of Bayesian inference and application to the real world problems such as game theory (Bayesian Game) etc. This chapter was organized as follows. In Sections 2 and 3, we present Model-based Bayesian inference and the components of Bayesian inference, respectively. The last section contains some applications of Bayesian inference. of Bayesian inference which it can be application to the Bayesian game and a general Bayesian abstract fuzzy economy game or a fuzzy game. For application to Bayesian game, we


Introduction
In statistical inference, there are two ways for interpretations of probability include Frequentist (or Classical) inference and Bayesian inference. It usually is unlike with each other in the classical nature of probability. Classical inference defines probability as the limit of an event's relative frequency for a large number of experiments and only in the sense of random experiments which are well defined. Other side, Bayesian inference can to impose probabilities to each statement when a random process is not associated. In the sense of Bayesian, probability is a way to show an individual's degree of believes in a statement. Bayesian inferences are different interpretations of probability, and also different approaches depend on those interpretations. Bayes' theorem presents the relativity about two conditional probabilities that are the reverse of anything other. The initials of the term Bayes' theorem is in honor of Reverend Thomas Bayes, and is referred to as Bayes' law (see [1]). This theorem shows the conditional probability or posterior probability of an event A after B is observed in terms of the prior probability of A, prior probability of B and the conditional probability of B given A. It is valid in all interpretations of probability. Bayes' formula is how to revise probability statements using data. The Bayes' law (or Bayes' rule) is The conditional probability definition is defined as follows For example, let a dice is thrown under a dice-box. From the standard model, all of outcomes have probability equal to 1/6. Now, the dice is lifted a bit and a random corner of the upper side is able to visible which it contains a dot. The new probability distribution of the outcomes shows as follows. Let A i is the outcome of the throw, for i = 1 , 2 , 3 , 4 , 5 , 6 and B is the randomly chosen corner contains a dot. So, we get P(A i ) = 1/6 and P(B) = 2/3. We get the following table: The simplest way to construct the fourth column is to multiply. For any A i , P(A i | B) and P(B| A i ), to sum these values and divide by this sum. This final term is said to be scaling and corresponds to the formula as An simpler argument is that P(A i | B) has to be a probability distribution, thus sum to unity. As the scaling operation is trivial, Bayes' rule is also shown as where P(A) the prior (distribution), P(B|A) is the likelihood and P(A|B) is the posterior (distribution).
The main result of Bayesians statistics is that statistical inference may depend on the simple device posterior ∝ prior * likelihood. By dice-throwing example is not of controversial. The disputations about the possibility of using Bay's rule as The second ingredient we need is data, plus a how the data associate to the truth which is nothing but the classical concept of specifying a random relationship for all associated values of Truth. Note that P(Data| Truth) is not applied as probability distribution for different data, but as the probability of the given data for different values of Truth. Various authors do apply P(Data| Truth) for likelihood to sheer this misconstrue. Now, noting that (replace Truth with T), probability of Data (P(Data)) can be written as that is as a function of P(T) and P(Data| T), it is obvious that the prior and likelihood enable, using 1 to construct a new probability statement about T given the data as follows P TruthjData ð Þ¼the posterior: The purpose of this chapter was to introduce the concept of Bayesian inference and application to the real world problem such as game theory (Bayesian Game). In this chapter was organized as follows. In Sections 2 and 3, we present Model-based Bayesian inference and the components of Bayesian inference, respectively. The last section contains some applications of Bayesian inference.

Model-based Bayesian inference
The basic of Bayesian inference is continued by Bayes' theorem. From (1), replacement B with observations y, A with the set of parameter Θ, and probabilities P with densities p, results as the following which p(y) is the marginal likelihood of y, p(Θ) is the set prior distributions of the set of parameter Θ before y is observed, p(y|Θ) is the likelihood of y underneath a model and p(Θ|y) is the joint posterior distribution of Θ that expresses uncertainty about parameter set Θ after taking both the prior and data into system. Because there are often multiple parameters, Θ presents a set of j parameters, we have The term determines the marginal likelihood (or the prior predictive distribution) of y which it was introduced by Jeffreys [2], and may be set to c where c is an unknown constant. This distribution shows what y should be similar to given the model, before y has been observed. Only the prior probabilities and the model's likelihood function are applied for p(y). The presence of p(y) normalizes the joint posterior distribution, p(Θ|y) guarantee it is a proper distribution and integrates to 1. From replacement p(y) with a constant of proportionality c, the Bayes' theorem becomes to We get when ∝ is proportional to.
This formulation (11) be shown as the unnormalized joint posterior being proportional to the likelihood multiply with the prior. Howsoever, the aim of this model is often not to concluding the non-normalized joint posterior distribution, however to concluding the marginal distributions of the parameters. The set of all Θ can partitioned as when the interest sub-vector denote by Φ and the complementary sub-vector of Θ denoted by Λ, usually called to as a vector of nuisance parameters. For a Bayesian scope, the presence of nuisance parameters does not pose any formal, theoretical problems. A nuisance parameter is a parameter that exists in the joint posterior distribution of a model, though it is not a interest parameter. The marginal posterior distribution of φ, the interest parameter, can be shown as In model-based Bayesian inference, Bayes' theorem is applied to approximate the nonnormalized joint posterior distribution, and lastly the user can evaluate and make inferences by the marginal posterior distributions.

The components of Bayesian inference
In this section, we presents about the components of Bayesian inference which contains the prior distributions, the likelihood or likelihood function and the joint posterior distribution as follows.
1. p(Θ) is the prior distributions for set of Θ, and uses probability as a methods of quantifying uncertainty about Θ before taking the data into system.

p(y|Θ)
is the function of likelihood which all variables are associated in a full probability model.

p(Θ|y)
is the joint posterior distribution that shows uncertainty about Θ after taking both the prior and the data into system. If Θ is partitioned into a single parameter of interest φ and the remaining parameters are considered nuisance parameters, then the marginal posterior distribution of φ denote by p(φ|y).

Prior distribution
The prior distribution is a main concept of Bayesian and shows the information about an uncertain Θ that is merged with the probability distribution of new data to yield the posterior distribution which in turn is applied for future inferences and decisions about Θ. The existence of a prior distribution for any problem can justified by some axioms of decision theory; which we now focus for how to set up a prior distribution for every given application. Generally, Θ will be a vector, but for easiness we will point as on p(Θ).
By well-identified and large sample sizes, suitable alternatives of p(Θ) will have minor effects on posterior inferences. This definition might look like to be circular, but in practice one can check the dependence on p(Θ) by a sensitivity analysis: comparing posterior inferences under different suitable alternatives of p(Θ).
If the sample size is small, or available data provide only indirect information about the parameters of interest, then p(Θ) becomes more important. In various cases, nevertheless, models can be set up hierarchically, such that clusters of parameters have shared p(Θ), which can themselves be approximated from data. Prior probability distributions have belonged to one of two kinds as informative and uninformative priors. In this section, four kinds of priors which include informative, weakly informative, least informative, and uninformative, are shown according to information and the aim in the use of the prior.

Informative prior
If prior information is obtainable about Θ, it should be included in p(Θ). If the current model is homologous to a previous model, and the current model is goal to be an adjusted version dependent on more current data, then the posterior distribution of Θ from the previous model maybe used as p(Θ) for the current model. Now, every version of a model is not start from scratch, based only on the current data, but the cumulative effects of all data, past and current, can be taken into system. To sure the current data do not dominate the prior, in 2000, Ibrahim and Chen [3] presented the power prior which it is a class of informative prior distribution that takes early data and results into system. If the current data is very homologous to the previous data, then the precision of the posterior distribution increases when including more information from previous models. If the current data differs tremendously, then the posterior distribution of Θ maybe in the tails of the prior distribution for Θ, therefore p(Θ) contributes less density in its tails.
Sometimes informative prior is not ready to be applied, for example when it resides in other person, as in an expert. For this way, their human personal beliefs of the probability for the event must be elicited into the form of a suitable probability density function which this process is said to be prior elicitation.

Weakly informative prior
Weakly informative prior (in the short term: WIP) use prior information for regularization and stabilization, providing sufficient prior information to prevent results that contradict our knowledge for example an algorithmic failure to explore the state space. Other aim is for WIPs to use less prior information than is really available. WIPs should provide some of the useful of prior information while avoiding some of the risk from using information which does not exist. WIPs are the most common priors in practice and are liked by subjective Bayesians.
Selecting WIPs may be cumbersome. WIPs distributions should change with the sample size, since a model should have sufficient prior information to learn from the data, but the prior information must also be weak sufficient to learn from the data.
In practice, this is an example of WIPs. It is favor, for well reasons, to center and scale all continuous predictors [4]. Though centering and scaling predictors is not talked about here, but it should be clear that the potential range of the posterior distribution of θ for a centered and scaled predictor should be small. A favor WIPs for a centered and scaled predictor may be θ $ N 0; 10; 000 ð Þwhere θ is normal distribution agreeable to a mean of 0 and a variance of 10,000. Here, the density for θ is nearly flat. Nonetheless, the fact that it is not perfectly at yields well properties for numerical estimation algorithms. In both Bayesian and Frequentist inference, it is possible for numerical estimation algorithms to become stuck in regions of at density which become more common as sample size decreases or model complexity increases. Numerical estimation algorithms in Frequentist inference function as though a at prior were used, thus numerical estimation algorithms in Frequentist inference become stuck more frequently than numerical estimation algorithms in Bayesian inference. Prior distributions that are not completely at allow sufficient information for the numerical estimation algorithm to continue to diagnose the goal density, the posterior distribution.
After updating a model in which WIPs exist, the user should be investigating the posterior. If the posterior contradicts knowledge, then the WIPs must be revised by including information that will make the posterior consistent with knowledge [4]. A favor purpose Bayesian criticism against WIPs is that there is no precise mathematical form to derive the optimal WIPs for a given model and data.

Vague priors
A vague prior, is said to be a diffuse prior which difficult to define, after considering WIPs. In 2005, Lambert, Sutton, Burton, Abrams and Jones introduce the first formal move from vague to WIPs. After conjugate priors were introduced by Raiffa and Schlaifer in 1961 which most applied Bayesian has applied vague priors, parameterized to estimate the idea of uninformative priors.
Normally, a vague prior is a conjugate prior together with a large size parameter. Howsoever, if the sample size is small then vague priors may be problems. All most problems about vague priors and small sample size are implicated with scale rather than location. The problem can be particularly acute in random-effects models which it is used rather loosely in this here to imply exchangeable, hierarchical and multilevel structures. A vague prior is defined as commonly being a conjugate prior that is intent to estimate an uninformative prior and without two goals of regularization and stabilization.

Least informative prior
Least informative priors (for short term LIP) is applied here to describe a class of prior in which the aim is to minimize the amount of subjective information content, and to apply a prior that is determined only by the model and observed data. The rationale for using LIPs is often called to let the data speak for themselves. LIPs are preferred by objective Bayesians. LIPs are contains Flat Priors [12], Hierarchical Prior [4], Jeffreys Prior [2], MAXENT [5] and Reference Priors [6][7][8] etc.

Uninformative prior
Traditionally, most of the above descriptions of prior distributions were classified as uninformative priors. However, uninformative priors do not really exist (see in [9]) and all priors are informative in some ways. Moreover, there have been various names associated with uninformative priors including diffuse, minimal, non-informative, objective, reference, uniform, vague, and perhaps weakly informative etc.

Proper and improper priors
It is important for the prior distribution to be proper. A prior distribution, p(θ), is improper when Before, an unbounded uniform prior distribution is an inappropriate prior distribution since p(θ) ∝ 1, for θ ∈ (À∞, ∞). An inappropriate prior distribution may be cause an inappropriate posterior distribution. If the posterior distribution is inappropriate, then inferences are invalid.
To determine the propriety of a joint posterior distribution, the marginal likelihood should be finite for any y. Again, the marginal likelihood is p(y) = Ð p(y|Θ)p(Θ)dΘ. Although inappropriate prior distributions can be applied, it is good practice to avoid them.

Likelihood
To completely for the definition of a Bayesian, both the prior distributions and the likelihood must be estimated or completely specified. The likelihood or p(y|Θ), contains the available information provided by the sample. The likelihood is p yjΘ ð Þ ¼ Q n i¼1 p y i jΘ À Á : The data y effect to the posterior distribution p(Θ| y) only through the likelihood p(Θ| y). In this way, Bayesian inference believes the likelihood principle which states that for a given sample of data, any two probability models p(Θ| y) that have the same likelihood yield the same inference for Θ.

Posterior distribution
Recent theoretical and applied overviews of Bayesian statistics, including many examples and uses of posterior distributions, see [10][11][12]. The posterior distributions for decision-making about home radon exposure are discussed in [13].
The posterior distribution summarizes the current state of knowledge about all the uncertain quantities in a Bayesian analysis. Analytically, the posterior density is the product of the prior density and the likelihood. In a complicated analysis, the joint posterior distribution can be summarized by a set of L simulation draws of the vector of uncertain quantities w 1 , w 2 , … , w J , as illustrated in the following matrix: The marginal posterior distribution for any unknown quantity w l can be summarized by its column of L simulation draws. In many examples it is not necessary to construct the entire table ahead of time; rather, one creates the L vectors of posterior simulations for the parameters of the model and then uses these to construct posterior simulations for other unknown quantities of interest, as necessary.

Application to games theory
In this section, we present the application of Bayesian inference to the real world problems such as Bayesian Game as follows.

The classical games
The basic contents of the n-person game was presented by John Forbes Nash [14] in 1950. Also, he first shows the existence of equilibrium for this model when the player's preferences are representable by continuous quasi-concave utilities and the sets of strategy are simplex. The definition of an n-person game can be written as below.

Definition 4.1
The normal form of an nÀ person game is X i ; r i ð Þ n i¼1 , where for each i ∈ {1, 2, … , n}, the set of individual strategies of player i denoted by X i which X i is a non-empty set and r i is the preference relation on X ≔ Q i ∈ I X i of player i.
The individual preferences r i are usually represented by utility functions, i.e. for each i ∈ {1, 2, … , n} there exist a real valued function u i : X ≔ Q i ∈ I X i !R such that: xr i y ⇔ u i ðxÞ ≥ u i ðyÞ, ∀x;y ∈ X: Then the normal form of nÀ person game is transformed to The solution of this game is called Nash equilibrium as below.

Definition 4.2
The Nash equilibrium for the game X i ; u i ð Þ n i¼1 is a point x * ∈ X which satisfies for each i ∈ {1, 2, The following theorem offers sufficient conditions for the existence of Nash equilibrium.

Theorem 4.3
Let Γ ¼ X i ; u i ð Þ n i¼1 be a n-person game and denoted by f the real-valued function on X Â X defined by f x; y ð Þ ¼ Σ n i¼1 u i x Ài ; y i À Á . Let us assume that 1. for each i ∈ {1, 2, … , n}, X i is a non-empty compact convex subset of a Hausdorff linear topological space; 2. for each i ∈ {1, 2, … , n}, u i (Á, x i ) is continuous on X Ài = Q i 6 ¼ j X j for each fixed x i ∈ X i ; 3. Σ n i¼1 u i is continuous on X; 4. f(x, Á) is quasi-concave on X, for each x ∈ X.
Then, Γ has an equilibrium.
Next, we present some examples of Nash equilibrium for two persons game as follows.

Example 4.4
The battle of the sexes game has two Nash equilibrium (MT, FT), (MS, FS) with (3, 2) and (2, 3), where "Male like playing tennis" denoted by MT, "Male like shopping" denoted by MS, "Female like playing tennis" denoted by FT and "Female like shopping" denoted by FS, see in Figure 1.

Example 4.5
The oligopoly behavior game is a unique Nash equilibrium (Aa, Ba) where "A coffee shop use a strategy for don't advertising" denoted by Ad, "A coffee shop use a strategy for advertising" denoted by Aa, "A coffee shop use a strategy for do not advertising" denoted by Bd, and "A coffee shop use a strategy for advertising" denoted by Ba, see in Figure 2.

The Bayesian games
For a long time, we have been supposed that everything in the game was normal knowledge for everyone playing. However, real players may have private information about their own payoffs, their type or preferences, etc. The way to modeling this situation of asymmetrical information is by recurring to the concept was defined by Harsanyi in 1967. The key is to introduce a move by the nature, which changes the uncertainty by converting an asymmetrical information problem into an imperfect information problem. The concept is the nature moves determining players' types, a concept that collects all the private information relevant them (i.e. payoffs, preferences, beliefs of another players, etc.).

Definition 4.6
The normal form of Bayesian games with incomplete information include: 1. the players i ∈ {1, 2, … , I}; 2. the set of finite action for each player a i ∈ A i ;  3. the finite type set for each player θ i ∈ Θ i ; 4. a probability distribution on types p(θ) It is important to discuss some parts of the definition. Players' types comprise all relevant information about some player's private characteristics. The type of θ i is only observed by player i who uses this information both to make decisions and to update itself beliefs about the likelihood of opponents' types.
Combining actions and types for each player it is possible to create the strategies. Strategies will be given by s i : A Bayesian Nash Equilibrium (for short term: BNE) is basically the same concept than a Nash Equilibrium with the addition that players need to take expectations over opponents' types as follows.

Definition 4.7
A Bayesian Nash Equilibrium is a Nash Equilibrium of a Bayesian Game, i.e. Eu i s i js À1 ; θ i ð Þ≥ Eu i s 0 i js Ài ; θ i À Á for all s 0 i ∈ S i and for all types θ i occurring with positive probability.
The following theorem for the existence of Bayesian Nash Equilibrium.

Theorem 4.8
Every finite Bayesian Games has a Bayesian Nash Equilibrium.

Example 4.9
Consider the Bayesian games as follows: 1. Nature decides that the payoffs are as in matrix I or II, with probabilities; 2. ROW is informed of the choice of nature but COL is not; 3. The choices of ROW include U or D and the choices of COL include L or R where these choices are made simultaneously; 4. Payoffs are as in the matrix chosen from nature.
For any of the Bayesian games, we will find all BNE. All equilibrium in mixed behavioral strategies can be written as.

Matrix I:
Matrix II:

Pure strategy BNE
First, we will deflate the case of incomplete information problem as a static extended game with all of possible strategies: b Γ. It can be presented follow Harsanyi, that the Nash Equilibrium of b Γ is the same equilibrium of the imperfect game presented. The idea is to deflate a game such that all the ways the game can follow is considered in the extended game b Γ.
The first step is to define the strategies for all player.
Since he does not know in which matrix the game is played, so, COL has only two strategies which contain L and R.
ROW knows in which Matrix the game occurs, and the strategies are UU, UD, DU and DD where UD is played U in Matrix I and D in Matrix II.
The probability knowledge, the nature locates the game in any matrix. The new extended game b Γ can be shown as: Remember that DU is a dominated strategy for ROW. After displacement that possibility, the game has 3 pure Nash Equilibrium as follows {(UU; L); (UD; R); (DD; R)}.

Mixed strategy BNE
Sequent to obtain the mixed strategies we will make another kind of analysis and try to repeat the three pure BNE obtained before. Suppose the probabilities of playing each action are as displayed in the matrices as below, where y is the probability COL plays L, if the game is in Matrix I then x is the probability ROW plays U and if the game is in Matrix II then z is the probability ROW plays U.
Next, we can check each the possibilities in order to find the Nash Equilibrium, such as those strategies stable for any players. Let us start by checking COL's strategies since there are less combinations.

Mixed equilibrium Case 1:
If y = 0, we have b. x ∈ [0, 1] and c. z = 0. Here, we want to check this is a equilibrium from COL's point of view. By g., we can see that if z = 0, then x < 2 which always hold and that y = 0. This Nash equilibrium supports two of the three pure BNE found before: (DD, R), which is the same as y = 0, x = 0 and z = 0 and (UD, R) which is the same as y = 0, x = 1 and z = 0.
There are many BNE in which column plays R and row plays xU + (1 À x)D, when x ∈ [0, 1] if Matrix I occurs and D if Matrix II occurs.
From f., we can see that when x = 1, then it should be the case that z ≥ 1 2 in order to be true that y = 1. Hence, these BNE are restricted to y = 1, z ∈ 1 2 ; 1 Â Ã and x = 1.
This BNE supports the third pure Nash Equilibrium found before: (UU, L), which is the same as y = 1, x = 1 and z = 1.
There are many BNE in which column plays L and row plays U if Matrix I occurs and zU

Case 3:
If y ∈ (0, 1), we have a. x = 1 and c. z = 0. By e., we can see that in order y belongs to [0, 1] it should be the case that x = (1 À z). However it is impossible this equality to hold if both z = 0 and x = 1.
Therefore, the case if y ∈ (0, 1) is not a Bayesian Nash equilibrium.

Abstract economy model
Later, the existence of social equilibrium was proved Debreu [15]. Also Arrow and Debreu [16] proved the existence of Walrasian equilibrium. The classical abstract economy game introduced by Shafer and Sonnenschein [17] or Borglin and Keiding [18] consists of a finite set of agents, each characterized by certain constraints and preferences, explained by correspondences. Following many previous authors ideas, they studied the existence of equilibrium for generalized games (see, for example, [19][20][21][22][23][24][25][26][27] and the references therein). Now, we show some definitions of an abstract economy model and equilibrium of this model as follows. Let the set of agents be the finite set {1, 2, … , n}. For each i ∈ {1, 2, … , n} let X i be a non-empty set.

Definition 4.10
An abstract economy Γ ¼ X i ; A i ; P i ð Þ n i¼1 is defined as a family of n ordered triplets (X i , A i , P i ), where for each i ∈ I:

Definition 4.11
An equilibrium for Γ is a point x * ∈ Q i ∈ I X i which satisfies for each i ∈ {1, 2, … , n}:

Theorem 4.12
Let Γ ¼ X i ; A i ; P i ð Þ n i¼1 be an abstract economy which satisfies, for each i ∈ {1, 2, … , n}: 1. X i is a non-empty compact convex subset in R l ;

A i is a continuous correspondence;
3. for each x ∈ X, A i (x) is non-empty compact and convex; 4. P i has an open graph in X Â X i and for each x ∈ X, P i (x) is convex; Then, Γ has an equilibrium.

Fuzzy games
The first concept of a fuzzy set was introduced by Zadeh [28] in 1965. Fuzzy set theory has been shown to be a gainful tool to describe situations in which the data are imprecise or vague. The theory of fuzzy sets has become a well framework for studying results concerning fuzzy equilibrium existence for abstract fuzzy economies. The first study of a fuzzy abstract economy (or a fuzzy game) has been studied by Kim and Lee in [29], they shown the existence of the equilibrium for 1-person fuzzy game. Also Kim and Lee [29] shown the existence of equilibrium for generalized games when the constraints or preferences are vague due to the agent's behavior. In 2009, Patriche [30] studied the Bayesian abstract economy game and proved the existence of equilibrium for an abstract economy game with differential information and a measure space of agents. However, the existence of random fuzzy equilibrium for fuzzy game has not been studied so far. In 2013, Patriche [31] defined the Bayesian abstract economy game and proved the existence of the Bayesian fuzzy equilibrium for this game. Also, Patriche [32] defined the new Bayesian abstract fuzzy economy game and proved the existence of the Bayesian fuzzy equilibrium for this game which it is characterized by a private information set, an action fuzzy mapping, a random fuzzy constraint one and a random fuzzy preference mapping. Recently, Patriche [33] defined the fuzzy games and applications to systems of generalized quasi-variational inequalities problem. The Bayesian fuzzy equilibrium concept is an extension of the deterministic equilibrium. She also generalized and extended the former deterministic models introduced by Debreu [15], Shafer and Sonnenschein [17] and Patriche [34]. Very recently, Saipara and Kumam [35] introduced the model of general Bayesian abstract fuzzy economy for product measurable spaces, and proved the existence for Bayesian fuzzy equilibrium of this model as follows.
For each i ∈ I, let Ω i ; Z i ð Þbe a measurable space, Ω; Z ð Þbe the product measurable space where Ω≔ Q i ∈ I Ω i , Z≔ ⊗ i ∈ I Z i and μ is a probability measure on Ω; Z ð Þ. Let Y denote the strategy or commodity space, where Y is a separable Banach space.
Let I be a non-empty finite set (the set of agents). For each i ∈ I, let X i : Ω i !F (Y) be a fuzzy mapping, and let z i ∈ (0, 1]. Let L X i = {x i ∈ S(X i (Á)) z i : x i is Σ i -measurable}. Denote by L X = Q i ∈ I L X i and by the set An element x i of L X i is called a strategy for agent i. The typical element of L X i is denoted by x i and that of (X i (ω i )) z i by x i (ω i ) (or x i ). We can define a general Bayesian abstract fuzzy economy model of product measurable spaces as follow.

Definition 4.13
A general Bayesian abstract fuzzy economy model of product measurable spaces is defined as follows: where I is non-empty finite set (the set of agents) and: a. X i : Ω i ! F (Y) is a action (strategy) fuzzy mapping of agent i; b. Σ i is a sub σ-algebra of Z ¼ ⊗ i ∈ I Z i , which denotes the private information of agent i; is the random fuzzy constraint mapping of agent i; is the random fuzzy preference mapping of agent i; e. a i : L X ! (0, 1] is a random fuzzy constraint function, and p i : L X ! (0, 1] is a random fuzzy preference function of agent i; The Bayesian fuzzy equilibrium for a general Bayesian abstract fuzzy economy model of product measurable spaces is defined as follows.

Definition 4.14
A Bayesian fuzzy equilibrium for Γ is a strategy profilex * ∈ L X such that for all i ∈ I,

Theorem 4.15
Let I be a non-empty finite set. Let the family i ∈ I be a general Bayesian abstract economy model of product spaces satisfy (a)-(j). Then, there exists a Bayesian fuzzy equilibrium for Γ.
For each i ∈ I, the following conditions are sastisfied: is a non-empty convex weakly compact-valued and integrably bounded correspondence; b. X i : c. For each ω i ;x ð Þ∈ Ω i Â L X , A i ω i ;x ð Þ ð Þ aixÞ ð is convex and has a non-empty interior in the relative norm topology of (X i (ω i )) z i ; i.e. for each ω i ∈ Ω i and for each y ∈ Y, the set ð P i ω i ;x ð Þ ð Þ p ix ð Þ Þ À1 ω i ; y ð Þ¼x ∈ L X : y ∈ f P i ω i ;x ð Þ ð Þ p ix ð Þ g is weakly open in L X ; j. For eachx i ∈ L Xi , for each ω i ∈ Ω i ,x i ∉ A i ω i ;x ð Þ ð Þ aix ð Þ ∩ P i ω i ;x ð Þ ð Þ p ix ð Þ .
Moreover, in 1960, Fichera and Stampacchia first introduced the variational inequalities problem, this issue has been widely studied. Next, the basic concept of variational inequalities for fuzzy mappings was first introduced by Chang and Zhu [36] in 1989. In the topic of variational inequalities problem, there are many mathematicians who studied this topic (see, for example, [37,38]). In 1993, the concept of a random variational inequality was introduced by Noor and Elsanousi [39]. Recently, Patriche [31] used the model of the Bayesian abstract fuzzy economy to prove the existence of solution for the two types of random quasi-variational inequalities with random fuzzy mappings.

Conclusion
The main objectives of this chapter was introduced the concept of Bayesian inference and application to some real world problems. In this chapter, we were presented about the basic concept of Bayesian inference which it can be application to the Bayesian game and a general Bayesian abstract fuzzy economy game or a fuzzy game. For application to Bayesian game, we were shown the solution of Bayesian Nash Equilibrium (BNE) for a Bayesian game with examples. Finally, we were shown the existence of Bayesian fuzzy equilibrium for a fuzzy game.