Bi matrix for two power plants by different energy sources.
In this chapter, we quantify an optimal level of subsidy for the sharing of hybrid-enabling technology innovation in an energy market while examining its Bertrand-Nash equilibrium. We formulate this as a Stochastic Differential Game (SDG) and analyze the stability of the Stuckenberg, Nash and cooperative equilibria via a feedback control strategy. We then adopt limit expectation and variance of the improvement degree to identify the influence of the external environment on the decision maker. We show that the game depends on its parameters and the equilibria chosen. Ultimately, our use of short-run price competition characterized by strategic supplies for renewable and fossil resources provides a more robust model than that presented by Bertrand-Edgworth with endogenous capacity. As a result, we highlight that R&D investments in hybrid-enabling technology can ensure immediate reliability and affordability within energy production and implementation of policy instruments.
- Bertrand duopoly game
- cooperative game
- hybrid-enabling technology
- Nash non-cooperative game
- Stackelberg game
- stochastic differential game
In recent years, many researchers have developed models to discuss the importance of lowering carbon emissions and its potential impact on society by examining economic growth, international trade, and health benefits. Khan et al.  examined the relationship between green logistics indices, economic, environmental, and social factors through the perspective of Asian emerging economies. By adopting a Fully Modified OLS (FMOLS) Model and Dynamic OLS (DOLS) they claimed that logistics operations, particularly the efficiency of customs clearance processes, quality of logistics services and trade and transport-related infrastructure are positively and significantly correlated with per capita income, manufacturing value added and trade openness, whereas greater logistics operations are negatively associated with social and environmental problems including, climate change, global warming, carbon emissions, and poisoning atmosphere. Khan et al.  examined the potential relationship between public health expenditures, logistics performance indices, renewable energy, and ecological sustainability in members of the Association of Southeast Asian Nations by applying the structural equation modeling approach. They showed that the use of renewable energy in logistics operations will improve environmental and economic performance to reduce emissions, whereas environmental performance is negatively correlated with public health expenditures, indicating that greater environmental sustainability can improve human health and economic growth. In , economic growth and environmental sustainability in the South Asian Association for Regional Cooperation using the data from the South Asian Association for Regional Cooperation (SAARC) member countries from 2005 to 2017 was examined. Adopting the panel autoregressive distributed lag technique to examine the hypotheses, they find that environmental sustainability is strongly and positively associated with national scale-level green practices, including renewable energy, regulatory pressure, eco-friendly policies, and the sustainable use of natural resources. In , the consumption of renewable energy with international trade and environmental quality in Nordic countries from 2001 to 2018 is investigated. Their findings concluded that renewable energy is strongly and positively associated with international trade in Nordic countries. Furthermore,  adopted multi-criteria-decision-making techniques to examine barriers in the sustainable supply chain management (SSCM) when firms are facing heavy pressure to adopt green practices in their supply chain (SC) operations to achieve better socio-environmental sustainability.
Around the world governments, businesses and individuals have committed to reducing carbon emission. As a result, the energy economy is highly exposed to these processes. As industries push for renewable energies, technology will need to step in to ensure reliability of the power supply. Therefore, there remains a need for exploiting the role of hybrid technology, its dynamics, limitations on the reduction of pollution levels and policy implementation within the wider carbon emissions debate. This is due to the vital role hybrid technology plays in energy production processes and the ability for the energy system to offer a better energy security. Development of such lower carbon emission policies has potential benefits to the environment and ecological sustainability to those economies. However, within many of these environmental policy models, technology is incorporated as an exogenous variable and limited attention is given to endogenous technology, other technological breakthroughs, potential government subsidies or collaborative innovations to integrate low carbon technology in environmental economics. Such interventions will promote the renewable energy sector to use natural resources and undertake public-private partnership investments to minimize dependence on fossil fuel derived energy.
To investigate the effects of hybrid-enabling technology when producing energy to meet consumption demand, we assume energy producing firms follow the Bertrand game paradigm. In the presence of government subsidy for the development and sustainability of renewable energy, tax on pollution created by energy producing firms will motivate them to undertake Research & Development (R&D) measures to improve hybrid enabling technologies to further reduce the level of carbon pollution. As a result, from an economic point of view it is an interesting question to examine the Bertrand-Nash equilibrium under such a dynamic environment. This chapter examines this concept via a Stochastic Differential game paradigm.
Many researchers have applied game theory to study carbon reduction behavior in electricity markets. In , the Cournot equilibria in an oligopolistic electricity market subject to a linear demand function is examined. In , the power suppliers bidding behavior is evaluated under the supply function equilibrium (SFE) paradigm, where the market power of an independent system operator (ISO) is modeled as a bi-level multi-objective problem. In , the equilibrium strategies in random-demand procurement auctions in the electricity market is obtained and presented a method for explicit calculation of the bid strategies is presented.  proposed a Nash bargaining game model to examine how governments can determine the taxes and subsidies in a competitive electricity market whilst achieving their environmental objectives. In [4, 6, 22, 29, 34], the role of government as a leading player who intervenes in competitive electricity markets to promote environmental protection is evaluated. In , the role of government, when managing environmental sustainability in a complete electricity market in a Stackelberg game paradigm is examined. In , a more robust trans-boundary industrial pollution reduction strategy for global emission collaborations is presented. The dynamics of each country’s quantity of pollution is modeled as a Brownian motion with Jumps to capture the systematic jumps caused by surprise effects arising from policy uncertainties within the economy. However, a crucial limitation within many of these environmental policy models, is that technological change is incorporated as an exogenous variable and does not consider the role of endogenous hybrid-enabling technology or other technological breakthroughs, hence limiting the dynamics of these models.
We quantify an optimal level of subsidy for the sharing of hybrid-enabling technology in an energy market under a Bertrand-duopoly game. We formulate a Stochastic Differential Game (SDG) to analyze the stability of the Stackelberg, Nash and cooperative equilibria via a feedback control strategy. We then adopt limit expectation and variance of the improvement degree to identify the influence of external environment limitations on the decision maker. We show that the game depends on its parameters and the equilibria chosen. We consider an electricity market composed of power plants I and II, with each one having the choice between fossil fuels (e.g., natural gas, petroleum or coal) and renewable sources (e.g., biomass, solar, wind, wave, geothermal or hydroelectric). Such hybrid power plants play a crucial ameliorating role in managing the long-standing problem of climate change and ensure immediate reliability and affordability of energy production, whilst reducing Greenhouse Gas (GHG) emissions.
In this model, we consider a Bertrand duopoly game for two power plants under endogenous hybrid-enabling technology. In the first stage the matrix of prices , (where is the price of energy , given that the opponent player’s energy ) is determined as a Nash equilibrium of the game where each player wants to optimize his/her demand. We then search for the Nash equilibria, and the optimal proportions that maximizes the subject to the source type of energy that has been used. Once all these parameters have been fixed, the game becomes dynamic due to the evolution of a hybrid-enabling technology level , prompted by Research and Developments (R&D) measures undertaken by each power plant. Hence, each player must fix a time-dependent effect level associated with this hybrid-enabling technology. In doing so, this study makes the following contributions to existing game theory/energy economics literature:
stochastic endogenous hybrid-enabling technology innovation is introduced into a two-player stochastic differential game with random interference factors, which capture uncertain external environment factors and the internal limitations within the shared hybrid-enabling technology decision process. In doing so, we provide a framework to quantify the impacts of market power on prices.
we show that both power plants invest in R&D measures and that the limit of expectation and variance of the improvement degree can be applied to identify the influence of random factors.
mathematically, we show that the issue of the game depends on the parameters of the game and the type of equilibrium one considers.
by applying the HJB equation we obtain the optimal effort level and the optimal level of subsidy for sharing hybrid-enabling technology via feedback equilibrium strategies whilst examining the Stackelberg equilibria, Nash equilibria and cooperative equilibria under Bertrand duopoly.
we reveal that for a given level of payoff distribution the Stackelberg equilibria under endogenous hybrid-technology innovation and the sharing paradigm dominate the Nash equilibria.
we show that in Stackelberg and Nash games, optimal hybrid-enabling technology innovation is proportional to the government subsidy, but the variance improvement degree of the Stackelberg game is different to the results of the Nash game.
our characterization of the short run price competition by strategic supplies for renewable and fossil resources, provides a more robust model than that presented by Bertrand-Edgworth, in which price competition with fixed (endogenous) capacities was used.
our model shows that robust cost-reducing R&D investments with effective hybrid-enabling technology innovation strengthens an innovator’s competitive position and the Stackelberg structure emerges as an equilibrium outcome, allowing each power plant to optimally use energy sources to produce electricity while maximizing their payoffs.
Therefore, under a Stochastic Differential Game (SDG) paradigm with uncertainty, each power plant can optimally use energy sources to produce electricity while maximizing their payoffs. Each power plant is capable of using fossil fuels and renewable sources to produce electricity at any time. To maintain the generality of the proposed model, this model is not limited to a specific energy source. Hence, the terms and , are used throughout the paper. On the other hand the government encourages power plants to conform to a maximum accepted level of carbon emissions through strategies such as the imposition of tariffs on polluters as well as incentives for those who choose to undertake R&D measures to reduce their emission levels in order to maintain environmental sustainability. R&D spending is costly, and the presumption is that R&D spending is somehow connected to increased innovation, revenue growth and profits.
In recent years, researchers have incorporated the theory of SDGs, originated from [11, 20, 21, 28, 34] to analyzed environmental issues. Especially  analyzed (two player) zero-sum stochastic differential games in a rigorous way, and proved that the upper and lower value functions of such games satisfy the dynamic programming principle whilst being the unique viscosity solutions of their associated Hamilton-Jacobi-Bellman-Isaacs equations.
In Section 2, the proposed model and elements of evolutionary game theory are presented. In Section 3 by implementing the Stackelberg game we examine feedback Stackelberg equilibria, optimal level of subsidy for the shared hybrid-enabling technology from its counterpart and the limit of expectation and variance. In Section 4 by implementing a Nash game we examine feedback Nash equilibria and the limit of expectation and variance under hybrid-enabling technology. In Section 5 by implementing a cooperative game we examine feedback equilibria and the limit of expectation and variance under hybrid-enabling technology. In Section 6, comparative analysis of equilibrium results are described. Section 7 concludes the study. Appendix at the end of the chapter contains proofs.
2. Model setup
We propose that the production process of electricity leads to emissions and is proportional to the power industry’s use of energy source. We assume that there are two power plants (Player I) and (Player II) in the energy market and each power plant is capable of using fossil fuels (F) and renewable sources (R) to generate power at any given time t. To reduce the level of Green House Gas (GHG)-emissions into the atmosphere (accordance with  Protocol), the government will set a maximum emission quantitative level, that is directly linked to the power industry’s use of energy source F, when producing electricity. Government encourages the power industry to undertake necessary hybrid-enabling technology to reduce their GHG-emission levels to the maximum accepted quantitative level, , and improve efficiency in renewables. We assume that the power plants change their strategies over time based on payoff comparisons based on hybrid-enabling technological advances. This contradicts with classical non cooperative game theory that analyzes how rational players will behave through static solution concepts such as the Nash equilibrium (NE) (i.e., a strategy choice for each player whereby no individual has a unilateral incentive to change his or her behavior).
Under the theory of evolutionary games, the production strategies in the absence of any superior hybrid-enabling technological advances, allows the power plants to play a symmetric two-person 2 × 2 bi-matrix game. Thus, for each power plant, we define the set Σ as its pure strategy given by the set of non-negative prices [0, ∞). According to the Bertrand game all firms setting the lowest price will split market demand equally (Hotelling type) and the profit can be calculated subject to the electricity prices and the associated cost functions.
Then each iteration of an evolutionary game, where two matched power plants in accordance with Bertrand paradigm compete with each market and play a one-shot non-zero-sum game, represents the benchmark game of the population. If is the matrix of prices of power plants, respectively, then via Proposition 1 (given below), it will allow us to derive Nash equilibria of prices for these two matched power plants. On the demand side we assume that the preferences are quadratic as in .
We define the continuous demand function , for each power plant as
where is the demand function for the power plants employing the energy source against the power plant which use the energy source is the tariff imposed by government subject to the power source i. For example government impose a tariff-rate quota (TRQs) , for fossil fuels (F) and a feed-in-tariff (FITs) , for renewable sources (R). is the electricity price of the power plant that uses the energy source i, versus the power plant that employs the energy source j. , is the constant market base for the power plant that employs the energy source i versus the one which use the energy source j. The parameters and , are independent constants that captures the demand sensitivity of a power plant subject to its own price and its rival’s price . Eq. (1), concludes that the goods in the market are gross substitutes and that the demand function , is increasing in the price of the rival firm .
The government’s tariff policy for the power plants with respect to their source of energy for long-time periods are transparent, and this information is available to the public. Therefore, it is assumed that the competitive power plants follow the government’s financial legislation, having the capability and technological skills to produce electricity from specific sources at any given time to meet energy demand. Then for each time-period the power plant will consider the tariff-rate quota or feed-in-tariff and adopt a pricing strategy for the selected energy source. Hence, we conclude that the production rate of the power plants is equal to the corresponding demand rates with a negligible internal consumption and waste rate.
To apply the Backward induction technique to investigate the equilibrium prices, demand, and profits, we define the profit function for each power plant as
where and is the unit production cost of the power plant when using energy source i. , for any additional R&D unit cost for undertaking hybrid-enabling technology, for a power plant that rely on an energy source i, (is the initial setup cost of the power plants when using the energy source i). We also assume that The firms’ technologies are represented by their reduced cost functions. This assumes that all factor markets are perfectly competitive and – both here and in the models of imperfect competition in the output market – are not influenced by any strategic behavior of the firms in other markets. We will make alternative assumptions about those technologies. In the first assumption, pollution is proportional to output and firms do not have any further abatement technologies.
Proposition 1. The equilibrium price for the power plants under, is given as
Proof. Via the first order conditions of the profit function (Eq. (2)), obtain
Defining and using and , rewrite the first order conditions as:
Simultaneously solving Eq. (4), obtain
such that and are obtained via and are the optimum prices if the profit functions are concave on and on . Then via the second order conditions, obtain the maximum point in the set as: Since , implies that the second derivative of the profit function in equilibrium is negative confirming that the profit function is concave at this point.
Proposition 2. At equilibrium prices the power plant’s demand and profit undercan be obtained as
Remark 1. The only outcome where neither power plant has an incentive to deviate is when , which will be the Nash or Bertrand equilibrium for the game. The intuition behind this result is that power plants will keep undercutting the price of its rival until price equals marginal cost. In the long run price changes with marginal cost and industry production increases with demand and falls with marginal cost. One way for a power plant to avoid the Bertrand paradox and earn economic profit in a Bertrand setting is to have a competitive cost advantage over its rival.
2.1 Production decisions of power plants with homogenous hybrid/enabling technology
Restricting ourselves to a two matched symmetric two-person bi- matrix game in random contest in a one-population evolutionary game, we define the payoff (utility) in Table 1.
|Power Plant II|
|Production Strategy||Fossil Fuel||Renewable Sources|
|Power Plant I||Fossil Fuel|
Note 1. Power and the payoff are measured on a utility scale consistent with the power plant’s preference ranking. Furthermore, [8, 35, 36] have applied symmetric two-person bi-matrix game in random contest to study evolutionary stable games.
Then via equation , in Proposition 1, and the payoff matrix of the power plant I is given by:
Obviously the bimatrix of the power plant II, is given by:
Proposition 3. The Nash equilibrium for the Bi-matrix game G, is given as
Proof. Suppose players I and II use mixed strategies (x,1-x) and (y,1-y), respectively, where
The probability that player I choosing row 1 is x and the probability that player I choosing row 2 is 1-x.
The probability that player II choosing row 1 is y and the probability that player II choosing row 2 is 1-y.
Then the value of the game for Player I is
and the value of the game for Player II is
Suppose (X, Y) yields a Nash equilibrium. Then for the given payoffs having implies that
Otherwise Player I can change x slightly and do better.
Similarly, for ,
Otherwise Player II can change y slightly and do better. It follows that the unique Nash equilibrium (x,y), has
Remark 1. Since the power plants plays a symmetric two person bimatrix game G, having two pure strategies , imply that G, has an evolutionary stable strategy. Then the Nash equilibrium is an outcome in which the strategy chosen by each player is the best reply to the strategy chosen by the other. This best reply strategy yields the highest payoff to the player choosing it, given the strategy chosen by the co-player, [24, 25].
2.2 Production decisions of power plants under endogenous hybrid/enabling technological advances
Both players will undertake R&D measures on hybrid-enabling technology to ensure immediate reliability and affordability in energy production whilst reducing GHG-emissions. We assume that the strategic effects implemented by power plant I (Player I), has improved hybrid-enabling technology to generate energy and utilize energy sources in a much efficient way. This gives a superior advantage to power plant I overpower plant II (Player II) and both power plants are rational to maximize their profits. Although Power plant II has heterogeneous resources to hybrid-enabling technology, from a practical point of view it is logical for power plant I to share this technology with power plant II, because the price competition is typically characterized by a second-mover advantage. Many researchers have investigated the effects of these commitments in Cournot, Bertrand and Stackelberg setups. See [1, 2, 5]. Due to the government incentives, tariff-rate quota, feed-in-tariff and R&D incentive measures, the power companies will be competitive to improve their efficiency. Let denotes the R&D effort level of technological improvements on renewable sources at time t, and denotes the R&D effort level of technological improvements on fossil fuel at time t, of Player I. denotes the R&D effort level of technological improvements on renewable sources at time t, and denotes the R&D effort level of technological improvements on fossil fuel at time t, of Player II. For, further consideration, the sharing cost of advanced hybrid-enabling technology (Player I) and inferior hybrid-enabling technology (Player II) is denoted as and which are the quadratic functions of the effect level of Player I and Player II at time t, respectively. Consider
where and lower the , more effective is the technological development.
Let denote the evolution of the hybrid-enabling technology at time t, due to R&D collaborative innovation system of Player I and Player II at time t. The dynamics of hybrid-technology is governed by the stochastic differential equation (SDE):
, is the attenuation coefficient of hybrid-enabling technology. Let and denote the influence of the effort level of hybrid-enabling technology sharing on collaboration innovation between Player I and Player II, at time t. is a standard Brownian motion and random interference factor on hybrid-enabling technology.
Let denotes the total profit under the hybrid-enabling technology system at time t. Let and denote the influence of the effort level hybrid-enabling technology on the total profit of Player I and player II, respectively, at time t, namely, the marginal return coefficient of hybrid-enabling technology. Total profit function can be expressed as:
is the influence of the hybrid-enabling technology innovation on total revenue ; is the total government subsidy coefficient of hybrid-enabling technology based on increments of advances in hybrid-enabling technology.
Proposition 4. At least one of the Power Plants has a second mover advantage.
Proof. Demand function , given by Eq. (4), is twice continuously differentiable and
The first inequality says that each demand is downward sloping in own price, and the second that goods are substitutes (each demand increases with the price of the other good).  shows that in case of symmetric firms, there is a second-mover (first-mover) advantage for both players when each profit function is strictly concave in own action and strictly increasing (decreasing) in rival’s action, and reaction curves are upward (downward) sloping.
Then a sufficient condition on the super-modularity of the profit function is obtained via the profit function , given by Eq. (4):
where is the expectations. The main implication of this is that it leads to reaction correspondences that are non-decreasing (in the sense that each selection is non-decreasing) but need not be single-valued or continuous. This has a very appealing and precise interpretation: The price elasticity of Power Plant i’s demand increases in the rival’s price, . This is a very intuitive and general condition, though clearly not a universal one. It is satisfied in particular if , if a higher price by a Power Plant’s rival does not lower the responsiveness of the Power plant’s demand to a change in own price.
We further assume that the total revenue is allocated between two players and is the payoff distribution coefficient of player I at time t and . Although Player II has heterogeneous resources of hybrid-enabling technology, Player I can produce electricity more efficiently with lower GHG-emission, ensure immediate reliability and affordability in energy production. Then Player II, can acquire practical outcomes of this hybrid-enabling technological advances. To promote the hybrid-enabling technology, Player II (leader) determine an optimal sharing effort level and an optimal subsidy. Then Player I (follower) choose their optimal sharing effort level according to the optimal sharing effort level and subsidy. This leads to a Stackelberg equilibrium. Let , denote the subsidy for hybrid-enabling technology, with Player II willing to pay to Payer I under collaboration. The objective functions of power plant I and power plant II satisfy the following partial differential equations
where and are the discount rates of Player I and Player II, respectively. In this feedback control strategy , L S R (t) ≥ 0, and are the control variables and . is the state variable. In feedback control process, it is assumed that players at every point in time have access to the current system and can make decisions accordingly to that state. Consequently, the players can respond to any disturbance in an optimal way. Hence, feedback strategies are robust for deviations and players can react to disturbances during the evolution of the game and adapt their actions accordingly, .
3. A Stackelberg game under heterogeneous technology
Theory of strong Stackelberg reasoning is an improved version of an earlier theory , which provides an explanation of coordination in all dyadic (two-player) common interest games. It provides an explanation of why players tend to choose strategies associated with a payoff-dominant Nash equilibrium. Its distinctive assumption is that players behave as though their co-players will anticipate any strategy choice and invariably choose a best reply to it. Stackelberg strategies resulting from this form of reasoning do not form Nash equilibria. The theory makes no predictions, because a non-equilibrium outcome is inherently unstable, leaving at least one player with a reason to choose differently and thereby achieve a better payoff. Strong Stackelberg reasoning is a simple theory, according to which players in dyadic games choose strategies that would maximize their own payoffs if their co-players could invariably anticipate their strategy choices and play counter-strategies that yield the maximum payoffs for themselves. The key assumption is relatively innocuous, first because game theory imposes no constraints on players’ beliefs, apart from consistency requirements, and second because the theory does not assume that players necessarily believe that their strategies will be anticipated, merely that they behave as though that is the case, as a heuristic aid to choosing the best strategy. Strong Stackelberg reasoning is, in fact, merely a generalization of the minorant and majorant models introduced by  and used to rationalize their solution of strictly competitive games.
To promote the sharing of hybrid-enabling technology, the Player II (the leader) determine an optimal sharing effort sharing level and an optimal subsidy scheme. Then the Player I (the follower) choose his/her optimal sharing level according to the optimal sharing effort level and subsidy. This leads to a Stackelberg equilibrium.
Proposition 5. If above conditions are satisfied, the feedback Stackelberg leader (Player II)-follower (Player I) and equilibria is given as:
where are the optimal effort level of hybrid-enabling technological improvements shared on renewable sources and fossil fuel at time t by Player I, respectively. are the optimal effort level of technological improvements shared on renewable sources and fossil fuel at time t by Player II, respectively.
The optimal level of subsidy for sharing hybrid-enabling on renewable sources is given by
Similarly, the optimal level of subsidy for sharing hybrid-enabling technology on fossil fuel is given by:
The optimal sharing payoff functions under hybrid-enabling technology on renewable sources and on fossil fuel for Player I and Player II are given below
where a1, a2, b1 and b2 are given in the proof.
Proof. We define the optimal revenue functions for Player I and Player II under hybrid-enabling technology as and , respectively, which are continuously differentiable. Applying HJB equation to , for Player I, we obtain
Via the first order conditions, we obtain the optimal values for Player I as:
where The optimal sharing revenue function, , for Player II and the associated HJB equation is
Via the first order conditions of (Eq. (40)), we obtain the optimal values for Player II as:
And the optimal value for
Hence, the solution of the HJB equation is an unary function with K (K as the independent variable), we define and , where are constants that need to be solved. Simplifying (Eq. (39)), obtain:
and simplifying (Eq. (40)), obtain:
This implies that,
Substituting the results of and into Eqs. (37), (38), (41) and (42), and simplifying, we obtain the optimal effort level of hybrid-enabling technological improvements. By substituting optimal values given in Eqs. (48)–(50) into Eqs. (46) and (47) obtain the optimal sharing payoff functions under hybrid-enabling technology on renewable sources and fossil fuel for Player I and Player II.
3.1 The limit of expectation and variance
The payoff of Player I and Player II, under the Stackelberg game paradigm is related to the improvement degree of hybrid-enabling technology via Proposition 4. To analyze the limit of expectations and variance under Stackelberg game equilibrium rewrite (Eq. (19)) as follows.
Proposition 6. The limit of expectation, and variancein the Stackelberg game feedback equilibrium must satisfy
Proof. Applying Itô’s lemma to (Eq. (51)), obtain:
Then and can be defined as:
Solving the above non-homogeneous linear differential equation, will obtain the results.
4. Nash non cooperative game
Under Nash-non-cooperative game setting, Player I and Player II simultaneously and independently choose their optimal efforts levels of heterogeneous hybrid-enabling technology sharing concept to maximize their profits.
Proposition 7. If above conditions are satisfied, the feedback non-cooperative game Nash equilibria will be:
where are the optimal level of hybrid-enabling technological advantage on renewable sources and on fossil fuel at time t for Player I, respectively. are the optimal level of hybrid-enabling technological advantage on fossil fuel and on renewable sources at time t for Player II, respectively.
The optimal sharing payoff functions under hybrid-enabling technology on renewable sources and on fossil fuel for Player I and Player II are given below
where and are given in the proof.
Proof. See Appendix A.
4.1 The limit of expectation and variance
Proposition 8. The limit of expectationand variancein the Nash non-cooperative game feedback equilibrium must satisfy
Poof of Proposition 8 is like the derivation of Proposition 6.
5. Cooperative game
Under cooperative game paradigm, Player I and Player II will choose to collaborate/share their hybrid-enabling technology development knowledge while sharing the payoff function in order to maximize their total payoffs. As a result, hybrid-enabling technology can be improved through this effort as well.
Proposition 9. If above conditions are satisfied, then the feedback cooperative equilibria are defined as
and the optimal cooperative payoff function under hybrid-enabling technology on renewable sources and on fossil fuel, respectively.
where is given in the proof.
Proof. The objective function (optimal sharing payoff function) satisfies the following equation.
Then the optimal revenue sharing function satisfies the following HJB equation
Via the first order conditions, now obtain the optimal values as:
Hence, the solution of the HJB equation is an unary function with K, where and are constant that need to be solved. This implies that
5.1 The limit of expectation and variance
Proposition 10. The limit of expectation and variance in cooperative game feedback equilibrium satisfy
Proof. Poof of Proposition 10 is like the derivation of Propositions 6 and 8.
6. Comparative analysis of equilibrium results
Proposition 11. The outcome of the game depends on the parameters of the game and the type of the equilibrium one considers.
Proof. (i) Player I, will participate in a Stackelberg game to share more hybrid-enabling technology under the condition that Player II pay much more extra cost for hybrid-enabling technology
(ii) Player I will prefer to participate in a cooperative game over a non-cooperative game with Player II under the condition such that
(iii) The total payoff for Player I under a Stackelberg game exceeds the total payoff of Nash non-cooperative game with Player II under the condition such that
(iv). Player II will prefer to participate in a cooperative game over a non-cooperative game with Player I under the condition such that
Proposition 12. For any K ≥ 0, under the condition that Player II pay an extra cost for sharing hybrid-enabling technology. Then the optimal sharing payoff of hybrid-enabling technology of Player I reaches higher than the optimal sharing payoff under the condition that player II does not provide extra cost. This implies that. Similarly, the optimal sharing payoff of hybrid-enabling technology of Player II reaches higher than the optimal sharing payoff under the condition that Player II do not provide extra cost, such that
Proof. When , establish that
7. Concluding remarks
In this chapter a complete study of an energy market by considering a Bertrand duopoly game with two power plants using endogenous hybrid-enabling technology was presented. Numerous game paradigms were articulated and defined including Stackelberg, Nash non-cooperative and cooperative games as well as their relevant equilibria via a feedback control strategy. Mathematically, the necessary conditions under which a power plant will move from taking part in a non-cooperative Nash game to participate as a leader in a Stackelberg game was derived. In doing so, this model allowed us to quantify the optimal level of subsidy for sharing the hybrid-enabling technology. We then adopted the concept of limit expectation and variance of the improvement degree to identify the influence of random factors of external environment and limitations of the decision maker. It is found that for a given level of payoff distribution the Stackelberg equilibria with technological enhancements, the knowledge sharing paradigm dominates the Nash equilibria. In both Stackelberg and Nash games, optimal technological enhancements for power plants were found to be proportional to the government subsidy, but the variance improvement degree of the Stackelberg game differed to the results of the Nash non-cooperative game due to the influence of random factors.
Furthermore, we have shown that due to optimal price reaction functions being upward sloping, the subsidy level plays a decisive role on the payoff function of power plant II as the leader in a Stackelberg game. This model shows that cost reducing R&D investments with efficient hybrid-enabling technology innovation/s strengthens one’s competitive bargaining position via the level of subsidy for Power Plant I to become a follower in the Stackelberg game. By analyzing this stochastic differential game model, we capture the government subsidy incentive as well as the subsidy that the leader (Power Plant II) pays the follower (Power Plant I) to share hybrid-enabling technology.
The proposed quantitative framework could assist policymakers when determining the appropriate R&D incentives for the development of hybrid-enabling technology within the energy market to achieve desired short and long-term environmental objectives with respect to budget limitations and environmental considerations.
Proof. The optimal profit function for power plant I satisfies the following HJB equation such that :
Via the first order conditions, first obtain the optimal values for power plant I as:
where The HJB for power plant (II), using then obtain
Hence, the solution of the HJB equation is an unary function with K, such that , . Hence, finally as:
Substituting into Eqs. (A.2), (A.3), (A.5) and (A.6) and simplifying, obtain the optima effort level of technological improvements. By substituting the above results into Eqs. (A.1) and (A.4) we obtain the optimal sharing payoff functions, for power plant (I) and power plant (II).