Optimal decisions and expected values in two player zero sum games with diagonal game matrixes-explicit functions, general proofs and effects of parameter estimation errors

doi:10.15406/iratj.2019.05.00193

eISSN: 2574-8092

International Robotics & Automation Journal

Research Article Volume 5 Issue 5

Optimal decisions and expected values in two player zero sum games with diagonal game matrixes-explicit functions, general proofs and effects of parameter estimation errors

Peter Lohmander

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Optimal Solutions in cooperation with Linnaeus University, Sweden

Correspondence: Peter Lohmander, Optimal Solutions in cooperation with Linnaeus University, Umea,Sweden

Received: October 17, 2019 | Published: October 31, 2019

Citation: Lohmander P. Optimal decisions and expected values in two player zero sum games with diagonal game matrixes-explicit functions, general proofs and effects of parameter estimation errors. Int Rob Auto J . 2019;5(5):186-198. DOI: 10.15406/iratj.2019.05.00193

Download PDF

Abstract

In this paper, the two player zero sum games with diagonal game matrixes, TPZSGD, are analyzed. Many important applications of this particular class of games are found in military decision problems, in customs and immigration strategies and police work. Explicit functions are derived that give the optimal frequences of different decisions and the expected results of relevance to the different decision makers. Arbitrary numbers of decision alternatives are covered. It is proved that the derived optimal decision frequency formulas correspond to the unique optimization results of the two players. It is proved that the optimal solutions, for both players, always lead to a unique completely mixed strategy Nash equilibrium. For each player, the optimal frequency of a particular decision is strictly greater than 0 and strictly less than 1. With comparative statics analyses, the directions of the changes of optimal decision frequences and expected game values as functions of changes in different parameter values, are determined. The signs of the optimal changes of the decision frequences, of the different players, are also determined as functions of risk in different parameter values. Furthermore, the directions of changes of the expected optimal value of the game, are determined as functions of risk in the different parameter values. Finally, some of the derived formulas are used to confirm earlier game theory results presented in the literature. It is demonstrated that the new functions can be applied to solve common military problems.

Keyword: optimal decisions, completely mixed strategy Nash equilibrium, zero sum game theory, stochastic games

Introduction

What is the optimal strategy of a decision maker, BLUE, such as an individual or organization, when at least one more decision maker, RED, can influence the outcomes? This is a typical question in game theory.

Game theory is a field of research that contains large numbers of studies with different assumptions concerning the number of players, the kinds of decisions that can be taken by the different participants and the degree of information available to the different decision makers at different points in time.

Luce and Raffa¹ give a general description of most of the game theory literature. Some of the highly important and original publications in the field are Nash,² von Neumann,³ and Dresher⁴. Chiang⁵ covers two person zero sum games and most other methods and theories of general mathematical economics. Isaacs⁶ develops dynamic games with and without stochastic events in continuous and discrete time. In Braun⁷ we find a section where differential equations are used to model and describe the development of games of conflict with several examples of real applications.

Lohmander⁸ contains a new approach to dynamic games of conflict with two players. It includes a stochastic dynamic programming, SDP, model with a linear programming, LP, or quadratic programming, QP, model as a sub routine. The LP or QP can be used to solve static game problems, such as two person zero sum games, TPZSGs, for each state and stage in the SDP model. The outcomes of the repeated games move the positions in state space (change the states to new states) with different transitions probabilities, in the following periods, within the SDP model. The SDP model solves the complete dynamic and stochastic game over a time horizon with several periods.

During the history of game theory, the TPZSGs have always gained considerable theoretical and practical interest. A detailed treatment is given by Luce and Raffa.¹ Several kinds of TPZSGs with large numbers of military applications are well described by Washburn.⁹ This can serve as a good introduction to the analysis in this paper. A Nash equilibrium is the normal outcome of LP solutions to TPZSGs. It is however important to be aware that the Nash equilibrium can not always be expected to be the result in real world games. If the strategies of the players are gradually adjusted based on the observations of the decision frequences of the other players, mixed strategy probability orbits (constrained cycles) may develop. Convergence to the Nash equilibrium can not always be expected. Lohmander¹⁰ has developed a dynamic model and described these possibilities. Herings et al¹¹ focuses on stationary equilibria in stochastic games. They are interested in model structure, selection and computation. Babu et al¹² give a good historical introduction to the literature on stochastic games. They also develop some new results in the area of equilibrium strategies of dynamic games based on mixed strategy assumptions within static games.

In this paper, a particular class of TPZSGs will be analyzed, namely two player zero sum games with diagonal game matrixes were all diagonal elements are strictly positive. Let us denote them TPZSGDs. This may seem to be a highly particular, constrained and irrelevant class of games. However, this is not true. A large number of obvious and economically very important real world applications of this class of games exist, in particular in military applications, in customs problems and in police work. Lohmander¹³ defines, describes and solves four different types of military TPZSGD decision problems with this methodology. These problems include:

The selection of roads for transport when enemy forces may prepare attacks along different roads with different expected outcomes,
The selection of roads where attacks on enemy transports should be prepared,
The positioning of guard squads and
The positioning of intelligence, reconaissance and sabotage groups.

Game theory literature usually focuses on very general classes of games, without giving special attention to the TPZSG, and the even more specific TPZSGD, classes.

In this paper, explicit functions of the optimal decision frequences and the expected results of relevance to the different players are derived for situations with arbitrary numbers of decision alternatives.

In the earlier game theory literature, when general classes of games are analyzed, it has usually not been possible to derive explicit functions. Earlier studies are mostly focused on general principles, proofs of the existence of solutions and numerical algorithms to calculate solutions in particular numerically specified situations.

One of the general results derived and proved in this paper is that, for every game in the TPZSGD class, the optimal strategy, for both players, always leads to a unique and completely mixed strategy Nash equilibrium. This means that, for each player, the optimal frequency of every possible decision, is strictly greater than 0 and strictly less than 1.

This result is critical to analytical TPZSGD game theory. It makes it possible to instantly determine the equation system that should be used to calculate the optimal decision frequences. Hence, the optimal decision frequences become possible to analyze with general analytical methods. Explicit functions can be derived for arbitrary numbers of decision options and for all possible elements in the game matrix. In other words, we do not have to handle every particular case with numerical methods.

In the existing literature on game theory, such a proof is not easily found. This problem is usually avoided by intuitive arguments and reasonable assumptions. The book by Washburn⁹ is one such example. A similar case is found in Babu et al.¹² They avoid to show that the Nash equilibrium, which they analyze, really is completely mixed. Babu et al¹² simply assume the existence of a particular probability vector. In this paper, the existence of such a probability vector will be proved for a diagonal game matrix where all diagonal elements are strictly positive. It will also be proved that all elements of the probability vector are strictly positive and strictly less than one. Furthermore, explicit functions will be derived for and the value of the game.

Thanks to the derived functions, it is also possible to perform explicit sensitivity analyses and to determine the directions of changes of optimal decision frequences and expected results if the direction of change of a particular parameter is known.

In this study, it has been possible to derive explicit results in an area that is highly relevant in real applications: How are the optimal decision frequences of the different players changed if the level of risk of some parameter(s) change(s)? Related results have earlier been derived in stochastic dynamic ”one player” problems by Lohmander.¹⁴ First, relevant functions of decisions and expected game values are determined. The first and second derivatives are determined and signed. Then, the Jensen inequality is used to determine the directions of change of the optimal decision frequences and expected game values under the influence of increasing risk in the different parameter values.

Analysis

A TPZSGD will now be analyzed in the most general way. BLUE is the maximizer, who selects the row, i. RED is the minimizer, who selects the column, j. The decision of BLUE is not known by RED before RED takes a decision and the decision of RED is not known by BLUE before BLUE takes the decision. The game matrix, A(i,j), is diagonal. All diagonal elements c_ij=A(i,j) are strictly positive and represent the reward that BLUE obtains from RED in case i=j. "('The reward that BLUE obtains is equal to the loss that RED gets")". In case i≠j, the reward is zero. Equations (2.1) and (2.2) define these conditions.

$c_{i j} |_{i \neq j} = 0, i = 1, ...., n, j = 1, 2, ..., n$ (2.1)

${c_{i j} |}_{i = j} = g_{i} > 0, i = 1, ..., n, j = 1, 2, ..., n$ (2.2)

A concrete example is the following: RED should move an army convoy from one city to another. One road, among the existing n available roads, should be selected. BLUE wants to destroy as many RED trucks as possible. RED sends the convoy via road j and BLUE moves the equipment and troops to road i and prepares an attack there. If i=j, BLUE attacks RED and destroys the number of RED trucks found in a diagonal element of the game matrix where i=j. . If BLUE and RED select different roads, no attack takes place and no trucks are destroyed. $A (i, j) = 0 f o r i \neq j$ .

Different roads usually have different properties with respect to slope, curvature, protection, options to hide close to the road and so on. As a consequence, the values of the diagonal elements of the game matrix, $A (i, j) > 0 f o r i = j$ , are usually not the same for different values of i.

The maximization problem of BLUE

The maximization problem of BLUE is defined here. The expected reward, X₀, is the objective function, which is found in (2.1.1). The number of possible decisions is n and the probability of a particular decision, i, is X_i. The total probability can not exceed 1, which is shown in (2.1.2.).g_i is defined in (2.2). Since RED can select any decision , j,x₀ is constrained via (2.1.3). Furthermore, no probability can be negative, which is seen in (2.1.4).

$\max x_{0}$ (2.1.1)

s.t.

$\sum_{i = 1}^{n} x_{i} \leq 1$ (2.1.2)

$x_{0} \leq g_{i} x_{i}, i = 1, ..., n$ (2.1.3)

$x_{i} \geq 0, i = 1, ..., n$ (2.1.4)

Let $λ_{i}$ denote dual variables. The following Lagrange function is defined:

$L = x_{0} + λ_{0} (1 - \sum_{i = 1}^{n} x_{i}) + \sum_{i = 1}^{n} λ_{i} (g_{i} x_{i} - x_{0})$ (2.1.5)

The following derivatives will be needed in the proceeding analysis:

$\frac{d L}{d λ_{0}} = 1 - \sum_{i = 1}^{n} x_{i} \geq 0$ (2.1.6)

$\frac{d L}{d λ_{i}} = g_{i} x_{i} - x_{0} \geq 0, i = 1, ..., n$ (2.1.7)

$\frac{d L}{d x_{0}} = 1 - \sum_{i = 1}^{n} λ_{i} \leq 0$ (2.1.8)

$\frac{d L}{d x_{i}} = λ_{i} g_{i} - λ_{0} \leq 0, i = 1, ..., n$ (2.1.9)

Karush Kuhn Tucker conditions in general problems

In general problems, we may have different numbers of decision variables and constraints. Furthermore, the elements ${c_{i j} |}_{i \neq j}$ are not necessarily zero (Table 1).

$λ_{i} \geq 0 \forall i$	$\frac{d L}{d λ_{i}} \geq 0 \forall i$	$λ_{i} \frac{d L}{d λ_{i}} = 0 \forall i$
$x_{j} \geq 0 \forall j$	$\frac{d L}{d x_{j}} \leq 0 \forall j$	$x_{j} \frac{d L}{d x_{j}} = 0 \forall j$

Table 1 Karush Kuhn Tucker conditions in general maximization problems

Particular conditions in problems that satisfy (2.1) and (2.2)

Note that in these problems, i=j in all relevant constraints.

$λ_{i} \geq 0 \forall i$ (2.1.10)

$\frac{d L}{d λ_{i}} \geq 0 \forall i$ (2.1.11)

$λ_{i} \frac{d L}{d λ_{i}} = 0 \forall i$ (2.1.12)

$x_{i} \geq 0 \forall i$ (2.1.13)

$\frac{d L}{d x_{i}} \leq 0 \forall i$ (2.1.14)

$x_{i} \frac{d L}{d x_{i}} = 0 \forall i$ (2.1.15)

Proof 1: Proof that $x_{0}^{*} > 0$ :

(2.1.2) and (2.1.4) make it feasible to let $x_{i} > 0, i = 1, ... n$ .

(2.2) says that $g_{i} > 0, i = 1, 2, ..., n$ .

When $g_{i} x_{i} > 0, i = 1, ... n$ , (2.1.3) makes it feasible to let . $x_{0} > 0$

(2.1.1) states that we want to maximize x₀. Let stars indicate optimal values.

Hence, when optimal decisions are taken, $x_{0} = x_{0}^{*} > 0$ .

Proof 2: Proof that $x_{i}^{*} > 0, i = 1, ..., n$ :

(2.1.7) says that $\frac{d L}{d λ_{i}} = g_{i} x_{i} - x_{0} \geq 0, i = 1, ..., n$

Proof 1 states that $x_{0} > 0$ . (2.2) says that $g_{i} > 0, i = 1, ..., n$ .

$x_{i} \geq \frac{x_{0}}{g_{i}} > 0, i = 1, ..., n$ .

Hence, $x_{i} = x_{i}^{*} > 0, i = 0, ..., n$ .

Proof 3: Proof that $λ_{i}^{*}, i = 0, ..., n$ can be determined from a linear equation system.

$(x_{i} > 0, i = 0, ..., n) \land$ (2.1.15) $\Rightarrow$ ${\frac{d L}{d x_{0}} = 0; \frac{d L}{d x_{i}} = 0, i = 1, ..., n}$ $= {(2.1.16) \land (2.1.17)}$ .

$\frac{d L}{d x_{0}} = 1 - \sum_{i = 1}^{n} λ_{i} = 0$ (2.1.16)

$\frac{d L}{d x_{i}} = λ_{i} g_{i} - λ_{0} = 0, i = 1, ..., n$ (2.1.17)

Proof 4: Proof that $λ_{i}^{*} > 0, i = 0, ..., n$ .

(2.1.16) $\Rightarrow$ $\exists {i |}_{i > 0, λ_{i} > 0}$ .

Hence, at least for one strictly positive value i, $λ_{i}$ is strictly greater than zero.

$(\exists {i |}_{i > 0, λ_{i} > 0})$ $\land$ $(g_{i} > 0, i = 1, ..., n)$ (2.1.17) $\Rightarrow$ $λ_{0} > 0$ .

$λ_{0} > 0$ (2.1.18)

(2.1.17) $\land$ $(g_{i} > 0, i = 1, ..., n)$ $\land$ (2.1.18) $\Rightarrow$ $(λ_{i} > 0, i = 1, ..., n)$

$λ_{i} > 0, i = 1, ..., n$ (2.1.19)

(2.1.18) $\land$ (2.1.19) $\Rightarrow$ $(λ_{i} > 0, i = 0, ..., n)$

$λ_{i}^{*} > 0, i = 0, ..., n$ (2.1.20)

Proof 5: Proof that $x_{i}^{*}, i = 1, ..., n$ , can be determined from a linear equation system.

$(λ_{i} > 0, i = 0, ..., n) \land$ (2.1.12) $\Rightarrow$ ${\frac{d L}{d λ_{0}} = 0; \frac{d L}{d λ_{i}} = 0, i = 1, ..., n}$ $= {(2.1.21) \land (2.1.22)}$ .

$\frac{d L}{d λ_{0}} = 1 - \sum_{i = 1}^{n} x_{i} = 0$ (2.1.21)

$\frac{d L}{d λ_{i}} = g_{i} x_{i} - x_{0} = 0, i = 1, ..., n$ (2.1.22)

Determination of explicit equations that give all values: $x_{i}^{*}, i = 0, ..., n$ :

(2.1.22) $\Rightarrow$ (2.1.23).

$x_{i} = \frac{x_{0}}{g_{i}}, i = 1, ..., n$ (2.1.23)

(2.1.21) $\Rightarrow$ (2.1.24).

$\sum_{i = 1}^{n} x_{i} = 1$ (2.1.24)

$\sum_{i = 1}^{n} \frac{x_{0}}{g_{i}} = 1$ (2.1.25)

$\sum_{i = 1}^{n} \frac{1}{g_{i}} = \frac{1}{x_{0}}$ (2.1.26)

$x_{0} = \frac{1}{\sum_{i = 1}^{n} \frac{1}{g_{i}}}$ (2.1.27)

$x_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.1.28)

$x_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.1.29)

Determination of explicit equations that give all values: $λ_{i}^{*}, i = 0, ..., n$ :

(2.1.17) $\Rightarrow$ (2.1.30).

$λ_{i} = \frac{λ_{0}}{g_{i}}, i = 1, ..., n$ (2.1.30)

(2.1.16) $\Rightarrow$ (2.1.31)

$\sum_{i = 1}^{n} λ_{i} = 1$ (2.1.31)

$\sum_{i = 1}^{n} \frac{λ_{0}}{g_{i}} = 1$ (2.1.32)

$\sum_{i = 1}^{n} \frac{1}{g_{i}} = \frac{1}{λ_{0}}$ (2.1.33)

$λ_{0} = \frac{1}{\sum_{i = 1}^{n} \frac{1}{g_{i}}}$ (2.1.34)

$λ_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.1.35)

$λ_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.1.36)

Observations:

$x_{0}^{*} = λ_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.1.37)

$x_{i}^{*} = λ_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.1.38)

The minimization problem of RED

We are interested in the solution to $\min y_{0}$ . The objective function is formulated as $\max (- y_{0})$ . The frequences of the different decisions, i are $y_{i}$ .

$\max (- y_{0})$ (2.2.1)

s.t.

$\sum_{i = 1}^{n} y_{i} \geq 1$ (2.2.2)

$y_{0} \geq g_{i} y_{i}, i = 1, ..., n$ (2.2.3)

$y_{i} \geq 0, i = 1, ..., n$ (2.2.4)

Proof that $y_{0}^{*} > 0$

(2.2.2) $\Rightarrow$ (2.2.5).

$\exists {i |}_{1 \leq i \leq n, y_{i} > 0}$ (2.2.5)

$g_{i} > 0, i = 1, ..., n$ (2.2.6)

(2.2.3) $\land$ (2.2.5) $\land$ (2.2.6) $\Rightarrow$ (2.2.7).

$y_{0}^{*} \geq y_{0} > 0$ (2.2.7)

Let $μ_{i}$ denote dual variables. The following Lagrange function is defined for RED:

$L_{2} = - y_{0} + μ_{0} (\sum_{i = 1}^{n} y_{i} - 1) + \sum_{i = 1}^{n} μ_{i} (y_{0} - g_{i} y_{i})$ (2.2.8)

These derivatives will be needed in the analysis:

$\frac{d L_{2}}{d μ_{0}} = \sum_{i = 1}^{n} y_{i} - 1 \geq 0$ (2.2.9)

$\frac{d L_{2}}{d μ_{i}} = y_{0} - g_{i} y_{i} \geq 0, i = 1, ..., n$ (2.2.10)

$\frac{d L_{2}}{d y_{0}} = - 1 + \sum_{i = 1}^{n} μ_{i} \leq 0$ (2.2.11)

$\frac{d L_{2}}{d y_{i}} = μ_{0} - μ_{i} g_{i} \leq 0, i = 1, ..., n$ (2.2.12)

Proof that $y_{i}^{*} > 0, i = 0, ..., n$

According to (2.2.1), we want to maximize $- y_{0}$ , which implies that we minimize $y_{0}$ .

(2.2.2) $\Rightarrow$ $\sum_{i = 1}^{n} y_{i} \geq 1$

(2.2.4) $\Rightarrow$ $y_{i} \geq 0, i = 1, ..., n$

Let us start from an infeasible point, origo, and move to a feasible point in the way that keeps $y_{0}$ as low as possible. Initially, let $(y_{1}, ..., y_{n}) = (0, ..., 0)$ . According to (2.2.2), this point is not feasible.

(2.2.3) $\Rightarrow$ $\min {y_{0} |}_{y_{i} = 0, i = 1, ..., n} = 0$ .

Now, we have to move away from the infeasible point $(y_{1}, ..., y_{n}) = (0, ..., 0)$ . We have to reach a point that satisfies $\sum_{i = 1}^{n} y_{i} \geq 1$ without increasing y₀ more than necessary. To find a point that satisfies (2.2.2), we have to increase the value of at least one of the ${y_{i} |}_{i \in {1, ..., n}}$ . Select one arbitrary index ${k |}_{1 \leq k \leq n}$ . To simplify the exposition, we let k=1. According to (2.2.3): If we increase y₁ by dy₁, increases by g₁dy₁, as long as $d y_{i} = 0, i = 2, ..., n$ . Hence, $d y_{0} = g_{1} d y_{1}$ . Let $z = d y_{0} = g_{1} d y_{1}$ .

However, when $d y_{1} > 0$ , we may also partly increase $y_{i}, i = 2, ..., n$ without increasing $d y_{0}$ above z. This follows from (2.2.3) and (2.2.10). Since we want to satisfy , we want to increase as much as possible, without increasing d_yemabove . Hence, we select:

$g_{i} d y_{i} = z = g_{1} d y_{1}, i = 2, ..., n$ (2.2.13)

$d y_{i} = \frac{g_{1}}{g_{i}} d y_{1}, i = 2, ..., n$ (2.2.14)

$(d y_{1} > 0) \land (g_{i} > 0, i = 1, ..., n) \Rightarrow d y_{i} > 0, i = 2, ..., n$ (2.2.15)

Since we started in origo, we have

$y_{i} = d y_{i} + 0 > 0, i = 1, ..., n$ (2.2.16)

We already know that $y_{0}^{*} \geq y_{0} > 0$ . Hence,.

$y_{i}^{*} > 0, i = 0, ..., n$ (2.2.17)

Observation: The following direct method can be used to solve the optimization problem of RED.

First, remember that $y_{0}^{*} = d y_{0}^{*} + 0 = z$ . We may directly determine the optimal values of $y_{i}^{*} > 0, i = 0, ..., n$ without using the Lagrange function and KKT conditions, in this way:

$\sum_{i = 1}^{n} y_{i} = ((d y_{1} + 0) + (d y_{2} + 0) ... + (d y_{n} + 0)) = 1$ (2.2.18)

$\sum_{i = 1}^{n} y_{i} = (y_{1} + y_{2} + ... + y_{n}) = 1$ (2.2.19)

$\sum_{i = 1}^{n} y_{i} = (\frac{z}{g_{1}} + (\frac{g_{1}}{g_{2}} \frac{z}{g_{1}}) + ... + (\frac{g_{1}}{g_{n}} \frac{z}{g_{1}})) = 1$ (2.2.20)

$\sum_{i = 1}^{n} y_{i} = (\frac{z}{g_{1}} + \frac{z}{g_{2}} + ... + \frac{z}{g_{n}}) = 1$ (2.2.21)

$\sum_{i = 1}^{n} y_{i} = (\frac{1}{g_{1}} + \frac{1}{g_{2}} + ... + \frac{1}{g_{n}}) = \frac{1}{z}$ (2.2.22)

$\sum_{i = 1}^{n} g_{i}^{- 1} = \frac{1}{z}$ (2.2.23)

$y_{0}^{*} = z = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.2.24)

$y_{i}^{*} = g_{i}^{- 1} y_{0}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.2.25)

Proof that $μ_{i}^{*}, i = 0, ..., n$ can be solved via a linear equation system and that $μ_{i}^{*} > 0, i = 0, ..., n$ .

Since $y_{i}^{*} > 0, i = 0, ..., n$ , we may determine that $μ_{i}^{*} > 0, i = 0, ..., n$ via a linear equation system.

$(y_{i} \frac{d L_{2}}{d y_{i}} = 0, i = 0, ..., n) \land (y_{i} > 0, i = 0, ..., n) \Rightarrow (\frac{d L_{2}}{d y_{i}} = 0, i = 0, ..., n)$

$\frac{d L_{2}}{d y_{0}} = - 1 + \sum_{q = 1}^{n} μ_{q} = 0$ (2.2.26)

$\frac{d L_{2}}{d y_{i}} = μ_{0} - μ_{i} g_{i} = 0, i = 1, ..., n$ (2.2.27)

(2.2.26) $\Rightarrow$ $\exists {i |}_{1 \leq i \leq n, μ_{i} > 0}$ (2.2.28)

$(g_{i} > 0, i = 1, ..., n) \land$ (2.2.27) $\land$ (2.2.28) $\Rightarrow μ_{0} > 0$ (2.2.29)

$(g_{i} > 0, i = 1, ..., n) \land$ (2.2.27) $\land$ (2.2.29) $\Rightarrow (μ_{i} > 0, i = 1, ..., n)$ (2.2.30)

(2.2.29) $\land$ (2.2.30) $\Rightarrow (μ_{i} > 0, i = 0, ..., n)$ (2.2.31)

Proof that $y_{i}^{*}, i = 0, ..., n$ can be solved via a linear equation system and that $y_{i}^{*} > 0, i = 0, ..., n$ .

Since $μ_{i}^{*} > 0, i = 0, ..., n$ , we may determine that $y_{i}^{*} > 0, i = 0, ..., n$ via a linear equation system.

$(μ_{i} \frac{d L_{2}}{d μ_{i}} = 0, i = 0, ..., n) \land (μ_{i} > 0, i = 0, ..., n) \Rightarrow (\frac{d L_{2}}{d μ_{i}} = 0, i = 0, ..., n)$

$\frac{d L_{2}}{d μ_{0}} = \sum_{q = 1}^{n} y_{q} - 1 = 0$ (2.2.32)

$\frac{d L_{2}}{d μ_{i}} = y_{0} - g_{i} y_{i} = 0, i = 1, ..., n$ (2.2.33)

(2.2.32) $\Rightarrow \exists {i |}_{1 \leq i \leq n, y_{i} > 0}$ (2.2.34)

$(g_{i} > 0, i = 1, ..., n) \land$ (2.2.33) $\Rightarrow y_{0} > 0$ (2.2.35)

$(g_{i} > 0, i = 1, ..., n) \land$ (2.2.35) $\Rightarrow (y_{i} > 0, i = 1, ..., n)$ (2.2.36)

(2.2.35) $\land$ (2.2.36) $\Rightarrow (y_{i} > 0, i = 0, ..., n)$ (2.2.37)

Determination of explicit equations that give all values: $y_{i}^{*}, i = 0, ..., n$ :

(2.2.33) $\Rightarrow$ (2.2.38).

$y_{i} = \frac{y_{0}}{g_{i}}, i = 1, ..., n$ (2.2.38)

(2.2.32) $\Rightarrow$ (2.2.39).

$\sum_{i = 1}^{n} y_{i} = 1$ (2.2.39)

$\sum_{i = 1}^{n} \frac{y_{0}}{g_{i}} = 1$ (2.2.40)

$\sum_{i = 1}^{n} \frac{1}{g_{i}} = \frac{1}{y_{0}}$ (2.2.41)

$y_{0} = \frac{1}{\sum_{i = 1}^{n} \frac{1}{g_{i}}}$ (2.2.42)

$y_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.2.43)

$y_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.2.44)

Determination of explicit equations that give all values: $μ_{i}^{*}, i = 0, ..., n$ :

(2.2.27) $\Rightarrow$ (2.2.45).

$μ_{i} = \frac{μ_{0}}{g_{i}}, i = 1, ..., n$ (2.2.45)

(2.2.26) $\Rightarrow$ (2.2.46)

$\sum_{i = 1}^{n} μ_{i} = 1$ (2.2.46)

$\sum_{i = 1}^{n} \frac{μ_{0}}{g_{i}} = 1$ (2.2.47)

$\sum_{i = 1}^{n} \frac{1}{g_{i}} = \frac{1}{μ_{0}}$ (2.2.48)

$μ_{0} = \frac{1}{\sum_{i = 1}^{n} \frac{1}{g_{i}}}$ (2.2.49)

$μ_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.2.50)

$μ_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.2.51)

Observations:

$y_{0}^{*} = μ_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.2.52)

$y_{i}^{*} = μ_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.2.53)

Generalized Observations:

$x_{0}^{*} = λ_{0}^{*} = y_{0}^{*} = μ_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.2.54)

$x_{i}^{*} = λ_{i}^{*} = y_{i}^{*} = μ_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.2.55)

Sensitivity analyses

First, the sensitivity analyses will concern these variables: $x_{0}^{*} = λ_{0}^{*} = y_{0}^{*} = μ_{0}^{*}$ . How do these variables change under the influence of changing elements in the game matrix?

Observation: $x_{0}^{*} = λ_{0}^{*} = y_{0}^{*} = μ_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$

Proof that $\frac{d x_{0}^{*}}{d g_{i}} > 0 \land \frac{d^{2} x_{0}^{*}}{d g_{i}^{2}} < 0$ .

$x_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.3.1)

$\frac{d x_{0}^{*}}{d g_{i}} = (- 1) {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2} (- g_{i}^{- 2})$ (2.3.2)

$\frac{d x_{0}^{*}}{d g_{i}} = g_{i}^{- 2} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2} > 0$ (2.3.3)

$\frac{d^{2} x_{0}^{*}}{d g_{i}^{2}} = - 2 g_{i}^{- 3} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2} + g_{i}^{- 2} (- 2) {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 3} (- 1) g_{i}^{- 2}$ (2.3.4)

$\frac{d^{2} x_{0}^{*}}{d g_{i}^{2}} = - 2 g_{i}^{- 3} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2} (1 - g_{i}^{- 1} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1})$ (2.3.5)

$\frac{d^{2} x_{0}^{*}}{d g_{i}^{2}} = - 2 g_{i}^{- 1} {(x_{i}^{*})}^{2} (1 - x_{i}^{*})$ (2.3.6)

$(0 < x_{i}^{*} < 1) \land (g_{i} > 0) \Rightarrow \frac{d^{2} x_{0}^{*}}{d g_{i}^{2}} < 0$ (2.3.7)

Observation: $x_{0}^{*}$ is a strictly increasing and strictly concave function of each g_i. From the Jensen inequality, it follows that increasing risk in g_i will reduce the expected value of $x_{0}^{*}$ . Compare Figure 1.

Figure 1 In this graph, the horizontal axes represents E(g₁), the expected value of g₁. Here, g₁ is a stochastic variable. There are two possible outcomes, namely $E (g_{1}) - 1$ and $E (g_{1}) + 1$ , with probabilities ½ and ½ respectively. The vertical axes shows $x_{0}^{*} (E (g_{1}))$ , the optimal objective function value as a function of the expected value of g₁, and $E (x_{0}^{*} (g_{1}))$ , the expected value of the optimal objective function value of $x_{0}^{*}$ as a function of the value of g₁. The graph also includes a linear approximation of $x_{0}^{*} (E (g_{1}))$ based on the values of $x_{0}^{*} (E (g_{1}))$ for $E (g_{1}) = 1$ and for $E (g_{1}) = 3$ . This linear approximation is equal to $E (x_{0}^{*} (g_{1}))$ for $E (g_{1}) = 2$ . According to the Jensen inequality, $E (x_{0}^{*} (g_{1}))$ < $x_{0}^{*} (E (g_{1}))$ , when $x_{0}^{*} (E (g_{1}))$ is a strictly concave function and g₁ is a stochastic variable. This graph illustrates that the Jensen inequality is correct. The graph also illustrates the general conclusion that the expected optimal objective function value $E (x_{0}^{*} (g_{1}))$ is a strictly decreasing function of the level of risk in g₁.

Second, the sensitivity analyses will concern these variables: $x_{i}^{*} = λ_{i}^{*} = y_{i}^{*} = μ_{i}^{*}, i = 1, ..., n$ . How do these variables change under the influence of changing elements in the game matrix?

Observation: $x_{i}^{*} = λ_{i}^{*} = y_{i}^{*} = μ_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$

Proof that $\frac{d x_{i}^{*}}{d g_{i}} < 0 \land \frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} > 0, i \in {1, ..., n}$ .

$x_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (2.3.8)

$\frac{d x_{i}^{*}}{d g_{i}} = - g_{i}^{- 2} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1} + g_{i}^{- 1} (- 1) {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 2} (- g_{q}^{- 2})$ (2.3.9)

$\frac{d x_{i}^{*}}{d g_{i}} = g_{i}^{- 2} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1} (- 1 + g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1})$ (2.3.10)

$\frac{d x_{i}^{*}}{d g_{i}} = g_{i}^{- 1} x_{i}^{*} (- 1 + x_{i}^{*})$ (2.3.11)

$(g_{i} > 0) \land (0 < x_{i}^{*} < 1) \Rightarrow \frac{d x_{i}^{*}}{d g_{i}} < 0$ (2.3.12)

$\frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} = - g_{i}^{- 2} x_{i}^{*} (x_{i}^{*} - 1) + g_{i}^{- 1} (g_{i}^{- 1} x_{i}^{*} (x_{i}^{*} - 1)) (x_{i}^{*} - 1) + g_{i}^{- 1} x_{i}^{*} g_{i}^{- 1} x_{i}^{*} (x_{i}^{*} - 1)$ (2.3.13)

$\frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} = - g_{i}^{- 2} (x_{i}^{*} (x_{i}^{*} - 1) - (x_{i}^{*} (x_{i}^{*} - 1)) (x_{i}^{*} - 1) - x_{i}^{*} x_{i}^{*} (x_{i}^{*} - 1))$ (2.3.14)

$\frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} = - g_{i}^{- 2} ({(x_{i}^{*})}^{2} - x_{i}^{*} - x_{i}^{*} ({(x_{i}^{*})}^{2} - 2 x_{i}^{*} + 1) - {(x_{i}^{*})}^{2} (x_{i}^{*} - 1))$ (2.3.15)

$\frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} = - g_{i}^{- 2} ({(x_{i}^{*})}^{2} - x_{i}^{*} - {(x_{i}^{*})}^{3} + 2 {(x_{i}^{*})}^{2} - x_{i}^{*} - {(x_{i}^{*})}^{3} + {(x_{i}^{*})}^{2})$ (2.3.16)

$\frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} = - g_{i}^{- 2} (- 2 {(x_{i}^{*})}^{3} + 4 {(x_{i}^{*})}^{2} - 2 x_{i}^{*})$ (2.3.17)

$\frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} = 2 g_{i}^{- 2} x_{i}^{*} ({(x_{i}^{*})}^{2} - 2 x_{i}^{*} + 1)$ (2.3.18)

$\frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} = 2 g_{i}^{- 2} x_{i}^{*} {(x_{i}^{*} - 1)}^{2}$ (2.3.19)

$(g_{i} \neq 0) \land (0 < x_{i}^{*} < 1) \Rightarrow \frac{d^{2} x_{i}^{*}}{d g_{i}^{2}} > 0$ (2.3.20)

Observation: $x_{i}^{*}$ is a strictly decreasing and strictly convex function of g_i. From the Jensen inequality, it follows that increasing risk in g_i will increase the expected value of $x_{i}^{*}$ . Compare Figure 2.

Figure 2 In this graph, the horizontal axes represents $E (g_{1})$ , the expected value of g₁. Here, g₁ is a stochastic variable. There are two possible outcomes, namely $E (g_{1}) - 1$ and $E (g_{1}) + 1$ , with probabilities ½ and ½ respectively. The vertical axes shows the optimal decision frequency $x_{1}^{*} (E (g_{1}))$ as a function of the expected value of g₁, and $E (x_{1}^{*} (g_{1}))$ , the expected value of the optimal frequency $x_{1}^{*}$ as a function of the value of . The graph also includes a linear approximation of $x_{1}^{*} (E (g_{1}))$ based on the values of $x_{1}^{*} (E (g_{1}))$ for $E (g_{1}) = 1$ and for $E (g_{1}) = 3$ . This linear approximation is equal to $E (x_{1}^{*} (g_{1}))$ for $E (g_{1}) = 2$ . According to the Jensen inequality, $E (x_{1}^{*} (g_{1}))$ > $x_{1}^{*} (E (g_{1}))$ , when $x_{1}^{*} (E (g_{1}))$ is a strictly convex function and g₁ is a stochastic variable. This graph illustrates that the Jensen inequality is correct. The graph also illustrates the general conclusion that the expected optimal decision frequency $E (x_{1}^{*} (g_{1}))$ is a strictly increasing function of the level of risk in g₁.

Proof that $\frac{d x_{k}^{*}}{d g_{i}} > 0 \land \frac{d^{2} x_{k}^{*}}{d g_{i}^{2}} < 0, i \in {1, ..., n}, k \in {1, ..., n}, i \neq k$ .

$x_{k}^{*} = g_{k}^{- 1} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (2.3.21)

$\frac{d x_{k}^{*}}{d g_{i | i \neq k}} = g_{k}^{- 1} (- 1) {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2} (- g_{i}^{- 2})$ (2.3.22)

$\frac{d x_{k}^{*}}{d g_{i | i \neq k}} = g_{k}^{- 1} g_{i}^{- 2} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2}$ (2.3.23)

$(g_{m} > 0, m = 1..., n)) \Rightarrow \frac{d x_{k}^{*}}{d g_{i | i \neq k}} > 0$ (2.3.24)

$\frac{d^{2} x_{k}^{*}}{d g_{i | i \neq k}^{2}} = g_{k}^{- 1} (- 2 g_{i}^{- 3} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2} + g_{i}^{- 2} (- 2) {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 3} (- g_{i}^{- 2}))$ (2.3.25)

$\frac{d^{2} x_{k}^{*}}{d g_{i | i \neq k}^{2}} = 2 g_{k}^{- 1} g_{i}^{- 3} {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 2} ((g_{i}^{- 1}) {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1} - 1)$ (2.3.26)

$\frac{d^{2} x_{k}^{*}}{d g_{i | i \neq k}^{2}} = 2 g_{k}^{- 1} g_{i}^{- 1} {(x_{i}^{*})}^{2} (x_{i}^{*} - 1)$ (2.3.27)

$(g_{m} > 0, m = 1, ..., n) \land (0 < x_{i}^{*} < 1) \Rightarrow \frac{d^{2} x_{k}^{*}}{d g_{i | i \neq k}^{2}} < 0$ (2.3.28)

Observation: $x_{k}^{*}$ is a strictly increasing and strictly concave function of g_i. From the Jensen inequality, it follows that increasing risk in g_i will decrease the expected value of $x_{k}^{*}$ . Compare Figure 3.

Figure 3 In this graph, the horizontal axes represents $E (g_{1})$ , the expected value of g₁. Here, g₁ is a stochastic variable. There are two possible outcomes, namely $E (g_{1}) - 1$ and $E (g_{1}) + 1$ , with probabilities ½ and ½ respectively. The vertical axes shows the optimal decision frequency $x_{2}^{*} (E (g_{1}))$ as a function of the expected value of g₁, and $E (x_{2}^{*} (g_{1}))$ , the expected value of the optimal frequency $x_{2}^{*}$ as a function of the value of g₁. The graph also includes a linear approximation of $x_{2}^{*} (E (g_{1}))$ based on the values of $x_{2}^{*} (E (g_{1}))$ for $E (g_{1}) = 1$ and for $E (g_{1}) = 3$ . This linear approximation is equal to $E (x_{2}^{*} (g_{1}))$ for $E (g_{1}) = 2$ . According to the Jensen inequality, $E (x_{2}^{*} (g_{1}))$ < $x_{2}^{*} (E (g_{1}))$ , when $x_{2}^{*} (E (g_{1}))$ is a strictly concave function and g₁ is a stochastic variable. This graph illustrates that the Jensen inequality is correct. The graph also illustrates the general conclusion that the expected optimal decision frequency $E (x_{2}^{*} (g_{1}))$ is a strictly decreasing function of the level of risk in g₁.

Numerical illustration

The general definition of the following illustrating game is given in the preceeding section. Let n=2. A very detailed background and interpretation of this particular game, without the new functions and proofs, is given in Lohmander (2019).¹⁴

$A = [\begin{matrix} g_{1} & 0 \\ 0 & g_{2} \end{matrix}] = [\begin{matrix} 2 & 0 \\ 0 & 3 \end{matrix}]$ (3.1)

From (2.2.54) we know that:

$x_{0}^{*} = λ_{0}^{*} = y_{0}^{*} = μ_{0}^{*} = {(\sum_{i = 1}^{n} g_{i}^{- 1})}^{- 1}$ (3.2)

$x_{0}^{*}$ , the expected reward of BLUE, is equal to $y_{0}^{*}$ , the expected loss of RED, in case both optimize the respective strategies. Using the numerical values of the elements in A, we get:

$x_{0}^{*} = \frac{1}{\frac{1}{2} + \frac{1}{3}} = \frac{6}{5} = 1.2$ (3.3)

Hence, the expected value of the game is 1.2. This value is also shown in Figure 4. and Figure 5. The expected value of the game is a decreasing function of the level of risk of , which is described in connection to, and illustrated in, Figure 1.

Figure 4 The objective function value $x_{0}^{*}$ as a function of the two parameters $(g_{1}, g_{2})$ . $x_{0}^{*}$ is a strictly increasing function of both parameters.

Figure 5 The optimal objective function value $x_{0}^{*}$ as a function of the parameter g₁ for alternative values of g₂. $x_{0}^{*}$ is a strictly increasing and strictly concave function of g₁. Furthermore, $x_{0}^{*}$ is an increasing function of g₂.

From (2.2.55) we know that:

$x_{i}^{*} = λ_{i}^{*} = y_{i}^{*} = μ_{i}^{*} = g_{i}^{- 1} {(\sum_{q = 1}^{n} g_{q}^{- 1})}^{- 1}, i = 1, ..., n$ (3.4)

For BLUE and RED, the optimal probabilities to select different roads are equal. For BLUE, the optimal probability to select road 1 is $x_{1}^{*}$ . Via the elements in, we get:

$x_{1}^{*} = y_{1}^{*} = (\frac{1}{2}) x_{0}^{*} = 0.6$ (3.5)

$x_{2}^{*} = y_{2}^{*} = (\frac{1}{3}) x_{0}^{*} = 0.4$ (3.6)

$x_{1}^{*}$ is shown in Figures 6 & 7. In Figure 8, the optimal value is illustrated. The expected value of $x_{1}^{*}$ is an increasing function of the level of risk in g₁, which is shown in Figure 2. For BLUE, the optimal probability to select road 2, is $x_{2}^{*}$ . In Figure 9, we find this value is 0.4. Figure 3 illustrates that the expected value of $x_{2}^{*}$ is a decreasing function of the level of risk in g₁.

Figure 6 The optimal decision frequency $x_{1}^{*}$ , as a function of the two parameters (g₁,g₂). $x_{1}^{*}$ is a strictly decreasing and strictly convex function of g₁. $x_{1}^{*}$ is a strictly increasing and strictly concave function of.g₂

Figure 7 The optimal decision frequency $x_{1}^{*}$ , as a function of the two parameters (g₁,g₂). $x_{1}^{*}$ is a strictly decreasing and strictly convex function of g₁. $x_{1}^{*}$ is a strictly increasing and strictly concave function of g₂. Compare Figure 4., which shows the function from another angle.

Figure 8 The optimal decision frequency $x_{1}^{*}$ as a function of the parameter g₁ for alternative values of g₂. $x_{1}^{*}$ is a strictly decreasing and strictly convex function of g₁. Furthermore, $x_{1}^{*}$ is an increasing function of g₂.

Figure 9 The optimal decision frequency $x_{2}^{*}$ as a function of the parameter g₁ for alternative values of g₂. $x_{2}^{*}$ is a strictly increasing and strictly concave function of g₁. Furthermore, $x_{2}^{*}$ is an decreasing function of g₂.

The particular results $(x_{0}^{*}, x_{1}^{*}, x_{2}^{*})$ discussed in this in this section were also obtained by Lohmander (2019)¹⁴ via the traditional game theory approach of linear programming.

Conclusion

In this paper, the two player zero sum games with diagonal game matrixes, TPZSGD, are analyzed. Many important applications of this particular class of games are found in military decision problems, in customs and immigration strategies and police work. Explicit functions are derived that give the optimal frequences of different decisions and the expected results of relevance to the different decision makers. Arbitrary numbers of decision alternatives are covered. It is proved that the derived optimal decision frequency formulas correspond to the unique optimization results of the two players. It is proved that the optimal solutions, for both players, always lead to a unique completely mixed strategy Nash equilibrium. For each player, the optimal frequency of a particular decision is strictly greater than 0 and strictly less than 1. With comparative statics analyses, the directions of the changes of optimal decision frequences and expected game values as functions of changes in different parameter values, are determined. Some of the derived formulas are used to confirm earlier game theory results presented in the literature. It is demonstrated that the new functions can be applied to solve a typical military decision problem and that the new functions make it possible to draw clear conclusions concerning issues that were not earlier possible to get via linear programming solutions. With the new approach developed here, it is possible to determine the directions of change of the expected value of the objective function and of the optimal frequences of the different decision alternatives, under the influence of increasing risk in the game matrix elements. Such game matrix elements are in real applications never known with certainty. Hence, this new approach leads to more relevant results than those that can be obtained with earlier methods.