LI Shouyi,CHEN Mou,WU Qingxian,and WANG Yuhui
College of Automation Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 211100,China
Abstract: The threat sequencing of multiple unmanned combat air vehicles (UCAVs) is a multi-attribute decision-making (MADM)problem.In the threat sequencing process of multiple UCAVs,due to the strong confrontation and high dynamics of the air combat environment,the weight coefficients of the threat indicators are usually time-varying.Moreover,the air combat data is difficult to be obtained accurately.In this study,a threat sequencing method of multiple UCAVs is proposed based on game theory by considering the incomplete information.Firstly,a zero-sum game model of decision maker (D) and nature (N)with fuzzy payoffs is established to obtain the uncertain parameters which are the weight coefficient parameters of the threat indicators and the interval parameters of the threat matrix.Then,the established zero-sum game with fuzzy payoffs is transformed into a zero-sum game with crisp payoffs (matrix game) to solve.Moreover,a decision rule is addressed for the threat sequencing problem of multiple UCAVs based on the obtained uncertain parameters.Finally,numerical simulation results are presented to show the effectiveness of the proposed approach.
Keywords: threat sequencing,multiple unmanned combat air vehicles (UCAVs),multi-attribute decision-making (MADM),game theory,incomplete information.
With the rapid development of information technology and the gradual deepening of the world’s military revolution,the unmanned combat air vehicle (UCAV) has gradually become an important entity in modern air combat[1,2].Compared with manned aircraft,UCAV has many combat advantages,such as flexible use,rapid response,low cost,and zero casualties [3].Since multiple UCAVs coordinated operations have greater offensive advantages,they often participate in combat as a swarm [4,5].However,when the UCAVs are hired by the enemy,the largescale scattered UCAV targets will bring great challenges to our attack and defense.We know that a reasonable threat sequencing of multiple UCAVs is a prerequisite for the attack-defense decision-making problem.Therefore,it is crucial to study an efficient threat sequencing approach for multiple UCAVs.
In fact,the threat sequencing problem of multiple UCAVs means to convert several threat attribute values into the comprehensive ones,so as to sequence the threat targets based on the comprehensive threats [6,7].Essentially,the threat sequencing of multiple UCAVs is a multiattribute decision-making (MADM) problem.However,due to the high complexity and strong dynamics of the air combat environment,it is difficult to obtain the complete information needed for threat sequencing.In the threat sequencing problem,the incomplete information is mainly manifested in two aspects: (i) the time-varying weight coefficients of different threat indicators;(ii) the inaccurate air combat data.The incompleteness of information brings great challenges to the threat sequencing problem,which is seldom considered in the existing literature [8].Therefore,it is of far-reaching significance to develop an effective method for the threat sequencing problem of multiple UCAVs under incomplete information conditions [9,10].
In the existing literature,the methods to obtain the weights of an MADM problem can be classified into the subjective weighting methods,the objective weighting methods,and the combined weighting methods.The subjective weighting methods have a long history,but their subjective randomness is greater.Commonly used subjective weighting methods include the language measurement method [11],the Delphi method [12],and the analytic hierarchy process (AHP) [13].The objective weighting methods have a stronger mathematical theoretical basis than the subjective weighting methods,and their calculation process is relatively complicated.However,as entirely data-based approaches,they do not take into account the perception of the decision maker.Commonly used objective weighting methods include the entropy method [14],the dispersion maximization method [15],the principal component analysis method [16],and the multiple targets programming method [17].Compared with the previous two methods,the combined weighting method can both take into account the importance of the decision maker’s perception and the numerical rules contained in decision data.Essentially,the combination weighting method is the fusion of multiple individual weights obtained by different subjective and objective weightings [18].Although many methods have been proposed to determine the weights of an MADM problem with inaccurate data,their strong subjectivity and poor flexibility cannot be well applied to the actual threat sequencing of multiple UCAVs in the high dynamics and strong confrontation environment.
Game theory is an effective mathematical tool to study the strategic interaction of two or more decision makers.Since Von Neumann et al.[19] started the study of the game theory,it has been applied in the fields of economics,political science,international relations,engineering,computer science,military,biological evolution,etc [20–22].In [23],a matrix game model was established to generate maneuvering decisions for low-flying aircraft during one-on-one air combat over hilly terrain.The maneuvering decisions were made by comparing scores of two aircrafts ’ orientation,range,velocity,and terrain clearance.In [24],a two-person cooperative game was presented to achieve user cooperation diversity for time division multiple access (TDMA) based commercial cooperative communication networks.It was proved that the game is indeed a two-person bargaining problem,which had a unique Nash equilibrium solution.In [25],a stochastic differential game approach was proposed to model the duopolistic competition with sticky prices,and its feedback Nash equilibrium was analytically derived.In [26],a zero-sum game approach was proposed,which was an innovative approach for MADM problems.
In this study,borrowing the idea from [26],we construct a virtual person Nature (N) to determine the uncertain parameters (the weight coefficient parameters of the threat indicators and the interval parameters of the threat matrix),which acts as an “intelligent enemy” for decision maker (D).This is based on the following consideration: in a system with a lack of prior knowledge,D has no knowledge of the uncertain parameters at all,so these parameters are considered to be determined by N.Moreover,the threat sequencing process is modeled as a zerosum game of D and N ,where D chooses a probability distribution over the threat targets to maximize its payoff,and N chooses an uncertain parameter to minimize D’s payoff.Nash equilibrium,a widely adopted solution concept in the game theory,is used to represent the uncertain parameters and the decisions of D.
Note that in [26],it was assumed that the real values of different threat indicators are determined by the same interval parameter.However,in air combat,these threat indicators are independent of each other.Moreover,the uncertainty of threat data comes from the measurement errors of the sensors,electronic countermeasures,electromagnetic interference,etc.As a consequence,the interval parameters of these threat indicators have no correlation.On this account,we assume that the real values of different threat indicators are determined by different interval parameters,which is more in line with the actual air combat.Then,the established zero-sum game is transformed into a matrix game to solve.Finally,the threat sequencing result is given based on the solved Nash equilibrium.
The main contributions of this study are stated as follows:
(i) A threat sequencing method of multiple UCAVs that considers incomplete information (the time-varying weight coefficients of different threat indicators,and the inaccurate air combat data) is proposed.
(ii) A zero-sum game model of D and N with fuzzy payoffs is addressed for the threat sequencing problem of multiple UCAVs with incomplete information,whereD chooses a probability distribution over the threat targets to maximize its threat,and N chooses the incomplete information parameters to minimize D’s threat.
(iii) The established zero-sum game with fuzzy payoffs is solved by transforming it into a zero-sum game with crisp payoffs (matrix game).Therefore,the threat sequencing result of multiple UCAVs is obtained under incomplete information conditions.
The following sections of this study are organized as follows: In Section 2,the threat sequencing problem of multiple UCAVs is first described,then some basic concepts of interval numbers and zero-sum games are reviewed.In Section 3,a zero-sum game model with fuzzy payoffs is established for our threat sequencing problem,and a solution method is subsequently proposed for the established game.In Section 4,the effectiveness of the proposed method is verified by numerical simulations.Section 5 presents the conclusions.
Consider the following scenario: suppose that in the air combat,the Red hasmUCAVs denoted asR1,R2,···,Rm;the Blue hasnUCAVs denoted asB1,B2,···,Bn.For the Red,the threats of the Blue UCAVs can be characterized by five threat indicators: angle threat,speed threat,height threat,distance threat,and air combat capability threat.These threat indicators can be calculated by the approaches in [27–30],and the detailed expressions are omitted here to simplify the related descriptions.
We consider the threat sequencing problem ofB1,B2,···,Bnwith time-varying weight coefficients of different threat indicators and interval threat data.The real threat value of thejth attribute ofBiis considered to be a value in interval.As discussed above,the real values of different threat indicators are determined by different interval parameters.If we denote the interval parameter of attributejas λj(j=1,2,···,5),then the threat matrix has the following form:
where λ=(λ1,λ2,···,λ5),λj∈[0,1] (j=1,2,···,5).
Our goal is to give the threat sequencing ofB1,B2,···,Bnby obtaining the time-varying weight coefficient parameter ω and the interval parameter λ of the threat matrix.
To facilitate the following discussion,we first give the operation rules of interval numbers.If we denoteas two interval numbers,andgis a function,then the four arithmetic,the measure operatorm,and the function operations of interval numbers [31,32] are given as follows:
In this part,the basic concepts of two-person zero-sum games are introduced,then the solution algorithm and some properties of zero-sum games are reviewed [33,34].
As a class of non-cooperative games,two-person zerosum games (also called matrix games) are widely used in various decision-making scenarios [35].Usually,a twoperson zero-sum game of D and N [33] can be expressed by an×qmatrix:
wherebik(i=1,2,···,n;k=1,2,···,q) is D’s payoff when D chooses itsith pure strategy and N choose itskth pure strategy,and N’s payoff is -bikunder the same strategies choice.
and the expected payoff of N is defined as -U′(γ,π).In the game theory,Nash equilibrium is a widely adopted solution concept.Formally,a strategy vector [γ*,π*] is called a Nash equilibrium of the matrix game in (9),if the following inequalities hold for each D’s mixed strategy γ,and each N’s mixed strategy π:
It is well known that the matrix game in (9) can be solved by solving a pair of dual linear programming,and this fact is given by the following lemma:
Remark 1The above lemma requires that the elements of the payoff matrixBare positive.If there is a non-positive situation,we can add an enough large positive constant for each element ofBto guarantee the elements of the newly obtained matrixpositive.Then,the Nash equilibria ofBandis exactly the same [36].
In this section,a threat sequencing method is proposed for the threat sequencing problem of multiple UCAVs with incomplete information.First,a zero-sum game model of D and N with fuzzy payoffs is established to obtain the weight coefficient parameters of different indicators and the interval parameters of the threat matrix.Then,the established zero-sum game with fuzzy payoffs is transformed into a zero-game with crisp payoffs (the matrix game) to solve.Finally,a decision rule for the threat sequencing of multiple UCAVs is given.
In the threat sequencing problem of multiple UCAVs with time-varying weight coefficient parameters of different threat indicators and interval parameters of the threat matrix,we introduce a virtual person: N,which is regarded as an “intelligent enemy” of D.The goal ofN is to minimize D’s threat by selecting the weight coefficient parameter ω of different threat indicators,and the interval parameter λ of the threat matrix.Since D is uncertain about what parameters N will choose,one will choose a probability distribution over the UCAVsB1,B2,···,Bnto maximize the threat.
On the basis of [26],we establish a zero-sum gamefor the threat sequencing problem of multiple UCAVs,where
(i) D is the maximizing player.It chooses a probability distribution γ over the set of its threat targetsB1,B2,···,Bnto maximize its payoff.
(ii) N is the minimizing player.It chooses the weight coefficient parameter ω of different threat indicators and interval parameter λ of the threat matrix to minimize D’s payoff.
(iii) Γ is the strategy set of D.It is the set of all probability distributions over its threat targetsB1,B2,···,Bn.Γ can be written as follows:
is the set of all weight coefficients of the five threat indicators,and
is the set of all possible interval parameters of threat matrix.
(v)is the interval payoff matrix of D.The elements ofare all interval numbers,as expressed in (1).
(vi)Uis the expected payoff function of D .WhenD chooses a strategy γ ∈Γ ,and N chooses a strategythe expected payoffU(γ,ω,λ) of D is defined as
whereBλis the crisp payoff matrix determined by the parameter λ and the interval payoff matrix,as shown in (2).Since N is the “intelligent enemy” of D,the expected payoff of N under strategy vector (γ,ω,λ) is defined as -U(γ,ω,λ).
Remark 2The goal of D is to maximize its threat,and the goal of N is to minimize D’s threat,so the“threat” is treated as the “payoff” in the game model.
Remark 3The strategy set of N is not defined as Ω×Λ,but we impose a restriction on it: if ωj=0,then λj=0.This is for the convenience of proving the conclusion later.In fact,this restriction is reasonable,because if ωj=0,then N thinks that the indexjhas no value,so there is no need to materialize the data of indexj.Therefore,when ωj=0,setting λj=0 has no effect on the threat sequencing problem.
Remark 4In this paper,Nash equilibrium is used as the uncertainty parameters and the decision of D.In fact,for the considered threat sequencing problem,the Nash equilibrium solution is optimal for both D and N,which can be illustrated as follows: if (γ*,ω*,λ*) is a Nash equilibrium of the gameG,by definition,one can haveU(γ*,ω*,λ*)≥U(γ*,ω*,λ*). In other words,this approach can ensure that the payoff of D is not less thanU(γ*,ω*,λ*),regardless of whether the uncertainty parameter is (ω*,λ*) or not.
The zero-sum gameGestablished above is special,where the energy strategy of N consists of two parameters,ω and λ.Therefore,it cannot be solved by the traditional zero-sum game solving algorithm.In the following subsection,Gis transformed into a matrix gameG′to solve.
In this subsection,the matrix gameG′is constructed based onG,and the relationship between the Nash equilibria ofGandG′is established.Thus,Gcan be solved by solvingG′.
In the following,we construct a matrix gameG′=〈D,N,Γ,Π,,U′〉 based onG,where
(i) D is the maximizing player,as defined inG.
(ii) N is the minimizing player,as defined inG.
(iii) Γ is the strategy set of D.It is the same as D’s strategy set defined inG,as shown in (13).
(iv) Π is the strategy set of N.It is a set of probability distributions,given as follows:
(v)is the payoff matrix of D . It is ann×10 matrix composed ofBLandBU,which has the following form:
(vi)U′is the expected payoff function of D.The payoff of D under the strategy vector (γ,π) is given by
and the payoff of N under (γ,π) is defined as -U′(γ,π).
Obviously,G′can be solved by the traditional zerosum game solving algorithm.In the following,the relationship between the Nash equilibria ofGandG′is established,so as to solveGby solvingG′.
Firstly,we define a mappingffromto Π as follows:
The relationship of the Nash equilibria inGandG′is established by the following theorem:
Theorem 2If (γ*,ω*,λ*) is a Nash equilibrium ofG,then (γ*,f(ω*,λ*)) is a Nash equilibrium ofG′;conversely,if (γ*,π*) is a Nash equilibrium ofG′,then(γ*,f-1(π*)) is a Nash equilibrium ofG.
ProofIf (γ*,ω*,λ*) is a Nash equilibrium ofG,it holds that
Namely,we proved that (γ*,f-1(π*)) is a Nash equilibrium ofG.□
In the above discussion,the weight coefficient parameters ω*of different threat indicators and interval parameters λ*of the threat matrix are calculated by solving the established game model.In the following,a decision rule is given for the threat sequencing problem of multiple UCAVs based on the calculated parameters.
In this subsection,a decision rule is presented for the threat sequencing of multiple UCAVs.
Remark 5The threat ofBiis defined as the expected threat ofBiwhen D and N choose their Nash equilibrium strategies,respectively.
Based on the threat degree equation,all threat degrees of the threat targets can be obtained.It is obvious that the higher the threat degreeT(Bi),the greater the threat ofBi.As a consequence,the threat sequencing result is given according to the threat degrees of these targets.
The flowchart of the proposed threat sequencing method is shown in Fig.1,and the procedure of threat sequencing for air targetsB1,B1,···,Bnis given as follows:
Fig.1 Flowchart of threat sequencing of multiple UCAVs
Step 1Based on the interval air combat data obtained by various sensors,calculate the interval threat matrixB? in (1) according to the threat indicator calculation formulas in [27–29].
Step 2Construct the matrixbased on the threat matrix,as shown in (16).
Step 3Calculate the Nash equilibrium (γ*,π*) ofG′by Lemma 1.
Step 4According to Theorem 2,the Nash equilibrium (γ*,ω*,λ*)ofGis given as follows:
wheref-1is given in (29)–(31).Thus the weight coefficient parameter ω*and the interval parameter λ*are obtained.
Step 5Calculate the threat degreesT(Bi) of UCAVBi(i=1,2,···,n)according to (32).In consequence,the threat sequencing result ofB1,B2,···,Bnis obtained according to the threat degrees of these targets.
In this section,an example of one-to-four UCAVs air combat is given to illustrate the proposed method.Moreover,comparative analyses are conducted to show the effectiveness of the proposed method.
Suppose that for our UCAVR0,there are four enemy UCAVs,denoted asB1,B2,B3,B4,and their parameter values are presented in Table 1.
Table 1 Parameters of Bi and R0
Continued
The threat sequencing process ofB1,B2,B3,B4is given as follows:
Step 1The interval threat matrixcan be obtained by the threat indicators calculation formulas in [27-30],and interval number calculation rules in (3)-(8),which is given as
Step 2According to (16),which is also shown in Fig.2,construct matrixas foll ows:
Fig.2 Threat matrix of D in G′
Step 3Obviously,the elementsbijof matrixare all positive.According to Lemma 1,we first solve the following dual linear programming:
We can get that the followingandare the optimal solutions of (34) and (35),respectively:
Step 4According to Theorem 2,the weight coefficient parameter ω*and interval parameter λ*can be calculated by the formula (ω*,λ*)=f-1(π*),where the definition off-1is given in (29)-(31),thus we obtain
Step 5According to (32),the threat degreesT(Bi)(i=1,2,···,n)are calculated as
Therefore,the threat sequencing result is given as follows:
where ? means “is preferred to”.
For the threat sequencing problem of multiple UCAVs,the weight coefficient parameter of different threat indicators are usually changeable with time-varying,and the threat matrix is in the form of intervals.To calculate the weight coefficient parameter ω and interval parameter λ,a zero-sum game model of D and N with fuzzy payoffs is proposed in this study.In the proposed game,N is regarded as a spoiler,who minimizes D’s threat by choosing the weight coefficient parameter ω and interval parameter λ.In fact,in the traditional method,when the system is completely unknown,and there is no prior knowledge,the parameters ω and λ cannot be obtained by a convincing theory.However,in the proposed method,the parameters ω*and λ*are obtained via the game theory and is therefore reasonable.
In the above numerical example,for the calculated value of ω*and λ*in (38) and (39),we give 100 random values of γl(l=1,2,···,100).Under these parameters,the threat degreesU(γl,ω*,λ*) (l=1,2,···,100) are presented in Fig.3,which shows that the D’s choice ofγ*in (36) is the target with the greatest threat value.In fact,the meaning of threat sequencing is to obtain the threat degrees of the targets.It is reasonable to select the largest threat target.Our method is to perform threat sequencing on the basis of selecting the largest threat target,which shows the effectiveness of our method.
Fig.3 Threat degrees of different γl under ω* and λ*
In multi-attribute decision-making problems,the technique for order preference by similarity to an ideal solution (TOPSIS) method is often used to select the best solution from alternatives with multiple attributes.A threat sequencing method based on the TOPSIS method was given in [37].As a comparison of our method,in the following,the TOPSIS method is applied to conduct threat sequencing forB1,B2,B3,B4,and the calculation steps are as follows:
Step 1The fuzzy decision matrix is shown in (33).
Step 2Determine the positive negative ideal solution and the negative ideal solution as
Step 3Calculate the relative closeness coefficient of threat targetBias
Step 4Rank the threat targetB1,B2,B3,B4according to the relative closeness coefficients.Therefore,the threat sequencing result based on the TOPSIS method isB3?B2?B4?B1.
The result shows that our threat sequencing result isB2?B4?B3?B1,while the result of the TOPSIS method isB3?B2?B4?B1.This is because the TOPSIS method does not distinguish the importance of different threat indicators,and treats the weight coefficient of each indicator as the same.Hence,B3has the greatest threat among the four targets.In our method,N is endowed with wisdom.Under its motivation to obtain the most payoffs,the weight coefficients of the threat indicators are ω*=(0,0,0.85,0.085 3,0.064 6) and the uncertain parameters are λ*=(0,0,1,1,1).Note that in the threat sequencing problem of multiple UCAVs,the time-varying weight coefficients and interval parameters are difficult to effectively deal with by the traditional methods.It should be pointed out that the theory that we have established gives an effective theoretical support for this kind of problem,thereby ensuring the validity and rationality of our method.
In this study,for the threat sequencing problem of multiple UCAVs,a zero-sum game model of D and N with fuzzy payoffs has been proposed to obtain the weight coefficient parameters of the threat indicators and the interval parameters of the threat matrix.Moreover,a novel technique has been proposed to solve the established zero-sum game with fuzzy payoffs,whereby the final threat sequencing result has been provided.Finally,the effectiveness of the proposed method has been verified by numerical simulations.Future work will aim to extend the proposed method to more similar scenarios for MADM problems with incomplete information.
Journal of Systems Engineering and Electronics2022年4期