Perfect Bayesian equilibrium
In game theory, a Perfect Bayesian Equilibrium is an equilibrium concept relevant for dynamic games with incomplete information. It is a refinement of Bayesian Nash equilibrium. A PBE has two components - strategies and beliefs:
- The strategy of a player in given information-set determines how this player acts in that information-set. The action may depend on the history. This is similar to a sequential game.
- The belief of a player in a given information-set determines what node in that information-set the player believes that they are playing at. The belief may be a probability distribution over the nodes in the information-set. Formally, a belief system is an assignment of probabilities to every node in the game such that the sum of probabilities in any information set is 1.
- Sequential rationality: each strategy should be optimal in expectation, given the beliefs.
- Consistency: each belief should be updated according to the strategies and Bayes' rule, in every path of positive probability.
PBE in signaling games
A signaling game is the simplest kind of a dynamic Bayesian game. There are two players, one of them has only one possible type, and the other has several possible types. The sender plays first, then the receiver.To calculate a PBE in a signaling game, we consider two kinds of equilibria: a separating equilibrium and a pooling equilibrium. In a separating equilibrium each sender-type plays a different action, so the sender's action gives information to the receiver; in a pooling equilibrium, all sender-types play the same action, so the sender's action gives no information to the receiver.
Gift game 1
Consider the following game:- The sender has two possible types: either a "friend" or an "enemy". Each type has two strategies: either give a gift, or not give.
- The receiver has only one type, and two strategies: either accept the gift, or reject it.
- The sender's utility is 1 if their gift is accepted, -1 if their gift is rejected, and 0 if they do not give any gift.
- The receiver's utility depends on who gives the gift:
- * If the sender is a friend, then the receiver's utility is 1 or 0.
- * If the sender is an enemy, then the receiver's utility is -1 or 0.
- The sender's strategy is: a friend gives and an enemy does not give. The receiver's beliefs are updated accordingly: if they receive a gift, they know the sender is a friend; otherwise, they know the sender is an enemy. So receiver's strategy is: accept. This is NOT an equilibrium, since the sender's strategy is not optimal: an enemy sender can increase their payoff from 0 to 1 by sending a gift.
- The sender's strategy is: a friend does not give and an enemy gives. The receiver's beliefs are updated accordingly: if they receive a gift, they know the sender is an enemy; otherwise, they know the sender is a friend. The receiver's strategy is: reject. Again, this is NOT an equilibrium, since the sender's strategy is not optimal: an enemy sender can increase their payoff from -1 to 0 by not sending a gift.
Now, let's look at the following potential pooling equilibria:
- The sender's strategy is: always give. The receiver's beliefs are not updated: they still believe in the a-priori probability, that the sender is a friend with probability and an enemy with probability. Their payoff from accepting is, so they accept if-and-only-if. So this is a PBE if-and-only-if the apriori probability for being a friend satisfies.
- The sender's strategy is: never give. Here, the receiver's beliefs when receiving a gift can be arbitrary, since receiving a gift is an event with probability 0, so Bayes' rule does not apply. For example, suppose the receiver's beliefs when receiving a gift is that the sender is a friend with probability 0.2. The receiver's strategy is: reject. This is a PBE regardless of the apriori probability. Both the sender and the receiver get expected payoff 0, and neither of them can improve the expected payoff by deviating.
- If, then there are two PBEs: either the sender always gives and the receiver always accepts, or the sender always does not give and the receiver always rejects.
- If, then there is only one PBE: the sender always does not give and the receiver always rejects. This PBE is not Pareto efficient, but this is inevitable, since the sender cannot reliably signal their type.
Gift game 2
- If the sender is a friend, then the receiver's utility is 1 or 0.
- If the sender is an enemy, then the receiver's utility is 0 or -1.
Similarly to example 1, there is no separating equilibrium. Let's look at the following potential pooling equilibria:
- The sender's strategy is: always give. The receiver's beliefs are not updated: they still believe in the a-priori probability, that the sender is a friend with probability and an enemy with probability. Their payoff from accepting is always higher than from rejecting, so they accept. This is a PBE - it is a best-response for both sender and receiver.
- The sender's strategy is: never give. Suppose the receiver's beliefs when receiving a gift is that the sender is a friend with probability, where is any number in. Regardless of, the receiver's optimal strategy is: accept. This is NOT a PBE, since the sender can improve their payoff from 0 to 1 by giving a gift.
- The sender's strategy is: never give, and the receiver's strategy is: reject. This is NOT a PBE, since for any belief of the receiver, rejecting is not a best-response.
To summarize: in this variant of the gift game, there are two SPEs: either the sender always gives and the receiver always accepts, or the sender always does not give and the receiver always rejects. From these, only the first one is a PBE; the other is not a PBE since it cannot be supported by any belief-system.
More examples
For further examples, see signaling game#Examples. See also for more examples.PBE in multi-stage games
A multi-stage game is a sequence of simultaneous games played one after the other. These games may be identical or different.Repeated public-good game
The following game is a simple representation of the free-rider problem. There are two players, each of whom can either build a public good or not build. Each player gains 1 if the public good is built and 0 if not; in addition, if player builds the public good, they have to pay a cost of. The costs are private information - each player knows their own cost but not the other's cost. It is only known that each cost is drawn independently at random from some probability distribution. This makes this game a Bayesian game.In the one-stage game, each player builds if-and-only-if their cost is smaller than their expected gain from building. The expected gain from building is exactly 1 times the probability that the other player does NOT build. In equilibrium, for every player, there is a threshold cost, such that the player contributes if-and-only-if their cost is less than. This threshold cost can be calculated based on the probability distribution of the players' costs. For example, if the costs are distributed uniformly on, then there is a symmetric equilibrium in which the threshold cost of both players is 2/3. This means that a player whose cost is between 2/3 and 1 will not contribute, even though their cost is below the benefit, because of the possibility that the other player will contribute.
Now, suppose that this game is repeated two times. The two plays are independent, i.e., each day the players decide simultaneously whether to build a public good in that day, get a payoff of 1 if the good is built in that day, and pay their cost if they built in that day. The only connection between the games is that, by playing in the first day, the players may reveal some information about their costs, and this information might affect the play in the second day.
We are looking for a symmetric PBE. Denote by the threshold cost of both players in day 1. To calculate, we work backwards and analyze the players' actions in day 2. Their actions depend on the history, and there are three options:
- In day 1, no player built. So now both players know that their opponent's cost is above. They update their belief accordingly, and conclude that there is a smaller chance that their opponent will build in day 2. Therefore, they increase their threshold cost, and the threshold cost in day 2 is.
- In day 1, both players built. So now both players know that their opponent's cost is below. They update their belief accordingly, and conclude that there is a larger chance that their opponent will build in day 2. Therefore, they decrease their threshold cost, and the threshold cost in day 2 is.
- In day 1, exactly one player built; suppose it is player 1. So now, it is known that the cost of player 1 is below and the cost of player 2 is above. There is an equilibrium in which the actions in day 2 are identical to the actions in day 1 - player 1 builds and player 2 does not build.