《博弈论》(英文版) lect 16 Applications of Sequential and Perfect Bayesian Equilibrium

Eco514-Game Theory lecture 16: Applications of Sequential and Perfect Bayesian equilibrium Marciano siniscalchi November 16. 1999 Introduction The purpose of this lecture is to help you familiarize with the workings of sequential equi- librium and"sequential equilibrium lite", i.e. perfect Bayesian equilibrium The main focus is the "reputation"result of Kreps and Wilson(1982). You should refer to OR for details and definitions: I am following the textbook quite closely We have already mentioned the Entry Deterrence game. Now consider a K-fold repetition of the game in which an incumbent faces a sequence of K(potential) entrants. It is clear hat, in the subgame in which the last entrant gets a chance to play, the incumbent will Concede. But then, by backward induction, he will also Concede at any previous round This conclusion seems unpalatable. One would expect the chain-store to fight a few times early on so as to deter further entry; yet, this simply cannot occur in equilibrium The Chain-Store Game with Imperfect Information Kreps and Wilson suggest the following way out of the Chain Store Paradox. Assume that with some small probability e >0, the incumbent is"Tough: that is, she really enjoys beating up entrants Consider the behavior of a"Regular"incumbent; clearly, when she faces the last entrant she will Concede if the latter moves In. However, in previous rounds, she may threaten to Fight entry; in this case, if the current entrant moves In and is met with a fight, the following entrant does not learn whether the incumbent is Regular or Tough, because both Fight enty Hence, the posterior probability that the incumbent is Tough remains constant at E, and if this value is such that the following Entrant prefers not to Enter, the incumbent's threat is credible
Eco514—Game Theory Lecture 16: Applications of Sequential and Perfect Bayesian Equilibrium Marciano Siniscalchi November 16, 1999 Introduction The purpose of this lecture is to help you familiarize with the workings of sequential equilibrium and “sequential equilibrium lite”, i.e. perfect Bayesian equilibrium. The main focus is the “reputation” result of Kreps and Wilson (1982). You should refer to OR for details and definitions: I am following the textbook quite closely. We have already mentioned the Entry Deterrence game. Now consider a K-fold repetition of the game in which an incumbent faces a sequence of K (potential) entrants. It is clear that, in the subgame in which the last entrant gets a chance to play, the incumbent will Concede. But then, by backward induction, he will also Concede at any previous round. This conclusion seems unpalatable. One would expect the chain-store to fight a few times early on so as to deter further entry; yet, this simply cannot occur in equilibrium. The Chain-Store Game with Imperfect Information Kreps and Wilson suggest the following way out of the Chain Store Paradox. Assume that, with some small probability > 0, the incumbent is “Tough”: that is, she really enjoys beating up entrants. Consider the behavior of a “Regular” incumbent; clearly, when she faces the last entrant, she will Concede if the latter moves In. However, in previous rounds, she may threaten to Fight entry; in this case, if the current entrant moves In and is met with a fight, the following entrant does not learn whether the incumbent is Regular or Tough, because both Fight entry. Hence, the posterior probability that the incumbent is Tough remains constant at , and if this value is such that the following Entrant prefers not to Enter, the incumbent’s threat is credible. 1

hings are not quite so simple in actuality, but Kreps and wilson prove that something very close to this can be supported in a sequential equilibrium; indeed, while there exist other sequential equilibria in which no "reputation"is maintained, there exists a simple restriction on the entrants'beliefs which ensures that the reputation outcome obtains. OR analyze the situation using the notion of PBE, and do not investigate outcome uniqueness. I will follow their treatment, emphasizing certain key steps Model Payoffs in the stage game are as follows 1 In 2 C 0.a b-1, fe Figure 1:0=01=R, T), a>1>b>0, CR=fT=0, fR=CT=-1 To understand the equilibrium strategies, observe first that, if the entrant expect the incumbent to Fight with probability b, her expected payoff from entry equals b(1-b)+(6 )b=0; hence, she is indifferent between staying Out and going In Now, the Tough incumbent will always Fight in the Pbe under consideration. This implies, among other things, that as soon as an entrant observes C in any previous round, he will conclude that the incumbent is Regular, and therefore will enter; so will every subsequent entrant Consider the problem faced by the Regular incumbent who observes entry at some stage k< K. If she Concedes, she gets 0 in the current period, but faces entry thereafter-and hence, by best-responding, she can secure a payoff of no more than 0. Now suppose that she Fights, and that this leads the next entrant to revise her beliefs in such a way that he is indifferent between In and Out; suppose further that, this being the case, the next entrant chooses Out with probability Then, by Fighting the current entrant(and best-responding thereafter), the incumbent also gets a continuation payoff of 0 Thus, following entry, the incumbent is indifferent between F and C. Hence, he entrant will be such that he is indeed indifferent between going In and staying Out o randomize, and the probability of C can be chosen so that the posterior beliefs of th Notice that this argument relies heavily on the equilibrium assumption!
Things are not quite so simple in actuality, but Kreps and Wilson prove that something very close to this can be supported in a sequential equilibrium; indeed, while there exist other sequential equilibria in which no “reputation” is maintained, there exists a simple restriction on the entrants’ beliefs which ensures that the reputation outcome obtains. OR analyze the situation using the notion of PBE, and do not investigate outcome uniqueness. I will follow their treatment, emphasizing certain key steps. Model Payoffs in the stage game are as follows: b, cθ q 1 0,a Out In q 2 F C b − 1, fθ Figure 1: Θ = Θ1 = {R, T}, a > 1 > b > 0, cR = fT = 0, fR = cT = −1. To understand the equilibrium strategies, observe first that, if the entrant expect the incumbent to Fight with probability b, her expected payoff from entry equals b(1 − b) + (b − 1)b = 0; hence, she is indifferent between staying Out and going In. Now, the Tough incumbent will always Fight in the PBE under consideration. This implies, among other things, that as soon as an entrant observes C in any previous round, he will conclude that the incumbent is Regular, and therefore will enter; so will every subsequent entrant. Consider the problem faced by the Regular incumbent who observes entry at some stage k < K. If she Concedes, she gets 0 in the current period, but faces entry thereafter—and hence, by best-responding, she can secure a payoff of no more than 0. Now suppose that she Fights, and that this leads the next entrant to revise her beliefs in such a way that he is indifferent between In and Out; suppose further that, this being the case, the next entrant chooses Out with probability 1 a . Then, by Fighting the current entrant (and best-responding thereafter), the incumbent also gets a continuation payoff of 0. Thus, following entry, the incumbent is indifferent between F and C. Hence, he can randomize, and the probability of C can be chosen so that the posterior beliefs of the next entrant will be such that he is indeed indifferent between going In and staying Out. [Notice that this argument relies heavily on the equilibrium assumption!] 2

Equilibrium Refer to OR for definitions The basic idea is as follows The first k*-1 entrants will stay Out, where k*= mink: b-k+1>6. Hence, nothing will be learned about the incumbents type in equilibrium during the first k*-1 rounds: t(h)< h* implies u(h(T) If, however, one of the first k*-1 entrants does enter, the Regular incumbent Fights Note that this implies that, even in this case, nothing will be learned about the in- cumbent's type; hence, the continuation play following such a deviation resembles equilibrium play Entrant k" goes In if E=u(h)(r)<b-k+, and randomizes between In and Out in case of equality. In either case, if there is entry, the Incumbent reacts by randomizing between C and F. If there is no entry, beliefs are unchanged, so the next entrant tainly enters Beliefs are thus updated via Bayes'rule. Thus, as soon as the Incumbent plays C, the following Entrants all choose In. Also, whenever an entrant goes In and the incumbent Fights. the next entrant is indifferent between In and Out. and hence randomizes e In the last round. the Incumbent Concedes You should convince yourself that the equilibrium described in OR achieves precisely this. In particular, you should see that, whenever u(h)(T)=b-k+, the probability of a Fight in the following round is exactly b (so the next entrant is indeed willing to randomize Also, the posterior probability of T given F is bk-k, as per the equilibrium beliefs. Finally as claimed in the previous section, it is easy to see that, following entry at or after Stage k*, the continuation payoff to the incumbent if he plays F is indeed 0, as it must be to justify randomization Plausible beliefs The preceding arguments go a long way(but not all the way) towards proving that reputation can be maintained in a PBE. However, the original result due to Kreps and wilson is stronger first, they use the notion of Sequential Equilibrium; second, they show that any SE which satisfies a certain restriction on beliefs induces the reputation outcome. The restriction on beliefs has the following form: consider two histories h and h' such that h' differs from h only in that, at some subhistories following entry, the incumbent
Equilibrium [Refer to OR for definitions] The basic idea is as follows. • The first k ∗ − 1 entrants will stay Out, where k ∗ = min{k : b K − k + 1 ≥ }. Hence, nothing will be learned about the incumbent’s type in equilibrium during the first k ∗ − 1 rounds: t(h) < k∗ implies µ(h)(T) = . • If, however, one of the first k ∗ − 1 entrants does enter, the Regular incumbent Fights. Note that this implies that, even in this case, nothing will be learned about the incumbent’s type; hence, the continuation play following such a deviation resembles equilibrium play. • Entrant k ∗ goes In if = µ(h)(T) < bK−k ∗+1, and randomizes between In and Out in case of equality. In either case, if there is entry, the Incumbent reacts by randomizing between C and F. If there is no entry, beliefs are unchanged, so the next entrant certainly enters. • Beliefs are thus updated via Bayes’ rule. Thus, as soon as the Incumbent plays C, the following Entrants all choose In. Also, whenever an entrant goes In and the incumbent Fights, the next entrant is indifferent between In and Out, and hence randomizes. • In the last round, the Incumbent Concedes. You should convince yourself that the equilibrium described in OR achieves precisely this. In particular, you should see that, whenever µ(h)(T) = b K−k+1, the probability of a Fight in the following round is exactly b (so the next entrant is indeed willing to randomize). Also, the posterior probability of T given F is b K−k , as per the equilibrium beliefs. Finally, as claimed in the previous section, it is easy to see that, following entry at or after Stage k ∗ , the continuation payoff to the incumbent if he plays F is indeed 0, as it must be to justify randomization. Plausible Beliefs The preceding arguments go a long way (but not all the way) towards proving that reputation can be maintained in a PBE. However, the original result due to Kreps and Wilson is stronger: first, they use the notion of Sequential Equilibrium; second, they show that any SE which satisfies a certain restriction on beliefs induces the reputation outcome. The restriction on beliefs has the following form: consider two histories h and h 0 such that h 0 differs from h only in that, at some subhistories following entry, the incumbent 3

Concedes instead of Fighting. Then it must be the case that u(h)(r)>(h(R). This makes sense-and Kreps and Wilson deem beliefs which satisfy this condition plausible. The key point is that one could construct "strange"equilibria in which reacting to off- equilibrium entry with F induces subsequent entrants to believe that the incumbent is Reg- ular: this could engender further entry out of equilibrium. If this is the case, the incumbent might Concede early in the game, in order to avoid ( being identified as a regular type !) You can convince yourself that the beliefs used in the construction of our "reputational luilibrium"are indeed plausible Building vs. Maintaining a Reputation Note that, in the equilibrium we have constructed, essentially nothing happens in the first ke*-1 stages. Thus, there is no sense in which the incumbent "builds a reputation"for being tough. This can only be said of the last few stages-but note that the length of the randomization phase only depends on 6, and not on the overall length of the game. Thus, one may have a very protracted play phase in which no reputation is built, and a comparatively tiny one in which actual reputation building occurs Rather, this story has to do with maintaining a reputation for toughness. Also, it should be apparent that the result depends crucially on the equilibrium assumption: consider the delicate interlocking of randomizations
Concedes instead of Fighting. Then it must be the case that µ(h)(R) ≥ µ(h 0 )(R). This makes sense—and Kreps and Wilson deem beliefs which satisfy this condition plausible. The key point is that one could construct “strange” equilibria in which reacting to off- equilibrium entry with F induces subsequent entrants to believe that the incumbent is Regular: this could engender further entry out of equilibrium. If this is the case, the incumbent might Concede early in the game, in order to avoid (!) being identified as a regular type (!!!). You can convince yourself that the beliefs used in the construction of our “reputational equilibrium” are indeed plausible. Building vs. Maintaining a Reputation Note that, in the equilibrium we have constructed, essentially nothing happens in the first k ∗ − 1 stages. Thus, there is no sense in which the incumbent “builds a reputation” for being tough. This can only be said of the last few stages—but note that the length of the randomization phase only depends on , and not on the overall length of the game. Thus, one may have a very protracted play phase in which no reputation is built, and a comparatively tiny one in which actual reputation building occurs. Rather, this story has to do with maintaining a reputation for toughness. Also, it should be apparent that the result depends crucially on the equilibrium assumption: consider the delicate interlocking of randomizations. 4
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 《博弈论》(英文版) lect 15 Sequential Equilibrium.pdf
- 《博弈论》(英文版) lect 14 General Extensive Games.pdf
- 《博弈论》(英文版) lect 13 Repeated Games.pdf
- 《博弈论》(英文版) lect 12 Repeated Games.pdf
- 《博弈论》(英文版) lect 11 Subgame perfection Marciano siniscalchi.pdf
- 《博弈论》(英文版) lect 10 Extensive Games with(Almost) Perfect Information.pdf
- 《博弈论》(英文版) lect 1 Game Theory Multiperson Decision Theory.pdf
- 《博弈论》(英文版) Marciano Siniscalchi Game Theory.doc
- 湖南商学院:《概率论》课程教学资源(PPT课件)第五章 大数定律和中心极限定理(5.2)中心极限定理.ppt
- 湖南商学院:《概率论》课程教学资源(PPT课件)第五章 大数定律和中心极限定理(5.1)大数定律.ppt
- 《数学分析》课程教学资源(考研辅导)第十六讲 数学分析补充大纲.doc
- 《数学分析》课程教学资源(考研辅导)第十五讲 阶的概念.doc
- 《数学分析》课程教学资源(考研辅导)第十四讲 广义积分的收敛性.doc
- 《数学分析》课程教学资源(考研辅导)第十三讲 微分方法的应用.doc
- 《数学分析》课程教学资源(考研辅导)第十二讲 凸函数及其应用.doc
- 《数学分析》课程教学资源(考研辅导)第十一讲 极限与连续.doc
- 《数学分析》课程教学资源(考研辅导)第十讲 级数的收敛性.doc
- 《数学分析》课程教学资源(考研辅导)第九讲 积分不等式.doc
- 《数学分析》课程教学资源(考研辅导)第八讲 数学分析补充大纲.doc
- 《数学分析》课程教学资源(考研辅导)第七讲 阶的概念.doc
- 《博弈论》(英文版) lect 2 (Iterated)Best Response Operators.pdf
- 《博弈论》(英文版) lect 3 Nash Equilibrium.pdf
- 《博弈论》(英文版) lect 4 Games with Payoff Uncertainty(1).pdf
- 《博弈论》(英文版) lect 5 Games with Payoff Uncertainty(2).pdf
- 《博弈论》(英文版) lect 6 Interactive Epistemology(1) Marciano Siniscalchi.pdf
- 《博弈论》(英文版) lect 7 Interactive Epistemology(2 Marciano siniscalchi.pdf
- 《博弈论》(英文版) lect 8 Applications(1)-Simultaneous Auctions Marciano siniscalchi.pdf
- 《博弈论》(英文版) lect 8.5 More on Auctions.pdf
- 《博弈论》(英文版) lect Forward Induction Marciano Siniscalchi January 10, 2000.pdf
- 《博弈论》(英文版) lect Game Theory Signaling Games Marciano Siniscalchi January.pdf
- 《博弈论》(英文版) lect TH.pdf
- 《博弈论》(英文版) ps1 Due Friday, September.pdf
- 《博弈论》(英文版) ps2 Due Thursday, October.pdf
- 《博弈论》(英文版) ps3 Due Thursday Octobe.pdf
- 《博弈论》(英文版) ps4 Due Tuesday, November.pdf
- 《博弈论》(英文版) ps5 Due Tuesday, November.pdf
- 广州大学:《数学分析》课程教学资源(PPT课件讲稿)第二十章(20.1)对弧长的曲线积分.ppt
- 广州大学:《数学分析》课程教学资源(PPT课件讲稿)第二十章(20.2)对坐标的曲线积分.ppt
- 广州大学:《数学分析》课程教学资源(PPT课件讲稿)第二十一章(21.1)重积分.ppt
- 广州大学:《数学分析》课程教学资源(PPT课件讲稿)第二十一章(21.2)直角坐标系下二重积分的计算.ppt