讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈

阿新 • • 發佈：2018-01-20

函數 eve rsa rac mat 最大 orm 靜態們的

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈

貝葉斯博弈(Bayesian Games)

本文是Game Theory An Introduction (by Steven Tadelis) 的學習筆記。

不完整信息的靜態博弈(Incomplete information static games)

不完整信息博弈意味著玩家之間缺乏共識(common knowledge)，具體指的是其它對手的行動集、結果集和收益函數等信息。
對不完整信息博弈的處理方法來自於Harsanyi。
他引進了兩個概念來解決這個問題。
type space: 將對手隱藏的信息(行動集、結果集和收益函數等)轉化為多個types，每個type中的信息都是可知的。

belief: 由於不知道對手的具體type是什麽，因此使用分布概率表示對手選擇某個type的可能性。
這樣就可以通過概率統計來計算可能的收益。

靜態不完整信息貝葉斯博弈(static Bayesian game of incomplete information)的normal-form描述
\[ \left \langle N, \{ A_i \}_{i=1}^n, \{ \Theta_i \}_{i=1}^n, \{ v_i(\cdot; \theta_i), \theta_i \in \Theta_i \}_{i=1}^n, \{ \phi_i \}_{i=1}^n \right \rangle \where \N = \{ 1,2,\cdots, n\} \text{ : is the set of players} \A_i \text{ : the action set of player i} \\Theta_i \text{ : the type space of player i} \v_i : A \times \Theta_i \to \mathbb{R} \text{ : type dependent pay of function of player i} \\phi \text{ : the belief of player i with respect to the uncertainty over the other players' types} \\phi(\theta_{-i} | \theta_i) \text{ : the posterior conditional distribution on } \theta_{-i} \]
靜態不完整信息貝葉斯博弈處理流程：

自然選擇一個類型組合(profile of types)\(\theta_1, \theta_2, \cdots, \theta_n\)。
每個玩家知道自己\(\theta_i\)，使用先前的\(\phi_i\)來形成對對手type的分布概率。
玩家選擇行動。
根據玩家們的行動\(a = (a_i, a_2, \cdots, a_n)\)，可以或者收益\(v_i(a; \theta)\).

條件概率(conditional probability)
當事件S發生時，事件H發生的條件概率為：
\[ \Pr{H|S} = \frac{\phi(S \land H)}{\phi(S)} \]
靜態不完整信息貝葉斯博弈 - 純策略
\[ \left \langle N, \{ A_i \}_{i=1}^n, \{ \Theta_i \}_{i=1}^n, \{ v_i(\cdot; \theta_i), \theta_i \in \Theta_i \}_{i=1}^n, \{ \phi_i \}_{i=1}^n \right \rangle \\]
玩家i的一個純策略\(s_i(\theta_i) \to a_i\)
靜態不完整信息貝葉斯博弈 - 混合策略
玩家i的一個混合策略是一個在純策略之上的概率分布。
靜態不完整信息貝葉斯博弈 - 純策略貝葉斯納什均衡(pure-strategy Bayesian Nash equilibrium)
一個純策略貝葉斯納什均衡\(s^* = (s_1^*, \cdots, s_n^*)\)，如果對於每個玩家i，每個玩家的類型\(\theta_i \in \Theta_i\)，每個行動\(a_i \in A_i\)，滿足：
\[ \sum_{\theta_{-i} \in \Theta_{-i}} \phi_i(\theta_{-i}|\theta_i) v_i(s_i^*(\theta_i), s_{-i}^*(\theta_{-i});\theta_i) \geq \sum_{\theta_{-i} \in \Theta_{-i}} \phi_i(\theta_{-i}|\theta_i) v_i(a_i, s_{-i}^*(\theta_{-i});\theta_i) \where \ v_i(a_i, s_{-i}^*(\theta_{-i});\theta_i) \text{ : only on type } \theta_i \text{, the player i's payoff function} \]
其含義：對於每個玩家，其行動\(s_i^*(\theta_i)\)的分布概率收益總和總是最大的。

關於這章（甚至整本書），重要的是學會如何使用這些理論，書中提供了很好的示例。但這裏就不介紹了。

參照

Game Theory An Introduction (by Steven Tadelis)
讀書筆記: 博弈論導論 - 01 - 單人決策問題
讀書筆記: 博弈論導論 - 02 - 引入不確定性和時間
讀書筆記: 博弈論導論 - 03 - 預備知識
讀書筆記: 博弈論導論 - 04 - 理性和公共知識
讀書筆記: 博弈論導論 - 05 - 納什均衡
讀書筆記: 博弈論導論 - 06 - 混合的策略
讀書筆記: 博弈論導論 - 07 - 完美信息的動態博弈預備知識
讀書筆記: 博弈論導論 - 08 - 完美信息的動態博弈可信性和順序合理性
讀書筆記: 博弈論導論 - 09 - 完美信息的動態博弈多階段博弈
讀書筆記: 博弈論導論 - 10 - 完美信息的動態博弈重復的博弈
讀書筆記: 博弈論導論 - 11 - 完美信息的動態博弈戰略協議
Nash bargaining solution

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈

函數 eve rsa rac mat 最大 orm 靜態們的讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈貝葉斯博弈(Bayesian Games) 本文是Game Theory An Introduction (by Steven Tadel

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈

貝葉斯博弈(Bayesian Games)

不完整信息的靜態博弈(Incomplete information static games)

參照

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈

讀書筆記: 博弈論導論 - 14 - 不完整信息的靜態博弈機制設計

讀書筆記: 博弈論導論 - 16 - 不完整信息的動態博弈信號傳遞博弈

讀書筆記: 博弈論導論 - 07 - 完美信息的動態博弈預備知識

讀書筆記: 博弈論導論 - 06 - 混合的策略

《javascript設計模式》讀書筆記二（封裝和隱藏信息）

[學習筆記]機器學習——演算法及模型（五）：貝葉斯演算法

機器學習筆記（一）：極大似然估計與貝葉斯估計的區別

《編寫可維護的JavaScript》讀書筆記之程式設計實踐-不是你的物件不要動

【讀書筆記】《你不知道的JS-上》

[RK3288][Android6.0] 除錯筆記 --- 螢幕畫面顯示不完整問題

【讀書筆記】致加西亞的信（一）

odoo開發筆記 -- 安裝Backend debranding去除odoo信息模塊後隱藏開發者模式

Chrome控制臺毫無反應，打印不出信息了？

機器學習讀書筆記（四）樸素貝葉斯基礎篇之網站賬號分類

[學習筆記]Python網絡爬蟲與信息提取

Linux學習筆記（2）linux系統信息與進程相關命令

機器學習實戰讀書筆記(3)--樸素貝葉斯

<機器學習實戰>讀書筆記--樸素貝葉斯

機器學習實戰讀書筆記（四）：樸素貝葉斯演算法

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈 貝葉斯博弈

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈 貝葉斯博弈

貝葉斯博弈(Bayesian Games)

不完整信息的靜態博弈(Incomplete information static games)

參照

相關推薦

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈

讀書筆記: 博弈論導論 - 12 - 不完整信息的靜態博弈貝葉斯博弈