Explainable Reinforcement Learning Through a Causal Lens

阿新 • • 發佈：2021-10-26

發表時間：2019（AAAI 2020）
文章要點：這篇文章通過構建一個圖結構，來解釋為啥agent要做/不做某個動作。具體來說就是先把某個問題給抽象成一個圖結構，定義狀態動作回報等關鍵資訊的節點和邊，然後在訓練RL的時候也順便用資料來訓練這個圖。訓練完了之後，就根據圖用深度優先搜尋去找，做某個動作或者不做某個動作最後導致的結果是啥，然後就說一定程度上對RL的策略做了解釋。
總結：這個文章也太晦澀了，不知道在說什麼，裡面太多心理學的詞彙，比如Causal Lens，minimally complete，structural equations，task prediction，5-point Likert Explanation Satisfaction Scale，其實方法和RL關係不大。
疑問：

只知道個大概意思，其實不是很懂怎麼去構造圖的，也不懂怎麼去訓練的。structural causal model需要人為構造嗎，那如果問題太複雜或者我們對問題並不完全瞭解，該怎麼去構造？structural equations具體指的是什麼，怎麼去學的？DAG是啥？
5-point Likert Explanation Satisfaction Scale是啥？文章還說如果圖太大，找不到complete的解釋，所以就去找minimal explanations，不知道這兩的定義是啥，也不知道具體咋找的。

Explainable Reinforcement Learning Through a Causal Lens

Explainable Reinforcement Learning Through a Causal Lens

論文閱讀筆記《Few-Shot Learning Through an Information Retrieval Lens》

Online and Offline Reinforcement Learning by Planning with a Learned Model

Model-based Reinforcement Learning: A Survey

Reinforcement Learning (DQN) 中經驗池詳細解釋

論文記載： Deep Reinforcement Learning for Traffic LightControl in Vehicular Networks

MFMARL(Mean Field Multi-Agent Reinforcement Learning)實現

強化學習論文研讀（四）——Deep Reinforcement Learning with Double Q-Learning

讀論文--Characterizing Attacks on Deep Reinforcement Learning

Evaluating the Performance of Reinforcement Learning Algorithms

Detecting Rewards Deterioration in Episodic Reinforcement Learning

Decoupling Value and Policy for Generalization in Reinforcement Learning

Game Theory and Multi-agent Reinforcement Learning筆記上

Offline Evaluation of Online Reinforcement Learning Algorithms

Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning

Improving Generalization in Reinforcement Learning with Mixture Regularization

REPAINT: Knowledge Transfer in Deep Reinforcement Learning

LEARNING INVARIANT REPRESENTATIONS FOR REINFORCEMENT LEARNING WITHOUT RECONSTRUCTION

論文解讀：COLING-2020(ccf-b)-Answer-driven Deep Question Generation based on Reinforcement Learning

Learning to Combat Compounding-Error in Model-Based Reinforcement Learning

Explainable Reinforcement Learning Through a Causal Lens

相關推薦