Probability Matching
Boltzmann Exploration
- 手柄的概率 $p(i)=\frac{\exp \frac{\bar{R}(i)}{\tau}}{\sum_{j=1}^N \exp \frac{\bar{R}(j)}{\tau}}$ #card

Probability Matching
https://blog.xiang578.com/post/logseq/Probability Matching.html

Probability Matching
https://blog.xiang578.com/post/logseq/Probability Matching.html