项目作者: niffler92
项目描述 :
Bandit algorithms
高级语言: Python
项目地址: git://github.com/niffler92/Bandit.git
Bandit algorithms
Contextual Bandit
Current implementations
simulation example:
Change Directory to example/scripts and type ./simulation.sh
in command line.
./simulation.sh
Experiment can be done by changing arguments and Envs in simulation.sh and example/envs/envs.py, example/envs/bandits.py
Original work reference bgalbraith
Study guide
Paper and thesis
- Lihong Li, Wei Chu, John Langford, Robert E. Schapire, A Contextual-Bandit Approach to Personalized News Article Recommendation
- Dai Shi, Exploring Bandit Algorithms for Automatic Content Selection
- Olivier Chapelle, Lihong Li, An Empirical Evaluation of Thompson Sampling
- Junpei Komiyama, Junya Honda, Hiroshi Nakagawa, Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
- Piette JD, The potential impact of intelligent systems for mobile health self-management support: Monte Carlo simulations of text message support for medication adherence
Lecture notes
Blog posts
TODO