Abstract
We introduce Boundary Focused Thompson sampling (BFTS), a new Bayesian algorithm to solve the anytime m-top exploration problem, where the objective is to identify the m best arms in a multi-armed bandit. First, we consider a set of existing benchmark problems that consider sub-Gaussian reward distributions (i.e., Gaussian with fixed variance and categorical reward). Next, we introduce a new environment inspired by a real world decision problem concerning insect control for organic agriculture. This new environment encodes a Poisson rewards distribution. For all these benchmarks, we experimentally show that BFTS consistently outperforms AT-LUCB, the current state of the art algorithm.
| Original language | English |
|---|---|
| Title of host publication | 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI) |
| Pages | 1422-1428 |
| Number of pages | 7 |
| ISBN (Electronic) | 2375-0197 |
| DOIs | |
| Publication status | Published - Nov 2019 |
| Event | ICTAI - Portland, United States Duration: 4 Nov 2019 → 6 Nov 2019 |
Conference
| Conference | ICTAI |
|---|---|
| Country/Territory | United States |
| City | Portland |
| Period | 4/11/19 → 6/11/19 |
Fingerprint
Dive into the research topics of 'Bayesian Anytime m-top Exploration'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver