BelMan: bayesian bandits on the belief–reward manifold
Debabrota Basu, Pierre Senellart and Stéphane Bressan (2018), BelMan: bayesian bandits on the belief–reward manifold. arXiv preprint arXiv:1805.01627.
BelMan: bayesian bandits on the belief–reward manifold Read More »

