Finite-time Analysis of the Multiarmed Bandit Problem
2002
Machine Learning
5,589 citations