Keywords

RegretDilemmaReinforcement learningLogarithmMathematical optimizationComputer scienceMathematical economicsAction (physics)Bounded functionSimple (philosophy)MathematicsArtificial intelligenceMachine learning

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Volume
47
Issue
2-3
Pages
235-256
Citations
5589
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

5589
OpenAlex

Cite This

Peter Auer, Nicolò Cesa‐Bianchi, Paul Fischer (2002). Finite-time Analysis of the Multiarmed Bandit Problem. Machine Learning , 47 (2-3) , 235-256. https://doi.org/10.1023/a:1013689704352

Identifiers

DOI
10.1023/a:1013689704352