Competing in the dark: An efficient algorithm for bandit linear optimization

Abstract

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O*(√T)regret. The setting is a natural generalization of the nonstochastic multiarmed bandit problem, and the existence of an efficient optimal algorithm has been posed as an open problem in a number of recent papers. We show how the difficulties encountered by previous approaches are overcome by the use of a self-concordant potential function. Our approach presents a novel connection between online learning and interior point methods.

Keywords

RegretGeneralizationComputer scienceMathematical optimizationLinear programmingOnline algorithmAlgorithmConnection (principal bundle)Optimization problemPoint (geometry)Function (biology)MathematicsMachine learning

Affiliated Institutions

Related Publications

ROBUST MODELING WITH ERRATIC DATA

Jon F. Claerbout , Francis Muir

An attractive alternative to least‐squares data modeling techniques is the use of absolute value error criteria. Unlike the least‐squares techniques the inclusion of some infini...

1973 Geophysics 807 citations

Handbook of Genetic Algorithms

Lawrence Davis

This book sets out to explain what genetic algorithms are and how they can be used to solve real-world problems. The first objective is tackled by the editor, Lawrence Davis. Th...

1991 7308 citations

Atomic Decomposition by Basis Pursuit

Scott Shaobing Chen , David L. Donoho , Michael A. Saunders

The time-frequency and time-scale communities have recently developed a large number of overcomplete waveform dictionaries --- stationary wavelets, wavelet packets, cosine packe...

1998 SIAM Journal on Scientific Computing 6879 citations

Performance evaluation of genetic algorithms for flowshop scheduling problems

Tadahiko Murata , Hisao Ishibuchi

The aim of this paper is to evaluate the performance of genetic algorithms for the flowshop scheduling problem with an objective of minimizing the makespan. First we examine var...

2002 230 citations

Active Learning with Statistical Models

David Cohn , Zoubin Ghahramani , Michael I. Jordan

For many types of machine learning algorithms, one can compute the statistically `optimal' way to select training data. In this paper, we review how optimal data selection techn...

1996 Journal of Artificial Intelligence Re... 1241 citations

Publication Info

Year: 2008
Type: article
Pages: 263-274
Citations: 192
Access: Closed

External Links

Citation Metrics

192

OpenAlex

Cite This

APA Style

                            
                                    Jacob Abernethy, 
                                
                                    Elad Hazan, 
                                
                                    Alexander Rakhlin
                                
                            (2008). 
                            Competing in the dark: An efficient algorithm for bandit linear optimization. 
                            ScholarlyCommons (University of Pennsylvania)
                            
                            , 263-274.