Fast and Optimal Online Portfolio Selection
Posed by Tim Van Erven et al. (2020)
§ Problem Statement
Setup
Online portfolio selection with assets proceeds for rounds as follows. On round , the learner chooses a portfolio vector , where . Then the market reveals a price-relative vector and the learner's wealth is multiplied by . Equivalently, define the per-round log-loss
The (static) regret after rounds is
Consider the follow-the-regularized-leader (FTRL) method with the log-barrier regularizer , optimizing over the simplex interior :
where may be a constant or a schedule chosen by the algorithm.
Unsolved Problem
Prove or refute that there exists a computationally feasible instantiation of this FTRL-with-log-barrier approach (i.e., a concrete choice of and tie-breaking if needed) whose regret matches the minimax-optimal worst-case regret rate for online portfolio selection (up to constant factors and/or lower-order terms as a function of and ), without imposing any a priori bounded-gradient (or equivalent) assumption beyond the protocol condition .
§ Discussion
§ Significance & Implications
The COLT 2020 note identifies a three-way gap among existing approaches: some have optimal regret guarantees but are computationally infeasible, others are efficient but lack optimal regret guarantees, and still others rely on bounded-gradient-type conditions that are not guaranteed by the protocol. Establishing (or disproving) minimax-optimal regret for a natural, computationally feasible FTRL algorithm based on the log barrier would directly close this gap for the standard online portfolio model and would require analysis techniques for FTRL that do not hinge on bounded gradients, with potential spillovers to other self-concordant-regularized online convex optimization problems.
§ Known Partial Results
Erven et al. (2020): The COLT 2020 open-problem note (Van Erven, Van der Hoeven, Kotlowski, Koolen; PMLR 125) states that state-of-the-art online portfolio methods each fail on at least one axis: computational feasibility, optimal regret guarantees, or avoidance of bounded-gradient assumptions.
Erven et al. (2020): The note proposes the log-barrier-regularized FTRL update as a natural approach and emphasizes that it is computationally feasible.
Erven et al. (2020): The central unresolved question posed by the note is whether this log-barrier FTRL approach attains the optimal (minimax) regret for online portfolio selection under the basic protocol (with ) and without bounded-gradient assumptions.
Erven et al. (2020): The note suggests that resolving the question will likely require new proof techniques for analyzing FTRL.
Erven et al. (2020): The note points to technical connections to self-concordance and mentions prior use of self-concordant ideas in bandit convex optimization.
§ References
Open Problem: Fast and Optimal Online Portfolio Selection
Tim Van Erven, Dirk Van der Hoeven, Wojciech Kotłowski, Wouter M. Koolen (2020)
Conference on Learning Theory (COLT), PMLR 125
📍 Open-problem note in COLT proceedings.
Open Problem: Fast and Optimal Online Portfolio Selection (PDF)
Tim Van Erven, Dirk Van der Hoeven, Wojciech Kotłowski, Wouter M. Koolen (2020)
Conference on Learning Theory (COLT), PMLR 125
📍 Proceedings PDF.