Constrained Online Convex Optimization without Slater's Condition
이 뉴스, 어떠셨어요?
한 번의 탭으로 반응을 남겨요 · 로그인 불필요
Abstract
We study constrained online convex optimization with adversarial losses and stochastic or adversarial constraints.
For stochastic constraints, existing algorithms that achieve nearly optimal regret and constraint violation bounds typically rely on regularity assumptions such as Slater's condition, while adversarial-constraint algorithms avoid these assumptions by using a rather restrictive round-wise feasible comparator.
We bridge this gap with an anytime primal-dual framework that incorporates an adaptive regularizer into the dual update.
The regularizer stabilizes the dual process without relying on the negative drift induced by Slater's condition.
For stochastic constraints and convex losses, our algorithm achieves $O(\sqrt{T})$ expected regret and $O(\sqrt{T}\log T)$ expected cumulative constraint violation.
Furthermore, we show that our algorithm also admits high-probability bounds of the same order on regret and constraint violation.
For strongly convex losses, the regret bound improves to $O(\log T)$ with a violation bound of the same order.
With a minor modification, the framework also applies to adversarial constraints and provides guarantees for hard constraint violation.