Constrained Online Convex Optimization without Slater's Condition

arXiv Math

이 뉴스, 어떠셨어요?

한 번의 탭으로 반응을 남겨요 · 로그인 불필요

CC BY

이 매체는 공공·자유 라이선스로 본문을 직접 표시합니다.

Abstract

We study constrained online convex optimization with adversarial losses and stochastic or adversarial constraints.

For stochastic constraints, existing algorithms that achieve nearly optimal regret and constraint violation bounds typically rely on regularity assumptions such as Slater's condition, while adversarial-constraint algorithms avoid these assumptions by using a rather restrictive round-wise feasible comparator.

We bridge this gap with an anytime primal-dual framework that incorporates an adaptive regularizer into the dual update.

The regularizer stabilizes the dual process without relying on the negative drift induced by Slater's condition.

For stochastic constraints and convex losses, our algorithm achieves $O(\sqrt{T})$ expected regret and $O(\sqrt{T}\log T)$ expected cumulative constraint violation.

Furthermore, we show that our algorithm also admits high-probability bounds of the same order on regret and constraint violation.

For strongly convex losses, the regret bound improves to $O(\log T)$ with a violation bound of the same order.

With a minor modification, the framework also applies to adversarial constraints and provides guarantees for hard constraint violation.

전문 보기

Constrained Online Convex Optimization without Slater's Condition

이 뉴스, 어떠셨어요?

Abstract

관련 뉴스

'research' 카테고리 뉴스

What Drives Interactive Improvement from Feedback?

Contrastive Reflection for Iterative Prompt Optimization

How Can AI Find My Model? A Model-Finding Experimental Study Considering Data Formats, Embeddings, and Retrieval Strategies

arXiv의 다른 기사

Beyond expert users: agents should help users construct preferences, not just elicit them

Investigating Multi-Agent Deliberation in Law

Why Solve It Twice? Hierarchical Accumulation of Skills for Transfer-Efficient ML Engineering