Generalized non-stationary multi-armed Bandits

03.02. bis 01.02.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie

Anne Manegueu (Magdeburg)

Recent decades have seen rapid advances in multi-armed bandits (MAB) algorithms, due to their ability to optimize with the presence of uncertainty. Classical MAB problems are studied based on the assumption that the data generating mechanism does not change over time. This assumption is often violated in real-world applications, since the distributions of rewards themselves may change over time, and more generally the environment might be non-stationary. A large part of the bandit literature has devoted to the question of finding models that capture the non-stationarity of the environment, giving hence birth to two classes of multiarmed bandits: the rested bandits and the restless Bandits. In our work, we proposed a generalization of the switching bandits, which is a subclass of the restless bandit-where the reward distribution is assumed to be piecewise stationary. To be specific, we consider a multi-armed bandit setting that unifies the following settings: (a) the switching bandit problem, (b) the MAB problem with locally polynomial mean reward, (c) the MAB problem with locally smooth mean rewards, and (d) the one, where the gaps of the arms have a bounded number of inflextion points and the highest arm's mean cannot vary too much in a short-range. We propose two algorithms in this general setting termed "the selectivebandits" and the "the prudentbandits", that solve in an efficient and unified way the four problems (a)-(d) mentioned.

The Zoom access data are available on the programm of the Research Seminar.

zu den Veranstaltungen

Generalized non-stationary multi-armed Bandits

03.02. bis 01.02.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie

Aktuelle Veranstaltungen

04.02.2026, 9:00 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

Stabilization by transport noise and enhanced dissipation in the Kraichnan model

04.02.2026, 10:10 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

Mathematical analysis of physically motivated kinetic models

05.02.2026, 16:15 – Raum 1.22, Haus 9
Forschungsseminar Differentialgeometrie

Perturbation theory of Dirac operators and homotopy groups of parameter spaces

09.02.2026, 9:00 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

Discrete Analysis and Its Applications

09.02.2026, 10:10 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

On the volume-renormalized mass

Links

Informationen für

Generalized non-stationary multi-armed Bandits

03.02. bis 01.02.2021, 13:00 Forschungsseminar Wahrscheinlichkeitstheorie

Aktuelle Veranstaltungen

Links

Informationen für

03.02. bis 01.02.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie