Bandit problems

24.05.2018, 16:15
Forschungsseminar Wahrscheinlichkeitstheorie

Pierre Ménard

First Talk

Title: An introduction to best arm identification in the bandit problems

Abstract: In this talk we will introduce the bandit setting and in particular the problem of best arm identification in the fixed confidence setting.
We will show how to characterize the complexity of these problems and present an algorithm that reaches asymptotically this complexity.
Most of the results that will be presented are extract from Optimal Best Arm Identification with Fixed Confidence, Garivier and Kaufmann, 2016.

Second Talk
Title: Threshold bandit for dose-ranging

Abstract: We will present a variant of the best arm identification in the bandit problem: the threshold bandit problem. It is a way to model the dose-ranging problem for phase I
of clinical trials. As in the fist talk we will give the complexity of these problems and present an algorithm to solve them. This talk is based on the paper: Thresholding Bandit for Dose-ranging: The Impact of Monotonicity, Garivier et al., 2017.

zu den Veranstaltungen