Mastering the Maze - Reinforcement Learning in Grid-World-Setup

06.01.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie

Peter Keller

Imagine, a robot is moving through a maze that consists of equal sized rooms. Adjacent rooms are connected by doors. The robot can now perform actions, choosing the next door with respect to a given policy (discrete probability distribution on the available doors). After a new room is entered, the next room is chosen only with respect to the current position. Penalizing the robot's movement with a negative reward (for example -1 for each room change), the robot can learn the optimal policy that leads him through the maze as quickly as possible. We give an introduction to the solution methods of this type of problem via Markov Reward- and Decision-Processes optimizing a discrete Bellman-type equation and show some basic simulations/implementations of Q-Learning.

The Zoom access data are available on the programm of the Research Seminar.

zu den Veranstaltungen

Mastering the Maze - Reinforcement Learning in Grid-World-Setup

06.01.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie

Aktuelle Veranstaltungen

04.02.2026, 9:00 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

Stabilization by transport noise and enhanced dissipation in the Kraichnan model

04.02.2026, 10:10 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

Mathematical analysis of physically motivated kinetic models

05.02.2026, 16:15 – Raum 1.22, Haus 9
Forschungsseminar Differentialgeometrie

Perturbation theory of Dirac operators and homotopy groups of parameter spaces

09.02.2026, 9:00 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

Discrete Analysis and Its Applications

09.02.2026, 10:10 Uhr – Haus 9, Raum 2.22
Hochschulöffentlicher Vortrag

On the volume-renormalized mass

Links

Informationen für

Mastering the Maze - Reinforcement Learning in Grid-World-Setup

06.01.2021, 13:00 Forschungsseminar Wahrscheinlichkeitstheorie

Aktuelle Veranstaltungen

Links

Informationen für

06.01.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie