Mastering the Maze - Reinforcement Learning in Grid-World-Setup

06.01.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie

Peter Keller

Imagine, a robot is moving through a maze that consists of equal sized rooms. Adjacent rooms are connected by doors. The robot can now perform actions, choosing the next door with respect to a given policy (discrete probability distribution on the available doors). After a new room is entered, the next room is chosen only with respect to the current position. Penalizing the robot's movement with a negative reward (for example -1 for each room change), the robot can learn the optimal policy that leads him through the maze as quickly as possible. We give an introduction to the solution methods of this type of problem via Markov Reward- and Decision-Processes optimizing a discrete Bellman-type equation and show some basic simulations/implementations of Q-Learning.

The Zoom access data are available on the programm of the Research Seminar.

zu den Veranstaltungen

Mastering the Maze - Reinforcement Learning in Grid-World-Setup

06.01.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie

Aktuelle Veranstaltungen

02.07.2025, 9:15 - 10:15 Uhr – 2.09.0.17
Forschungsseminar: Gruppen und Operatoralgebren

Surjunctive Groups

02.07.2025, 13:00 – Haus 9, Raum 0.17 und Zoom
Forschungsseminar Diskrete Spektraltheorie

A gradient flow that is none: Heat flow with Wentzell boundary conditions

03.07.2025, 16:15 – Raum 1.22
Forschungsseminar Differentialgeometrie

Heat and resolvent expansions in the noncommutative case

10.07.2025, 16:15 – Raum 1.22
Forschungsseminar Differentialgeometrie

Cancellation properties of exotic 4-dimensional positive scalar curvature metrics

16.07.2025, 14:30 – Haus 9, Raum 2.22 + Zoom
Institutsrat

Institutsratssitzung 16.07.2025

Links

Informationen für

Mastering the Maze - Reinforcement Learning in Grid-World-Setup

06.01.2021, 13:00 Forschungsseminar Wahrscheinlichkeitstheorie

Aktuelle Veranstaltungen

Links

Informationen für

06.01.2021, 13:00
Forschungsseminar Wahrscheinlichkeitstheorie