Reinforcement Learning
- Type: Lecture / Practice (VÜ)
- Chair: KIT-Fakultäten - KIT-Fakultät für Informatik - Institut für Anthropomatik und Robotik - IAR Neumann
- Semester: WS 24/25
- 
                    Time:
                    Thu 2024-10-24
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-10-25
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2024-10-31
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Thu 2024-11-07
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-11-08
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2024-11-14
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-11-15
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2024-11-21
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-11-22
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2024-11-28
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-11-29
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2024-12-05
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-12-06
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2024-12-12
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-12-13
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2024-12-19
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2024-12-20
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2025-01-09
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2025-01-10
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2025-01-16
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2025-01-17
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2025-01-23
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2025-01-24
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2025-01-30
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2025-01-31
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2025-02-06
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2025-02-07
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 Thu 2025-02-13
 09:45 - 11:15, weekly
 11.10 Engelbert-Arnold-Hörsaal (EAS)
 11.10 Elektrotechnisches Institut (ETI) (1. OG)
 Fri 2025-02-14
 11:30 - 13:00, weekly
 30.46 Neuer Hörsaal Chemie
 30.46 Chemie-Hörsaalgebäude (EG)
 
- 
                    Lecturer:
                    Prof. Dr. Gerhard Neumann
 TT-Prof. Dr. Rudolf Lioutikov
 Mevlüt Onur Celik
 Niklas Freymuth
 Hongyi Zhou
- SWS: 4
- ECTS: 6
- Lv-No.: 2400163
- Information: On-Site
| Content | Reinforcement Learning (RL) is a sub-field of machine learning in which an artificial agent has to interact with its environment and learn how to improve its behaviour by trial and error. For doing so, the agent is provided with an evaluative feedback signal, called reward, that he perceives for each action performed in its environment. RL is one of the hardest machine learning problems, as, in contrast to standard supervised learning, we do not know the targets (i.e. the optimal actions) for our inputs (i.e. the state of the environment) and we also need to consider the long-term effects of the agent’s actions on the state of the environment. Due to recent successes, RL has gained a lot of popularity with applications in robotics, automation, health care, trading and finance, natural language processing, autonomous driving and computer games. This lecture will introduce the concepts and theory of RL and review current state of the art methods with a particular focus on RL applications in robotics. An exemplary list of topics is given below: 
 Lernziele: - Students are able to understand the RL problem and challenges. - Students can differentiate between different RL algorithm and understand their underlying theory - Students will know the mathematical tools necessary to understand RL algorithms - Students can implement RL algorithms for various tasks - Students understand current research questions in RL Empfehlungen: 
 
 Erfolgskontrolle: Siehe Modulhandbuch! Arbeitsaufwand: 180h, aufgeteilt in: 
 ca 30h Prüfungsvorbereitung 
 | 
| Language of instruction | English | 
| Organisational issues | 6 ECTS Vorlesungs-und Übungsturnus: Siehe ILIAS | 
