Risk-Sensitive Reinforcement Learning

Kadlčík, Marek

CS SKLog in Log in (EduId)

Theses 9eea9a

Risk-Sensitive Reinforcement Learning – Bc. Marek Kadlčík

Zpět na vyhledávání

Bc. Marek Kadlčík

Bachelor's thesis

Risk-Sensitive Reinforcement Learning

Abstract:

Cílem standardních metod zpětnovazebního učení je maximalizovat očekávaný budoucí zisk. Ukazujeme motivaci pro zvážení risku při rozhodování, popisujeme zavedené definice risku a formulujeme odpovídající účelové fukce v kontextu zpětnovazebního učení. Nakonec poskytujememe rozsáhlý přehled existujících metod v literatuře pro jejich optimalizaci a uvádíme možné budoucích směry v této oblasti.

Abstract:

Standard reinforcement learning methods aim to maximize the average future returns. We show a motivation for consideration of risk in decision-making, describe established definitions of risk and formulate corresponding risk-constrained and risk-penalizing objectives in context of reinforcement learning. Finally, we provide an extensive overview of existing methods in the literature for their optimization …more

Keywords

reinforcement learning Markov decision process dynamic programming machine learning Monte Carlo Q-Learning REINFORCE Actor-critic expected utility mean-variance conditional value at risk CVaR risk of ruin chance-constraint

Language used: English

Date on which the thesis was submitted / produced: 25. 5. 2021

Identifier: https://is.muni.cz/th/efw88/

Thesis defence

Date of defence: 1. 7. 2021
Supervisor: doc. RNDr. Vojtěch Řehák, Ph.D.
Reader: RNDr. Petr Novotný, Ph.D.

Citation record

Cite this text

ISO 690-compliant citation record:

KADLČÍK, Marek. \textit{Risk-Sensitive Reinforcement Learning}. Online. Bachelor's thesis. Brno: Masaryk University, Faculty of Informatics. 2021. Available from: https://theses.cz/id/9eea9a/.

{{Citace kvalifikační práce
 | příjmení = Kadlčík
 | jméno = Marek
 | instituce = Masaryk University, Faculty of Informatics
 | titul = Risk-Sensitive Reinforcement Learning
 | url = https://theses.cz/id/9eea9a/
 | typ práce = Bachelor's thesis
 | vedoucí = doc. RNDr. Vojtěch Řehák, Ph.D.
 | rok = 2021
 | počet stran =
 | strany =
 | citace = 2024-06-01
 | poznámka =
 | jazyk = 
}}

Full text of thesis

Contents of on-line thesis archive

Published in Theses:

světu

Other ways of accessing the text

Institution archiving the thesis and making it accessible: Masarykova univerzita, Fakulta informatiky

Reference to the local database directory of the institution

Masaryk University

Faculty of Informatics

Bachelor programme / field:
Informatics / Artificial Intelligence and Natural Language Processing

Theses on a related topic

Score-driven Models for Value at Risk and Expected Shortfall
Kateřina Nováková
Value at Risk: Historická simulace, variančně kovarianční metoda a Monte Carlo simulace
Adam Felcman
Pareto Front Estimation in Risk-Constrained Markov Decision Processes
Martin Kurečka
Reinforcement Learning of Risk-Averse Policies in Markov Decision Processes
Jiří Vahala
Model Tuning with Reinforcement Learning from Human Advice
Thomas RIEDL
Reinforcement Learning for the Game of Battleship
Tomáš Kancko
Risk-Aversion in Algorithms for Poker
Martin Horáček