Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1258918

Deep reinforcement learning for market making with time- varying order arrival intensities


Gašperov, Bruno
Deep reinforcement learning for market making with time- varying order arrival intensities, 2022., doktorska disertacija, Fakultet elektrotehnike i računarstva, Zagreb


CROSBI ID: 1258918 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Deep reinforcement learning for market making with time- varying order arrival intensities

Autori
Gašperov, Bruno

Vrsta, podvrsta i kategorija rada
Ocjenski radovi, doktorska disertacija

Fakultet
Fakultet elektrotehnike i računarstva

Mjesto
Zagreb

Datum
19.10

Godina
2022

Stranica
112

Mentor
Kostanjčar, Zvonko

Ključne riječi
market making ; deep reinforcement learning ; stochastic optimal control ; machine learning ; high-frequency trading

Sažetak
Market making is a problem of the optimal placement of limit orders on both sides of the limit order book with the goal of maximizing the trader’s terminal wealth while minimizing the related risks. Such risks particularly include inventory, execution, latency, adverse selection, and model uncertainty risks. Especially salient is the inventory risk, arising from the fluctuations in the value of the asset held in the market maker’s inventory, which is typically non-zero, since it depends on when and whether the placed orders get executed. Consequently, effective market making requires dynamic adaptation to changes in the current inventory level and other relevant market and market maker-related variables. The underlying problem of stochastic optimal control can be naturally cast as a discrete Markov Decision Process (MDP). Existing analytical approaches to market making tend to be predicated upon a set of naïve assumptions and are ill- suited to market making on order-driven markets as they fail to consider the discreteness of the limit order book in general. Moreover, they do not factor in the market microstructure dynamics, especially the time variability of order arrival intensities. Promisingly, methods based on (deep) reinforcement learning are known to lend themselves well to solving problems formulated as MDPs and hence offer a potential alternative to tackling market making. Moreover, considering that the model of the market maker’s environment is typically unknown, model-free deep reinforcement learning methods, capable of learning directly from data without any explicit modeling of the underlying dynamics or prior knowledge, are of pivotal importance. Bearing this in mind, as well as the shortcomings of the current approaches, in this thesis novel model-free deep reinforcement learning methods for market-making on order-driven markets with time-varying order arrival intensities are proposed. The first method is based on two standalone supervised learning-based signal generating units and a deep reinforcement learning unit for market making that exploits the generated signals. Special attention is paid to demands on the sufficient granularity of the resulting market making policies and to the methods’ robustness to variations in the market microstructure dynamics. To this end, a procedure for training market making agents robust to such variations, based on adversarial reinforcement learning, is also proposed. Moreover, an evaluation framework for testing the proposed method with respect to the interpretability and the risk-adjusted return metrics is proposed. The second method is concerned with market making under a weakly consistent, multivariate Hawkes process-based LOB model. The experimental results are discussed, analyzed, and juxtaposed against the results of several market making benchmarks. It is found that the proposed methods outperform the benchmarks with respect to multiple risk- adjusted reward performance metrics.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Projekti:
HRZZ-IP-2019-04-5241 - Algoritmi dubokog podržanog učenja za upravljanje rizicima (DREAM) (Kostanjčar, Zvonko, HRZZ ) ( CroRIS)
--KK.01.1.1.01.009 - Napredne metode i tehnologije u znanosti o podatcima i kooperativnim sustavima (DATACROSS) (Šmuc, Tomislav; Lončarić, Sven; Petrović, Ivan; Jokić, Andrej; Palunko, Ivana) ( CroRIS)

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Zvonko Kostanjčar (mentor)

Avatar Url Bruno Gašperov (autor)


Citiraj ovu publikaciju:

Gašperov, Bruno
Deep reinforcement learning for market making with time- varying order arrival intensities, 2022., doktorska disertacija, Fakultet elektrotehnike i računarstva, Zagreb
Gašperov, B. (2022) 'Deep reinforcement learning for market making with time- varying order arrival intensities', doktorska disertacija, Fakultet elektrotehnike i računarstva, Zagreb.
@phdthesis{phdthesis, author = {Ga\v{s}perov, Bruno}, year = {2022}, pages = {112}, keywords = {market making, deep reinforcement learning, stochastic optimal control, machine learning, high-frequency trading}, title = {Deep reinforcement learning for market making with time- varying order arrival intensities}, keyword = {market making, deep reinforcement learning, stochastic optimal control, machine learning, high-frequency trading}, publisherplace = {Zagreb} }
@phdthesis{phdthesis, author = {Ga\v{s}perov, Bruno}, year = {2022}, pages = {112}, keywords = {market making, deep reinforcement learning, stochastic optimal control, machine learning, high-frequency trading}, title = {Deep reinforcement learning for market making with time- varying order arrival intensities}, keyword = {market making, deep reinforcement learning, stochastic optimal control, machine learning, high-frequency trading}, publisherplace = {Zagreb} }




Contrast
Increase Font
Decrease Font
Dyslexic Font