Deep reinforcement learning on a multi-asset environment for trading

Ali Hirsa, Joerg Osterrieder, Branka Hadji-Misheva, Jan-Alexander Posth

Research output: Working paperPreprintAcademic

65 Downloads (Pure)


Financial trading has been widely analyzed for decades with market participants and academics always looking for advanced methods to improve trading performance. Deep reinforcement learning (DRL), a recently reinvigorated method with significant success in multiple domains, still has to show its benefit in the financial markets. We use a deep Q-network (DQN) to design long-short trading strategies for futures contracts. The state space consists of volatility-normalized daily returns, with buying or selling being the reinforcement learning action and the total reward defined as the cumulative profits from our actions. Our trading strategy is trained and tested both on real and simulated price series and we compare the results with an index benchmark. We analyze how training based on a combination of artificial data and actual price series can be successfully deployed in real markets. The trained reinforcement learning agent is applied to trading the E-mini S&P 500 continuous futures contract. Our results in this study are preliminary and need further improvement.
Original languageEnglish
Publication statusPublished - 15 Jun 2021


  • q-fin.TR


Dive into the research topics of 'Deep reinforcement learning on a multi-asset environment for trading'. Together they form a unique fingerprint.

Cite this