Reinforcement Learning in Digital Finance

Register for Event
Register for Event
Register on Meetup
Date & Venue
Date: 03 Feb 2025, 08:30 – 07 Feb 2025, 17:00
Venue: University of Twente, Ravelijn, Hallenweg 17, 7522 NH Enschede, the Netherlands

Course

Date: 03 Feb 2025, 08:30 – 07 Feb 2025, 17:00
Venue: University of Twente, Ravelijn, Hallenweg 17, 7522 NH Enschede, the Netherlands

Reinforcement Learning in Digital Finance offers a comprehensive exploration of reinforcement learning (RL), which is a powerful framework to support sequential decision-making under uncertainty.

Candidates will delve into the theoretical foundations of reinforcement learning, state-of-the-art algorithms, and practical applications within the digital finance domain. RL can facilitate and enhance sequential decision-making processes and services in finance, such as algorithmic trading, portfolio optimization, risk management, and fraud detection. Doctoral candidates will gain a solid understanding of the theoretical underpinnings of RL as well as practical experience in implementing reinforcement learning algorithms to solve real-world financial problems. The applications will consider constraints and regulations that concern privacy, algorithmic bias, and explainability.
By the end of the course, candidates will be equipped with the knowledge and skills to leverage reinforcement learning for optimizing control, enhancing performance, and improving services in the dynamic landscape of digital finance.

Speakers

Wouter van Heeswijk

Wouter van Heeswijk is an assistant professor in operations research & financial engineering at the University of Twente. His research efforts focus primarily on reinforcement learning, with both methodological developments and applications across domains. He teaches a number of courses in financial engineering, with topics including reinforcement learning in finance, numerical valuation of derivatives, real option analysis, risk management and financial accounting. Within the DIGITAL network, he is primarily involved in the doctoral training programme.

Martijn Mes

Martijn Mes is a full professor of Transportation and Logistics Management (TLM) and chair of the Industrial Engineering and Business Information Systems (IEBIS) section within the High Tech Business and Entrepreneurship (HBE) department at the University of Twente (Enschede, The Netherlands). He holds a master’s degree in Applied Mathematics (2002) and did his PhD at the School of Management and Governance, University of Twente (2008). After finishing his PhD, Martijn did his postdoc at Princeton University, Department of Operations Research and Financial Engineering, where he researched the topics of Ranking and Selection (R&S), Bayesian Global Optimization (BGO), and Optimal Learning.

Joerg Osterrieder

Joerg Osterrieder is Associate Professor of Finance and Artificial Intelligence at the University of Twente in the Netherlands, Professor of Sustainable Finance at Bern Business School in Switzerland, and Advisor on Artificial Intelligence to the ING Group's Global Data Analytics Team. He has more than 15 years of experience in financial statistics, quantitative finance, algorithmic trading, and the digitization of the finance industry. Joerg is the Chair of the European COST Action 19130 Fintech and Artificial Intelligence in Finance, an interdisciplinary research network comprised of over 300 researchers from 51 countries globally.

Branka Hadji Misheva

Branka Hadji Misheva is a Professor in Applied Data Science and Finance at BFH, working on AI applications in finance, XAI methods, network models and fintech risk management. She holds a PhD in Economics and Management of Technology with a specific focus on network models as they apply to the operation and performance of P2P lending platforms, from the University of Pavia, Italy. She has furthermore participated in the acquisition of over 20 SNF, Innosuisse and EU projects and published a variety of papers related within the different research proposals Prof. Hadji Misheva is also research author of over 25 papers in the field of credit risk modeling, graph theory, predictive performance of scoring models, lead behavior in crypto markets and explainable AI models for credit risk management.

Stefano Penazzi

Stefano Penazzi is a key member of the Cardo AI team, serving as a Senior Data Scientist. Stefano contributes to the company's mission by leveraging his extensive expertise in data science in developing predictive models, which in turn is shaped by multiple years spent in research positions across distinguished European academic institutions (University of Bologna, University of Hull and ETH Zurich). Prior to Cardo AI, Stefano has also contributed to the development of a decision support system in Python, employed by the Australian Government, to assess and forecast the economic, environmental, and social impacts resulting from the introduction of biofuel.

Anne Zander

Anne Zander is an Assistant Professor in Applied Mathematics at the University of Twente, The Netherlands. She studied Mathematics and Physics and earned a Ph.D. in 2021 from the Karlsruhe Institute of Technology, Germany, working on Operations Research methods applied to Healthcare Logistics. Her theoretical research focuses on sequential decision problems, particularly high-dimensional, discrete decision spaces. To solve those problems, she aims to integrate lookahead planning via Stochastic Programming and learning from past experience via Reinforcement Learning. Her main field of application is Healthcare Logistics. Currently, Anne Zander participates in and leads several projects in close collaboration with healthcare providers related to capacity planning during infectious outbreaks or improving the patient flow from hospitals to follow-up care.

Warren B. Powell

Warren B. Powell is Professor Emeritus at Princeton University, where he taught for 39 years, and is currently the Chief Innovation Officer at Optimal Dynamics.   He was the founder and director of CASTLE Lab, which focused on stochastic optimization with applications to freight transportation, energy systems,  health, e-commerce, finance and the laboratory sciences, supported by over $50 million in funding from government and industry.  He has pioneered a new universal framework that can be used to model any sequential decision problem, including the identification of four classes of policies that spans every possible method for making decisions. This is documented in his latest book with John Wiley: Reinforcement Learning and Stochastic Optimization: A unified framework for sequential decisions.

Schedule

Monday - 3rd of February 2025

08:30 - 10:30

Course introduction and general introduction to Reinforcement Learning
🚩RA1247, Ravelijn building
👤Wouter van Heeswijk, Martijn Mes

11:00 - 12:00

Reinforcement Learning in Digital Finance
🚩RA1247, Ravelijn building
👤Jörg Osterrieder

13:00 - 14:00

Markov Decision processes and basics of temporal difference learning
🚩RA1247, Ravelijn building
👤Wouter van Heeswijk

14:00 - 15:00

Q-learning in Taxi Cab environment
🚩RA1247, Ravelijn building
👤Wouter van Heeswijk

15:15 - 16:00

Convergence proofs for Q-learning
🚩RA1247, Ravelijn building
👤Anne Zander

16:00 - 18:00

Group formation, topic selection and problem formulation
🚩RA1247, Ravelijn building
👤Wouter van Heeswijk, Jörg Osterrieder

Tuesday - 4th of February 2025

08:30 - 10:30

Deep Reinforcement Learning in Finance
🚩RA3411, Ravelijn building
👤Jörg Osterrieder

11:00 - 12:00

Deep Q-learning for automated stock trading
🚩RA1247, Ravelijn building
👤Jörg Osterrieder

13:00 - 15:30

Policy-based reinforcement learning
🚩RA 3411, Ravelijn building
👤Wouter van Heeswijk

15:45 - 17:00

Project: description and coding
🚩RA 3411, Ravelijn building
👤Wouter van Heeswijk,  Jörg Osterrieder

17:00 - 18:00

Parameterized policies in the finance industry
🚩RA 3411, Ravelijn building
👤Wouter van Heeswijk

Wednesday - 5th of February 2025

08:30 - 10:30

Explainable AI in Reinforcement Learning
🚩RA1315, Ravelijn building
👤Branka Hadji Misheva

11:00 - 12:00

Project: explainable components
🚩RA 1315, Ravelijn building
👤Branka Hadji Misheva

13:00 - 18:00

Research seminar on Reinforcement Learning
🚩RA 1315, Ravelijn building
👤Various guest speakers

Thursday - 6th of February 2025

08:30 - 10:30

Industry perspective on tabular transfer learning
🚩RA 3411, Ravelijn building
👤Stefano Penazzi

11:00 - 12:00

Discussion panel: prospects and barriers of Reinforcement Learning in Digital Finance
🚩RA 3411, Ravelijn building
👤Wouter van Heeswijk, Jörg Osterrieder, Stefano Penazzi

Friday - 7th of February 2025

08:30 - 10:30

Project: coding and presentation design
🚩RA 3411, Ravelijn building
👤Wouter van Heeswijk,  Jörg Osterrieder

11:00 - 12:30

Project progress presentation and discussions
🚩RA 3411, Ravelijn building
👤Wouter van Heeswijk,  Jörg Osterrieder

Friday - 30th of May 2025

09:00 - 17:00

Final group presentations
🚩Online

Part of DIGITAL

The Marie Sklowdoska-Curie Action (MSCA) Industrial Doctoral Network on Digital Finance

The MSCA Industrial Doctoral Network on Digital Finance (2024-2027), funded by Horizon Europe, brings together top universities, research centers, and companies to train PhD students in new financial technologies. Academic partners include the University of Twente, WU Vienna, Poznan University of Economics, Universitatea Babeș-Bolyai, Kaunas University of Technology, RPTU, University of Pavia, University of Naples Federico II, American University of Sharjah, and Bern University of Applied Sciences. Industry and research partners are Fraunhofer, Deutsche Bank, Deutsche Börse Group, Raiffeisen Bank, Swedbank, BIS, EIT Digital, Royalton Partners, Quoniam Asset Management, Cardo AI, and Athena Research. Together, they focus on practical projects to help students build skills and create new solutions for the finance sector.

Find Out More

Additional information about the event and training program can be found here:

Contact

In case of questions, feel free to reach out to Dr. van Heeswijk.