Financial market making using RLLib

Pavy · October 13, 2023, 7:17am

Hi,

This is a theoretical question and apologies if this is a wrong forum.

I am trying to model a trading algorithm with the following dynamics:

S = State of market
A = agents actions
R = reward (profit/loss)

In this lets call game, the agent receives a reward if his actions (trades) are profitable.

The transition probabilities are P(S(T) | S(T-1)) i.e. the agent’s actions have no effect on the next state of the market (too many market participants).
The profit or loss is not instant. It is accrued over a period as such there is a credit assignment problem.

My questions are:

Thanks,
Pavy

Topic		Replies	Views
Scaling rewards depending on action distribution RLlib	2	371	November 3, 2021
My RLlib implementation seems to compute random actions RLlib	4	918	February 15, 2022
Multi-Agent Transformer RLlib	5	1199	September 21, 2022
Decentralized multi agent reinforcement learning RLlib	4	121	November 2, 2024
Multi-agent Supply Chain Optimization with RLlib RLlib	1	299	June 19, 2024