FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies

Flow RL is a high-performance reinforcement learning library, combining modern deep RL algorithms with flow and diffusion models for advanced policy parameterization, planning ability or dynamics modeling. It features:

State-of-the-Art Algorithms and Efficiency: We provide JAX implementations of SOTA algorithms, such FQL, BDPO, DAC and etc;
Flexible Flow Architectures: We provide built-in support various types of flow and diffusion models, such as CNFs and DDPM;
Comprehensive Evaluations: We test the algorithms on commonly adopted benchmark and provide the results. Please check our arXiv paper for more details about the module design and benchmark results:

🚀 Installation & Usage

Currently FlowRL is hosted on PyPI and therefore can be installed via pip install flowrl. However, we recommend to clone and install the library using the following commands:

git clone https://github.com/typoverflow/flow-rl.git
cd flow-rl
pip install -e .

Alternatively, you can use our Docker image:

docker pull typoverflow/flow-rl
docker run --gpus all -it typoverflow/flow-rl bash

The entry files are presented in examples/. Please refer to the scripts in scripts/ for how to execute the algorithms.

📊 Supported Algorithms

Offline RL:

Algorithm	Location
IQL	`flowrl/agent/offline/iql.py`
IVR	`flowrl/agent/offline/ivr.py`
DQL	`flowrl/agent/offline/dql.py`
DTQL	`flowrl/agent/offline/dtql.py`
DAC	`flowrl/agent/offline/dac.py`
FQL	`flowrl/agent/offline/fql/fql.py`
BDPO	`flowrl/agent/offline/bdpo/bdpo.py`

Online RL (On-Policy):

Algorithm	Location
PPO	`flowrl/agent/online/ppo.py`
DPPO	`flowrl/agent/online/dppo.py`
FPO	`flowrl/agent/online/fpo.py`
FPOPP	`flowrl/agent/online/fpopp.py`
GenPO	`flowrl/agent/online/genpo.py`

Online RL (Off-Policy):

Algorithm	Location
SAC	`flowrl/agent/online/sac.py`
TD3	`flowrl/agent/online/td3.py`
TD7	`flowrl/agent/online/td7/td7.py`
QSM	`flowrl/agent/online/qsm.py`
QVPO	`flowrl/agent/online/qvpo.py`
DACER	`flowrl/agent/online/dacer.py`
SDAC	`flowrl/agent/online/sdac.py`
DPMD	`flowrl/agent/online/dpmd.py`
IDEM	`flowrl/agent/online/idem.py`

📝 Citing Flow RL

If you use Flow RL in your research, please cite:

@article{gao2026flowrl,
  title={FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies},
  author={Gao, Chenxiao and Chen, Edward and Chen, Tianyi and Dai, Bo},
  journal={arXiv preprint arXiv:2603.27450},
  year={2026}
}

💎 Acknowledgements

Inspired by foundational work from

Jax-CORL
DAC

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
.github/workflows		.github/workflows
examples		examples
flowrl		flowrl
scripts		scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies

🚀 Installation & Usage

📊 Supported Algorithms

📝 Citing Flow RL

💎 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies

🚀 Installation & Usage

📊 Supported Algorithms

📝 Citing Flow RL

💎 Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages