rl-algorithms

Minimal RL training codebase in PyTorch for both off-policy and on-policy methods.

Algorithms

DQN (agent=dqn, trainer_name=off_policy)
PPO (agent=ppo, trainer_name=on_policy)

Environments

MiniGrid (e.g. MiniGrid-Empty-5x5-v0)
Atari ALE through Gymnasium (e.g. atari-ALE/Pong-v5)

Setup (uv)

uv sync

Run commands below with uv run ... so they use the project virtual environment.

Training

Unified entrypoint:

uv run python train.py <overrides...>

DQN (off-policy)

uv run python train.py agent=dqn trainer_name=off_policy task=MiniGrid-Empty-5x5-v0 device=cuda

Atari Pong:

uv run python train.py agent=dqn trainer_name=off_policy task=atari-ALE/Pong-v5 device=cuda

PPO (on-policy, vectorized envs)

uv run python train.py agent=ppo trainer_name=on_policy task=MiniGrid-Empty-5x5-v0 device=cuda num_envs=8 rollout_steps=128

Atari Pong:

uv run python train.py agent=ppo trainer_name=on_policy task=atari-ALE/Pong-v5 device=cuda num_envs=8 rollout_steps=128

Config and Structure

Main config: config.yaml
Trainer selection: trainer_name=off_policy|on_policy in config.yaml or CLI overrides
Agent configs: agent/dqn.yaml, agent/ppo.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
agent		agent
storage		storage
trainers		trainers
README.md		README.md
config.yaml		config.yaml
logger.py		logger.py
pyproject.toml		pyproject.toml
replay.py		replay.py
train.py		train.py
utils.py		utils.py
video.py		video.py
wrappers.py		wrappers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rl-algorithms

Algorithms

Environments

Setup (uv)

Training

DQN (off-policy)

PPO (on-policy, vectorized envs)

Config and Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

rl-algorithms

Algorithms

Environments

Setup (uv)

Training

DQN (off-policy)

PPO (on-policy, vectorized envs)

Config and Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages