Reinforcement Learning
stable-baselines3
Safetensors
enduro
atari
sb3
ppo
control
Eval Results (legacy)
Instructions to use lucasschott/Enduro-v5-PPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- stable-baselines3
How to use lucasschott/Enduro-v5-PPO with stable-baselines3:
from huggingface_sb3 import load_from_hub checkpoint = load_from_hub( repo_id="lucasschott/Enduro-v5-PPO", filename="{MODEL FILENAME}.zip", ) - Notebooks
- Google Colab
- Kaggle
metadata
license: mpl-2.0
tags:
- reinforcement-learning
- stable-baselines3
- enduro
- atari
- sb3
- ppo
- control
model-index:
- name: Enduro-v5-PPO
parameters: 2.2M
results:
- task:
type: reinforcement-learning
name: Reinforcement Learning
dataset:
name: Enduro-v5
type: gymnasium
metrics:
- type: mean_reward
value: 599.54 +/- 131.49
PPO Agent for Enduro-v5
This is a Proximal Policy Optimization (PPO) agent trained on the Enduro-v5 environment using Stable Baselines 3.
Hyperparameters
See config.json for details.
Requirements
- Python: 3.10
Dependencies
gymnasium==1.0.0
ale_py==0.10.1
gymnasium[atari]
torch==2.4.0
stable_baselines3==2.4.1
opencv-python==25.0.1
How to Load
from huggingface_hub import hf_hub_download
from stable_baselines3 import PPO
model_path = hf_hub_download(repo_id="lucasschott/Enduro-v5-PPO", filename="model.zip")
agent = PPO.load(model_path)