Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
1
Languages
Licenses
Other
Reset Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Safetensors
Transformers
PEFT
GGUF
TensorBoard
Diffusers
ONNX
stable-baselines3
sentence-transformers
MLX
ml-agents
TF-Keras
Keras
Joblib
Adapters
Transformers.js
timm
setfit
OpenVINO
sample-factory
Flair
Core ML
LiteRT
NeMo
fastai
ESPnet
BERTopic
spaCy
Rust
Scikit-learn
fastText
OpenCLIP
KerasHub
ExecuTorch
Asteroid
speechbrain
AllenNLP
llamafile
Fairseq
PaddlePaddle
PaddleOCR
Stanza
Habana
pyannote.audio
SpanMarker
Graphcore
paddlenlp
unity-sentis
DDUF
univa
Apply filters
Models
26,602
Base only
Inference Available
Inference
Edit filters
Sort: Trending
Active filters:
stable-baselines3
Clear all
Adilbai/stock-trading-rl-agent
Reinforcement Learning
•
Updated
Jan 8
•
167
•
159
sanju-1007/SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
2 days ago
•
65
•
1
Iqnemo/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
5 days ago
•
19
•
1
ThomasSimonini/demo-hf-CartPole-v1
Reinforcement Learning
•
Updated
May 3, 2023
ThomasSimonini/ppo-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Apr 7, 2022
•
3
•
1
ThomasSimonini/ppo-BreakoutNoFrameskip-v4
Reinforcement Learning
•
Updated
Apr 7, 2022
•
5
•
3
ThomasSimonini/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Aug 28, 2023
•
10
•
14
ThomasSimonini/ppo-PongNoFrameskip-v4
Reinforcement Learning
•
Updated
Apr 7, 2022
•
1
•
1
ThomasSimonini/ppo-QbertNoFrameskip-v4
Reinforcement Learning
•
Updated
Apr 7, 2022
•
1
ThomasSimonini/ppo-SeaquestNoFrameskip-v4
Reinforcement Learning
•
Updated
Apr 7, 2022
ThomasSimonini/ppo-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Apr 7, 2022
•
2
•
3
ThomasSimonini/ppo-Walker2DBulletEnv-v0
Reinforcement Learning
•
Updated
Jul 15, 2022
carlosaguayo/Simonini-ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 22, 2022
•
1
mrm8488/a2c-Pong-v0
Reinforcement Learning
•
Updated
Feb 11, 2022
•
1
mrm8488/a2c-PongNoFrameskip-v0
Reinforcement Learning
•
Updated
Feb 12, 2022
osanseviero/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jul 5, 2022
•
1
•
1
sb3/demo-hf-CartPole-v1
Reinforcement Learning
•
Updated
Mar 11, 2024
•
5
•
2
TrabajoAprendizajeProfundo/Trabajo
Reinforcement Learning
•
Updated
Apr 11, 2022
•
1
osanseviero/TEST_COLAB_ppo-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Apr 19, 2022
osanseviero/test_sb3
Reinforcement Learning
•
Updated
May 4, 2022
•
7
sb3/ppo-Pendulum-v1
Reinforcement Learning
•
Updated
Oct 11, 2022
•
291
•
3
osanseviero/TEST2ppo-LunarLander-v3
Reinforcement Learning
•
Updated
May 10, 2022
•
5
SuperSecureHuman/Lunar-Landing-PPO
Reinforcement Learning
•
Updated
May 5, 2022
epsil/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 4, 2022
•
1
LidarRL/TEST2ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 4, 2022
•
1
DBusAI/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 4, 2022
•
1
Phaneo/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 4, 2022
NorbertRop/PPO-MlpPolicy-LunarLander-v2
Reinforcement Learning
•
Updated
May 4, 2022
CWhy/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 4, 2022
•
1
DarthVadar/TEST3ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 4, 2022
•
4
•
1
Previous
1
2
3
...
100
Next