stable-baselines3 by K-Dense-AI

Production-ready reinforcement learning algorithms (PPO, SAC, DQN, TD3, DDPG, A2C) with scikit-learn-like API. Use for standard RL experiments, quick prototyping, and well-documented algorithm implementations. Best for single-agent RL with Gymnasium environments. For high-performance parallel training, multi-agent systems, or custom vectorized environments, use pufferlib instead.

Coding

5.2K Stars

629 Forks

Updated Jan 9, 2026, 04:57 PM

Why Use This

This skill provides specialized capabilities for K-Dense-AI's codebase.

Use Cases

Developing new features in the K-Dense-AI repository
Refactoring existing code to follow K-Dense-AI standards
Understanding and working with K-Dense-AI's codebase structure

Install Guide

2 steps

1

Download Ananke

Skip this step if Ananke is already installed.
2

Install inside Ananke

Click Install Skill, paste the link below, then press Install.

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/stable-baselines3

Skill Snapshot

Auto scan of skill assets. Informational only.

Valid SKILL.md

Checks against SKILL.md specification

Source & Community

Repository claude-scientific-skills

Skill Version

main

Community

5.2K 629

Updated At Jan 9, 2026, 04:57 PM

Skill Stats

SKILL.md 299 Lines

Total Files 1

Total Size 0 B

License MIT license

Source

GitHub Repository ↗ Commit main ↗ skill.extrachatgpt.com ↗