stable-baselines3 by K-Dense-AI

Production-ready reinforcement learning algorithms (PPO, SAC, DQN, TD3, DDPG, A2C) with scikit-learn-like API. Use for standard RL experiments, quick prototyping, and well-documented algorithm implementations. Best for single-agent RL with Gymnasium environments. For high-performance parallel training, multi-agent systems, or custom vectorized environments, use pufferlib instead.

Coding
5.2K Stars
629 Forks
Updated Jan 9, 2026, 04:57 PM

Why Use This

This skill provides specialized capabilities for K-Dense-AI's codebase.

Use Cases

  • Developing new features in the K-Dense-AI repository
  • Refactoring existing code to follow K-Dense-AI standards
  • Understanding and working with K-Dense-AI's codebase structure

Skill Snapshot

Auto scan of skill assets. Informational only.

Valid SKILL.md

Checks against SKILL.md specification

Source & Community

Skill Version
main
Community
5.2K 629
Updated At Jan 9, 2026, 04:57 PM

Skill Stats

SKILL.md 299 Lines
Total Files 1
Total Size 0 B
License MIT license