training-data-curation by sundial-org

Guidelines for creating high-quality datasets for LLM post-training (SFT/DPO/RLHF). Use when preparing data for fine-tuning, evaluating data quality, or designing data collection strategies.

Content & Writing

138 Stars

8 Forks

Updated Jan 16, 2026, 09:58 PM

Why Use This

This skill provides specialized capabilities for sundial-org's codebase.

Use Cases

Developing new features in the sundial-org repository
Refactoring existing code to follow sundial-org standards
Understanding and working with sundial-org's codebase structure

Install Guide

2 steps

1

Download Ananke

Skip this step if Ananke is already installed.
2

Install inside Ananke

Click Install Skill, paste the link below, then press Install.

https://github.com/sundial-org/skills/tree/main/skills/training-data-curation

Skill Snapshot

Auto scan of skill assets. Informational only.

Valid SKILL.md

Checks against SKILL.md specification

Source & Community

Repository skills

Skill Version

main

Community

138 8

Updated At Jan 16, 2026, 09:58 PM

Skill Stats

SKILL.md 126 Lines

Total Files 1

Total Size 5.4 KB

License NOASSERTION

Source

GitHub Repository ↗ Commit main ↗ skill.extrachatgpt.com ↗