AIWG training-complete framework β corpus-to-dataset pipeline with SKILL.md agentic surface and optional Python runtime backend. Marketplace plugin for AIWG.
-
Updated
Apr 16, 2026 - Python
AIWG training-complete framework β corpus-to-dataset pipeline with SKILL.md agentic surface and optional Python runtime backend. Marketplace plugin for AIWG.
Deepseek-Dataset-Generator creates conversational datasets for LLM fine-tuning via DeepSeek API. Supports various formats (ChatML, ShareGPT, Alpaca, JSON, CSV), easy configuration via YAML and detailed logs. Ideal for generating realistic and customized data quickly.
Add a description, image, and links to the alpaca-format topic page so that developers can more easily learn about it.
To associate your repository with the alpaca-format topic, visit your repo's landing page and select "manage topics."