Acerca de
Streamlines the process of training language models by providing comprehensive support for SFT, DPO, GRPO, and reward modeling workflows on Hugging Face Jobs. It automates training script generation using UV and PEP 723, handles hardware selection and cost estimation, integrates real-time monitoring via Trackio, and ensures model persistence by automatically pushing results to the Hub. This skill is ideal for developers who need to perform advanced model alignment and fine-tuning without the complexity of managing local GPU infrastructure.