GRPO Fine-Tuning Claude Code Skill | Vision-Language Models