01Chain-of-thought reasoning prompts for transparent safety critiques
02Automated self-critique and revision templates based on custom constitutions
03Hardware-specific guidance for training 7B+ parameter models
04AI-driven preference evaluation for scalable reward model training
053,983 GitHub stars
06Two-phase alignment workflow covering SL self-critique and RLAIF