01Manually update task classifications and success outcomes
020 GitHub stars
03Override false positive or false negative correction detections
04View detailed logs of recent task evaluations and session IDs
05Recalculate task-specific confidence and autonomy scores in real-time
06Directly edit local JSONL evaluation records via terminal automation