01Identifies benchmark, dataset, and metric tokens for technical evaluation sections
0288 GitHub stars
03Captures explicit limitation and failure hooks to ensure balanced reporting
04Generates structured JSONL outputs organized by H3 subsection headers
05Extracts numeric data including percentages, scores, and counts from evidence sources
06Automates citation mapping against BibTeX references for factual accuracy