Conkurrence: Robust LLM Evaluation & AI Agreement Toolkit