Trace Review & Dataset Promotion FAQs

Question 1

What permissions are required to use these tools?

Accepted Answer

Using these tools requires specific scopes: 'review:read' for listing items and 'review:write' for flagging, judging, and promoting traces to datasets.

Question 2

Does it handle large volumes of trace data?

Accepted Answer

Yes, it includes built-in pagination for retrieving review items and queue-wide triage guidance to help prioritize high-impact or older pending runs.

Question 3

How does this skill improve AI model accuracy?

Accepted Answer

It facilitates a human-in-the-loop feedback loop, allowing developers to manually verify AI outputs and feed high-quality 'gold' examples back into datasets for more accurate future evaluations.

Question 4

What types of judgment scoring does it support?

Accepted Answer

The skill supports multiple scoring types including binary (Pass/Fail), categorical (specific labels), and continuous (numeric ranges) based on your evaluation configuration.

Question 5

Can I use this with VS Code or Cursor?

Accepted Answer

Yes, as part of the Truesight MCP skills, it is compatible with Claude Code, Cursor, VS Code, Windsurf, and any client supporting the agent skills standard.

Trace Review & Dataset Promotion

主要功能

使用场景

Trace Review & Dataset Promotion

主要功能

使用场景