Eval package format
Packages include evalkit.yml, README.md, tests, optional scorers, and optional datasets.
evalkit run @evalkit/pii-detection --model anthropic/claude-sonnet-4-6Edit on GitHub
Packages include evalkit.yml, README.md, tests, optional scorers, and optional datasets.
evalkit run @evalkit/pii-detection --model anthropic/claude-sonnet-4-6Edit on GitHub