Eval package format

Packages include evalkit.yml, README.md, tests, optional scorers, and optional datasets.

evalkit run @evalkit/pii-detection --model anthropic/claude-sonnet-4-6
Edit on GitHub
EvalKit | Open AI Evaluation Marketplace