feat: eval package scaffold with CLI, schemas, types, task loader
CI / check (pull_request) Successful in 1m42s

New package @united-workforce/eval (uwf-eval CLI):

- CLI skeleton: run/report/diff/list subcommands (stubs)
- 5 OCAS schemas: eval-run, judge-frontmatter, judge-upstream,
  judge-hallucination, judge-token-stats
- TaskManifest type + parser/validator for task.yaml
- JudgeOutput/JudgeInput types for judge contract
- EvalRunPayload/EvalRunConfig/EvalJudgeRecord storage types
- 19 unit tests: task loader validation + schema definitions

Refs #69
This commit is contained in:
2026-06-04 23:42:16 +00:00
parent b94234652a
commit 99619d85db
21 changed files with 675 additions and 1 deletions
+2 -1
View File
@@ -25,6 +25,7 @@
{ "path": "packages/agent-builtin" },
{ "path": "packages/agent-mock" },
{ "path": "packages/agent-claude-code" },
{ "path": "packages/cli" }
{ "path": "packages/cli" },
{ "path": "packages/eval" }
]
}