feat: eval package scaffold with CLI, schemas, types, task loader
CI / check (pull_request) Successful in 1m42s
CI / check (pull_request) Successful in 1m42s
New package @united-workforce/eval (uwf-eval CLI): - CLI skeleton: run/report/diff/list subcommands (stubs) - 5 OCAS schemas: eval-run, judge-frontmatter, judge-upstream, judge-hallucination, judge-token-stats - TaskManifest type + parser/validator for task.yaml - JudgeOutput/JudgeInput types for judge contract - EvalRunPayload/EvalRunConfig/EvalJudgeRecord storage types - 19 unit tests: task loader validation + schema definitions Refs #69
This commit is contained in:
+2
-1
@@ -25,6 +25,7 @@
|
||||
{ "path": "packages/agent-builtin" },
|
||||
{ "path": "packages/agent-mock" },
|
||||
{ "path": "packages/agent-claude-code" },
|
||||
{ "path": "packages/cli" }
|
||||
{ "path": "packages/cli" },
|
||||
{ "path": "packages/eval" }
|
||||
]
|
||||
}
|
||||
|
||||
Reference in New Issue
Block a user