united-workforce

Author	SHA1	Message	Date
xingyue	1abc3b4cf4	chore: fix all biome lint errors across monorepo - Fix import ordering (organizeImports) across multiple packages - Replace forEach with for...of loops (noForEach) - Replace non-null assertions with fallback values (noNonNullAssertion) - Add biome-ignore comments for justified noExplicitAny usages - Remove parameter properties, use explicit class properties (noParameterProperties) - Fix string concatenation to template literals (useTemplate) - Fix format issues (CSS, TypeScript) - Add tailwindDirectives CSS parser config in biome.json - Replace var with const (noVar) Result: 0 errors, 12 warnings (all cognitive complexity, acceptable)	2026-05-23 18:39:02 +08:00
xingyue	330db43b5f	feat(builtin-agent): persist ReAct loop turns as session JSONL Each turn (assistant response / tool result) is appended to a JSONL file at ~/.uncaged/workflow/sessions/<sessionId>.jsonl during the loop. On completion, the JSONL is read back, each turn is stored as a CAS node, and the detail payload references them as a flat turns[] array in chronological order. The session file is then deleted. Benefits: - Real-time observability: tail -f the JSONL to watch loop progress - Crash recovery: partial JSONL survives process death - Zero write contention: one file per session - Detail stays a flat array for easy consumption by CLI/dashboard Changes: - New session.ts: initSessionDir, appendSessionTurn, readSessionTurns, removeSession - loop.ts: append JSONL each turn instead of accumulating in-memory - detail.ts: reads session JSONL → persists turns to CAS → stores detail - agent.ts: passes storageRoot/sessionId to loop, cleans up session on completion - types.ts: remove index from TurnPayload (order is implicit in JSONL/array) - schemas.ts: sync with type changes Ref: #433	2026-05-23 18:27:28 +08:00
xiaoju	211f38bc8d	fix(claude-code): include edge prompt in agent prompt as Current Instruction buildClaudeCodePrompt was dropping ctx.edgePrompt entirely — the graph transition instruction (e.g. 'Implement the plan') never reached the agent. Now appended as '## Current Instruction' at the end of the prompt.	2026-05-23 09:46:17 +00:00
xingyue	080792a6c0	feat: builtin agent session resume via deterministic message reconstruction (#426 ) - StepRecord adds edgePrompt field (backward compat: defaults to "") - StepNode CAS schema includes edgePrompt - writeStepNode persists ctx.edgePrompt - buildHistory exposes edgePrompt in StepContext - buildBuiltinMessages reconstructs multi-turn moderator↔agent conversation: system = role prompt + output format (stable prefix) per prior visit: user (edgePrompt + inter-step summary) + assistant (output) current: user (edgePrompt + recent summary) - Zero extra persistence — pure function of CAS chain - Stable prefix for LLM prompt cache hits - 10 builtin tests pass, all other package tests pass	2026-05-23 17:34:49 +08:00
xiaomo	9f95956e19	Merge pull request 'fix(builtin): split prompt into system/user messages' (#425 ) from fix/builtin-agent-system-user-split into main	2026-05-23 09:17:13 +00:00
xingyue	44147da419	fix(builtin): split prompt into system/user messages System message = agent identity (role prompt + output format instruction) User message = moderator speech (task + edge prompt + history) This reflects the workflow's core model: moderator speaks to agent via the graph's edge prompt. Previously all content was in a single system message with no user message, causing Claude API 400 errors. - buildBuiltinPrompt now returns { system, user } instead of string - agent.ts sends system + user as separate messages - Tests updated accordingly	2026-05-23 17:15:23 +08:00
xiaoju	bc64f2613b	fix(thread): handle null stderr from execFileSync, increase maxBuffer to 50MB - err.stderr can be null (not just undefined) when child process fails - maxBuffer default (1MB) too small for stream-json verbose output	2026-05-23 08:58:05 +00:00
xiaoju	d16ce44bc3	feat(claude-code): enrich step details with per-turn breakdown Switch from --output-format json to stream-json --verbose to capture per-turn data. Detail now includes: - model name - usage (input/output/cache tokens) - stopReason - turns[] as individual CAS nodes with role, content, tool calls Also addresses PR #421 review fixes: - sessionId guard: skip cache write when sessionId is empty/undefined - silent catch: log resume failures with debug tag 5VKR8N3Q - atomic write: session cache uses temp+rename for crash safety Closes #422	2026-05-23 08:16:47 +00:00
xiaomo	45122bc458	Merge pull request 'fix: disable hermes resume, add claude-code resume support, debate workflow' (#421 ) from test/418-resume-e2e-repro into main	2026-05-23 07:59:53 +00:00
xingyue	cef4db9a87	refactor: remove workspace path sandbox and shell gate - Replace resolvePathInWorkspace with simple resolvePath (no boundary check) - Remove UWF_BUILTIN_ALLOW_SHELL env gate from run_command - Update tests accordingly Per review: sandbox was false security with shell=true, and path restrictions are unnecessary for a trusted agent environment.	2026-05-23 15:50:30 +08:00
xiaoju	1afaeacd57	feat: extract session cache to agent-kit, add resume to claude-code agent Move getCachedSessionId/setCachedSessionId from workflow-agent-hermes into workflow-agent-kit so all agent adapters can share the same session cache logic. Add cross-process session resume to workflow-agent-claude-code: on re-entry (isFirstVisit=false), look up the cached sessionId and use 'claude --resume' to continue with full conversation history. Cache file renamed from hermes-sessions.json to agent-sessions.json to reflect its shared nature. Refs #418	2026-05-23 07:44:02 +00:00
xingyue	deac2336b6	feat: add @uncaged/workflow-agent-builtin package Built-in role agent that uses workflow config models directly, with its own tool-calling run loop. No external agent dependency. - OpenAI-compatible chat completion client with tool_calls support - P0 toolkit: read_file, write_file, run_command - Integrates via createAgent factory from workflow-agent-kit - CAS detail recording for each turn - Path sandboxing and shell opt-in (UWF_BUILTIN_ALLOW_SHELL)	2026-05-23 15:29:55 +08:00
xiaoju	aad2792754	fix(hermes): disable ACP session/resume by default Hermes ACP _restore fails for custom providers — resolve_runtime_provider throws and base_url/api_mode are lost, causing resume to silently create a new session with no history. Prompt then returns empty text or refusal. Disable resume by default. Set UWF_HERMES_RESUME=1 to opt back in. Includes investigation notes in docs/investigations/. Refs #418	2026-05-23 07:23:14 +00:00
scottwei	10642fdc45	Merge pull request 'test: failing e2e test for session resume bug (#418 )' (#419 ) from test/418-resume-e2e-repro into main Reviewed-on: uncaged/workflow#419	2026-05-23 06:49:54 +00:00
xiaoju	3b6aa6525f	test: add failing e2e test for session resume bug (#418 ) Cross-process resume returns empty text on subsequent prompt. This test documents the bug — expected to fail until #418 is fixed.	2026-05-23 06:43:47 +00:00
xiaomo	54631c43c7	docs: update cli-reference with log commands, --count flag, edge prompt concept	2026-05-23 06:32:27 +00:00
xiaoju	7faa8184ae	feat: add uwf log subcommands (list, show, clean) - uwf log list: list log files with sizes - uwf log show --thread <id>: filter by thread ID - uwf log show --process <pid>: filter by process ID - uwf log clean --before <date>: delete old log files - Tests: 12 new tests covering all subcommands Implemented by solve-issue workflow, biome fixes applied manually. Closes #413 Refs #411, #410	2026-05-23 06:23:56 +00:00
xiaoju	816137315e	feat: add uwf log subcommands (list, show, clean) - cmdLogList: list log files with sizes, sorted by date descending - cmdLogShow: filter entries by thread, process, and/or date - cmdLogClean: delete log files older than given date - 12 tests covering all functions and edge cases Fixes #413	2026-05-23 06:21:06 +00:00
xiaoju	9a111d16c7	fix: invalid Crockford Base32 char 'L' in log tag PL_AGENT_DONE Fixes runtime crash on uwf thread step.	2026-05-23 06:13:29 +00:00
xiaoju	d0dc7b5a19	feat: add process-level debug logger (Phase 1) - New ProcessLogger in workflow-util: process-scoped JSONL logger - Entry schema: {ts, pid, tag, msg, thread, workflow} - Storage: ~/.uncaged/workflow/logs/YYYY-MM-DD.jsonl - Auto logs process init info (argv, node version, context) - cli-workflow thread commands fully instrumented: - thread start/step, moderator evaluate, agent spawn/done - thread archived, error paths Refs #411, #412, #410	2026-05-23 06:10:05 +00:00
xiaoju	4b45f4e6d1	feat: replace edgePrompt null check with isFirstVisit (Phase 2) - Add isFirstVisit: boolean to AgentContext - Compute from steps history: !steps.some(s => s.role === role) - hermes.ts: use isFirstVisit for first-entry vs re-entry logic - buildInitialPrompt: always append edgePrompt as Moderator Instruction - edgePrompt is never blanked — always the real moderator instruction - New tests for first-visit, re-entry, and fallback scenarios Refs #405, #407, #404	2026-05-23 04:54:11 +00:00
xiaoju	3d6399c0e3	feat: make edge prompt required (Phase 1) - Transition.prompt: string \| null → string - EvaluateResult.prompt: string \| null → string - AgentContext.edgePrompt: string \| null → string - CLI YAML validation rejects missing prompt - All tests updated Phase 2 will replace edgePrompt === null checks with findLastRoleIndex. Refs #405, #406, #404	2026-05-23 04:28:58 +00:00
xiaoju	638329a562	feat: edge prompt + session resume implementation (#402 ) - buildContinuationPrompt: incremental prompt for role re-entry - buildHermesPrompt: dual-mode (initial vs continuation) - session-cache: thread:role → hermes sessionId mapping - HermesAcpClient.resume(): session/resume JSON-RPC - Fallback: cache miss or resume fail → initial prompt - UWF_NO_RESUME env to skip cache - solve-issue.yaml: reviewer→developer edge prompt - Tests updated for EvaluateResult + continuation prompt Refs #402	2026-05-23 03:57:04 +00:00
xiaoju	1a06e014f5	feat(protocol): add edge prompt to Transition + EvaluateResult (#402 ) - Transition type gains prompt: string \| null - evaluate() returns EvaluateResult { role, prompt } instead of string - normalizeGraph coerces prompt: undefined → null - spawnAgent passes edge prompt via UWF_EDGE_PROMPT env - AgentContext gains edgePrompt field Refs #402	2026-05-23 03:49:15 +00:00
xiaoju	d5d05334f5	fix: ACP client permission handling and process cleanup Two bugs fixed: 1. request_permission messages (JSON-RPC requests with both id+method) were silently swallowed by the response handler, causing hermes to hang waiting for permission approval. Now properly distinguish responses (id only) from server requests (id+method). 2. uwf-hermes process never exited after completing because the hermes ACP subprocess was still alive. Now explicitly close the ACP client after agent completion so the subprocess terminates. 小橘 <xiaoju@shazhou.work>	2026-05-22 14:51:43 +00:00
xiaoju	844f5438fe	fix: replace @agentclientprotocol/sdk with readline-based JSON-RPC The official TS SDK's ndJsonStream hangs indefinitely on prompt() for sessions with 20+ messages (solve-issue planner). Root cause appears to be a stream backpressure issue in the SDK's ReadableStream adapter. Switch back to readline-based line parsing which reliably receives all JSON-RPC responses. Also handle session/request_permission inline (auto-approve, yolo mode equivalent). Ref #398	2026-05-22 14:34:27 +00:00
xiaoju	f90614a622	feat: collect structured turns from ACP session updates UwfAcpClient now tracks all session/update events: - agent_message_chunk → assistant message content - agent_thought_chunk → assistant reasoning - tool_call → pending tool invocation (name + rawInput) - tool_call_update (completed/failed) → assistant tool_call + tool result Messages are accumulated across prompts (same session) and stored via storeHermesSessionDetail, restoring the full structured detail (turns with tool calls, reasoning) that was lost in the initial ACP migration. Ref #398	2026-05-22 13:13:02 +00:00
xiaoju	68af555313	fix: share ACP client across run/continue for session continuity The client is now created once in createHermesAgent() and shared by runHermes and continueHermes closures. This preserves conversation context during frontmatter retry loops — continue() sends a follow-up prompt on the same ACP session instead of starting a new one. Client is cleaned up via process.on('exit'). Ref #398	2026-05-22 13:06:14 +00:00
xiaoju	025695dbe9	refactor: use @agentclientprotocol/sdk instead of hand-rolled JSON-RPC Replace 250-line custom ACP client with official TypeScript SDK. Uses ClientSideConnection + ndJsonStream for stdio transport. Same public API (connect/prompt/close), 115 lines, zero custom protocol code. Ref #398	2026-05-22 12:58:55 +00:00
xiaoju	96584e481f	refactor: replace spawnHermes with HermesAcpClient Remove spawnHermes, spawnHermesChat, spawnHermesResume, parseSessionId, and buildResultFromSession. runHermes and continueHermes now use HermesAcpClient for structured JSON-RPC communication. Session ID comes directly from ACP protocol, eliminating #380 race condition. Agent output collected via streaming chunks instead of session file loading. Phase 2 of RFC #398 Fixes #380	2026-05-22 12:18:14 +00:00
xiaoju	766ec7ddc2	feat: add HermesAcpClient for structured ACP communication Implements JSON-RPC client that communicates with `hermes acp` via stdin/stdout. Replaces fragile stdout/stderr parsing with structured protocol: initialize → session/new → session/prompt → collect chunks. Session ID comes directly from protocol response, eliminating the race condition in #380. Phase 1 of RFC #398	2026-05-22 12:15:09 +00:00
xiaomo	baa2edfa38	Merge pull request 'feat: workflow-agent-claude-code' (#393 ) from feat/391-workflow-agent-claude-code into main	2026-05-22 10:58:18 +00:00
xingyue	4dff320d5c	fix: throw on non-JSON Claude Code output instead of fallback	2026-05-22 18:57:07 +08:00
scottwei	c9fcb15384	Merge branch 'main' into jshang/workflow-dashboard	2026-05-22 10:52:53 +00:00
xiaoju	76fab22827	fix: explicitly forbid extra frontmatter fields in output format instruction buildOutputFormatInstruction now includes explicit language telling agents to output ONLY schema-defined fields and to focus on their role's deliverable. Fixes #394	2026-05-22 10:49:04 +00:00
xingyue	176844d7f5	fix: add sessionId to raw fallback, fix test meta→frontmatter+description	2026-05-22 18:42:27 +08:00
xingyue	31695e89a8	feat: add workflow-agent-claude-code package Claude Code CLI adapter for the workflow engine, mirroring workflow-agent-hermes architecture. Spawns `claude -p` with `--output-format json` for structured output parsing. Refs #391	2026-05-22 18:38:18 +08:00
xiaomo	669875fb46	Merge pull request 'feat: validate model connectivity during uwf setup' (#392 ) from feat/335-setup-validate-model into main	2026-05-22 10:32:01 +00:00
xiaoju	6d94be34a9	feat: validate model connectivity during uwf setup Send a test completion request after configuration to verify the model is reachable. If validation fails, warn the user and suggest trying a different model or checking their settings. Fixes #335	2026-05-22 10:30:39 +00:00
xiaomo	d95fe45a3d	Merge pull request 'feat: add --count/-c flag to uwf thread step' (#390 ) from feat/373-thread-step-count into main	2026-05-22 10:11:13 +00:00
xiaoju	4d47effd39	fix: generate frontmatter instruction dynamically from role schema Replace hardcoded 5-field example with schema-driven generation. Now shows actual enum values, types, and required markers for each role's frontmatter schema. Fixes #389 小橘 <xiaoju@shazhou.work>	2026-05-22 10:03:45 +00:00
xiaoju	67870392ab	fix: dynamic frontmatter field extraction from role schema Replace hardcoded 5-field candidate with schema-driven extraction. Now reads outputSchema properties and picks matching fields from parsed frontmatter, supporting role-specific fields like plan, approved, success. Falls back to standard 5 fields when schema has no properties. Fixes #388 小橘 <xiaoju@shazhou.work>	2026-05-22 09:57:30 +00:00
jiashuang	9316b843f6	init workflow service	2026-05-22 17:46:53 +08:00
xiaoju	487c48effa	fix: revert output protocol changes from #385 Agent CLI outputs plain CAS hash (not JSON), engine parses plain hash. StepOutput no longer carries sessionId — session info is already in CAS detail. Keeps the valuable parts of #385: sessionId in AgentRunResult (process-internal), continue support, and frontmatter retry loop.	2026-05-22 09:39:36 +00:00
xiaoju	f0f840e6e0	fix: StepOutput.sessionId → string \| null, legacy fallback → null	2026-05-22 09:16:13 +00:00
xiaoju	7ff90cef4f	feat: agent session protocol — sessionId in result, continue support, frontmatter retry Breaking changes: - AgentRunResult now requires sessionId field - AgentOptions now requires continue function - Agent CLI outputs JSON {stepHash, sessionId} instead of plain CAS hash - Engine parses JSON output (with legacy CAS hash fallback) New features: - Frontmatter validation retry: if agent output lacks valid frontmatter, engine calls agent.continue() up to 2 times with correction message - Session tracking: sessionId flows from agent → engine → StepOutput - Hermes agent: session parse failure is now a hard error (no raw text fallback) - Hermes agent: supports --resume for continue sessions Closes #384	2026-05-22 09:13:05 +00:00
xiaoju	e62d51d845	Merge remote-tracking branch 'origin/feat/remove-llm-extract' into feat/384-agent-session-protocol	2026-05-22 09:06:24 +00:00
xiaoju	99a2890be2	feat: remove LLM extract fallback, require YAML frontmatter Agent output must contain valid YAML frontmatter matching the role schema. If frontmatter parsing fails, the step fails immediately with a clear error instead of falling back to an LLM extraction that can fabricate values. The extract module remains as a public API export but is no longer used in the agent run loop. Breaking change: agents that relied on LLM extraction to produce valid output will now fail. They must output proper frontmatter.	2026-05-22 08:58:01 +00:00
xiaoju	3b7d0564bb	feat: uwf cas put-text for storing plain text in CAS - Register built-in text schema ({type: 'string'}) alongside workflow schemas - Add cmdCasPutText command: uwf cas put-text <text> - Update CLI reference in workflow-util - Update solve-issue.yaml procedure to use put-text Refs #380	2026-05-22 08:53:27 +00:00
xiaoju	45dacf540b	feat: thread step --count/-c <number> to run multiple steps Add --count/-c flag to 'uwf thread step' for running N steps in one invocation, stopping early if $END is reached. - cmdThreadStep now loops up to count times, delegates to cmdThreadStepOnce - CLI parses -c/--count, defaults to 1 (backward compatible single output) - Validation rejects 0, negative, and non-integer counts - 7 new tests covering CLI parsing and count validation Fixes #373 Co-authored-by: uwf-hermes (solve-issue workflow)	2026-05-22 08:06:26 +00:00

... 3 4 5 6 7 ...

666 Commits