0.1.97 • Published 2d agoCLI

cool-workflow

Licence

BSD-2-Clause

Version

0.1.97

Deps

Size

2.7 MB

Vulns

Weekly

Stars

Summary Dependency Versions

Cool Workflow — your AI report, already proven. Point your agent at any repo or folder and get a saved, cited report you can re-verify offline.

Get a saved, cited report from your AI agent — not a chat message you lose.

cw demo tamper — builds a signed ledger, forges it three ways, catches all three

Point an AI coding agent at a repo (or any folder of docs) and Cool Workflow turns the run into a durable, inspectable workflow: it plans the work, delegates each task to your agent, records and verifies every result, then writes a report where every claim is cited to a file.ts:42.

The model is fuel. CW is the black-box recorder, the dashboard, and the gearbox — never the engine. It never calls a model API, never holds your keys, and never uploads your code. Your agent spends the tokens; CW keeps the books — as plain JSON on your own disk.

Install

npm install -g cool-workflow

Or install with Homebrew

brew tap coo1white/cool-workflow https://github.com/coo1white/cool-workflow
brew install coo1white/cool-workflow/cool-workflow
cw version

Upgrade later with brew update && brew upgrade cool-workflow.

You need: Node.js v18+ and one agent CLI on your machine — claude, codex, gemini, or opencode (all auto-detected). No agent yet? cw demo still works — CW never runs a model itself.

Quick Start

1 · Prove it works — 30 seconds, no agent needed

cw demo tamper
# → builds a real signed ledger, forges it three ways, catches all three offline
# → VERDICT: tamper-evidence holds ✓

2 · Ask a question about your code — one command

cw -q "What are the main risks here?"

CW auto-detects the current repo and the first agent on your PATH. Pin a specific one with a flag (-claude, -codex, -gemini, -deepseek). Point it anywhere — no cd required — or review a remote repo by URL and CW clones, snapshots, and reviews the checkout:

cw -q "What are the security risks?" -dir /path/to/project
cw -q "What are the risks?" --link https://github.com/owner/repo

Not just code. Aim CW at a folder of docs, notes, or papers and it reads them as sources for the same saved, cited report:

cw quickstart research-synthesis --repo /path/to/papers \
  --question "What do these papers conclude?"

3 · Open the report

The command prints the path. Every finding carries a clickable file.ts:42 pointer:

cat .cw/runs/<run-id>/report.md

While it runs you get a calm, Claude-Code-style live view — a compact rolling window of tool calls that updates in place instead of an endless wall — and a clean findings table when it's done.

Why Cool Workflow

Most agent tooling runs a whole task as one long prompt, then hands you a chat message. When the work is long, parallel, or high-stakes, you can't tell what happened or trust the result. CW treats it as a runtime problem — and rests on four commitments:

Commitment	What it means for you
Model as fuel, not engine	CW never calls a model API. Execution is always delegated to your agent, so the backend is swappable and your credentials and code never leave your machine.
Evidence-gated decisions	Every adopted result records its basis, authority, rationale, and the alternative it beat. Missing evidence doesn't slip through — it fails closed to a visible `unexplained` state.
Deterministic, local replay	Every step is plain JSON under `.cw/runs/<id>/` — read it, diff it, resume it, replay it. No hidden database; the runtime never guesses success.
Vendor-neutral by design	One source-of-truth manifest generates every vendor adapter (Claude, Codex, …) over a shared CLI + MCP runtime, with a fail-closed drift check. No lock-in, no forked logic.

How It Works

CW is a small TypeScript tool with zero runtime dependencies. It drives your agent over a repo — or any folder — in saved, replayable stages, writing everything to disk as inspectable files. It never imports a model SDK or stores an API key.

The CW pipeline: ask, plan, dispatch (delegated to your agent), verify (evidence gate, fails closed), commit verified state, and a saved, cited, signed report — every step recorded as durable .cw/ JSON.

ask simple → run simple → verify simple → resume simple

What You Can Run

Workflow	What it produces
`architecture-review`	Map a repo, rank risks, and back every claim with `file:line` evidence
`pr-review-fix-ci`	Review a PR or branch, diagnose CI, and propose + verify fixes
`research-synthesis`	Answer a question over a local folder of files — your docs, notes, or papers
`release-cut`	Run a gated, reviewed release with dry-run evidence

cw app list            # see everything installed
cw doctor              # check your setup    →    cw fix   shows the fix commands

Multi-agent, when you need it. Fan work out across agents with built-in topologies, compose flows (a task can run a whole child workflow with subWorkflow, or a loop() phase can iterate until a predicate or token budget says stop), and re-run fast — cw run <app> --drive --incremental reuses every step whose inputs didn't change.

Built-in multi-agent topologies: map-reduce (fan out, fold in), debate (argue then draw a verdict), and judge-panel (N independent judges score one candidate).

Can You Trust the Report?

CW doesn't run the model — it keeps the books. Your agent signs its findings (ed25519), and cw report verify-bundle checks — offline, with nothing but the public key — that every signed finding is in the report unaltered. Edit a finding, in the report or in the agent's own result, and the check fails. CW holds no private key: the agent signs, CW only verifies.

cw demo tamper                                  # proves it in 30s — edits a signed result, watch it fail
cw -q "…" --bundle                              # seal a run into one portable file
cw report verify-bundle report.cwrun.json       # anyone can re-check it offline, with just the file

This attests the agent's signed findings — not that nothing else was added or that nothing was left out. For exactly what is and isn't proven, see the Trust Model.

Use It From Your Editor

CW exposes the same runtime over MCP — Claude Desktop, Cursor, and VS Code call CW as a tool, so your agent can plan a run, drive it, and verify a report without leaving the editor. CLI and MCP share one registry and are parity-checked. See the Wiki.

Troubleshooting

Problem	Fix
No agent found	`cw doctor` — shows which agents are on your machine
`status: blocked`	Set `CW_AGENT_COMMAND=builtin:claude` or pass `-claude`
`claude: command not found`	Install Claude Code and run again
Where is my report?	`<repo>/.cw/runs/<id>/report.md`

Docs & Wiki

New here? Start with the Wiki → Getting Started · Mental Model · Glossary · Trust & Audit

Building on CW? See the Getting Started doc, Project Index, and CLI MCP Parity.

CW dogfoods its own release process — every cut runs the release-cut workflow against this repo.

License

BSD-2-Clause. Built by COOLWHITE LLC.