GoodMemory
Language: English | 简体中文
GoodMemory is a memory layer for AI products and coding agents.
It gives chat apps, copilots, and agent hosts a durable user/project memory loop: write selected facts, retrieve the right context, inject it into the next turn, audit what happened, and delete it when it is wrong.
GoodMemory is not an LLM, agent framework, vector database, or generic RAG system. It is the product memory layer between your app or installed agent host and the model runtime.
What You Get
- Durable memory API:
remember,recall,buildContext,feedback,forget,exportMemory, anddeleteAllMemory. - Installed agent memory for Codex and Claude Code through
goodmemory setup, managed hooks, installed Codex pre-action,goodmemory status, read-only MCP, and opt-in writeback. - Public write customization with
GoodMemoryConfig.remember,RememberProfile,rememberRules,RememberInput.annotations, and named extractor ids. - Package exports for
goodmemory,goodmemory/ai-sdk,goodmemory/host, andgoodmemory/httpthrough compileddistartifacts and TypeScript declarations. - Local-first storage: Bun gets durable SQLite by default; explicit Postgres, injected adapters, and embedding providers can be added when needed.
- Evaluation and release evidence paths for deterministic tests, live evals, provider-backed evals, package smoke tests, and quality gates.
Benchmark Results
GoodMemory separates gate-verified public claims from internal research
evidence. A number may appear in the public-claims table only after
gate:public-benchmark-claim --strict passes for its committed declaration:
complete coverage, executionFailures: 0, a no-memory baseline, deterministic
scoring or an independent judge, verified dataset source and license, and a
reproducible run (commit + command + package version).
Public claims (gate-verified)
| Benchmark | Primary metric | GoodMemory result | Baseline / reference | Claim declaration |
|---|---|---|---|---|
| LongMemEval full 500 | judge-free deterministic-subset answer accuracy (same-model semantic judge excluded by construction) | 0.720 (360/500) with goodmemory-rules-only; +65.2pt over no-memory; evidence-session recall 0.9543 |
no-memory baseline 0.068 (34/500; 30 of the 34 are bare abstentions) | longmemeval.json |
| MemoryAgentBench (CR, TTL) | answer accuracy — deterministic, judge-free | CR 0.959, TTL 0.767 | no-memory ablation 0.000; published single-hop CR ceiling ~0.60 | memoryagentbench.json |
The LongMemEval claim is judge-free, replacing an earlier internal with-judge
number (0.908) that is superseded and not claimable. A case counts as correct
only when a deterministic method scores it (abstention / exact / contains /
expected_alternative / numeric_count); the eval pipeline's same-model semantic
judge (gpt-5.5 judging gpt-5.5) is excluded by construction — with it, the
diagnostic overall accuracy is 0.896, reported for transparency but not
claimed. The claimed 0.720 (360/500, executionFailures: 0, v0.3.5) uses the
embedding-free goodmemory-rules-only profile; abstention contributes only 28
of the 360 correct answers, while the no-memory baseline's 0.068 is mostly bare
abstention (30 of its 34 correct), so the +65.2-point lift is the memory
system's contribution. Judge-free refers to scoring — answers are still
generated by gpt-5.5. Full provenance is in the
claim declaration.
The MemoryAgentBench claim is GoodMemory's first public benchmark claim, and it
is deliberately scoped. Only Conflict Resolution (CR 0.959) and Test-Time
Learning (TTL 0.767) are claimed: a no-memory ablation scores both 0.000 (the
questions are unanswerable without GoodMemory's retrieved consolidated fact /
in-context demos), so these are genuine memory contributions, scored
deterministically with no LLM judge (executionFailures: 0, 259 questions).
Accurate Retrieval and Long-Range Understanding are EXCLUDED: the no-memory
ablation scores them higher (AR 0.926 vs 0.890; LRU 0.632 vs 0.518), so they
are multiple-choice leaks where the model answers from the candidates in the
question, not memory wins. CR/TTL measure answer-time current-value resolution
and in-context retrieval, not general retrieval recall.
Internal diagnostics (not public claims)
These rows are research and hardening evidence, not claims. Each is blocked
from public claim by its own committed declaration, which records the exact
blockers. The underlying run reports live under gitignored reports/ and are
reproducible from the run commands recorded in the declarations.
| Benchmark | Internal number | Why it is not claimable | Declaration |
|---|---|---|---|
| ImplicitMemBench Full-300 | overall 213.26 / 300 (0.7109) with goodmemory-distilled-feedback+controlled-priming vs 128 / 300 (0.4267) upstream-chat baseline |
same-model judge (gpt-5.5 judging gpt-5.5) on most scorer families; dataset source/license unverified | implicitmembench.json |
| BEAM (100K, rules-only retrieval diagnostic) | recall fitted 0.9621 (all narrow gates on; 355 evidence questions of 400) vs generalization 0.6822 (all 151 narrow gates disabled); measured answer checkpoint 278 / 400 (0.695) vs 224 / 400 (0.56) pre-evidence-pack, executionFailures: 0 |
same-model judge on the answer checkpoint; the 28-pt recall gap is scenario-fitted, not general retrieval (see ADR-005); generalization below the ≥0.75 target | beam.json |
| LoCoMo | representative conv-1 live run 0.020 (4/199); exact gold-turn recall ~0.07-0.08, zero-retrieval ~0.92 on the lexical/rules substrate | banked retrieval boundary — answers are recall-bound; semantic candidate-generation retrieval work is in progress | locomo.json |
Per ADR-005 BEAM recall is
a dual metric: a fitted figure (all narrow gates on) and a generalization
figure (all narrow gates disabled). Much of the fitted recall comes from
scenario-fitted query classifiers tuned to specific BEAM cases, which do not
fire on unrelated user data; treat the generalization figure as the floor for
out-of-distribution inputs. The BEAM answer checkpoint (0.695, general
answer-time evidence pack src/answer/evidencePack.ts) additionally rides on
that fitted recall and uses the answer model as its own judge, so it stays
internal while answer-gap hardening and independent-judge scoring mature. Use
task-board/00-README.txt for execution order and
docs/GoodMemory-Current-Status-and-Evidence.md
for claim boundaries.
Choose Your Integration Path
GoodMemory has three primary product entry points. They are not the only APIs:
lower-level surfaces such as goodmemory/host, custom stores, eval tooling, and
runtime helpers support these paths. They are the README-level ways to decide
how to start.
1. Build Memory Into An Agent, Chatbox, Or Copilot
Use this when you own the product server and the model call. Install
goodmemory in your Node/Bun service, create one memory instance, and pass a
stable scope such as userId, workspaceId, sessionId, and optionally
agentId.
The request flow is:
- Before the model call, run
recall()for the current scope and query. - Run
buildContext()to turn recall hits into a prompt fragment. - Call your model with that memory context.
- After the response, write selected signals with
memory.jobs.enqueueRemember()orremember(). - Use
feedback(), targetedreviseMemory(),forget(), andexportMemory()for correction, deletion, and user audit.
If your server already uses Vercel AI SDK, use goodmemory/ai-sdk to wrap
generateText() or streamText() instead of hand-wiring the whole loop. Start
with App Quickstart, then read
AI SDK Adapter if you use AI SDK.
2. Add Memory To Codex Or Claude Code
Use this when you want an installed coding agent to remember project and user
context without changing the agent itself. Install the global CLI and run
goodmemory setup.
The installed-host flow is:
session-startanduser-prompt-submithooks recall scoped memory.- GoodMemory injects a compact context block into Codex or Claude Code.
- Codex
pre-tool-usecan deny or redirect risky Bash throughgoodmemory codex actionon the same installed config and storage path. - Read-only MCP gives trace, context, stats, and artifact inspection.
- Optional writeback stays
offby default; useobserveto inspect candidates before moving toselectivedurable writes.
Start with Quickstart: Codex Or Claude Code Memory. Use Installed Host Writeback when you are ready to review or enable writes.
3. Deploy GoodMemory As A Backend Memory-Layer Service
Use this when another backend should call GoodMemory as a service, especially when the product backend is Python/FastAPI or when a product such as OneLife should keep memory server-side instead of bundling GoodMemory into a mobile or browser client.
Deploy the packaged goodmemory-http-bridge in a Node/Bun sidecar. Your backend
then calls:
/memory/recall-contextbefore its own model call/memory/rememberafter a user-confirmed or product-approved signal/memory/feedbackfor procedural corrections/memory/exportand/memory/forgetfor audit and deletion/memory/revisefor targeted correction by explicit memory id
Your service still owns auth, product policy, UI, and model orchestration. GoodMemory owns memory storage, recall, context assembly, write governance, and audit/export/delete behavior. Start with Python/FastAPI HTTP Bridge, then check Runtime And Storage for SQLite/Postgres choices.
During a model turn, GoodMemory does four jobs:
- Resolve memory for the current
scope. - Build a prompt-ready context fragment.
- Record selected post-response signals when your app or host allows it.
- Provide audit, correction, export, and deletion paths for user control.
Your app or installed agent still owns auth, UI, model calls, and product policy. GoodMemory owns the memory loop and storage boundary.
Install
GoodMemory 0.3.6 has two normal install paths.
Use the global CLI when you want memory enhancement inside installed coding agents:
npm install -g goodmemory@0.3.6
goodmemory setup
goodmemory status
Use the package dependency when you are building an application:
npm install goodmemory@0.3.6
If you want to type goodmemory directly, install the global CLI.
A project-local npm install goodmemory@0.3.6 does not put goodmemory on your shell PATH.
Use npx goodmemory, npm exec -- goodmemory, or ./node_modules/.bin/goodmemory
from that project instead.
npx goodmemory -V
Bun consumers can install it directly:
bun add goodmemory@0.3.6
Tarball verification for release rehearsal:
npm install ./goodmemory-0.3.6.tgz
The installed CLI is Bun-backed for non-version commands. The package bin is
Node-safe for goodmemory -V and goodmemory --version; other commands
delegate to Bun.
Quickstart: Codex Or Claude Code Memory
For most users, the first useful path is installed-host memory.
npm install -g goodmemory@0.3.6
goodmemory setup
goodmemory status
goodmemory setup detects Codex and Claude Code, installs managed host wiring,
and asks for:
- host:
codex,claude, or both detected hosts - activation: global, current workspace, or manual opt-in
- GoodMemory user id
- optional Postgres storage
- optional embedding provider
- optional LLM extraction provider
- writeback mode:
off,observe, orselective
Interactive setup defaults to global activation with workspace-derived
isolation and recommends observe for new host configs so users can review
writeback candidates before enabling durable writes. Existing host configs keep
their current writeback mode when the interactive prompt default is accepted.
Scripted installs stay safe with --json or --no-interactive.
Skipping provider setup is valid: GoodMemory still works with local SQLite and
rules-only extraction.
Useful commands:
goodmemory setup --host codex
goodmemory status codex --workspace-root .
goodmemory enable codex --workspace-root . --writeback observe
goodmemory enable codex --workspace-root . --writeback selective
goodmemory disable codex --workspace-root .
goodmemory uninstall codex
The installed host path has four pieces:
- Managed pre-action for Codex:
pre-tool-usecan deny or redirect risky Bash andgoodmemory codex actionexecutes the vetted first step on the same installed config, storage, provider, and scope path used by recall and writeback. - Recall injection:
session-startanduser-prompt-submithooks callrecall()plusbuildContext()and fail open if config, parsing, or storage is unavailable. - Deep inspection:
goodmemory mcp serve --host codexandgoodmemory-mcp --host codexexpose read-only context, trace, stats, and artifact tools. - Optional writeback:
session-stopand explicit writeback commands can turn selected after-response signals into durable memory.
Installed Host Writeback
Installed Host Writeback is opt-in. Runtime config defaults and new scripted
installs remain off unless the user explicitly chooses a writeback mode.
Existing configs keep their current writeback mode when no explicit override is
provided. New interactive installs recommend observe so candidates are visible
before durable writes are enabled.
Use observe before selective:
goodmemory enable codex --writeback observe
goodmemory codex writeback --json
goodmemory enable codex --writeback selective
goodmemory codex writeback --json
Writeback rules:
off: no after-response memory extraction.observe: store local bounded/redacted candidate previews for review without raw transcripts or durable memory writes.selective: write selected candidates through the publicremembersurface.- Raw transcripts are not persisted as memory.
- Assistant-originated durable memory is blocked unless the host confirms or verifies it and the active profile allows it.
remember: "never"masks annotated content before deterministic, custom, or assisted extraction.
Audit and undo:
goodmemory codex writeback inspect --json
goodmemory codex writeback forget --event-id <event-id> --review-outcome false_write
The audit ledger stores bounded redacted candidate previews, candidate keys,
typed linked record ids, status, reasons, host, mode, timestamps,
scope/session digests, and optional manual review metadata. It does not store
raw host payloads. forget --event-id deletes linked memory/evidence records
through public forget() before marking durable audit events forgotten; for
observe-only events it marks the candidate dismissed without calling
forget().
Claude Code has deterministic CLI parity for hook and writeback commands; Codex is the canonical live-evidence path.
Scripted Host Install
Use goodmemory install <host> when you want a fully non-interactive setup:
goodmemory install codex \
--user-id <user-id> \
--activation-mode global \
--writeback observe \
--storage-provider postgres \
--storage-url "postgres://user:pass@host:5432/goodmemory" \
--embedding-provider openai \
--embedding-model text-embedding-3-small \
--embedding-api-key <key> \
--llm-provider openai \
--llm-model gpt-4o-mini \
--llm-api-key <key> \
--no-interactive
Managed config lives under ~/.goodmemory/<host>.json. Re-running install with
provider flags updates the same config and keeps MCP/hook registration
idempotent. Package uninstall does not delete ~/.goodmemory, repo-local
.goodmemory, local SQLite files, or remote Postgres data. Use
goodmemory uninstall <host> to remove managed host wiring, and use
goodmemory forget ... or explicit storage deletion to remove memory data.
App Quickstart
Use the root package when you are building a chatbox, copilot, or product agent. The recommended Node service path is the same thin loop used by the Express and Fastify examples. A longer walkthrough lives in docs/GoodMemory-15-Minute-App-Integration.md.
import type { GoodMemoryTraceSpan } from "goodmemory";
import { createGoodMemory } from "goodmemory";
const traceSpans: GoodMemoryTraceSpan[] = [];
const memory = createGoodMemory({
observability: {
traceSink: {
emit(span) {
traceSpans.push(span);
},
},
},
});
const scope = {
userId: "u-1",
workspaceId: "workspace-a",
sessionId: "s-1",
};
const userMessage = "Remember that the migration rollout is blocked on QA signoff.";
// Call startSession once when the product opens a new session. For later turns
// with the same sessionId, append to the existing runtime state instead.
await memory.runtime.startSession({ scope });
await memory.runtime.appendMessage({
scope,
message: {
role: "user",
content: userMessage,
},
});
const recall = await memory.recall({
scope,
query: "What should the assistant know before replying?",
retrievalProfile: "general_chat",
});
const context = await memory.buildContext({
recall,
output: "system_prompt_fragment",
});
const assistantText = await callYourModel({
memoryContext: context.content,
userMessage,
});
await memory.runtime.appendMessage({
scope,
message: {
role: "assistant",
content: assistantText,
},
});
const writeJob = await memory.jobs.enqueueRemember({
scope,
messages: [
{
role: "user",
content: userMessage,
},
{
role: "assistant",
content: assistantText,
},
],
idempotencyKey: "turn-1",
reason: "post_response_memory_write",
});
const drained = await memory.jobs.drain({ maxJobs: 1 });
const committedJob =
drained.jobs.find((job) => job.jobId === writeJob.jobId) ?? writeJob;
console.log({
traceCount: traceSpans.length,
writeJobId: writeJob.jobId,
writeJobStatus: committedJob.status,
});
async function callYourModel(input: {
memoryContext: string;
userMessage: string;
}): Promise<string> {
void input.memoryContext;
return `Got it. I will keep that in mind: ${input.userMessage}`;
}
The core memory loop is intentionally small:
remember()writes selected user, app, or host signals.recall()retrieves scoped memory for a query.buildContext()turns recall hits into a prompt fragment or JSON payload.feedback()records explicit corrections and procedural preferences.forget()deletes wrong or obsolete memory.
For production app integrations, the recommended turn loop adds the governed runtime layer around that core:
memory.runtime.startSession()andmemory.runtime.appendMessage()track current-session state without making raw transcripts durable memory.memory.jobs.enqueueRemember()schedules after-response memory writes with idempotency and visible job status.memory.jobs.drain()commits queued writes in this in-memory scheduler. In a production service, run draining in your worker or request-adjacent job loop.GoodMemoryConfig.observability.traceSinkreceives redaction-safe traces for remember, recall, context, revise, forget, export, and job events.memory.reviseMemory({ target: { memoryId } })corrects a known memory by explicit id, not by fuzzy text selection.exportMemory()gives the user an audit/export path.
Runtime archive persistence is off by default. If you call
memory.runtime.endSession({ scope, archive: "off" }), session state is
cleared without writing an archive. If you opt into archive persistence, keep it
summary-only and never treat raw transcripts as the default memory source.
For server integrations, start with the thin examples:
examples/express-chat-server.ts or
examples/fastify-chat-server.ts.
For Python/FastAPI backends, use the packaged goodmemory-http-bridge path
described below.
Opt-In Recall Tuning: Multi-Hop, Offline Embedding, And Conversational Extraction
The knobs below are optional and conservative by design. Default recall is single-pass and rules-only, and default extraction is unchanged; nothing happens unless you opt in.
Opt-in multi-hop recall
recall() is single-pass by default. Pass multiHop: true for an opt-in
two-pass retrieval: GoodMemory runs the query, extracts bridge entities named in
the first-pass evidence, expands the query with them, and runs a second pass.
const recall = await memory.recall({
scope,
query: "Who manages the project Alice started?",
multiHop: true,
});
Use it when the answer needs an entity that only the first hop names (hop 1 finds "Alice started Project Atlas"; hop 2 needs "who manages Project Atlas").
- It is opt-in. Default recall stays single-pass; leaving
multiHopunset changes nothing. - It is not a general semantic retriever. It bridges named entities lexically; it does not rank by meaning.
- It can add noise when first-pass recall is weak: if hop 1 surfaces the
wrong evidence, the extracted bridge entities are wrong and the expanded query
dilutes recall. Measured on LoCoMo (where base retrieval is very low)
multiHophurt recall, so do not reach for it to fix conversational / phrasing-gap retrieval — that needs real semantic retrieval, not multi-hop bridging.
Offline local embedding adapter
createLocalEmbeddingAdapter() is a deterministic, offline, dependency-free
embedding adapter (hashed character-n-gram vectors). Inject it for
lexical/morphological tie-breaking without configuring an embedding provider:
import { createGoodMemory, createLocalEmbeddingAdapter } from "goodmemory";
const memory = createGoodMemory({
adapters: { embeddingAdapter: createLocalEmbeddingAdapter() },
});
- It is not neural semantic retrieval. The vectors are hashed lexical features, so they break ties between lexically similar candidates; they do not understand meaning.
- Do not use it to claim a semantic benchmark improvement. It cannot bridge a question-to-text phrasing gap that surface lexical overlap already misses.
- For real semantic ranking, configure a neural embedding provider via
GOODMEMORY_EMBEDDING_*instead.
Opt-in conversational fact extraction
By default, assisted extraction (when a providers.extraction model is
configured) pulls durable product memory — profiles, preferences, references,
and facts. Set providers.extraction.mode: "conversational" to instead
decompose dialogue into self-contained, coreference-resolved, entity- and
date-normalized atomic claims at write time, so later retrieval matches a
normalized fact instead of a raw conversational turn.
const memory = createGoodMemory({
providers: {
extraction: {
provider: "openai",
model: "gpt-5.5",
apiKey: process.env.GOODMEMORY_ASSISTED_EXTRACTOR_API_KEY!,
baseURL: process.env.GOODMEMORY_ASSISTED_EXTRACTOR_BASE_URL,
mode: "conversational",
},
},
});
Use it for chat/agent products where memory comes from multi-turn conversation and questions are phrased differently from how things were said ("Who is the user's manager?" vs. "yeah my boss Dana signed off").
- It is opt-in. Leaving
modeunset (or omittingproviders.extraction) keeps the default extraction behavior; the recall ranking path is untouched. - It is a write-time LLM pass: it uses your configured chat model, so it adds extraction latency and token cost, and like any LLM step it can drop or misphrase a claim. Raw turns remain the ground truth.
- It is not semantic retrieval. It normalizes the stored text so lexical retrieval has a better surface to match; it does not rank by meaning. It is the embedding-free lever for the conversational phrasing gap, not a replacement for a neural embedding provider.
- Do not quote a benchmark number from it without held-out validation, and do not tune the extraction prompt to a specific benchmark's phrasing.
Runtime And Storage
createGoodMemory({}) follows a local-first auto-storage contract:
- Explicit
storage.providerwins when supplied. - Without explicit storage, GoodMemory uses Postgres only when a configured target can bootstrap the GoodMemory backend.
- On Bun, zero-config durable storage is local SQLite at
./.goodmemory/memory.sqlite. - On Node runtimes without the built-in local SQLite adapter, zero-config storage falls back to in-memory.
- Unsupported explicit built-in
sqliteorpostgresselections are reported as unavailable rather than mislabeled durable. - Injected
documentStore,sessionStore, orvectorStoreadapters are reported as adapter-defined storage. - Without
GOODMEMORY_EMBEDDING_*, runtime behavior remainsrules-only. - Supported local runtimes can use
sqlite-vssfor SQLite semantic indexing; unsupported runtimes keep durable non-accelerated fallback behavior.
Inspect the resolved runtime instead of guessing:
import { createGoodMemory, inspectGoodMemoryRuntime } from "goodmemory";
const memory = createGoodMemory({});
const runtime = inspectGoodMemoryRuntime(memory);
console.log(runtime.storage);
SQLite vector controls:
GOODMEMORY_SQLITE_VECTOR_MODE=off|prefer|requireGOODMEMORY_SQLITE_CUSTOM_LIBRARY_PATHGOODMEMORY_SQLITE_VECTOR_EXTENSION_PATHGOODMEMORY_SQLITE_VECTOR_EXTENSION_ENTRYPOINTGOODMEMORY_SQLITE_VECTOR_SEARCH_FUNCTION
Public Remember Customization
Product integrations should customize writes through the public remember
surface. Do not use test-only extractor seams for product behavior.
import { createGoodMemory, rememberRules } from "goodmemory";
const memory = createGoodMemory({
remember: {
preset: "default",
profiles: [
{
id: "life-coach",
when: { agentId: "life-coach" },
rules: [
rememberRules.fact(/my top priority this quarter is (.+)/i, {
id: "life-goal-priority",
category: "goal",
tags: ["life_coach", "long_term_goal"],
attributes: { horizon: "quarter" },
content: ({ match }) => match[1] ?? "",
}),
rememberRules.preference(/please coach me with (.+)/i, {
id: "life-coaching-style",
category: "coaching_style",
value: ({ match }) => match[1] ?? "",
}),
],
assistantOutputs: { mode: "confirmed_or_verified_only" },
},
],
},
});
await memory.remember({
scope: { userId: "u-1", agentId: "life-coach" },
messages: [
{
role: "user",
content: "My top priority this quarter is rebuilding my sleep routine.",
},
],
annotations: [
{
messageIndex: 0,
remember: "always",
metadataPatch: { tags: ["confirmed_by_host"] },
},
],
});
Profile extractors can be raw MemoryExtractor objects or named
{ id, extractor } entries. Use named extractors for real integrations so
remember events and eval reports carry stable extractorIds even if profile
composition changes. Remember events also carry resolved profileId and
presetId metadata.
AI SDK Adapter
GoodMemory's Node-compatible AI SDK path is a plain Request -> Response
server handler built from createGoodMemory() and createGoodMemoryAISDK().
import { createGoodMemory } from "goodmemory";
import type { GoodMemoryStreamTextInput } from "goodmemory/ai-sdk";
import { createGoodMemoryAISDK } from "goodmemory/ai-sdk";
const memory = createGoodMemory({});
const aiSDK = createGoodMemoryAISDK({
memory,
});
type MemoryChatRequest = Pick<
GoodMemoryStreamTextInput,
"messages" | "query" | "scope" | "system"
>;
function isMemoryChatRequest(value: unknown): value is MemoryChatRequest {
if (!value || typeof value !== "object" || Array.isArray(value)) {
return false;
}
const candidate = value as Record<string, unknown>;
const scope = candidate.scope;
return Array.isArray(candidate.messages)
&& !!scope
&& typeof scope === "object"
&& !Array.isArray(scope)
&& typeof (scope as { userId?: unknown }).userId === "string"
&& (scope as { userId: string }).userId.trim().length > 0;
}
export async function handleMemoryChat(request: Request): Promise<Response> {
const body: unknown = await request.json();
if (!isMemoryChatRequest(body)) {
return new Response(
JSON.stringify({
error: "Expected a request body with a messages array and scope.userId.",
}),
{
headers: { "content-type": "application/json; charset=utf-8" },
status: 400,
},
);
}
const result = aiSDK.streamText({
messages: body.messages,
query: body.query,
scope: body.scope,
system: body.system,
model: {} as never,
});
return result.toTextStreamResponse();
}
Notes:
- The canonical server example is examples/plain-ai-sdk-server.ts.
- Thin Express and Fastify examples are examples/express-chat-server.ts and examples/fastify-chat-server.ts.
examples/vercel-ai-chat.tsremains a lower-level wrapper/API example.- Next.js App Router can map
export async function POST(request: Request)to the same handler body. - The first public server path is
ModelMessage-first. - The wrapper augments
systemthroughrecall()andbuildContext()and soft-fails if the memory layer errors.
Python/FastAPI HTTP Bridge
Use the packaged HTTP bridge when a Python backend should call GoodMemory as a server-side memory service.
GOODMEMORY_HTTP_BRIDGE_TOKEN="replace-with-service-token" \
GOODMEMORY_STORAGE_PROVIDER=postgres \
GOODMEMORY_STORAGE_URL="postgres://user:pass@host:5432/goodmemory" \
./node_modules/.bin/goodmemory-http-bridge --profile life-coach
Python callers send Authorization: Bearer <token> plus the x-goodmemory-*
scope headers to POST /memory/recall-context, /memory/remember,
/memory/feedback, /memory/export, /memory/forget, and targeted
/memory/revise. The TypeScript bridge API is available from goodmemory/http.
Host Adapter API
Use goodmemory/host when an external host wants artifacts or host-specific
contracts without importing internals.
import { createGoodMemory } from "goodmemory";
import { createHostAdapter } from "goodmemory/host";
const memory = createGoodMemory({});
const adapter = createHostAdapter({
id: "codex-handoff",
hostKind: "codex",
memory,
readableArtifactTypes: ["session_memory"],
});
const result = await adapter.readArtifacts({
scope: {
userId: "u-1",
workspaceId: "workspace-a",
sessionId: "s-1",
},
includeRuntime: true,
});
Modes:
file-assisted: read compiled artifacts such asMEMORY.md,user.md,session-memory/<sessionId>.md, andplaybooks/*.mdwithout treating files as canonical storage.file-authoritative: available for the minimal writable subset. Today that subset is the canonicalplaybooks/*.mdfile shape, writing structured deltas back into active validated-pattern feedback records.
Writable guardrails:
- Prompt and skill snippet files remain derived read-only outputs.
- Risky guidance edits require explicit
verifyWriteapproval. - Low-risk metadata edits such as
appliesToandWhycan write back without the extra approval step. - Failed writable operations return diagnostics with rollback guidance.
Current Claude/Codex examples stay in file-assisted mode by default.
CLI Reference
The goodmemory command on your shell PATH is the global CLI installed with
npm install -g goodmemory@0.3.6. In a local dependency install, invoke the
package bin as npx goodmemory, npm exec -- goodmemory, or
./node_modules/.bin/goodmemory. The repo-local bun run goodmemory script is
for development only.
Memory-first commands:
./node_modules/.bin/goodmemory inspect --user-id <user-id> --workspace-id <workspace-id>
./node_modules/.bin/goodmemory trace --user-id <user-id> --workspace-id <workspace-id> --query "Which runbook is the source of truth?"
./node_modules/.bin/goodmemory export-memory --user-id <user-id> --workspace-id <workspace-id> --output ./tmp/export
./node_modules/.bin/goodmemory stats --user-id <user-id> --workspace-id <workspace-id>
./node_modules/.bin/goodmemory remember --user-id <user-id> --workspace-id <workspace-id> --session-id <session-id> --message "Remember that the deploy is blocked on smoke verification."
./node_modules/.bin/goodmemory feedback --host codex --workspace-root . --session-id <session-id> --signal "Keep coding summaries short and list explicit next steps."
./node_modules/.bin/goodmemory forget --host codex --workspace-root . --session-id <session-id> --memory-id <memory-id>
Installed-host commands:
goodmemory -V
goodmemory --version
goodmemory setup --host codex
goodmemory status codex --workspace-root .
goodmemory install codex --activation-mode global --writeback observe --user-id <user-id>
goodmemory enable codex --workspace-root . --writeback selective
goodmemory mcp serve --host codex
goodmemory-mcp --host codex
goodmemory codex bootstrap --user-id <user-id> --workspace-id <workspace-id>
goodmemory claude bootstrap --user-id <user-id> --workspace-id <workspace-id>
Hook and writeback examples:
printf '%s' '{"cwd":".","session_id":"s-1","hook_event_name":"SessionStart","source":"startup"}' \
| goodmemory codex hook session-start
printf '%s' '{"cwd":".","session_id":"s-1","tool_name":"Bash","tool_input":{"command":"./tools/DeepAnalyzer --detailed"}}' \
| goodmemory codex hook pre-tool-use
goodmemory codex action -- ./tools/DeepAnalyzer --detailed
printf '%s' '{"cwd":".","session_id":"s-1","messages":[{"role":"user","content":"Next step is to finish the release smoke."}]}' \
| goodmemory codex writeback --json
printf '%s' '{"cwd":".","session_id":"s-1","event_id":"stop-1","summary":"Keep coding summaries short."}' \
| goodmemory codex hook session-stop
Eval artifact inspection:
./node_modules/.bin/goodmemory eval inspect --run-dir reports/eval/live/<run-id> --case-id <case-id>
./node_modules/.bin/goodmemory eval trace --run-dir reports/eval/live/<run-id> --case-id <case-id>
./node_modules/.bin/goodmemory eval export-case --run-dir reports/eval/live/<run-id> --case-id <case-id> --output /tmp/case.json
CLI surface:
goodmemory -Vgoodmemory --versiongoodmemory setupgoodmemory statusgoodmemory installgoodmemory uninstallgoodmemory enablegoodmemory disablegoodmemory inspectgoodmemory tracegoodmemory export-memorygoodmemory statsgoodmemory remembergoodmemory feedbackgoodmemory forgetgoodmemory mcp servegoodmemory-mcpgoodmemory codex hookgoodmemory codex writebackgoodmemory claude hookgoodmemory claude writebackgoodmemory codex bootstrapgoodmemory claude bootstrapgoodmemory eval inspectgoodmemory eval tracegoodmemory eval export-case
Examples
Installed-package guides:
- 15-minute app integration guide: docs/GoodMemory-15-Minute-App-Integration.md
- Reference integration guide: docs/GoodMemory-Reference-Integration-Guide.md
- Codex handoff setup guide: docs/GoodMemory-Codex-Handoff-Setup-Guide.md
- Claude Code setup guide: docs/GoodMemory-Claude-Code-Setup-Guide.md
Repo-local examples:
- Basic chat integration: examples/basic-chat.ts
- Coding-agent flavored integration: examples/coding-agent.ts
- Plain AI SDK server integration: examples/plain-ai-sdk-server.ts
- Express chat server integration: examples/express-chat-server.ts
- Fastify chat server integration: examples/fastify-chat-server.ts
- AI SDK wrapper integration: examples/vercel-ai-chat.ts
- Life-coach public remember profile: examples/life-coach-remember-profile.ts
- Claude-style host artifact consumption: examples/host-claude-artifacts.ts
- Codex-style session handoff consumption: examples/host-codex-handoff.ts
Run examples from this repo:
bun run example:chat
bun run example:coding-agent
bun run example:ai-sdk-server
bun run example:express-chat
bun run example:fastify-chat
bun run example:vercel-ai
bun run example:life-coach-profile
bun run example:host-claude
bun run example:host-codex
Testing And Eval
Default local gates:
bun test
bun run typecheck
bun run test:coverage
Use bun run test:all only when you intentionally want the broader sweep
through vendored or third-party test trees.
Eval commands:
bun run eval:smoke
bun run eval:fallback
bun run eval:live
bun run eval:live-memory
bun run eval:live-auto-memory
bun run eval:live-provider-memory
bun run eval:summary
Meanings:
eval:smoke: harness self-check.eval:fallback: deterministic validation without live model calls.eval:live: live generator plus live judge with an in-memory backend.eval:live-memory: live generator plus live judge using auto-storage semantics; default storage is local SQLite unless provider storage resolves.eval:live-auto-memory: alias foreval:live-memorywhen scripts need to make auto-storage explicit.eval:live-provider-memory: provider-backed evidence path requiring Postgres, embeddings, and assisted extraction; it does not silently fall back to SQLite.eval:summary: summarize existing eval output directories.
Live eval environment:
GOODMEMORY_EVAL_PROVIDERGOODMEMORY_EVAL_BASE_URLfor OpenAI-compatible gatewaysGOODMEMORY_EVAL_MODELGOODMEMORY_EVAL_API_KEYGOODMEMORY_EVAL_MAX_CONCURRENCYoptional parallelism capGOODMEMORY_JUDGE_PROVIDERGOODMEMORY_JUDGE_BASE_URLfor OpenAI-compatible gatewaysGOODMEMORY_JUDGE_MODELGOODMEMORY_JUDGE_API_KEY
eval:live-memory and eval:live-auto-memory also need embedding and
assisted extractor configuration:
GOODMEMORY_EMBEDDING_PROVIDERGOODMEMORY_EMBEDDING_BASE_URLfor OpenAI-compatible gatewaysGOODMEMORY_EMBEDDING_MODELGOODMEMORY_EMBEDDING_API_KEYGOODMEMORY_ASSISTED_EXTRACTOR_PROVIDERGOODMEMORY_ASSISTED_EXTRACTOR_BASE_URLfor OpenAI-compatible gatewaysGOODMEMORY_ASSISTED_EXTRACTOR_MODELGOODMEMORY_ASSISTED_EXTRACTOR_API_KEY
eval:live-provider-memory additionally requires:
GOODMEMORY_TEST_POSTGRES_URL
Output directories:
- live runs:
reports/eval/live/run-* - auto-storage live memory runs:
reports/eval/live-memory/run-* - provider-backed live memory runs:
reports/eval/live-provider-memory/run-* - fallback runs:
reports/eval/fallback/run-*
Strategy Rollout
GoodMemory keeps rules-only as the supported baseline. New retrieval behavior
moves through observe -> assist -> promote.
Operator guidance:
observe: collect isolated shadow evidence without changing the executed path.assist: allow candidate execution in controlled eval runs.promote: requirestrategy-promotion-gate.json, a cleanregression-dashboard.json, andstrategy-promotion-authorization.json.- Stay
rules-onlywhen eval evidence is incomplete, provider-backed dependencies are unavailable, or rollback conditions are present.
Current Status
Current stable public surface:
- root memory API through
goodmemory - AI SDK adapter through
goodmemory/ai-sdk - host adapter and host contracts through
goodmemory/host - HTTP bridge API through
goodmemory/httpand packagedgoodmemory-http-bridge - installed CLI and managed host setup through
goodmemory setup - Codex and Claude Code hooks for recall
- read-only MCP for inspection and debugging
- opt-in installed-host writeback with audit and undo
- local SQLite durable fallback on Bun
- Postgres, embeddings, assisted extraction, and provider-backed evals when configured
Still outside the accepted public claim:
- default-on automatic writeback
- raw transcript archive
- dashboard or managed cloud
- treating exported artifact files as canonical storage
- broadening root exports with internal proposal or promotion internals
For the detailed current-state and evidence map, use docs/GoodMemory-Current-Status-and-Evidence.md.
Documentation
- Documentation map and archive policy: docs/README.md
- Current status and evidence: docs/GoodMemory-Current-Status-and-Evidence.md
- Canonical design: docs/GoodMemory-First-Principles-and-Reference-Architecture.md
- v1 implementation architecture: docs/GoodMemory-OSS-Architecture-v1.md
- PRD: docs/GoodMemory-PRD.md
- TDD and evaluation strategy: docs/GoodMemory-TDD-and-Evaluation-Strategy.md
- Strategy rollout guide: docs/GoodMemory-Strategy-Rollout-Guide.md
- Release checklist: docs/GoodMemory-v1-Release-Checklist.md
- Historical quality-gate archive: docs/archive/quality-gates/README.md
- Historical v1 snapshot: docs/GoodMemory-v1-Quality-Gate.md
Use task-board/00-README.txt for execution order,
open follow-up work, and phase-specific acceptance boundaries. Archived design
inputs are not current truth and are routed through docs/README.md.