Integration¶

Kibitzer coordinates three existing tools, integrates with the superpowers plugin, and participates in the umwelt policy framework. None of these are required — kibitzer degrades gracefully.

What works without anything installed¶

Component	Requires
Path guard	nothing — just config.toml and state.json
Mode controller	nothing
Coach (basic patterns)	nothing
Coach (tool-aware suggestions)	`.mcp.json` in project root
Coach (fledgling queries)	fledgling Python package or CLI
MCP tools	nothing
BlqInterceptor	blq on PATH
JetsamInterceptor	jetsam on PATH
FledglingInterceptor	fledgling on PATH
Coach (semantic underuse)	fledgling available to the agent
Doc context pipeline (ILIKE)	pluckit Python package
Doc context pipeline (BM25)	pluckit + fledgling + DuckDB markdown extension
Doc context (Context7)	nothing — uses stdlib urllib (network access)
Policy from umwelt	umwelt Python package + compiled `.kibitzer/policy.db`

blq¶

blq captures structured build/test output. Kibitzer's BlqInterceptor suggests blq run test when the agent runs pytest (or npm test, cargo test, etc.) through Bash.

What kibitzer uses from blq: - blq binary on PATH (for interceptor availability check)

What kibitzer does NOT do: - Does not call blq directly - Does not enforce blq's sandbox specs (that's blq's domain) - Does not parse blq's output

Future: The coach could query blq's event stream for richer pattern detection (e.g., "you've introduced 3 new test failures since your last edit").

jetsam¶

jetsam manages git workflow — atomic saves, syncs, plans, mode state. Kibitzer's JetsamInterceptor suggests jetsam commands when the agent uses raw git through Bash.

What kibitzer uses from jetsam: - jetsam binary on PATH (for interceptor availability check)

What kibitzer does NOT do: - Does not read jetsam's state files (uses its own .kibitzer/state.json) - Does not call jetsam mode — mode switching is via kibitzer's own ChangeToolMode MCP tool - Does not duplicate jetsam's save/sync workflow

Design note: Kibitzer and jetsam both have a concept of "mode." They're independent — jetsam's mode controls its own workflow, kibitzer's mode controls path protection and coaching. An agent could use both. In a future version, the two could share mode state, but for now they're separate to avoid coupling.

Fledgling¶

Fledgling provides read-only code intelligence via DuckDB — AST queries, definition lookup, caller tracing, conversation analytics.

What kibitzer uses from fledgling: - fledgling binary on PATH or Python package importable (for interceptor availability check) - Semantic tool names in counter tracking (FindDefinitions, CodeStructure, etc.) - Conversation analytics queries for richer coaching: - tool_calls() — detect repeated search patterns (same grep 3+ times) - bash_commands() — find bash commands with structured alternatives (replaceable_by field)

How queries work:

Kibitzer prefers fledgling's Python API when importable (fledgling.connect()), falling back to CLI subprocess calls (fledgling -f json query "SQL"). Install with pip install kibitzer[fledgling] for the Python API path.

# Python API (preferred — in-process, fast)
import fledgling
con = fledgling.connect()
rows = con.sql("SELECT * FROM tool_calls() WHERE ...").df().to_dict(orient="records")

# CLI fallback (subprocess, slower but always works if CLI installed)
fledgling -f json query "SELECT * FROM tool_calls() WHERE ..."

All queries have a 5-second timeout. If fledgling is unavailable or a query fails, the coach falls back to state-only patterns — no degradation of existing behavior.

What kibitzer does NOT do: - Does not manage fledgling kits (that's the quartermaster's future job) - Does not write to fledgling's database (read-only, level 0)

Future: Kit effectiveness tracking (which tools were used vs. available in the current kit).

Pluckit¶

Pluckit provides structured access to markdown documentation — glob-based doc discovery, BM25 search via named FTS collections, and section extraction.

What kibitzer uses from pluckit: - Plucker.connection — access to the underlying DuckDB connection for read_markdown_sections - Plucker.fts_collection(name) — named BM25 collections with independent IDF statistics - Plucker.docs() — ILIKE fallback when fledgling is unavailable

How it's used:

Kibitzer's get_doc_context() pipeline uses pluckit for doc retrieval. Tool documentation paths are registered via register_docs() (typically from lackpy's tool catalog docs_index). On first retrieval, kibitzer calls read_markdown_sections with content_mode='full' (each section includes all descendant content), builds a named FTS collection, and uses BM25 ranking for search. Doc refs can include a #section_id anchor to scope a tool's docs to a specific section subtree. Falls back to ILIKE substring matching when fledgling is unavailable.

session.register_docs(
    doc_refs={
        "Read": "tools/read.md",
        "Edit": "tools/edit.md#permissions",  # scoped to ## Permissions subtree
    },
    docs_root="/path/to/docs",
)
result = session.get_doc_context("permission denied", tool="Edit")

Install with pip install kibitzer[pluckit] for the doc context pipeline. Without pluckit, get_doc_context() returns an empty DocResult — no degradation of existing behavior. With pluckit + fledgling, retrieval uses BM25 ranking with content_mode='full'; with pluckit alone, retrieval falls back to ILIKE.

What kibitzer does NOT do: - Does not decide which sections matter — that's the consumer's select callback

Umwelt¶

Umwelt is a policy framework — stylesheets for runtime behavior. Kibitzer registers vocabulary (properties on state.mode) and consumes compiled policy databases to override its default config.toml values.

What kibitzer uses from umwelt: - umwelt.registry.register_property() — declares 5 properties on the state.mode entity - umwelt.policy.PolicyEngine.from_db() — loads compiled .kibitzer/policy.db - engine.resolve(type="mode", id=mode_name) — resolves mode properties through the cascade

Vocabulary registration:

Kibitzer adds properties to umwelt's existing state.mode entity rather than creating its own taxon. These properties map to kibitzer's three roles:

Role	Property	Type	Comparison	Purpose
Restrictor	`writable`	list	pattern-in	Path prefixes writable in this mode
Expander	`strategy`	str	exact	Coaching strategy text for the agent
Expander	`coaching-frequency`	int	<= (min)	Coach fires every N tool calls
Interactor	`max-consecutive-failures`	int	<= (min)	Auto-transition threshold
Interactor	`max-turns`	int	<= (min)	Max turns before suggesting a switch

How policies flow:

.umw stylesheets          umwelt compile          .kibitzer/policy.db
(human-authored)    →     (build step)      →     (SQLite, checked in)
                                                        │
                                                  PolicyConsumer
                                                  .from_db(path)
                                                        │
                                                  .to_config() → dict
                                                        │
                                              merged into load_config()
                                              as tier 4 (highest priority)

Install with pip install kibitzer[umwelt]. Without umwelt, kibitzer uses its built-in config.toml defaults — no degradation.

What kibitzer does NOT do: - Does not compile policy databases (that's umwelt compile) - Does not define its own taxon — uses existing state.mode - Does not write to the policy database (read-only consumer) - Does not require umwelt at runtime — all imports are guarded

Design note: Kibitzer fills three umwelt roles simultaneously. As a restrictor, the writable property constrains which paths the path guard allows. As an expander, strategy and coaching-frequency control coaching behavior. As an interactor, max-consecutive-failures and max-turns govern mode controller thresholds. Policy authors can tune all of these per-mode in .umw stylesheets without touching kibitzer's code.

Context7¶

Context7 provides up-to-date documentation for external libraries via a public REST API. Kibitzer queries it as a fallback when local docs (pluckit) have no results for a tool failure.

What kibitzer uses from Context7: - GET /v2/libs/search?query=... — resolve a library name to a Context7 library ID - GET /v2/context?libraryId=...&query=...&type=json — fetch documentation sections

How it's used:

When a tool fails and no local docs match (or none are registered), kibitzer extracts a library name from the error message and queries Context7. Results are returned as DocSection objects, identical to pluckit results — downstream code doesn't know the source.

tool failure → extract error text → try local docs (pluckit)
                                         │
                                    no results?
                                         │
                              extract library name from error
                              (last word first, skip uppercase)
                                         │
                              Context7: search → fetch → DocSections
                                         │
                              inject via PostToolUse additionalContext

Enabled by default. Disable with docs.context7 = false in .kibitzer/config.toml. All network calls have a 5-second timeout. No authentication required for read-only queries.

What kibitzer does NOT do: - Does not cache Context7 results across calls (each query is fresh) - Does not write to Context7 (read-only public API) - Does not require network access — falls back silently on timeout or error

Relationship to the Context7 MCP plugin: The agent may also have Context7 available as an MCP tool (resolve-library-id, query-docs). Kibitzer's integration is independent — it calls the REST API directly from Python, not through MCP. Both can coexist.

Superpowers¶

The superpowers plugin manages workflow phases (brainstorm → plan → implement → review) through skill invocations. Kibitzer manages tool constraints (what can be written where).

These are complementary, not competing.

Concern	Superpowers owns	Kibitzer's role
Workflow phases	Skills define the progression	Observe active skill, suggest matching mode
Task tracking	TodoWrite / TaskCreate	Don't duplicate — read task state for coaching context
TDD discipline	test-driven-development skill	Path guard enforces mechanically (can't edit src/ in test_dev)
Code review	requesting-code-review skill	Coach can suggest invoking the skill
Git worktrees	using-git-worktrees skill	Detect worktree context, apply config per-worktree
Plan files	`docs/superpowers/plans/`	Read for coaching context ("you're on step 3 of 7")
Verification	verification-before-completion	Coach can remind after edit streaks

What kibitzer does NOT duplicate: - Superpowers' "design before code" gate - Superpowers' "verify before claiming done" gate - Superpowers' task/todo tracking - Superpowers' subagent dispatch

Hook coexistence: Superpowers uses SessionStart hooks. Kibitzer uses PreToolUse and PostToolUse. No collision. kibitzer init detects existing hooks and merges without clobbering.

Integration architecture¶

                          ┌───────────┐
                          │  umwelt   │
                          │ (level 0) │
                          │           │
                          │  policy   │
                          │ framework │
                          └─────┬─────┘
                                │ policy.db
┌───────────────────────────────┴──────────────────────────────────┐
│                       KIBITZER (level 1)                         │
│  Hooks: path guard, interceptors, mode controller, coach         │
│  MCP: ChangeToolMode, GetFeedback, GetDocContext                 │
│  API: register_docs, get_doc_context, get_correction_hints, ...  │
└──────┬──────────────┬──────────────┬──────────────┬──────────────┘
       │              │              │              │
 ┌─────┴─────┐ ┌─────┴─────┐ ┌─────┴──────┐ ┌────┴──────┐ ┌───────────┐
 │    blq    │ │  jetsam   │ │  fledgling  │ │  pluckit  │ │ context7  │
 │ (level 1) │ │ (level 1) │ │  (level 0)  │ │ (level 0) │ │ (REST API)│
 │           │ │           │ │             │ │           │ │           │
 │ test      │ │ git       │ │ code        │ │ local doc │ │ external  │
 │ capture   │ │ workflow  │ │ intelligence│ │ retrieval │ │ lib docs  │
 └───────────┘ └───────────┘ └─────────────┘ └───────────┘ └───────────┘

Umwelt sits above kibitzer — it provides the policy that kibitzer consumes. Each tool below is independent. Kibitzer suggests alternatives but never wraps or calls them. The agent decides whether to use the suggestion. Pluckit and Context7 are exceptions — kibitzer calls pluckit directly for local doc retrieval, and Context7's REST API for external library docs. Both are fallback-safe.