references/kit - kit - Gitea: Git with a cup of tea

mirror of https://github.com/mark3labs/kit.git synced 2026-06-13 19:20:06 +00:00

Author	SHA1	Message	Date
Ed Zynda	fd960921ca	refactor: address code audit findings across SDK, cmd, and internals - Remove deprecated GenerateWithLoopAndStreaming and TreeManager AppendFantasyMessage / AddFantasyMessages / GetFantasyMessages to close the SDK leakage caused by the kit.TreeManager type alias - Switch extensionAPI method signatures to local Extension* aliases so pkg.go.dev signatures no longer expose internal package names - Bundle runNormalMode dependencies into a runModeDeps struct, shrinking the runNonInteractive and runInteractive call sites from 40+ positional args to (ctx, deps) - Add generic subscribeTyped[E Event] helper and collapse ~30 typed OnXxx wrappers in pkg/kit/events.go onto it (public signatures unchanged) - Extract setupBashPipes / interpretBashExit in internal/core/bash.go to deduplicate the buffered and streaming execution paths - Extract resolveAutoRouteAPIKey and wrapProviderErr helpers in internal/models/providers.go and uniformly apply them across every createXxxProvider site - Reimplement internal/extensions/watcher.go as a thin wrapper over the general-purpose internal/watcher.ContentWatcher, eliminating ~130 LOC of duplicated fsnotify logic while preserving the existing test API - Add ctx.Err() pre-flight checks in executeRead / Write / Edit / Ls so cancellation actually short-circuits pure file-IO tools	2026-06-06 19:22:05 +03:00
Ed Zynda	ab2a77c95e	feat(sdk): runtime skills and context-file management (#36 ) (#37 ) * feat(sdk): runtime skills and context-file management (#36) Let SDK consumers add, remove, and replace skills and AGENTS.md-style context files after Kit construction. Every mutation recomposes the system prompt and applies it to the agent so the next turn picks up the new instructions without restarting Kit. - AddSkill / LoadAndAddSkill / RemoveSkill / SetSkills on kit.Kit - AddContextFile / AddContextFileContent / LoadAndAddContextFile / RemoveContextFile / SetContextFiles on kit.Kit - RefreshSystemPrompt to force a manual recomposition - agent.SetSystemPrompt / GetSystemPrompt on the internal agent so the composed prompt rebuilds the fantasy agent on the next call - Per-instance runtimeMu guards skills/contextFiles; GetSkills and GetContextFiles return defensive snapshots safe for concurrent use - Capture the resolved basePrompt during New so recomposition keeps per-model overrides and --system-prompt file resolution intact - Skills dedupe by Name; context files dedupe by Path (opaque ID, not required to be a real filesystem path) Tests cover add/remove/set/replace semantics, validation errors, disk loading round-trips, prompt composition, and an 8-goroutine race-stress sweep (go test -race clean). Docs: pkg/kit/README, root README Go SDK section, www sdk/overview "Runtime skills and context files" section, www sdk/options callout cross-referencing the new API. Fixes #36 * fix(agent): synchronize SetSystemPrompt against concurrent rebuilds - add promptMu to Agent guarding systemPrompt writes and the fantasy agent rebuild, fixing a data race when Kit.applyComposedSystemPrompt is invoked concurrently - read systemPrompt under the same lock in GetSystemPrompt - update the thread-safety stress test to use a non-nil agent so the SetSystemPrompt path is actually exercised under -race	2026-05-29 18:44:12 +03:00
Ed Zynda	bd24f3315c	fix(agent): track tool call args per ToolCallID for parallel calls (#33 ) (#34 ) Previously GenerateWithCallbacks stored the most recent tool call's args in a single shared variable, which got clobbered when a provider emitted multiple tool_use blocks in a single step. Every OnToolResult callback then received the args of the last OnToolCall, regardless of which call it was actually resolving — breaking any downstream UI, log, or trace that derived its description from the toolArgs parameter. - Replace the shared currentToolArgs with a map keyed by ToolCallID, guarded by a sync.Mutex in case the streaming layer dispatches callbacks from multiple goroutines. - Delete each entry in OnToolResult so the map cannot accumulate across steps. - Add a regression test driving the streaming wrapper with a fake fantasy.Agent that emits two parallel tool calls before either result, asserting each callback sees its own args. Fixes #33	2026-05-20 10:37:46 +03:00
Ed Zynda	975c30a773	fix(mcp): surface MCP tool failures as soft errors, not critical aborts (#31 ) The MCP adapter previously wrapped any error returned by MCPToolManager.ExecuteTool into a Go error returned from the fantasy.AgentTool.Run interface. The fantasy agent loop treats those as critical errors and aborts the entire turn — discarding all prior reasoning, tool calls, and results. In practice that meant a single misbehaved MCP server returning a JSON-RPC "-32602 Invalid params" (e.g. a Zod schema mismatch on the server's input validation) would kill an in-progress turn after the model had already done dozens of seconds of useful work, with no way for the model to see the validation message and self-correct. This mismatched the contract that native Kit tools follow: native tools return errors via kit.ErrorResult(...), which become soft tool-result errors that the model reads and can act on (retry with corrected args, try a different tool, give up gracefully). Make the MCP path behave the same way: - JSON-RPC protocol errors, transport failures, and server-side schema rejections are now returned as fantasy.NewTextErrorResponse(...) with err == nil, so the agent loop continues and the model sees the failure in-band as a tool result it can reason about. - Context cancellation (ctx.Err() != nil) remains a critical error so callers can abort turns deterministically. This is the only case where bubbling up is correct — the caller intentionally tore the turn down and the agent must not keep spinning. - Server-side soft errors (CallToolResult{ isError: true }) and the happy path are unchanged. The agent loop's MaxSteps cap already bounds the worst case for a permanently broken MCP server, so there is no risk of unbounded retries. Side effect: extracted a tiny mcpExecutor interface for the one method the adapter uses (ExecuteTool), purely so the adapter is unit-testable in isolation without standing up a full MCPToolManager + connection pool. Behavior change note for downstream consumers: code that relied on host.PromptResult / Stream returning a Go error containing "mcp tool execution failed" will no longer see those errors — the failure information is now in the assistant's final response (or in the OnAfterToolResult / OnToolResult hooks, where IsError will be true). Context cancellation continues to surface as an error from those calls as before. Co-authored-by: space_cowboy <space_cowboy@mark3labs.com>	2026-05-13 20:12:31 +03:00
Ed Zynda	e6084b7bd0	feat(mcp): add MCP Tasks support at the SDK level (#21 ) Implement Phase 1 of the MCP Tasks spec so long-running tools/call requests can run asynchronously, survive proxy timeouts, and be cancelled mid-flight. - connection pool now advertises mcp.NewTasksCapability() during initialize and captures the InitializeResult so callers can detect per-server task support - new MCPServerConfig.TasksMode (auto\|never\|always, default auto) parsed from both new and legacy mcp.json shapes - ExecuteTool augments tools/call with TaskParams when policy and capability allow, polls tasks/get / tasks/result until terminal, and best-effort tasks/cancel on context cancellation - new MCPToolManager methods: SetTaskConfig, ListServerTasks, GetServerTask, CancelServerTask - public SDK surface in pkg/kit: MCPTask, MCPTaskStatus, MCPTaskMode, MCPTaskProgress, MCPTaskProgressHandler, plus Options fields (MCPTaskMode, MCPTaskTimeout, MCPTaskTTL, MCPTaskPollInterval, MCPTaskMaxPollInterval, MCPTaskProgress) and Kit.{List,Get,Cancel} MCPTask methods - works around two upstream mcp-go v0.51.0 parser bugs (ParseCallToolResult rejects task responses; ParseTaskResultResult looks for content under a non-existent nested key) by decoding the wire shape directly via the transport - defaults to MCPTaskModeAuto so servers that don't advertise task support behave exactly as before Fixes #21	2026-05-04 16:51:09 +03:00
Ed Zynda	558fb5214f	feat(sdk): expose remaining Fantasy lifecycle callbacks as events and hooks Closes #19. SDK events (pkg/kit): - Add 10 new event types: StepStart, StepFinish, TextStart, TextEnd, ReasoningStart, Warnings, Source, StreamFinish, Error, Retry - Add typed convenience subscribers for all 31 event types (20 previously required raw Subscribe + type assertion) - Add OnPrepareStep hook for intercepting/replacing messages between steps within a multi-step turn (composes with existing steering) - Rename OnStreaming to OnMessageUpdate (deprecated alias kept) Agent internals (internal/agent): - Add GenerateCallbacks struct replacing 16 positional callback params - Add GenerateWithCallbacks method; deprecate GenerateWithLoopAndStreaming - Wire all Fantasy stream callbacks: OnStepStart, OnTextStart/End, OnReasoningStart, OnWarnings, OnSource, OnStreamFinish, OnError, OnRetry, OnStepFinish (unified step event) - Compose PrepareStep with steering channel + consumer hook Extension system (internal/extensions): - Add 8 new extension events: StepStart, StepFinish, ReasoningStart, Warnings, Source, Error, Retry, PrepareStep - Bridge SDK events to extension runner with Yaegi-safe types (string errors, plain int64 token fields, ContextMessage for PrepareStep) Docs: update README, SDK skill, www/sdk/callbacks, www/sdk/overview	2026-04-22 20:25:06 +03:00
Ed Zynda	4ba9d6fab3	feat(events): mirror Fantasy tool input streaming callbacks as Kit events - Add ToolCallStartEvent, ToolCallDeltaEvent, ToolCallEndEvent to SDK - Wire Fantasy OnToolInputStart/Delta/End through agent to EventBus - Add typed convenience subscribers: OnToolCallStart/Delta/End on Kit - Bridge new events to TUI via ToolCallInputStart/Delta/End app events - Extend extension system with OnToolCallInputStart/Delta/End handlers - Add extension event types, API methods, loader wiring, Yaegi symbols - Update docs: README, SDK skill, extensions skill, www/sdk, www/extensions Closes #16	2026-04-21 23:28:13 +03:00
Ed Zynda	3bb20f5283	feat(models): surface and prevent silent max-tokens truncation - Raise --max-tokens default from 4096 to 8192. - Auto-raise MaxTokens toward the model's catalog Limit.Output (capped at 32768) when the user hasn't set --max-tokens explicitly and no per-model modelSettings override applied. Prevents silent 4k/8k truncation on models that support 32k-262k output. - Surface FinishReasonLength at turn end: the app now subscribes to TurnEndEvent and renders a system-message banner explaining the current cap, the model's known ceiling, and how to raise it. Previously the TUI swallowed 'length' stops, producing 'ghost' truncations. - Export FinishReason* constants on pkg/kit (Stop, Length, ToolCalls, ContentFilter, Error, Other, Unknown) and fix stale comments that used Anthropic-style strings. - Add Kit.MaxTokens() and Kit.MaxOutputLimit() SDK accessors, backed by Agent.GetMaxTokens() which correctly returns 0 for providers that suppress the param (e.g. Codex OAuth). - Tests: rightSizeMaxTokens covers 7 paths (cap, raise, preserve, explicit flag, nil info, zero limit); handleTurnEnd covers length/ non-length/nil-sendFn and the fallback message formatter. - Docs: update configuration.md, cli/flags.md, and kit-extensions skill to reflect the new default and behavior.	2026-04-16 23:12:10 +03:00
Ed Zynda	71301a9035	feat: add interactive sudo password prompt for bash tool Add core TUI support for handling sudo password prompts when executing bash commands that require elevated privileges. - Detect sudo commands and check if credentials are cached (sudo -n) - Show modal password prompt with masked input (• characters) when needed - Pipe password via stdin using sudo -S -p '' (no password in command string) - Password flows through context callbacks, never stored in session history - Add PasswordPromptHandler to agent and SDK event system - Add password prompt overlay to TUI with 🔐 icon and hidden input - Include tests for sudo command detection and rewriting The password is never persisted to disk - it only exists in memory during execution and is piped directly to sudo via stdin.	2026-04-15 17:33:03 +03:00
Ed Zynda	25410af440	feat: add smart @ attachments with MIME detection and MCP resource support Phase 1: Smart @ for local files - ProcessFileAttachments now returns FileAttachmentResult with separate ProcessedText and FileParts fields instead of a plain string - Binary files (images, audio, video, PDFs, etc.) detected via MIME type are extracted as multimodal FileParts instead of XML-wrapped text garbage - detectMediaType() uses extension-based lookup then content sniffing - isBinaryMediaType() classifies image/, audio/, video/*, and specific application types as binary - @mcp:server:uri token format for referencing MCP resources in text - All 4 submission paths (TUI submit, TUI steer, MCP prompt, CLI) updated - App.RunOnceWithFiles/RunOnceResultWithFiles/RunOnceWithDisplayAndFiles added for non-interactive multimodal submission Phase 2: MCP resources in @ autocomplete - MCPToolManager gains loadServerResources(), GetResources(), ReadResource(), SubscribeResource(), UnsubscribeResource(), RefreshServerResources() - MCPResource and MCPResourceContent types for resource metadata/content - FileSuggestion extended with IsMCPResource, MCPServerName, MCPResourceURI - InputComponent.SetMCPResourceProvider() wires resource suggestions into the @ popup alongside local files - @ popup merges local file suggestions with MCP resource suggestions, sorted by fuzzy match score - MCP resources display 'mcp:servername' in the popup description - Selecting an MCP resource inserts @mcp:server:uri format - ProcessFileAttachments resolves @mcp: tokens via MCPResourceReader callback - Text resources are XML-wrapped as <resource>; binary resources become FileParts for multimodal submission - Agent, Kit SDK, and cmd/root.go wired end-to-end Phase 3: Resource subscriptions (foundation) - SubscribeResource/UnsubscribeResource on MCPToolManager - onResourcesChanged callback for live refresh (wired but not yet triggering UI refresh automatically) - RefreshServerResources for manual resource list refresh	2026-04-15 13:01:36 +03:00
Ed Zynda	e068487ff7	style(ui): fix gofmt alignment in MCPPromptInfo struct	2026-04-15 11:50:33 +03:00
Ed Zynda	0ffb0ba788	refactor(tools): remove fantasy dependency from internal/tools - Replace fantasy.AgentTool with plain MCPTool struct in MCPToolManager - Move fantasy adapter from internal/tools to internal/agent as mcpAgentTool - Add MCPToolManager.ExecuteTool() for framework-agnostic tool execution - Remove dead fantasy.LanguageModel field from MCPConnectionPool - Remove MCPToolManager.SetModel() (was only feeding the dead field) internal/tools is now a pure MCP client library with no LLM framework dependency. The fantasy-to-MCP bridging is confined to the agent layer where it belongs.	2026-04-15 11:27:47 +03:00
Ed Zynda	7eaeafff8c	fix(mcp): propagate OAuth config for runtime-added servers - Store authHandler and tokenStoreFactory on Agent struct so AddMCPServer() can propagate them to new MCPToolManagers (#3) - Add OAuthClientID, OAuthClientSecret, OAuthScopes fields to MCPServerConfig for servers without dynamic registration (#4) - Pass OAuth fields from server config to transport OAuthConfig in both SSE and Streamable HTTP client creation paths - Add GetAuthHandler() accessor to MCPToolManager - Add tests for auth handler propagation and OAuth config parsing Closes #3, closes #4	2026-04-11 15:24:47 +03:00
Ed Zynda	e07c94f49d	feat(mcp): add dynamic MCP server loading and unloading - Add AddServer/RemoveServer to MCPToolManager for runtime server management - Add RemoveConnection to MCPConnectionPool for per-server teardown - Add AddMCPServer/RemoveMCPServer/ListMCPServers to Agent and SDK Kit - Lazily create connection pool so AddServer works without prior LoadTools - Wire onToolsChanged callback to trigger agent tool list rebuild - Make MCPToolManager.Close nil-safe when pool was never initialized Tests: - Integration tests with real stdio MCP server (Python echo server) - Agent-level tests using mock LLM model (no API key needed) - Unit tests for error paths, callbacks, idempotency, nil safety - SDK type surface tests	2026-04-09 13:54:11 +03:00
Ed Zynda	b87146a284	feat(sdk): add MCPTokenStoreFactory for custom OAuth token storage - Add MCPTokenStoreFactory option to kit.Options allowing SDK consumers to provide custom token storage backends for remote MCP servers - Thread TokenStoreFactory through the full chain: kit.Options → kitsetup → agent → MCPToolManager → MCPConnectionPool - Add createTokenStore() helper on connection pool that delegates to the factory or falls back to the default FileTokenStore - Export MCPTokenStore, MCPToken, MCPTokenStoreFactory, and ErrMCPNoToken in pkg/kit/types.go following SDK naming conventions - Default behavior (file-based storage) is preserved when factory is nil	2026-04-09 13:27:40 +03:00
Ed Zynda	3a8ffc2104	feat(models): add per-model system prompt support - Add systemPrompt field to GenerationParams and config structs - On init, replace default system prompt with per-model prompt when user hasn't explicitly set one (via flag, config, or SDK option) - On model switch, detect per-model prompt and compose it with AGENTS.md, skills, and date/cwd context - Fix viper.IsSet bug: BindPFlag causes IsSet to return true for unset flags, so compare against defaultSystemPrompt instead - Agent.SetModel now updates stored system prompt from config - Export LoadModelSettingsFromConfig, LoadSystemPromptValue, and LookupModelForSettings for use by Kit.SetModel - Add tests for prompt apply, precedence, file path, and modelSettings override	2026-04-09 12:35:00 +03:00
Ed Zynda	30ad7c1d0b	feat(sdk): persist session messages incrementally per agent step - Add StepMessagesHandler callback to agent's GenerateWithLoopAndStreaming so callers can persist messages as each step completes - Wire onStepMessages in Kit.generate() to call session.AppendMessage for each step's messages immediately on completion - Track PersistedMessageCount on GenerateWithLoopResult so runTurn skips already-persisted messages in post-generation cleanup - Tool calls are always persisted as assistant+tool pairs (never orphaned) - Document concurrency and incremental persistence requirements on the SessionManager interface for custom implementations	2026-04-08 14:15:05 +03:00
Ed Zynda	f65b6737f2	feat(sdk): add SkipConfig and DisableCoreTools options Add two new Options fields for programmatic SDK usage: - SkipConfig: Skip .kit.yml file loading while still using viper defaults and environment variables. Useful for fully programmatic configuration. - DisableCoreTools: Allow creating agents with 0 tools (chat-only mode) or with only custom tools. When true and Tools is empty, no tools are loaded. When combined with custom Tools, only those tools are loaded. Updates documentation in README, pkg/kit/README, skills/kit-sdk/SKILL, and www/pages/sdk/options.	2026-04-07 17:10:58 +03:00
Ed Zynda	debb39f56c	fix(ui): show MCP tools in /tools and status bar after async loading Background MCP tool loading (added in `7e54710`) caused tools to not appear in the UI because tool names and counts were captured at startup before loading completed. This adds: - MCPToolsReadyEvent and MCPServerLoadedEvent for progress notifications - Dynamic GetToolNames/GetMCPToolCount callbacks for live updates - Per-server status messages as each MCP server finishes loading - Refresh handlers to update /tools output and status bar when ready	2026-04-07 16:29:09 +03:00
Ed Zynda	201d14804e	fix(ui): prevent double-rendered messages after reasoning-only responses - Always fire onResponse callback even when response text is empty so ResponseCompleteEvent reaches the TUI and resets the StreamComponent - Check for existing StreamingMessageItem in flushStreamAndPendingUserMessages before creating a new StyledMessageItem to avoid duplicate content - Mark trailing StreamingMessageItem complete on StepComplete, StepCancelled, and StepError to freeze live timers and prevent dangling streaming state	2026-04-07 13:52:30 +03:00
Ed Zynda	7e54710d4a	perf(agent): load MCP tools asynchronously to speed up startup Load MCP server tools in the background so the UI appears immediately instead of blocking until all servers connect. The first LLM call automatically waits for tools to be ready before proceeding. Key changes: - NewAgent() starts MCP loading in a background goroutine and returns immediately with core/extension tools only - GenerateWithLoop() calls ensureMCPTools() to lazily wait and rebuild the fantasy agent with full tool set before first LLM call - Parallelize LoadTools() across all configured MCP servers - Add WaitForMCPTools() and MCPToolsReady() for status checking - Refactor SetModel/SetExtraTools to use shared rebuildFantasyAgent() - Expose async MCP status methods in public SDK	2026-04-07 13:36:10 +03:00
Ed Zynda	88870be4d2	feat: add frequency-penalty and presence-penalty parameters - Add --frequency-penalty and --presence-penalty CLI flags (0.0-2.0) - Wire through config, viper, ProviderConfig, and fantasy agent options - Support in config file, env vars (KIT_FREQUENCY_PENALTY), and SDK - Pass to Ollama via options map (frequency_penalty, presence_penalty) - Apply on both initial agent creation and runtime model swap	2026-04-06 10:52:33 +03:00
Ed Zynda	aecce001ee	feat(mcp): add OAuth support for remote MCP servers - Add MCPAuthHandler interface at SDK level (pkg/kit/) so all consumers (CLI, TUI, SDK embedders) control the OAuth UX through one interface - Default handler opens system browser + local callback server with PKCE - CLIMCPAuthHandler wraps default with status messages (stderr pre-TUI, system messages via TUI event system once running) - Always enable OAuth on remote transports (streamable HTTP, SSE) when handler is configured; harmless for servers that don't need it - Dynamic client registration when no client ID is pre-configured - File-based TokenStore persists tokens to ~/.config/.kit/mcp_tokens.json keyed by server URL so users don't re-auth on restart - Catch OAuthAuthorizationRequiredError at connection init (startup) and tool execution (mid-session token expiry), run auth flow, retry once - Fix error wrapping (%v -> %w) in connection pool so errors.As can unwrap through the chain to find OAuth errors - Thread AuthHandler through MCPToolManager -> AgentConfig -> AgentCreationOptions -> AgentSetupOptions -> kit.Options	2026-04-04 17:41:57 +03:00
Ed Zynda	997c7d15ff	fix: include pasted images in steering messages Steering messages (Ctrl+S during agent work) now carry file attachments just like queued messages do. Previously, pasted images were silently dropped when steering. Changes: - Add SteerMessage struct with Text and Files fields - Update steer channel from chan string to chan SteerMessage - Add SteerWithFiles methods through the stack (UI, app, SDK) - Update PrepareStep to include files in injected user messages	2026-04-02 17:19:34 +03:00
Ed Zynda	685aaf207f	feat(extensions): add hot-reload with file watching and /reload-ext command - Add fsnotify-based file watcher that auto-reloads extensions on .go file changes in autoloaded dirs with 300ms debounce - Add /reload-ext built-in command (alias /re) for manual reload - Add Agent.SetExtraTools() so extension tools update on reload instead of being baked in at agent creation time - Run reload async via tea.Cmd to avoid prog.Send() deadlock when extension handlers call ctx.Print() during SessionStart/Shutdown - Wire watcher lifecycle into cmd/root.go with graceful shutdown	2026-04-02 15:41:54 +03:00
Ed Zynda	7d2f078111	fix(ui): freeze reasoning counter when last token is processed - Wire fantasy's OnReasoningEnd callback through the full event chain: agent → SDK (ReasoningCompleteEvent) → app → TUI - Freeze reasoning duration in both StreamComponent and StreamingMessageItem as soon as reasoning ends, not when the next assistant text chunk arrives - Fix accent color on duration label in render.ReasoningBlock to match the live streaming style (VeryMuted prefix + Accent duration)	2026-04-02 14:18:42 +03:00
Ed Zynda	b0802a5c32	fix: properly count existing cache blocks to stay under 4-block limit The issue was that cache control persisted across turns in conversation history, causing accumulation beyond Anthropic's 4-block limit. Changes: - Count existing cache blocks in message history before adding new ones - Only add new cache blocks up to the 4-block limit - Remove tool caching (was adding 1 block per turn) - Skip messages that already have cache control set Tested with 5 sequential messages - no errors, proper cache metrics.	2026-03-29 14:48:08 +03:00
Ed Zynda	dfe65ca227	chore: remove all Crush references from comments Remove mentions of Crush from: - cache_control.go - agent.go (2 references) - content.go - tool_renderers.go - lsp-diagnostics.go (2 references)	2026-03-29 14:43:51 +03:00
Ed Zynda	d4ec756ce5	fix: match Crush's cache_control strategy exactly Crush's proven 4-block strategy: 1. Last system message (if present) 2. Last 2 conversation messages 3. Last tool definition This stays exactly at Anthropic's 4-block limit without exceeding it. Previous implementation could exceed the limit in certain edge cases. Now matches Crush's battle-tested approach.	2026-03-29 14:42:29 +03:00
Ed Zynda	2971e73ee8	fix: limit Anthropic cache_control blocks to maximum 4 Anthropic API enforces a maximum of 4 blocks with cache_control per request. The previous implementation could exceed this limit when combining: - System message caching - Recent message caching - Tool definition caching Changes: - Add explicit cache block counting (max 4) - Remove tool cache control to stay under limit - Prioritize: system message first, then recent messages - Work backwards from end to cache most recent context first Fixes: bad request error 'A maximum of 4 blocks with cache_control may be provided'	2026-03-29 14:40:44 +03:00
Ed Zynda	5aa6c9e116	chore: fix all golangci-lint v2 issues - Fix gofmt formatting issues in 7 files - Replace atomic.AddUint64 with atomic.Uint64 type (modernize) - Replace for i := 0; i < count; i++ with for i := range count (modernize) - Replace strings.Split with strings.SplitSeq (modernize) - Replace deprecated GetFantasyProviders with GetLLMProviders - Replace deprecated GetFantasyMessages with GetLLMMessages - Replace deprecated ConvertFromFantasyMessage with ConvertFromLLMMessage - Replace deprecated FromFantasyMessage with FromLLMMessage - Replace deprecated ToFantasyMessages with ToLLMMessages - Remove 2 unused formatToolArgs functions	2026-03-29 14:36:03 +03:00
Ed Zynda	bca08476de	chore: fix remaining linting issues in caching code - Use max() built-in instead of if statement (modernize) - Remove unused buildAnthropicCacheOptions function - Remove unused anthropic import	2026-03-29 14:32:28 +03:00
Ed Zynda	b295a25946	feat: automatic prompt caching for cost reduction Implements automatic prompt caching to reduce API costs by 60-90% for repeated prompts with the same context. Architecture: - Provider-level caching for OpenAI (PromptCacheKey) - Message-level caching for Anthropic (avoids type conflicts) - Model family detection enables caching regardless of provider Key Changes: - Add ModelInfo.Family with SupportsCaching() and CacheType() methods - Add ProviderConfig.DisableCaching for opt-out - Implement message-level cache control in agent (like Crush) - Last system message gets cache control - Last 2 messages get cache control - Last tool gets cache control - Auto-disable caching when thinking is enabled (type conflict avoidance) - Add KIT_DISABLE_CACHE environment variable for global opt-out Tested with opencode/claude-sonnet-4-6 showing cacheRead/cacheWrite tokens in debug output, confirming 60-90% cost savings. Closes cost optimization for multi-turn conversations.	2026-03-29 14:24:07 +03:00
Ed Zynda	f0e4e2f757	refactor: remove Fantasy dependency name leakage from public SDK and docs Rename public SDK symbols to use generic LLM terminology instead of exposing the internal dependency name (charm.land/fantasy): Public API renames (with deprecated wrappers for backward compat): - ConvertToFantasyMessages() → ConvertToLLMMessages() - ConvertFromFantasyMessage() → ConvertFromLLMMessage() - GetFantasyProviders() → GetLLMProviders() New type alias: - LLMFilePart = fantasy.FilePart (eliminates need for direct fantasy import) - PromptResultWithFiles() signature now uses LLMFilePart Internal renames (with deprecated wrappers): - ModelsRegistry.GetFantasyProviders() → GetLLMProviders() - TreeManager.GetFantasyMessages() → GetLLMMessages() - TreeManager.AppendFantasyMessage() → AppendLLMMessage() - TreeManager.AddFantasyMessages() → AddLLMMessages() - Message.ToFantasyMessages() → ToLLMMessages() - FromFantasyMessage() → FromLLMMessage() - npmToFantasyProvider → npmToLLMProvider - isProviderFantasySupported() → isProviderLLMSupported() All internal callers migrated to new names. ~30 comments updated to remove Fantasy references across pkg/kit/, internal/agent/, internal/models/, internal/message/, internal/session/. Documentation updates: - AGENTS.md: added Public SDK rules section (no dependency leakage, naming conventions, deprecation pattern) - README.md: removed Fantasy references - pkg/kit/README.md: full rewrite with current API surface - skills/kit-sdk/SKILL.md: updated examples and type references - www/pages/providers.md, www/pages/cli/commands.md: updated	2026-03-29 14:01:57 +03:00
Ed Zynda	727b42acfe	cleanup: remove unused variable, duplicate condition, and reimplemented stdlib helper - agent: remove unused currentToolName variable and its compiler-suppressor '_ = currentToolName'; currentToolArgs is the field actually used by OnToolResult callbacks - tools/connection_pool: collapse double-nested identical if guard into a single check (copy-paste artifact) - tools/mcp_test: replace hand-rolled contains() helper with strings.Contains; add 'strings' import and delete the redundant function	2026-03-29 00:00:33 +03:00
Ed Zynda	31966c469f	Skip max_output_tokens for OpenAI Codex OAuth provider The Codex API doesn't support the max_output_tokens parameter, which was causing "Unsupported parameter: max_output_tokens" errors. - Add SkipMaxOutputTokens flag to ProviderResult - Set flag when creating Codex OAuth provider - Check flag in agent setup to skip WithMaxOutputTokens option - This matches pi's behavior of not sending max_tokens to Codex API	2026-03-26 15:04:16 +03:00
Ed Zynda	be55bc03f1	Add mid-turn steering with Ctrl+S	2026-03-26 12:10:14 +03:00
Ed Zynda	09919b6307	feat: update token usage after each step in multi-step turns Previously, token usage and costs were only updated at the end of a complete turn. For long-running multi-step tool-calling conversations, this meant the status bar showed stale (or zero) costs during the entire interaction. Now, after each complete step (tool call + result), the usage tracker is updated with the actual token counts from that step. This provides real-time cost accumulation visible in the status bar. Changes: - Add StepUsageHandler type and onStepUsage parameter to agent - Emit StepUsageEvent from kit layer after each step completes - Handle StepUsageEvent in app layer to update UsageTracker - Add EventStepUsage constant and StepUsageEvent struct to events The step usage is additive - each step's tokens are added to the running session totals, just like the final turn usage was before.	2026-03-25 18:17:48 +03:00
Ed Zynda	3fc0ad906e	feat(ui): streaming bash output in TUI Display streaming bash output in the TUI stream region as it arrives. Changes: - Add streaming bash output rendering to renderStream() - Style stdout with CodeBg, stderr with Error color - Add streamingMu mutex for thread-safe buffer access - Clear buffers on ToolResultEvent - Add ToolOutputEvent to event system (pkg/kit, internal/app) - Add ToolOutputHandler callback in agent - Implement streaming mode in bash tool with pipes - Add tests for accumulation and clearing The streaming output appears in real-time below the LLM streaming text while bash commands are executing, with proper synchronization to prevent race conditions between Update and Render methods.	2026-03-22 20:23:19 +03:00
Ed Zynda	c0880e1ef6	fix: preserve completed tool calls when cancelling with ESC When pressing ESC twice to cancel an agent turn, completed tool calls and their results were being discarded along with the in-progress text. Only the streaming text should be discarded. The root cause was a chain of two issues: 1. Agent layer (internal/agent/agent.go): Fantasy's Stream() returns nil on error, discarding all accumulated step data. Fixed by tracking completed step messages via the OnStepFinish callback and returning a partial GenerateWithLoopResult alongside the error. 2. App layer (internal/app/app.go): The in-memory message store was never synced from the tree session after cancellation. Fixed by reloading the store from the tree session (which the SDK's runTurn already persists partial progress to). The existing partial-persistence code in pkg/kit/kit.go runTurn() was correct but was dead code because the agent layer always returned nil on error. It now receives the partial result and persists completed step messages to the tree session as intended.	2026-03-21 18:32:28 +03:00
Ed Zynda	c94edc929b	feat: add rich tool metadata to SDK and extension events (Gaps 1-8) Thread ToolCallID, ToolKind, ParsedArgs, FileDiff metadata, StopReason, SessionID, and StructuredMessages across the SDK event bus, extension wrapper, app bridge, hooks, and ACP server layers. - Gap 1: ToolCallID from Fantasy's ToolCallContent threaded end-to-end - Gap 2: ToolKind via static lookup (execute/edit/read/search/agent) - Gap 3+4: FileDiffInfo with DiffBlocks via fantasy.ToolResponse.Metadata - Gap 5: StopReason from Fantasy FinishReason on TurnEndEvent/TurnResult - Gap 6: Subagent sessions now opt-out (NoSession); SessionID in JSON output - Gap 7: GetStructuredMessages() returns typed ContentParts - Gap 8: ParsedArgs map[string]any on tool events for convenience Edit/write tools attach structured diff metadata. ACP server uses real ToolCallIDs. Extension and SDK events kept in sync with matching fields.	2026-03-16 11:10:05 +03:00
Ed Zynda	4c126ca41b	feat(ui): show clean summary for subagent results instead of raw output - Add custom renderer for spawn_subagent tool showing status + 3-line preview - Pass toolArgs through ToolExecutionEvent to show task in spinner - Display 'Subagent: <task>' during execution instead of generic message - Compact mode shows concise one-line status summary	2026-03-14 17:04:50 +03:00
Ed Zynda	c46687fc44	fix: pass image file parts through Fantasy agent's Files field splitPromptAndHistory was extracting only text from the last user message, discarding FilePart data (clipboard images). The fix extracts both text and file parts, passing files via AgentStreamCall.Files and AgentCall.Files so Fantasy includes them in the API request. Also preserves file parts when BeforeTurn hooks or skill expansion replace the user message text in runTurn.	2026-03-09 10:26:31 +03:00
Ed Zynda	f3ea18ae3a	feat: add thinking model support with configurable reasoning levels Add extended thinking/reasoning support for Anthropic and OpenAI models: - ThinkingLevel type (off/minimal/low/medium/high) with token budgets - Stream reasoning deltas via OnReasoningDelta through SDK→TUI event pipeline - Render thinking blocks in StreamComponent (muted italic, collapsible) - ctrl+t toggles thinking visibility, shift+tab cycles thinking level - /thinking slash command with tab-completion for level names - --thinking-level CLI flag and config file support - Map ThinkingLevel to OpenAI ReasoningEffort for Responses API - Auto-bump Anthropic max_tokens when thinking budget exceeds it - Fix ResponseCompleteEvent prematurely resetting stream in streaming mode - Status bar displays current thinking level	2026-03-07 21:27:46 +03:00
Ed Zynda	24ea2c94e3	feat: add OpenAI Responses API support for codex/gpt-5/o3/o4 models Enable fantasy's Responses API path (WithUseResponsesAPI) for the OpenAI provider so that models like gpt-5.3-codex, codex-mini-latest, o3, o4-mini, and other Responses-only models work correctly. - Enable WithUseResponsesAPI on both createOpenAIProvider and createAutoRoutedOpenAIProvider - Build provider options for reasoning models (reasoning_summary, encrypted reasoning content) matching crush's coordinator behaviour - Thread ProviderOptions from provider creation through to the fantasy agent in NewAgent, SetModel, and the SDK Complete path - Pass generation parameters (Temperature, MaxTokens, TopP, TopK) to the fantasy agent for all providers (previously only Ollama) - Fix extension tool schema for Responses API: parse Parameters JSON Schema string into fantasy ToolInfo format, ensure Required is never nil (OpenAI rejects null, expects empty array)	2026-03-07 11:03:10 +03:00
Ed Zynda	23c16bb197	feat: add tool mgmt, model mgmt, options, event bus, LLM completion, steer mode, and 10 example extensions Phase 2+3 extension API additions: - Tool management: GetAllTools, SetActiveTools (plan-mode support) - Model management: SetModel, GetAvailableModels, ModelChangedEvent - Extension options: RegisterOption, GetOption, SetOption (env/config/default) - Inter-extension event bus: OnCustomEvent, EmitCustomEvent - Direct LLM completion: ctx.Complete with streaming/blocking modes - Steer delivery mode: CancelAndSend for interrupt-and-redirect New example extensions (10): - plan-mode.go: read-only exploration with /plan toggle - summarize.go: conversation summarization via ctx.Complete - bookmark.go: persistent bookmarks via AppendEntry/GetEntries - auto-commit.go: auto-commit on exit using last assistant message - permission-gate.go: confirm dangerous bash commands - protected-paths.go: block writes to .env, .git/, secrets/ - notify.go: desktop notifications on agent completion - inline-bash.go: !{cmd} expansion in prompts - pirate.go: system prompt persona injection - project-rules.go: load .kit/rules/*.md into system prompt Always-wrap tools through runner for SetActiveTools disabled-tool checking. Removed phase1/phase2 test extensions from examples.	2026-03-02 14:31:35 +03:00
Ed Zynda	2cf7464e76	combine startup info into single system message block Merge Context, Skills, and tool counts into one KIT System block instead of separate styled sections. Add separate MCP and extension tool counts to Agent, only displaying each when > 0.	2026-02-27 17:19:13 +03:00
Ed Zynda	35452cc21b	remove dead code and add Plan 10 for full app-as-SDK-consumer integration Delete legacy session files (manager.go, session.go) and unused ParseModelName() — all orphaned after Plans 00-09. Add Plan 10 to close all deferred items: app uses kit.New(), executeStep() delegates to kit.PromptResult(), extension observation events route through SDK EventBus.	2026-02-27 13:33:19 +03:00
Ed Zynda	d8f40039fe	export tools and tool factories with WithWorkDir option (Plan 01) - Add ToolOption/WithWorkDir functional options pattern to internal/core - Update all 7 tool constructors to accept ...ToolOption and resolve paths relative to the configured working directory - Create pkg/kit/tools.go with public exports: individual constructors, bundles (AllTools, CodingTools, ReadOnlyTools), and WithWorkDir - Add CoreTools field to AgentConfig/AgentCreationOptions so callers can inject custom tool sets instead of hardcoding core.AllTools() - Add Tools field to kit.Options and GetTools() to kit.Kit - Fully backward compatible: no-arg calls use os.Getwd() as before	2026-02-27 11:37:46 +03:00
Ed Zynda	7a6c0aba61	wire extension slash commands and tools into TUI dispatch pipeline Extension-registered slash commands are now fully end-to-end: - Commands appear in autocomplete popup (category: Extensions) - Commands appear in /help under Extensions section - Commands dispatch via handleExtensionCommand with argument support - Command names normalized to /prefix at the cmd layer boundary Extension-registered tools now show in /tools: - Agent.GetTools() includes extraTools from extensions - Previously only core + MCP tools were returned Also adds RegisterTool and RegisterCommand examples to the kit extensions init template, and adds .kit/ to .gitignore.	2026-02-27 00:26:06 +03:00

1 2

88 Commits