open-webui

mirror of https://github.com/open-webui/open-webui.git synced 2026-06-13 19:20:05 +00:00

Author	SHA1	Message	Date
Classic298	02b2a391e9	fix: block private-IP webhook URLs to close SSRF on caller-controlled URL (#24587 ) * fix: block private-IP webhook URLs to close SSRF on caller-controlled URL post_webhook(url, ...) in utils/webhook.py forwards the URL straight to aiohttp.ClientSession.post with no SSRF gate. The URL is caller-controlled on two surfaces: - User notification settings under ENABLE_USER_WEBHOOKS=true — any authenticated user can set the URL their notifications POST to. - Automation notification triggers (calendar alerts, etc.). Without a gate, the URL can target cloud metadata (169.254.169.254 / fd00:ec2::254), localhost-bound services, RFC1918 internal hosts, or any other private address reachable from the server process. Blind SSRF — no response body returned to the caller — but enough to enumerate internal services via response timing / status codes, and on cloud deployments enough to issue requests against IMDSv1 if available. Call validate_url() at the top of post_webhook. The function blocks private/reserved IPs when ENABLE_RAG_LOCAL_WEB_FETCH is False (the default), is the project's chosen SSRF gate, and is already applied to the equivalent fetch surfaces (retrieval, image-load, OAuth profile picture). Operators who legitimately need to webhook to private IPs (internal monitoring, self-hosted Slack alternatives, etc.) can set ENABLE_RAG_LOCAL_WEB_FETCH=True — same opt-out as the other gated surfaces. Scope intentionally limited to webhooks. The OAuth discovery and external reranker paths cwanglab also flagged are admin-configured with intentional private-IP defaults (reranker defaults to http://localhost:8080/v1/rerank) and are out of scope per Rule 9 — the admin owns the URL choice and the operator opt-out exists for them too. Reported by cwanglab in GHSA-5x9f-85cg-w3hf (cluster canonical with six closed siblings: g36v-23gj-j69x, 6j8f-h58v-xgmw, xpwv-52pm-p8hj, v9gp-hv2c-9qv8, fw7w-jrw7-p3v9, x7xq-74rg-m8mf). Co-authored-by: cwanglab <cwanglab@users.noreply.github.com> * fix: also pass allow_redirects=False on webhook post_webhook session.post Companion to the previous commit. validate_url() only validates the initial URL; aiohttp's default allow_redirects=True would still follow a 302 to a private-IP target. Same redirect-bypass class as the rh5x cluster's five call sites, sixth call site to receive the same gate. Co-authored-by: cwanglab <cwanglab@users.noreply.github.com> --------- Co-authored-by: cwanglab <cwanglab@users.noreply.github.com>	2026-06-01 14:15:51 -07:00
Classic298	eb38389636	fix: delete Qdrant points by ID so memory deletions don't orphan vectors (#25495 ) The Qdrant backends implemented delete(ids=...) as a payload filter on metadata.id, but points are stored with the item id as the Qdrant point id (see _create_points), and not every point carries an id in its payload. Memory points store only {created_at} in metadata (KB metadata embeddings likewise), so deleting a single memory matched nothing and left an orphaned vector that kept being injected into RAG context. Delete by point id instead: PointIdsList for the standard backend, and a tenant-scoped HasIdCondition for multitenancy (point ids are unique, so tenant isolation is preserved). Filter-based deletion is unchanged. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 14:13:46 -07:00
Timothy Jaeryang Baek	eebbc48f80	refac Co-Authored-By: Jacob Leksan <63938553+jmleksan@users.noreply.github.com>	2026-06-01 14:13:28 -07:00
Timothy Jaeryang Baek	cff51f05f5	chore: format	2026-06-01 14:10:40 -07:00
Timothy Jaeryang Baek	a4735e46b9	refac Co-Authored-By: Syed Mustafa Quadri <175467872+code-quad3@users.noreply.github.com>	2026-06-01 14:09:54 -07:00
Timothy Jaeryang Baek	6c8dfd8175	refac	2026-06-01 14:08:34 -07:00
Timothy Jaeryang Baek	6fce92aa12	chore: format	2026-06-01 13:56:55 -07:00
Justin Williams	478bc9e3f1	fix(oauth): use Protected Resource Metadata scopes in static OAuth 2.1 flow (#24690 ) The static credentials OAuth flow currently sets scope=None, relying on the OAuth provider's default scopes. This breaks providers like GitHub that default to minimal/public-only access when no scope is requested. This change reads scopes_supported from the Protected Resource Metadata document (RFC 9728) and uses them in the authorization request. Unlike the Authorization Server's scopes_supported (a full catalog of every scope the AS can grant), the PRM scopes_supported represents what the specific resource requires — making it safe to request without breaking providers like Entra ID that reject broad scope requests. Fixes the regression introduced in `349ea4ea` where all scope handling was removed from the static flow.	2026-06-01 13:52:18 -07:00
Jacob Leksan	80da840ae5	refactor: move background tasks handler call to ensure consistent execution in chat response handlers (#24717 )	2026-06-01 13:50:15 -07:00
Timothy Jaeryang Baek	c7de057a4a	refac	2026-06-01 13:45:23 -07:00
Timothy Jaeryang Baek	750604a11d	refac	2026-06-01 13:43:05 -07:00
James Liounis	69c88e163d	feat(retrieval): add Perplexity attribution header (#24833 ) Signed-off-by: James Liounis <james.liounis@perplexity.ai>	2026-06-01 13:40:52 -07:00
Timothy Jaeryang Baek	c8eb8edca4	refac	2026-06-01 13:38:40 -07:00
Classic298	33e4e0dcc4	fix: gate chat_completion channel: branch on channel access + message scoping (#24725 ) * fix: gate chat_completion channel: branch on channel access + message scoping When chat_id starts with 'channel:' the chat-completion handler skips the chat ownership / storage block below it. Nothing replaced that gate. The downstream channel emitter in socket/main.py:_make_channel_ emitter writes to Messages.update_message_by_id using a caller-supplied message_id pulled from form_data['id'], with no membership check, no write-access check, and no validation that the message_id belongs to the channel. Net effect: any authenticated user could submit chat_id='channel:<any-channel-uuid>' + id='<any-message-uuid>' and overwrite that message with attacker-controlled LLM output. Cross- channel writes worked too — private channels, DMs, channels the caller has no access to. Original author attribution stayed intact on the overwritten row. Add the missing checks at the channel: branch: 1. Channel must exist (404 otherwise). 2. Non-admin caller must have write access to the channel — membership for group/dm channels, AccessGrants permission='write' for others. 3. The message_id (if supplied) must belong to the same channel — a caller with write access to channel A cannot use this path to overwrite a message in channel B. Behaviour change is limited to callers who were exploiting the gap: legitimate flows that supply a message_id under their own channel membership continue to work unchanged. Co-authored-by: sfwani <sfwani@users.noreply.github.com> * chore: trim verbose comment on channel: branch gate --------- Co-authored-by: sfwani <sfwani@users.noreply.github.com>	2026-06-01 13:37:32 -07:00
Timothy Jaeryang Baek	7f7cd21018	refac	2026-06-01 13:34:50 -07:00
Craig	ce4dca47cb	fix: apply RAG_EMBEDDING_QUERY_PREFIX to memory search queries (#24921 ) The query_memory endpoint embeds the search query without the configured RAG_EMBEDDING_QUERY_PREFIX, while every RAG retrieval path in retrieval/utils.py correctly passes it. Instruction-tuned embedding models (e.g. Qwen3-Embedding) produce poor results without the prefix, causing memory search to return semantically unrelated results.	2026-06-01 13:23:15 -07:00
Timothy Jaeryang Baek	160a6694e4	refac	2026-06-01 13:20:33 -07:00
Timothy Jaeryang Baek	778dba1d6b	refac	2026-06-01 13:18:44 -07:00
Timothy Jaeryang Baek	27fb20c13a	refac	2026-06-01 13:15:21 -07:00
Timothy Jaeryang Baek	eb4eebc3ce	refac	2026-06-01 13:10:19 -07:00
Timothy Jaeryang Baek	d64ef1803d	refac Co-Authored-By: Zaid Marji <91486926+zaid-marji@users.noreply.github.com>	2026-06-01 13:07:49 -07:00
Timothy Jaeryang Baek	01810e32ad	refac	2026-06-01 13:02:48 -07:00
Lukáš Kucharczyk	42c2393f8e	Update Kagi API endpoint and request method (#25015 ) Co-authored-by: russelg <russelg@users.noreply.github.com>	2026-06-01 12:48:44 -07:00
Timothy Jaeryang Baek	4297c02b12	refac	2026-06-01 12:44:16 -07:00
Timothy Jaeryang Baek	9035601bdb	refac	2026-06-01 12:41:30 -07:00
Timothy Jaeryang Baek	e3ab4bd212	refac Co-Authored-By: Zixin Yu <183055163+ivvi0927@users.noreply.github.com>	2026-06-01 12:37:34 -07:00
Timothy Jaeryang Baek	fd76b51ab2	refac Co-Authored-By: Classic298 <27028174+Classic298@users.noreply.github.com>	2026-06-01 12:27:08 -07:00
Timothy Jaeryang Baek	c0f1aa2919	refac	2026-06-01 12:24:45 -07:00
rileydes-improving	567c4aabe9	feat: add support for Valkey vector database (#24769 ) * feat: add support for Valkey vector database Signed-off-by: Riley Des <riley.desserre@improving.com> * feat: add CLIENT SETNAME to Valkey vector store connections Set client_name on GlideClientConfiguration for both the main client and batch client so connections are identifiable in CLIENT LIST, monitoring dashboards, and CloudWatch metrics. Signed-off-by: Riley Des <riley.desserre@improving.com> --------- Signed-off-by: Riley Des <riley.desserre@improving.com>	2026-06-01 12:20:01 -07:00
Timothy Jaeryang Baek	8644532f5b	refac Co-Authored-By: Classic298 <27028174+Classic298@users.noreply.github.com>	2026-06-01 12:13:04 -07:00
Classic298	9a3eea6448	fix: bind prompt history/version ops to the authorized prompt (#25056 ) The history diff, delete, and version-restore routes authorize the URL prompt_id but then act on a caller-supplied history/version id without checking it belongs to that prompt (IDOR). Filter by prompt_id in compute_diff and delete_history_entry, and reject a cross-prompt version_id in update_prompt_version. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-01 12:07:52 -07:00
Classic298	e623081b2b	fix: handle list-shape data in Firecrawl /search response (#24712 ) Firecrawl /search returns either `{"data": [...]}` (flat list — v1, and what frost19k reported on #23966) or `{"data": {"web": [...]}}` (v2, current production). The parser only handled the dict shape: data = response.get('data') or {} results = data.get('web') or [] On a list-shape response, `data.get('web')` raised AttributeError, caught by the function's outer try/except, and `search_firecrawl` silently returned []. Web search worked against v2 endpoints but is one upstream-format-change away from failing closed again. Accept either.	2026-06-01 12:07:31 -07:00
Classic298	507b8b213c	refac: mirror native FC code_interpreter authz gates onto legacy XML-tag path (#24724 ) The native function-calling tool resolver in utils/tools.py applies five gates before exposing execute_code as a builtin tool: builtin-category enable, ENABLE_CODE_INTERPRETER global config, model capability, features.code_interpreter request flag, and the per-user features.code_interpreter permission. The legacy XML-tag detection path in streaming_chat_response_handler applied only the request-flag gate. Brings the legacy path to parity by running the same five-gate check before activating tag detection. Behaviour change is limited to deployments that previously relied on the asymmetry — admins who set ENABLE_CODE_INTERPRETER=False or revoked the per-user permission, on the legacy tool-calling mode, with the client supplying features.code_interpreter=true. Any of those three conditions met now correctly disables tag detection. Co-authored-by: sfwani <sfwani@users.noreply.github.com>	2026-06-01 12:07:15 -07:00
Timothy Jaeryang Baek	c93f071700	refac	2026-06-01 11:58:16 -07:00
Classic298	76947ff926	fix: reject collection names with unsafe characters in RAG ACL (#24982 ) Open WebUI's collection ACL accepted any unknown name as a legacy/ephemeral collection. In Milvus multi-tenancy mode that name becomes the `resource_id` and is interpolated unescaped into a SQL-like Milvus expression — `resource_id == '<name>'` — so a name like x' or resource_id != '' or resource_id == 'x turns the filter into a tautology and returns every tenant's chunks from the shared collection. All collection names Open WebUI generates are UUIDs, SHA-256 hex digests, or fixed-prefix variants of those — they all fit [A-Za-z0-9_-]. Add a strict format check in filter_accessible_collections (utils.py) that drops any name outside that set before any ACL or vector-store lookup, applied even on the admin bypass path. _validate_collection_access then surfaces the dropped name as a 403. As defense in depth, MilvusClient now validates resource_id at every expression-construction site and escapes single quotes / backslashes in any other string interpolated into a filter (delete ids, metadata filter values). Non-string filter values are typed-checked instead of str()-formatted. Co-authored-by: Claude <noreply@anthropic.com>	2026-06-01 11:48:43 -07:00
Classic298	a089842368	fix: don't revert replace/outlet content on chat save (#25485 ) update_chat_by_id re-derived assistant content from `output` on every save (serialize_output) so frontend edits to output items reflect in content. But it ran unconditionally, so content set independently of output — an __event_emitter__ {"type":"replace"} from an Action, or an outlet filter footer — was reverted to the original output-derived text on the next save. The reload reads chat.chat directly, so the change vanished after navigating away (regression vs 0.9.2, which predates the output mechanism). Re-derive only when the message's `output` actually changed versus what's stored, which still reflects genuine output edits but leaves independently-set content intact. Fixes #24585 Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 11:45:05 -07:00
Classic298	558ea2a152	fix: don't block first-admin signup on stale ENABLE_SIGNUP (#24821 ) Symptom On a fresh install (zero users) the frontend shows the mandatory "Create Admin Account" onboarding screen, but POST /api/v1/auths/signup returns 403 ACCESS_PROHIBITED ("You do not have permission to access this resource."). Wiping the database does not help when the config layer is backed by Redis (the value survives in the Redis/valkey volume), or when only the user table is cleared (the config row survives in Postgres). The instance is then unrecoverable through the UI. Root cause signup_handler() auto-sets request.app.state.config.ENABLE_SIGNUP = False immediately after the first admin is created. That value is persisted by the config layer (the Postgres config table, and Redis when REDIS_URL is set). On a later zero-user database the persisted False is read back, so ENABLE_SIGNUP resolves False even though no users exist. The old gate was: if WEBUI_AUTH: if not ENABLE_SIGNUP or not ENABLE_LOGIN_FORM: if has_users or not ENABLE_INITIAL_ADMIN_SIGNUP: 403 ENABLE_INITIAL_ADMIN_SIGNUP defaults to False, so with zero users the inner test (has_users or not ENABLE_INITIAL_ADMIN_SIGNUP) is True, and a stale ENABLE_SIGNUP=False trips the outer test, producing a 403 on the only UI path that can create the first admin. The frontend decides to show onboarding purely from user_count == 0, so frontend and backend disagree and the instance bricks. Change Split the gate by has_users. Subsequent signups (has_users True) are unchanged: still gated by ENABLE_SIGNUP and ENABLE_LOGIN_FORM. The first user (has_users False, the bootstrap admin the onboarding screen invites) is gated only by the admin-chosen ENABLE_LOGIN_FORM (the documented SSO-only hard-disable) unless ENABLE_INITIAL_ADMIN_SIGNUP is set. It is no longer gated by ENABLE_SIGNUP, which in the zero-user state is never an admin decision but the post-first-admin auto-disable leaking across a database reset. Why this is safe (full case analysis) For WEBUI_AUTH the gate has 16 input combinations over (has_users, ENABLE_SIGNUP, ENABLE_LOGIN_FORM, ENABLE_INITIAL_ADMIN_SIGNUP). Old and new are identical in 15 of them: * All 8 has_users=True cases: both reduce to "403 iff not ENABLE_SIGNUP or not ENABLE_LOGIN_FORM". Unchanged. * 7 of the 8 has_users=False cases: identical. The only changed case is has_users=False, ENABLE_SIGNUP=False, ENABLE_LOGIN_FORM=True, ENABLE_INITIAL_ADMIN_SIGNUP=False: old behaviour 403, new behaviour allow. The new condition is a strict subset of the old (new-403 implies old-403), so the change never newly blocks any request that previously succeeded; it only stops blocking that one bootstrap state. That state has no legitimate deployment. With the login form enabled and zero users the onboarding form is already served, and the only operator-configurable way to keep the first signup closed (SSO-only: ENABLE_LOGIN_FORM=False, optionally with ENABLE_INITIAL_ADMIN_SIGNUP) is preserved byte for byte. ENABLE_SIGNUP=False with zero users is not an operator choice, it is the automatic post-first-admin disable, so the old behaviour there was purely a brick with no recovery path. No security control is weakened: ENABLE_LOGIN_FORM and ENABLE_INITIAL_ADMIN_SIGNUP keep their exact meaning, and the WEBUI_AUTH=False path is untouched. This is not Redis-specific: it reproduces with Redis disabled through the Postgres config table alone (clear the user table, keep the config row). Verification Drove the real signup endpoint across a 10-case matrix on freshly migrated databases, including the full end-to-end first-admin creation (returns role=admin, row persisted as admin) and the preserved SSO-only, subsequent-signup and no-auth behaviours. All pass. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-01 11:44:36 -07:00
Timothy Jaeryang Baek	ad9f2eeb15	refac	2026-06-01 11:34:46 -07:00
Timothy Jaeryang Baek	1bbb2b933d	refac	2026-06-01 11:08:58 -07:00
Classic298	cc15a01778	fix: don't crash on startup when stdout can't encode the banner (#25482 ) The startup banner uses Unicode box-drawing characters. On a stdout that can't encode them (Windows cp1252, or redirected/headless/pythonw output) print() raises UnicodeEncodeError and aborts startup. This blocks running open-webui serve headless on Windows. Guard the banner print and fall back to a plain ASCII line so startup always proceeds regardless of the console encoding. Fixes #24965 Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 10:41:43 -07:00
HW	bddadafa07	fix(images): pass content_type=None to r.json() to accept non-standard MIME types (#24838 ) aiohttp's ClientResponse.json() validates the Content-Type header against 'application/json' by default and raises ContentTypeError for any other value — including 'application/x-ndjson', which Ollama returns for its OpenAI-compatible /v1/images/generations endpoint. Pass content_type=None to skip this check while keeping all other parsing behaviour unchanged. The fix covers image generation (openai, gemini, automatic1111 engines) and image editing (openai, gemini engines).	2026-06-01 10:33:24 -07:00
Algorithm5838	309caa82fb	fix: persist outlet filter changes to message output (#24884 )	2026-06-01 10:24:40 -07:00
Classic298	b0fa4384ea	fix: cache path traversal via sibling-prefix bypass in serve_cache_file (#25086 ) serve_cache_file gated the resolved path with file_path.startswith(os.path.abspath(CACHE_DIR)) without a trailing os.sep, so any path resolving to a sibling whose name starts with the cache-dir basename (e.g. cache_backup, cached_models) passed the prefix check. Authenticated users could read files from such siblings via /cache/../<sibling>/<file>. Appending os.sep to the prefix closes the bypass; deep traversal and absolute paths were already correctly blocked. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-01 10:17:02 -07:00
Timothy Jaeryang Baek	07cbc91a8e	refac Co-Authored-By: Boris Rybalkin <ribalkin@gmail.com>	2026-06-01 10:16:01 -07:00
Timothy Jaeryang Baek	0e73f7af09	refac Co-Authored-By: bannert <58707896+bannert1337@users.noreply.github.com>	2026-06-01 10:13:50 -07:00
Classic298	02b65ea582	fix: don't hang terminal proxy when one forwarding pump exits first (#25479 ) The proxy gathered _client_to_upstream and _upstream_to_client with return_exceptions=True. When upstream sends a graceful CLOSE, _upstream_to_client returns but gather keeps waiting on _client_to_upstream, which is blocked in ws.receive() until the browser disconnects. The handler stays pending and the finally: session.close() cleanup is deferred, leaking a ClientSession and an open browser socket. Use asyncio.wait(return_when=FIRST_COMPLETED) and cancel the pending sibling, so the proxy unwinds as soon as either direction finishes. The pumps' bare except Exception already lets CancelledError (a BaseException) propagate, so cancellation is clean and they need no change. Fixes #25464 Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 10:10:04 -07:00
Classic298	83890f18b9	feat: cap profile image data URI size to bound model/avatar bloat (#25476 ) * feat: cap profile image data URI size to bound model/avatar bloat validate_profile_image_url() validated data-URI format (MIME allowlist, SVG rejection, scheme checks) but never its length, so a valid data:image/...;base64,<huge> passed for both custom-model icons and user avatars. Large inline images bloat Postgres and the Redis MODELS hash and degrade model-list latency. Add PROFILE_IMAGE_MAX_DATA_URI_SIZE (default 256 KiB, 0 disables) and reject oversized data URIs in the shared validator, so both model meta (ModelMeta.profile_image_url) and user avatars (UpdateProfileForm) are bounded at one chokepoint. ModelMeta already clears invalid values to None on read, so existing oversized icons stop propagating into the MODELS hash on the next refresh. Fixes #25468 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix: default PROFILE_IMAGE_MAX_DATA_URI_SIZE to None (no cap) Per review: opt-in rather than a 256 KiB default. Unset leaves data URIs uncapped; the validator already skips the check on a falsy value. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 09:57:06 -07:00
Timothy Jaeryang Baek	6f0277db52	refac	2026-06-01 09:53:04 -07:00
Timothy Jaeryang Baek	55ca719bbf	refac	2026-06-01 09:48:29 -07:00
Timothy Jaeryang Baek	9e3e24e304	refac	2026-06-01 09:42:54 -07:00

1 2 3 4 5 ...

6212 Commits