open-webui

mirror of https://github.com/open-webui/open-webui.git synced 2026-06-13 19:20:05 +00:00

Author	SHA1	Message	Date
Timothy Jaeryang Baek	bfe5523c52	chore: bump	2026-06-01 13:54:10 -07:00
Timothy Jaeryang Baek	77c8c54b1e	refac	2026-06-01 13:53:19 -07:00
Justin Williams	478bc9e3f1	fix(oauth): use Protected Resource Metadata scopes in static OAuth 2.1 flow (#24690 ) The static credentials OAuth flow currently sets scope=None, relying on the OAuth provider's default scopes. This breaks providers like GitHub that default to minimal/public-only access when no scope is requested. This change reads scopes_supported from the Protected Resource Metadata document (RFC 9728) and uses them in the authorization request. Unlike the Authorization Server's scopes_supported (a full catalog of every scope the AS can grant), the PRM scopes_supported represents what the specific resource requires — making it safe to request without breaking providers like Entra ID that reject broad scope requests. Fixes the regression introduced in `349ea4ea` where all scope handling was removed from the static flow.	2026-06-01 13:52:18 -07:00
Timothy Jaeryang Baek	675e9bee5a	refac	2026-06-01 13:51:03 -07:00
Jacob Leksan	80da840ae5	refactor: move background tasks handler call to ensure consistent execution in chat response handlers (#24717 )	2026-06-01 13:50:15 -07:00
Chane Lu	974b13839e	fix: polyfill readable stream async iteration for Safari PDF extraction (#25473 )	2026-06-01 13:47:57 -07:00
Classic298	886c132a10	Update CHANGELOG.md (#25453 ) * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md	2026-06-01 13:46:17 -07:00
Timothy Jaeryang Baek	c7de057a4a	refac	2026-06-01 13:45:23 -07:00
Timothy Jaeryang Baek	750604a11d	refac	2026-06-01 13:43:05 -07:00
James Liounis	69c88e163d	feat(retrieval): add Perplexity attribution header (#24833 ) Signed-off-by: James Liounis <james.liounis@perplexity.ai>	2026-06-01 13:40:52 -07:00
Timothy Jaeryang Baek	c8eb8edca4	refac	2026-06-01 13:38:40 -07:00
Classic298	33e4e0dcc4	fix: gate chat_completion channel: branch on channel access + message scoping (#24725 ) * fix: gate chat_completion channel: branch on channel access + message scoping When chat_id starts with 'channel:' the chat-completion handler skips the chat ownership / storage block below it. Nothing replaced that gate. The downstream channel emitter in socket/main.py:_make_channel_ emitter writes to Messages.update_message_by_id using a caller-supplied message_id pulled from form_data['id'], with no membership check, no write-access check, and no validation that the message_id belongs to the channel. Net effect: any authenticated user could submit chat_id='channel:<any-channel-uuid>' + id='<any-message-uuid>' and overwrite that message with attacker-controlled LLM output. Cross- channel writes worked too — private channels, DMs, channels the caller has no access to. Original author attribution stayed intact on the overwritten row. Add the missing checks at the channel: branch: 1. Channel must exist (404 otherwise). 2. Non-admin caller must have write access to the channel — membership for group/dm channels, AccessGrants permission='write' for others. 3. The message_id (if supplied) must belong to the same channel — a caller with write access to channel A cannot use this path to overwrite a message in channel B. Behaviour change is limited to callers who were exploiting the gap: legitimate flows that supply a message_id under their own channel membership continue to work unchanged. Co-authored-by: sfwani <sfwani@users.noreply.github.com> * chore: trim verbose comment on channel: branch gate --------- Co-authored-by: sfwani <sfwani@users.noreply.github.com>	2026-06-01 13:37:32 -07:00
Timothy Jaeryang Baek	7f7cd21018	refac	2026-06-01 13:34:50 -07:00
Craig	ce4dca47cb	fix: apply RAG_EMBEDDING_QUERY_PREFIX to memory search queries (#24921 ) The query_memory endpoint embeds the search query without the configured RAG_EMBEDDING_QUERY_PREFIX, while every RAG retrieval path in retrieval/utils.py correctly passes it. Instruction-tuned embedding models (e.g. Qwen3-Embedding) produce poor results without the prefix, causing memory search to return semantically unrelated results.	2026-06-01 13:23:15 -07:00
Timothy Jaeryang Baek	160a6694e4	refac	2026-06-01 13:20:33 -07:00
Timothy Jaeryang Baek	4705c2d988	refac	2026-06-01 13:20:26 -07:00
Timothy Jaeryang Baek	778dba1d6b	refac	2026-06-01 13:18:44 -07:00
Timothy Jaeryang Baek	27fb20c13a	refac	2026-06-01 13:15:21 -07:00
Timothy Jaeryang Baek	eb4eebc3ce	refac	2026-06-01 13:10:19 -07:00
Timothy Jaeryang Baek	d64ef1803d	refac Co-Authored-By: Zaid Marji <91486926+zaid-marji@users.noreply.github.com>	2026-06-01 13:07:49 -07:00
Timothy Jaeryang Baek	346dab3d8f	refac	2026-06-01 13:03:13 -07:00
Timothy Jaeryang Baek	01810e32ad	refac	2026-06-01 13:02:48 -07:00
Mr. Meowgi	a9da054572	feat: add skills management to chat component (#25037 ) - Introduced skills functionality in Chat.svelte, MessageInput.svelte, and related components. - Added SkillsModal for displaying and managing available skills. - Updated state management to include selectedSkillIds and integrate skills API. - Enhanced UI to show available skills and their descriptions. - Updated translations to support skills-related text.	2026-06-01 12:53:30 -07:00
Lukáš Kucharczyk	42c2393f8e	Update Kagi API endpoint and request method (#25015 ) Co-authored-by: russelg <russelg@users.noreply.github.com>	2026-06-01 12:48:44 -07:00
Timothy Jaeryang Baek	e90a618f45	refac	2026-06-01 12:47:59 -07:00
Timothy Jaeryang Baek	4297c02b12	refac	2026-06-01 12:44:16 -07:00
Timothy Jaeryang Baek	9035601bdb	refac	2026-06-01 12:41:30 -07:00
Timothy Jaeryang Baek	e3ab4bd212	refac Co-Authored-By: Zixin Yu <183055163+ivvi0927@users.noreply.github.com>	2026-06-01 12:37:34 -07:00
Timothy Jaeryang Baek	fd76b51ab2	refac Co-Authored-By: Classic298 <27028174+Classic298@users.noreply.github.com>	2026-06-01 12:27:08 -07:00
Timothy Jaeryang Baek	c0f1aa2919	refac	2026-06-01 12:24:45 -07:00
rileydes-improving	567c4aabe9	feat: add support for Valkey vector database (#24769 ) * feat: add support for Valkey vector database Signed-off-by: Riley Des <riley.desserre@improving.com> * feat: add CLIENT SETNAME to Valkey vector store connections Set client_name on GlideClientConfiguration for both the main client and batch client so connections are identifiable in CLIENT LIST, monitoring dashboards, and CloudWatch metrics. Signed-off-by: Riley Des <riley.desserre@improving.com> --------- Signed-off-by: Riley Des <riley.desserre@improving.com>	2026-06-01 12:20:01 -07:00
Timothy Jaeryang Baek	8644532f5b	refac Co-Authored-By: Classic298 <27028174+Classic298@users.noreply.github.com>	2026-06-01 12:13:04 -07:00
Classic298	9a3eea6448	fix: bind prompt history/version ops to the authorized prompt (#25056 ) The history diff, delete, and version-restore routes authorize the URL prompt_id but then act on a caller-supplied history/version id without checking it belongs to that prompt (IDOR). Filter by prompt_id in compute_diff and delete_history_entry, and reject a cross-prompt version_id in update_prompt_version. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-01 12:07:52 -07:00
Classic298	e623081b2b	fix: handle list-shape data in Firecrawl /search response (#24712 ) Firecrawl /search returns either `{"data": [...]}` (flat list — v1, and what frost19k reported on #23966) or `{"data": {"web": [...]}}` (v2, current production). The parser only handled the dict shape: data = response.get('data') or {} results = data.get('web') or [] On a list-shape response, `data.get('web')` raised AttributeError, caught by the function's outer try/except, and `search_firecrawl` silently returned []. Web search worked against v2 endpoints but is one upstream-format-change away from failing closed again. Accept either.	2026-06-01 12:07:31 -07:00
Classic298	507b8b213c	refac: mirror native FC code_interpreter authz gates onto legacy XML-tag path (#24724 ) The native function-calling tool resolver in utils/tools.py applies five gates before exposing execute_code as a builtin tool: builtin-category enable, ENABLE_CODE_INTERPRETER global config, model capability, features.code_interpreter request flag, and the per-user features.code_interpreter permission. The legacy XML-tag detection path in streaming_chat_response_handler applied only the request-flag gate. Brings the legacy path to parity by running the same five-gate check before activating tag detection. Behaviour change is limited to deployments that previously relied on the asymmetry — admins who set ENABLE_CODE_INTERPRETER=False or revoked the per-user permission, on the legacy tool-calling mode, with the client supplying features.code_interpreter=true. Any of those three conditions met now correctly disables tag detection. Co-authored-by: sfwani <sfwani@users.noreply.github.com>	2026-06-01 12:07:15 -07:00
Timothy Jaeryang Baek	c93f071700	refac	2026-06-01 11:58:16 -07:00
Classic298	76947ff926	fix: reject collection names with unsafe characters in RAG ACL (#24982 ) Open WebUI's collection ACL accepted any unknown name as a legacy/ephemeral collection. In Milvus multi-tenancy mode that name becomes the `resource_id` and is interpolated unescaped into a SQL-like Milvus expression — `resource_id == '<name>'` — so a name like x' or resource_id != '' or resource_id == 'x turns the filter into a tautology and returns every tenant's chunks from the shared collection. All collection names Open WebUI generates are UUIDs, SHA-256 hex digests, or fixed-prefix variants of those — they all fit [A-Za-z0-9_-]. Add a strict format check in filter_accessible_collections (utils.py) that drops any name outside that set before any ACL or vector-store lookup, applied even on the admin bypass path. _validate_collection_access then surfaces the dropped name as a 403. As defense in depth, MilvusClient now validates resource_id at every expression-construction site and escapes single quotes / backslashes in any other string interpolated into a filter (delete ids, metadata filter values). Non-string filter values are typed-checked instead of str()-formatted. Co-authored-by: Claude <noreply@anthropic.com>	2026-06-01 11:48:43 -07:00
Classic298	a089842368	fix: don't revert replace/outlet content on chat save (#25485 ) update_chat_by_id re-derived assistant content from `output` on every save (serialize_output) so frontend edits to output items reflect in content. But it ran unconditionally, so content set independently of output — an __event_emitter__ {"type":"replace"} from an Action, or an outlet filter footer — was reverted to the original output-derived text on the next save. The reload reads chat.chat directly, so the change vanished after navigating away (regression vs 0.9.2, which predates the output mechanism). Re-derive only when the message's `output` actually changed versus what's stored, which still reflects genuine output edits but leaves independently-set content intact. Fixes #24585 Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 11:45:05 -07:00
Classic298	558ea2a152	fix: don't block first-admin signup on stale ENABLE_SIGNUP (#24821 ) Symptom On a fresh install (zero users) the frontend shows the mandatory "Create Admin Account" onboarding screen, but POST /api/v1/auths/signup returns 403 ACCESS_PROHIBITED ("You do not have permission to access this resource."). Wiping the database does not help when the config layer is backed by Redis (the value survives in the Redis/valkey volume), or when only the user table is cleared (the config row survives in Postgres). The instance is then unrecoverable through the UI. Root cause signup_handler() auto-sets request.app.state.config.ENABLE_SIGNUP = False immediately after the first admin is created. That value is persisted by the config layer (the Postgres config table, and Redis when REDIS_URL is set). On a later zero-user database the persisted False is read back, so ENABLE_SIGNUP resolves False even though no users exist. The old gate was: if WEBUI_AUTH: if not ENABLE_SIGNUP or not ENABLE_LOGIN_FORM: if has_users or not ENABLE_INITIAL_ADMIN_SIGNUP: 403 ENABLE_INITIAL_ADMIN_SIGNUP defaults to False, so with zero users the inner test (has_users or not ENABLE_INITIAL_ADMIN_SIGNUP) is True, and a stale ENABLE_SIGNUP=False trips the outer test, producing a 403 on the only UI path that can create the first admin. The frontend decides to show onboarding purely from user_count == 0, so frontend and backend disagree and the instance bricks. Change Split the gate by has_users. Subsequent signups (has_users True) are unchanged: still gated by ENABLE_SIGNUP and ENABLE_LOGIN_FORM. The first user (has_users False, the bootstrap admin the onboarding screen invites) is gated only by the admin-chosen ENABLE_LOGIN_FORM (the documented SSO-only hard-disable) unless ENABLE_INITIAL_ADMIN_SIGNUP is set. It is no longer gated by ENABLE_SIGNUP, which in the zero-user state is never an admin decision but the post-first-admin auto-disable leaking across a database reset. Why this is safe (full case analysis) For WEBUI_AUTH the gate has 16 input combinations over (has_users, ENABLE_SIGNUP, ENABLE_LOGIN_FORM, ENABLE_INITIAL_ADMIN_SIGNUP). Old and new are identical in 15 of them: * All 8 has_users=True cases: both reduce to "403 iff not ENABLE_SIGNUP or not ENABLE_LOGIN_FORM". Unchanged. * 7 of the 8 has_users=False cases: identical. The only changed case is has_users=False, ENABLE_SIGNUP=False, ENABLE_LOGIN_FORM=True, ENABLE_INITIAL_ADMIN_SIGNUP=False: old behaviour 403, new behaviour allow. The new condition is a strict subset of the old (new-403 implies old-403), so the change never newly blocks any request that previously succeeded; it only stops blocking that one bootstrap state. That state has no legitimate deployment. With the login form enabled and zero users the onboarding form is already served, and the only operator-configurable way to keep the first signup closed (SSO-only: ENABLE_LOGIN_FORM=False, optionally with ENABLE_INITIAL_ADMIN_SIGNUP) is preserved byte for byte. ENABLE_SIGNUP=False with zero users is not an operator choice, it is the automatic post-first-admin disable, so the old behaviour there was purely a brick with no recovery path. No security control is weakened: ENABLE_LOGIN_FORM and ENABLE_INITIAL_ADMIN_SIGNUP keep their exact meaning, and the WEBUI_AUTH=False path is untouched. This is not Redis-specific: it reproduces with Redis disabled through the Postgres config table alone (clear the user table, keep the config row). Verification Drove the real signup endpoint across a 10-case matrix on freshly migrated databases, including the full end-to-end first-admin creation (returns role=admin, row persisted as admin) and the preserved SSO-only, subsequent-signup and no-auth behaviours. All pass. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-01 11:44:36 -07:00
Timothy Jaeryang Baek	ad9f2eeb15	refac	2026-06-01 11:34:46 -07:00
G30	1010fd3142	fix(knowledge): remove premature drag-and-drop upload toast (#25484 )	2026-06-01 11:09:58 -07:00
Timothy Jaeryang Baek	1bbb2b933d	refac	2026-06-01 11:08:58 -07:00
Classic298	cc15a01778	fix: don't crash on startup when stdout can't encode the banner (#25482 ) The startup banner uses Unicode box-drawing characters. On a stdout that can't encode them (Windows cp1252, or redirected/headless/pythonw output) print() raises UnicodeEncodeError and aborts startup. This blocks running open-webui serve headless on Windows. Guard the banner print and fall back to a plain ASCII line so startup always proceeds regardless of the console encoding. Fixes #24965 Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-01 10:41:43 -07:00
G30	ceb1cbc009	fix(ui): guard JSON.parse(localStorage) calls with try/catch to prevent UI crashes (#25481 )	2026-06-01 10:33:54 -07:00
HW	bddadafa07	fix(images): pass content_type=None to r.json() to accept non-standard MIME types (#24838 ) aiohttp's ClientResponse.json() validates the Content-Type header against 'application/json' by default and raises ContentTypeError for any other value — including 'application/x-ndjson', which Ollama returns for its OpenAI-compatible /v1/images/generations endpoint. Pass content_type=None to skip this check while keeping all other parsing behaviour unchanged. The fix covers image generation (openai, gemini, automatic1111 engines) and image editing (openai, gemini engines).	2026-06-01 10:33:24 -07:00
Timothy Jaeryang Baek	936d5f2676	refac	2026-06-01 10:26:39 -07:00
Algorithm5838	309caa82fb	fix: persist outlet filter changes to message output (#24884 )	2026-06-01 10:24:40 -07:00
Classic298	b0fa4384ea	fix: cache path traversal via sibling-prefix bypass in serve_cache_file (#25086 ) serve_cache_file gated the resolved path with file_path.startswith(os.path.abspath(CACHE_DIR)) without a trailing os.sep, so any path resolving to a sibling whose name starts with the cache-dir basename (e.g. cache_backup, cached_models) passed the prefix check. Authenticated users could read files from such siblings via /cache/../<sibling>/<file>. Appending os.sep to the prefix closes the bypass; deep traversal and absolute paths were already correctly blocked. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-01 10:17:02 -07:00
Timothy Jaeryang Baek	07cbc91a8e	refac Co-Authored-By: Boris Rybalkin <ribalkin@gmail.com>	2026-06-01 10:16:01 -07:00
Timothy Jaeryang Baek	0e73f7af09	refac Co-Authored-By: bannert <58707896+bannert1337@users.noreply.github.com>	2026-06-01 10:13:50 -07:00

1 2 3 4 5 ...

16755 Commits