update

📝 docs(test): add Zustand store action testing best practices guide
Added comprehensive testing guide for Zustand store actions based on generateAIChat refactoring experience. ## New Guide Content **File**: `.cursor/rules/testing-guide/zustand-store-action-test.mdc` ### Key Topics Covered 1. **Test Layering Principles** 🎯 - Each test layer should only spy on direct dependencies - Avoid cross-layer spying - Clear separation of concerns 2. **Mock Strategy** 🎭 - Per-test mocking (recommended) - Avoid global mocks causing implicit coupling - Only spy common services in beforeEach 3. **File Organization** 📁 - `fixtures.ts`: Test constants and mock data factories - `helpers.ts`: Reusable test utility functions - `[action].test.ts`: Actual test files 4. **Template Code** 📝 - Basic action test template - Internal methods testing - Streaming/async flow testing - Toggle/loading state testing 5. **Common Issues & Solutions** ⚠️ - React state update warnings - Cross-layer spy problems - Mock type mismatches - Global mock pollution 6. **Real Case Study** 🎓 - generateAIChat refactoring experience - Before/after comparisons - Coverage improvement: 54.44% → 82.03% ### Benefits ✅ Clear test architecture guidelines ✅ Ready-to-use template code ✅ Common pitfalls prevention ✅ Real-world examples Updated `rules-index.mdc` to include the new guide. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2026-06-20 22:26:05 +00:00 · 2025-10-14 21:08:39 +08:00 · 2025-10-14 20:53:52 +08:00 · 2025-10-14 20:45:14 +08:00 · 2025-10-14 10:27:29 +00:00 · 2025-10-14 10:26:28 +00:00
737 changed files with 59893 additions and 7074 deletions
@@ -0,0 +1,175 @@
+---
+description: Guide for adding environment variables to configure user settings
+alwaysApply: false
+---
+
+# Adding Environment Variable for User Settings
+
+Add server-side environment variables to configure default values for user settings.
+
+**Priority**: User Custom > Server Env Var > Hardcoded Default
+
+## Steps
+
+### 1. Define Environment Variable
+
+Create `src/envs/<domain>.ts`:
+
+```typescript
+import { createEnv } from '@t3-oss/env-nextjs';
+import { z } from 'zod';
+
+export const get<Domain>Config = () => {
+  return createEnv({
+    server: {
+      YOUR_ENV_VAR: z.coerce.number().min(MIN).max(MAX).optional(),
+    },
+    runtimeEnv: {
+      YOUR_ENV_VAR: process.env.YOUR_ENV_VAR,
+    },
+  });
+};
+
+export const <domain>Env = get<Domain>Config();
+```
+
+### 2. Update Type (Optional)
+
+**Skip this step if the domain field already exists in `GlobalServerConfig`.**
+
+Add to `packages/types/src/serverConfig.ts`:
+
+```typescript
+export interface GlobalServerConfig {
+  <domain>?: {
+    <settingName>?: <type>;
+  };
+}
+```
+
+**Prefer reusing existing types** from `packages/types/src/user/settings` with `PartialDeep`:
+
+```typescript
+import { User<Domain>Config } from './user/settings';
+
+export interface GlobalServerConfig {
+  <domain>?: PartialDeep<User<Domain>Config>;
+}
+```
+
+### 3. Assemble Server Config (Optional)
+
+**Skip this step if the domain field already exists in server config.**
+
+In `src/server/globalConfig/index.ts`:
+
+```typescript
+import { <domain>Env } from '@/envs/<domain>';
+import { cleanObject } from '@/utils/object';
+
+export const getServerGlobalConfig = async () => {
+  const config: GlobalServerConfig = {
+    // ...
+    <domain>: cleanObject({
+      <settingName>: <domain>Env.YOUR_ENV_VAR,
+    }),
+  };
+  return config;
+};
+```
+
+If the domain already exists, just add the new field to the existing `cleanObject()`:
+
+```typescript
+<domain>: cleanObject({
+  existingField: <domain>Env.EXISTING_VAR,
+  <settingName>: <domain>Env.YOUR_ENV_VAR, // Add this line
+}),
+```
+
+### 4. Merge to User Store (Optional)
+
+**Skip this step if the domain field already exists in `serverSettings`.**
+
+In `src/store/user/slices/common/action.ts`, add to `serverSettings`:
+
+```typescript
+const serverSettings: PartialDeep<UserSettings> = {
+  defaultAgent: serverConfig.defaultAgent,
+  <domain>: serverConfig.<domain>, // Add this line
+  // ...
+};
+```
+
+### 5. Update .env.example
+
+```bash
+# <Description> (range/options, default: X)
+# YOUR_ENV_VAR=<example>
+```
+
+### 6. Update Documentation
+
+Update both English and Chinese documentation:
+- `docs/self-hosting/environment-variables/basic.mdx`
+- `docs/self-hosting/environment-variables/basic.zh-CN.mdx`
+
+Add new section or subsection with environment variable details (type, description, default, example, range/constraints).
+
+## Type Reuse
+
+**Prefer reusing existing types** from `packages/types/src/user/settings` instead of defining inline types in `serverConfig.ts`.
+
+```typescript
+// ✅ Good - reuse existing type
+import { UserImageConfig } from './user/settings';
+
+export interface GlobalServerConfig {
+  image?: PartialDeep<UserImageConfig>;
+}
+
+// ❌ Bad - inline type definition
+export interface GlobalServerConfig {
+  image?: {
+    defaultImageNum?: number;
+  };
+}
+```
+
+## Example: AI_IMAGE_DEFAULT_IMAGE_NUM
+
+```typescript
+// src/envs/image.ts
+export const getImageConfig = () => {
+  return createEnv({
+    server: {
+      AI_IMAGE_DEFAULT_IMAGE_NUM: z.coerce.number().min(1).max(20).optional(),
+    },
+    runtimeEnv: {
+      AI_IMAGE_DEFAULT_IMAGE_NUM: process.env.AI_IMAGE_DEFAULT_IMAGE_NUM,
+    },
+  });
+};
+
+// packages/types/src/serverConfig.ts
+import { UserImageConfig } from './user/settings';
+
+export interface GlobalServerConfig {
+  image?: PartialDeep<UserImageConfig>;
+}
+
+// src/server/globalConfig/index.ts
+image: cleanObject({
+  defaultImageNum: imageEnv.AI_IMAGE_DEFAULT_IMAGE_NUM,
+}),
+
+// src/store/user/slices/common/action.ts
+const serverSettings: PartialDeep<UserSettings> = {
+  image: serverConfig.image,
+  // ...
+};
+
+// .env.example
+# AI_IMAGE_DEFAULT_IMAGE_NUM=4
+```
+
@@ -0,0 +1,25 @@
+---
+globs: packages/database/migrations/**/*
+alwaysApply: false
+---
+
+# Database Migrations Guide
+
+## Defensive Programming - Use Idempotent Clauses
+
+Always use defensive clauses to make migrations idempotent:
+
+```sql
+-- ✅ Good: Idempotent operations
+ALTER TABLE "users" ADD COLUMN IF NOT EXISTS "avatar" text;
+DROP TABLE IF EXISTS "old_table";
+CREATE INDEX IF NOT EXISTS "users_email_idx" ON "users" ("email");
+ALTER TABLE "posts" DROP COLUMN IF EXISTS "deprecated_field";
+
+-- ❌ Bad: Non-idempotent operations
+ALTER TABLE "users" ADD COLUMN "avatar" text;
+DROP TABLE "old_table";
+CREATE INDEX "users_email_idx" ON "users" ("email");
+```
+
+**Important**: After modifying migration SQL (e.g., adding `IF NOT EXISTS` clauses), run `bun run db:generate-client` to update the hash in `packages/database/src/core/migrations.json`.
@@ -6,18 +6,16 @@ alwaysApply: true

 You are developing an open-source, modern-design AI chat framework: lobehub(previous lobe-chat).

-support platforms:
+Supported platforms:

 - web desktop/mobile
 - desktop(electron)
- mobile app(react native). coming soon
+- mobile app(react native), coming soon

 logo emoji: 🤯

 ## Project Technologies Stack

-read [package.json](mdc:package.json) to know all npm packages you can use.
-
 - Next.js 15
 - react 19
 - TypeScript
@@ -33,6 +31,6 @@ read [package.json](mdc:package.json) to know all npm packages you can use.
 - dayjs for time library
 - lodash-es for utility library
 - TRPC for type safe backend
- PGLite for client DB and PostgreSQL for backend DB
+- PGLite for client DB and Neon PostgreSQL for backend DB
 - Drizzle ORM
 - Vitest for testing
@@ -5,11 +5,11 @@ alwaysApply: false

 # LobeChat Project Structure

-note: some not very important files are not shown for simplicity.
-
 ## Complete Project Structure

-this project use common monorepo structure. The workspace packages name use `@lobechat/` namespace.
+This project uses common monorepo structure. The workspace packages name use `@lobechat/` namespace.
+
+**note**: some not very important files are not shown for simplicity.

 ```plaintext
 lobe-chat/
@@ -28,10 +28,12 @@ lobe-chat/
 │   │   │   ├── schemas/
 │   │   │   └── repositories/
 │   ├── model-bank/
+│   │   └── src/
+│   │       └── aiModels/
 │   ├── model-runtime/
 │   │   └── src/
-│   │       ├── openai/
-│   │       └── anthropic/
+│   │       ├── core/
+│   │       └── providers/
 │   ├── types/
 │   │   └── src/
 │   │       ├── message/
@@ -96,14 +98,14 @@ lobe-chat/
 - UI Components: `src/components`, `src/features`
 - Global providers: `src/layout`
 - Zustand stores: `src/store`
- Client Services: `src/services/`
+- Client Services: `src/services/` cross-platform services
  - clientDB: `src/services/<domain>/client.ts`
  - serverDB: `src/services/<domain>/server.ts`
 - API Routers:
  - `src/app/(backend)/webapi` (REST)
  - `src/server/routers/{edge|lambda|async|desktop|tools}` (tRPC)
 - Server:
-  - Services(can access serverDB): `src/server/services`
+  - Services(can access serverDB): `src/server/services` server-used-only services
  - Modules(can't access db): `src/server/modules` (Server only Third-party Service Module)
 - Database:
  - Schema (Drizzle): `packages/database/src/schemas`
@@ -113,8 +115,8 @@ lobe-chat/

 ## Data Flow Architecture

- **Browser/PWA**: React UI → Client Service → Direct Model Access → PGLite (Web WASM)
- **Server**: React UI → Client Service → tRPC Lambda → Server Services → PostgreSQL (Remote)
+- **Web with ClientDB**: React UI → Client Service → Direct Model Access → PGLite (Web WASM)
+- **Web with ServerDB**: React UI → Client Service → tRPC Lambda → Server Services → PostgreSQL (Remote)
 - **Desktop**:
  - Cloud sync disabled: Electron UI → Client Service → tRPC Lambda → Local Server Services → PGLite (Node WASM)
  - Cloud sync enabled: Electron UI → Client Service → tRPC Lambda → Cloud Server Services → PostgreSQL (Remote)
@@ -39,5 +39,6 @@ All following rules are saved under `.cursor/rules/` directory:
 ## Testing

 - `testing-guide/testing-guide.mdc` – Comprehensive testing guide for Vitest
+- `testing-guide/zustand-store-action-test.mdc` – Zustand store action testing best practices
 - `testing-guide/electron-ipc-test.mdc` – Electron IPC interface testing strategy
 - `testing-guide/db-model-test.mdc` – Database Model testing guide
@@ -0,0 +1,410 @@
+---
+globs: src/store/**/__tests__/*.test.ts
+alwaysApply: false
+---
+
+# 🏪 Zustand Store Action Testing Guide
+
+Testing guide for Zustand store actions under `src/store`. This guide is based on lessons learned from the `generateAIChat` refactoring practice.
+
+## Core Principles
+
+### 1. Test Layering Principle 🎯
+
+**Each layer should only test direct dependencies, never spy across layers**
+
+```
+❌ Bad Example - Cross-layer spying
+describe('internal_coreProcessMessage', () => {
+  it('test', async () => {
+    // ❌ Skipping internal_fetchAIChatMessage, directly spying on lower-level service
+    const streamSpy = vi.spyOn(chatService, 'createAssistantMessageStream');
+  });
+});
+
+✅ Good Example - Spy on direct dependencies
+describe('internal_coreProcessMessage', () => {
+  it('test', async () => {
+    // ✅ Only spy on directly called methods
+    const fetchSpy = vi.spyOn(result.current, 'internal_fetchAIChatMessage')
+      .mockResolvedValue({ isFunctionCall: false, content: 'response' });
+  });
+});
+```
+
+### 2. Mocking Strategy 🎭
+
+#### Per-Test Mocking (Recommended)
+
+```typescript
+// ✅ Spy on-demand in each test
+describe('myAction', () => {
+  it('should do something', async () => {
+    // Only spy when needed in this specific test
+    const serviceSpy = vi.spyOn(someService, 'method').mockResolvedValue(result);
+
+    // Test logic...
+
+    serviceSpy.mockRestore(); // Optional: cleanup
+  });
+});
+```
+
+#### Avoid Global Mocks
+
+```typescript
+// ❌ Avoid globally spying on everything in beforeEach
+beforeEach(() => {
+  spyOnEverything(); // Creates implicit coupling between tests
+});
+
+// ✅ Only spy on base services that almost all tests need
+beforeEach(() => {
+  spyOnMessageService(); // Most tests need this
+  // Other services should be spied on-demand within tests
+});
+```
+
+## Action Test Templates 📝
+
+### Basic Action Test
+
+```typescript
+import { act, renderHook } from '@testing-library/react';
+import { afterEach, beforeEach, describe, expect, it, vi } from 'vitest';
+
+import { useStore } from '../store';
+
+// Mock zustand
+vi.mock('zustand/traditional');
+
+// Test constants
+const TEST_IDS = {
+  DATA_ID: 'test-data-id',
+} as const;
+
+// Mock data factory
+const createMockData = (overrides = {}) => ({
+  id: TEST_IDS.DATA_ID,
+  status: 'initial',
+  ...overrides,
+});
+
+beforeEach(() => {
+  vi.clearAllMocks();
+
+  // Setup common mocks that most tests need
+  act(() => {
+    useStore.setState({
+      refreshData: vi.fn(),
+      internalMethod: vi.fn(),
+    });
+  });
+});
+
+afterEach(() => {
+  vi.restoreAllMocks();
+});
+
+describe('myAction', () => {
+  describe('validation', () => {
+    it('should return early when conditions not met', async () => {
+      act(() => {
+        useStore.setState({ requiredData: undefined });
+      });
+
+      const { result } = renderHook(() => useStore());
+
+      await act(async () => {
+        await result.current.myAction();
+      });
+
+      expect(result.current.internalMethod).not.toHaveBeenCalled();
+    });
+  });
+
+  describe('main flow', () => {
+    it('should process data correctly', async () => {
+      const { result } = renderHook(() => useStore());
+      const mockData = createMockData();
+
+      await act(async () => {
+        await result.current.myAction(mockData);
+      });
+
+      expect(result.current.internalMethod).toHaveBeenCalledWith(
+        expect.objectContaining({
+          id: TEST_IDS.DATA_ID,
+          status: 'processed',
+        }),
+      );
+    });
+  });
+
+  describe('error handling', () => {
+    it('should handle errors gracefully', async () => {
+      const { result } = renderHook(() => useStore());
+
+      vi.spyOn(result.current, 'internalMethod').mockRejectedValue(
+        new Error('Test error'),
+      );
+
+      await act(async () => {
+        await result.current.myAction();
+      });
+
+      expect(result.current.errorState).toBeDefined();
+    });
+  });
+});
+```
+
+### Testing Internal Methods
+
+```typescript
+// Save the real implementation for testing
+const realInternalMethod = useStore.getState().internal_method;
+
+describe('internal_method', () => {
+  it('should call correct dependencies', async () => {
+    // Restore the real implementation
+    act(() => {
+      useStore.setState({ internal_method: realInternalMethod });
+    });
+
+    const { result } = renderHook(() => useStore());
+
+    // ✅ Spy on direct dependencies
+    const dependencySpy = vi
+      .spyOn(result.current, 'internal_dependency')
+      .mockResolvedValue(expectedResult);
+
+    await act(async () => {
+      await result.current.internal_method(input);
+    });
+
+    expect(dependencySpy).toHaveBeenCalledWith(
+      expect.objectContaining({ /* expected params */ }),
+    );
+  });
+});
+```
+
+### Testing Streaming/Async Flows
+
+```typescript
+describe('streamingAction', () => {
+  it('should handle streaming chunks', async () => {
+    const { result } = renderHook(() => useStore());
+    const dispatchSpy = vi.spyOn(result.current, 'internal_dispatch');
+
+    // Mock streaming service
+    const streamSpy = vi
+      .spyOn(streamService, 'stream')
+      .mockImplementation(async ({ onChunk, onFinish }) => {
+        await onChunk?.({ type: 'data', content: 'chunk1' });
+        await onChunk?.({ type: 'data', content: 'chunk2' });
+        await onFinish?.('complete');
+      });
+
+    await act(async () => {
+      await result.current.streamingAction();
+    });
+
+    expect(dispatchSpy).toHaveBeenCalledWith(
+      expect.objectContaining({
+        type: 'update',
+        value: expect.objectContaining({ content: 'chunk1' }),
+      }),
+    );
+
+    streamSpy.mockRestore();
+  });
+});
+```
+
+### Testing Toggle/Loading States
+
+```typescript
+describe('internal_toggleLoading', () => {
+  it('should enable loading state with abort controller', () => {
+    const { result } = renderHook(() => useStore());
+
+    act(() => {
+      result.current.internal_toggleLoading(true, TEST_IDS.ITEM_ID, 'action');
+    });
+
+    const state = useStore.getState();
+    expect(state.loadingIds).toEqual([TEST_IDS.ITEM_ID]);
+    expect(state.abortController).toBeInstanceOf(AbortController);
+  });
+
+  it('should disable loading state and clear abort controller', () => {
+    const { result } = renderHook(() => useStore());
+
+    act(() => {
+      result.current.internal_toggleLoading(true, TEST_IDS.ITEM_ID, 'start');
+      result.current.internal_toggleLoading(false, undefined, 'stop');
+    });
+
+    const state = useStore.getState();
+    expect(state.loadingIds).toEqual([]);
+    expect(state.abortController).toBeUndefined();
+  });
+});
+```
+
+## Common Issues and Solutions ⚠️
+
+### Issue 1: React State Update Warning
+
+```typescript
+// ❌ Wrong: setState without wrapping
+useStore.setState({ data: newData });
+
+// ✅ Correct: Wrap all setState with act()
+act(() => {
+  useStore.setState({ data: newData });
+});
+```
+
+### Issue 2: Cross-Layer Spying
+
+```typescript
+// ❌ Wrong: Spy on lower-level services across layers
+describe('highLevelAction', () => {
+  const lowLevelServiceSpy = vi.spyOn(lowLevelService, 'method');
+});
+
+// ✅ Correct: Spy on direct dependencies
+describe('highLevelAction', () => {
+  const directDependencySpy = vi.spyOn(result.current, 'directMethod');
+});
+```
+
+### Issue 3: Mock Type Mismatch
+
+```typescript
+// ❌ Wrong: Return type doesn't match
+vi.spyOn(service, 'method').mockResolvedValue('string');
+// But method returns Response
+
+// ✅ Correct: Return correct type
+vi.spyOn(service, 'method').mockResolvedValue(new Response('string'));
+```
+
+### Issue 4: Global Mock Pollution
+
+```typescript
+// ❌ Wrong: Spy on all services in beforeEach
+beforeEach(() => {
+  spyOnServiceA();
+  spyOnServiceB();
+  spyOnServiceC(); // Creates coupling between tests
+});
+
+// ✅ Correct: Spy on-demand
+beforeEach(() => {
+  spyOnCommonService(); // Only spy on common services
+});
+
+describe('specific test', () => {
+  it('test', () => {
+    const specificSpy = vi.spyOn(specificService, 'method'); // Spy on-demand
+  });
+});
+```
+
+## Test Coverage Goals 📊
+
+### Coverage Requirements
+
+- **Minimum target**: 70%
+- **Recommended target**: 85%+
+- **Excellent target**: 90%+
+
+### Check Coverage
+
+```bash
+# Run coverage for a single test file
+bunx vitest run --coverage 'src/store/[domain]/__tests__/[action].test.ts'
+
+# View coverage for a specific file
+bunx vitest run --coverage --silent='passed-only' 'src/store/[domain]/__tests__/[action].test.ts' | grep "[action].ts"
+```
+
+### Priority Test Scenarios
+
+1. ✅ **Main Flow**: Normal business flow
+2. ✅ **Edge Cases**: Empty data, undefined values, boundary values
+3. ✅ **Error Handling**: Exception scenarios, failure retries
+4. ✅ **State Management**: Loading, Toggle, Abort
+5. ⚠️ **Corner Cases**: Optional, but don't write meaningless tests just for coverage
+
+## Real-World Case: generateAIChat Refactoring 🎓
+
+### Problems Before Refactoring
+
+```typescript
+// ❌ Problem 1: Cross-layer spying
+describe('internal_coreProcessMessage', () => {
+  const streamSpy = vi.spyOn(chatService, 'createAssistantMessageStream');
+  // Skipped the internal_fetchAIChatMessage layer
+});
+
+// ❌ Problem 2: Mocking wrong objects
+describe('internal_fetchAIChatMessage', () => {
+  vi.stubGlobal('fetch', ...); // But doesn't actually call fetch
+});
+
+// ❌ Problem 3: Global spy pollution
+beforeEach(() => {
+  spyOnChatService(); // All tests now have this spy
+});
+```
+
+### Solutions After Refactoring
+
+```typescript
+// ✅ Solution 1: Spy on direct dependencies
+describe('internal_coreProcessMessage', () => {
+  const fetchSpy = vi
+    .spyOn(result.current, 'internal_fetchAIChatMessage')
+    .mockResolvedValue({ isFunctionCall: false, content: 'response' });
+});
+
+// ✅ Solution 2: Mock correct service
+describe('internal_fetchAIChatMessage', () => {
+  const streamSpy = vi
+    .spyOn(chatService, 'createAssistantMessageStream')
+    .mockImplementation(async ({ onMessageHandle, onFinish }) => {
+      await onMessageHandle?.({ type: 'text', text: 'response' });
+      await onFinish?.('response', {});
+    });
+});
+
+// ✅ Solution 3: Spy on-demand
+beforeEach(() => {
+  spyOnMessageService(); // Only spy on common services
+  // Spy on chatService on-demand in tests
+});
+```
+
+### Refactoring Results
+
+- 📈 Coverage improvement: 54.44% → 82.03% (+27.59%)
+- ✅ Test pass rate: 52/52 (100%)
+- 🎯 Type errors: 6 → 0
+- 📝 Clearer tests: Explicit test layering
+
+## Best Practices Checklist ✅
+
+Check before testing:
+
+- [ ] Following test layering principle?
+- [ ] Mock objects match actual calls?
+- [ ] Avoiding global spy pollution?
+- [ ] All setState wrapped with act()?
+- [ ] Tests sufficiently atomic?
+- [ ] Test descriptions clear?
+- [ ] Coverage meets target?
@@ -29,16 +29,11 @@ alwaysApply: false

 ## Code Structure and Readability

- Refactor repeated logic into reusable functions.
 - Prefer object destructuring when accessing and using properties.
 - Use consistent, descriptive naming; avoid obscure abbreviations.
 - Use semantically meaningful variable, function, and class names.
 - Replace magic numbers or strings with well-named constants.
- Keep meaningful code comments; do not remove them when applying edits. Update comments when behavior changes.
- Ensure JSDoc comments accurately reflect the implementation.
- Look for opportunities to simplify or modernize code with the latest JavaScript/TypeScript features where it improves clarity.
 - Defer formatting to tooling; ignore purely formatting-only issues and autofixable lint problems.
- Respect project Prettier settings.

 ## UI and Theming

@@ -50,15 +45,14 @@ alwaysApply: false
 ## Performance

 - Prefer `for…of` loops to index-based `for` loops when feasible.
- Decide whether callbacks should be debounced or throttled based on UX and performance needs.
- Reuse existing npm packages rather than reinventing the wheel (e.g., `lodash-es/omit`).
+- Reuse existing utils inside `packages/utils` or installed npm packages rather than reinventing the wheel.
 - Query only the required columns from a database rather than selecting entire rows.

 ## Time and Consistency

 - Instead of calling `Date.now()` multiple times, assign it to a constant once and reuse it to ensure consistency and improve readability.

-## Some logging rules
+## Logging

 - Never log user private information like api key, etc
 - Don't use `import { log } from 'debug'` to log messages, because it will directly log the message to the console.
@@ -169,6 +169,13 @@ OPENAI_API_KEY=sk-xxxxxxxxx

 # FAL_API_KEY=fal-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

+########################################
+######### AI Image Settings ############
+########################################
+
+# Default image generation count (range: 1-20, default: 4)
+# AI_IMAGE_DEFAULT_IMAGE_NUM=4
+
 ### Nebius ###

 # NEBIUS_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
@@ -86,7 +86,19 @@ jobs:
                 [Original content]
                 </details>

-            4. Update using gh tool:
+            4. CRITICAL RULES to prevent hallucination and ensure accuracy:
+
+               - The "Original Content" section MUST contain the EXACT, UNMODIFIED original text byte-for-byte. NEVER add, remove, modify, or hallucinate ANY content in this section.
+               - Code blocks, error logs, JSON structures, and other technical content MUST appear in BOTH the translated section AND the original content section WITHOUT ANY MODIFICATION.
+               - When translating content with code/logs/JSON:
+                 * Copy the code/logs/JSON blocks identically to both sections
+                 * Only translate the natural language text (e.g., Chinese, Japanese) surrounding the code blocks
+                 * Keep all technical content (URLs, variable names, error messages in English) unchanged
+               - ALWAYS verify the "Original Content" section matches the source text exactly before updating
+               - If you detect any discrepancy, retrieve the original content again to ensure accuracy
+               - Pay special attention to the end of comments - do not drop or hallucinate the last sentences
+
+            5. Update using gh tool:

               - Choose the correct command based on the Event type in environment information:
                 - If Event is 'issues': gh issue edit [ISSUE_NUMBER] --title "[English title]" --body "[Translated content + Original content]"
@@ -36,7 +36,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      - name: Install deps
        run: bun i
@@ -95,7 +95,7 @@ jobs:
    runs-on: ${{ matrix.os }}
    strategy:
      matrix:
-        os: [macos-latest, macos-13, windows-2025, ubuntu-latest]
+        os: [macos-latest, macos-15-intel, windows-2025, ubuntu-latest]
    steps:
      - uses: actions/checkout@v5
        with:
@@ -129,11 +129,11 @@ jobs:
        run: npm run desktop:build
        env:
          # 设置更新通道，PR构建为nightly，否则为stable
-          UPDATE_CHANNEL: 'nightly'
+          UPDATE_CHANNEL: "nightly"
          APP_URL: http://localhost:3015
-          DATABASE_URL: 'postgresql://postgres@localhost:5432/postgres'
+          DATABASE_URL: "postgresql://postgres@localhost:5432/postgres"
          # 默认添加一个加密 SECRET
-          KEY_VAULTS_SECRET: 'oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE='
+          KEY_VAULTS_SECRET: "oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE="
          # macOS 签名和公证配置
          CSC_LINK: ${{ secrets.APPLE_CERTIFICATE_BASE64 }}
          CSC_KEY_PASSWORD: ${{ secrets.APPLE_CERTIFICATE_PASSWORD }}
@@ -152,10 +152,10 @@ jobs:
        run: npm run desktop:build
        env:
          # 设置更新通道，PR构建为nightly，否则为stable
-          UPDATE_CHANNEL: 'nightly'
+          UPDATE_CHANNEL: "nightly"
          APP_URL: http://localhost:3015
-          DATABASE_URL: 'postgresql://postgres@localhost:5432/postgres'
-          KEY_VAULTS_SECRET: 'oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE='
+          DATABASE_URL: "postgresql://postgres@localhost:5432/postgres"
+          KEY_VAULTS_SECRET: "oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE="
          NEXT_PUBLIC_DESKTOP_PROJECT_ID: ${{ secrets.UMAMI_NIGHTLY_DESKTOP_PROJECT_ID }}
          NEXT_PUBLIC_DESKTOP_UMAMI_BASE_URL: ${{ secrets.UMAMI_NIGHTLY_DESKTOP_BASE_URL }}
          # 将 TEMP 和 TMP 目录设置到 C 盘
@@ -168,10 +168,10 @@ jobs:
        run: npm run desktop:build
        env:
          # 设置更新通道，PR构建为nightly，否则为stable
-          UPDATE_CHANNEL: 'nightly'
+          UPDATE_CHANNEL: "nightly"
          APP_URL: http://localhost:3015
-          DATABASE_URL: 'postgresql://postgres@localhost:5432/postgres'
-          KEY_VAULTS_SECRET: 'oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE='
+          DATABASE_URL: "postgresql://postgres@localhost:5432/postgres"
+          KEY_VAULTS_SECRET: "oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE="
          NEXT_PUBLIC_DESKTOP_PROJECT_ID: ${{ secrets.UMAMI_NIGHTLY_DESKTOP_PROJECT_ID }}
          NEXT_PUBLIC_DESKTOP_UMAMI_BASE_URL: ${{ secrets.UMAMI_NIGHTLY_DESKTOP_BASE_URL }}

@@ -188,7 +188,7 @@ jobs:
            else
              ARCH_SUFFIX="x64"
            fi
-            
+
            mv latest-mac.yml "latest-mac-${ARCH_SUFFIX}.yml"
            echo "✅ Renamed latest-mac.yml to latest-mac-${ARCH_SUFFIX}.yml (detected: $SYSTEM_ARCH)"
            ls -la latest-mac-*.yml
@@ -234,7 +234,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      # 下载所有平台的构建产物
      - name: Download artifacts
@@ -0,0 +1,52 @@
+name: E2E CI
+permissions:
+  contents: read
+
+on:
+    pull_request:
+    push:
+
+concurrency:
+    group: e2e-${{ github.ref }}
+    cancel-in-progress: true
+
+jobs:
+    e2e:
+        name: Test Web App
+        runs-on: ubuntu-latest
+        timeout-minutes: 25
+        steps:
+            - name: Checkout
+              uses: actions/checkout@v4
+
+            - name: Setup Bun
+              uses: oven-sh/setup-bun@v2
+              with:
+                  bun-version: latest
+
+            - name: Install dependencies (bun)
+              run: bun install
+
+            - name: Install Playwright browsers (with system deps)
+              run: bunx playwright install --with-deps chromium
+
+            - name: Run E2E tests
+              env:
+                  PORT: 3010
+              run: bun run e2e
+
+            - name: Upload Playwright HTML report (on failure)
+              if: failure()
+              uses: actions/upload-artifact@v4
+              with:
+                  name: playwright-report
+                  path: playwright-report
+                  if-no-files-found: ignore
+
+            - name: Upload Playwright traces (on failure)
+              if: failure()
+              uses: actions/upload-artifact@v4
+              with:
+                  name: test-results
+                  path: test-results
+                  if-no-files-found: ignore
@@ -19,7 +19,7 @@ jobs:
        uses: actions/checkout@v5

      - name: Lock closed issues after 7 days of inactivity
-        uses: actions/github-script@v7
+        uses: actions/github-script@v8
        with:
          script: |
            const lockScript = require('./.github/scripts/lock-closed-issues.js');
@@ -32,7 +32,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      - name: Install deps
        run: bun i
@@ -82,7 +82,7 @@ jobs:
    runs-on: ${{ matrix.os }}
    strategy:
      matrix:
-        os: [macos-latest, macos-13, windows-2025, ubuntu-latest]
+        os: [macos-latest, macos-15-intel, windows-2025, ubuntu-latest]
    steps:
      - uses: actions/checkout@v5
        with:
@@ -116,9 +116,9 @@ jobs:
        run: npm run desktop:build
        env:
          APP_URL: http://localhost:3015
-          DATABASE_URL: 'postgresql://postgres@localhost:5432/postgres'
+          DATABASE_URL: "postgresql://postgres@localhost:5432/postgres"
          # 默认添加一个加密 SECRET
-          KEY_VAULTS_SECRET: 'oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE='
+          KEY_VAULTS_SECRET: "oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE="
          # macOS 签名和公证配置
          CSC_LINK: ${{ secrets.APPLE_CERTIFICATE_BASE64 }}
          CSC_KEY_PASSWORD: ${{ secrets.APPLE_CERTIFICATE_PASSWORD }}
@@ -137,8 +137,8 @@ jobs:
        run: npm run desktop:build
        env:
          APP_URL: http://localhost:3015
-          DATABASE_URL: 'postgresql://postgres@localhost:5432/postgres'
-          KEY_VAULTS_SECRET: 'oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE='
+          DATABASE_URL: "postgresql://postgres@localhost:5432/postgres"
+          KEY_VAULTS_SECRET: "oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE="
          NEXT_PUBLIC_DESKTOP_PROJECT_ID: ${{ secrets.UMAMI_BETA_DESKTOP_PROJECT_ID }}
          NEXT_PUBLIC_DESKTOP_UMAMI_BASE_URL: ${{ secrets.UMAMI_BETA_DESKTOP_BASE_URL }}

@@ -152,8 +152,8 @@ jobs:
        run: npm run desktop:build
        env:
          APP_URL: http://localhost:3015
-          DATABASE_URL: 'postgresql://postgres@localhost:5432/postgres'
-          KEY_VAULTS_SECRET: 'oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE='
+          DATABASE_URL: "postgresql://postgres@localhost:5432/postgres"
+          KEY_VAULTS_SECRET: "oLXWIiR/AKF+rWaqy9lHkrYgzpATbW3CtJp3UfkVgpE="
          NEXT_PUBLIC_DESKTOP_PROJECT_ID: ${{ secrets.UMAMI_BETA_DESKTOP_PROJECT_ID }}
          NEXT_PUBLIC_DESKTOP_UMAMI_BASE_URL: ${{ secrets.UMAMI_BETA_DESKTOP_BASE_URL }}

@@ -170,7 +170,7 @@ jobs:
            else
              ARCH_SUFFIX="x64"
            fi
-            
+
            mv latest-mac.yml "latest-mac-${ARCH_SUFFIX}.yml"
            echo "✅ Renamed latest-mac.yml to latest-mac-${ARCH_SUFFIX}.yml (detected: $SYSTEM_ARCH)"
            ls -la latest-mac-*.yml
@@ -216,7 +216,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      # 下载所有平台的构建产物
      - name: Download artifacts
@@ -41,7 +41,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      - name: Install deps
        run: bun i
@@ -71,7 +71,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      - name: Install deps
        run: bun i
@@ -104,7 +104,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      - name: Install deps
        run: bun i
@@ -148,7 +148,7 @@ jobs:
      - name: Install bun
        uses: oven-sh/setup-bun@v2
        with:
-          bun-version: ${{ secrets.BUN_VERSION }}
+          bun-version: 1.2.23

      - name: Install deps
        run: bun i
@@ -25,7 +25,7 @@ module.exports = defineConfig({
  ],
  temperature: 0,
  saveImmediately: true,
-  modelName: 'gpt-4.1-mini',
+  modelName: 'chatgpt-4o-latest',
  experimental: {
    jsonMode: true,
  },
@@ -2,6 +2,625 @@

 # Changelog

+### [Version 1.137.5](https://github.com/lobehub/lobe-chat/compare/v1.137.4...v1.137.5)
+
+<sup>Released on **2025-10-14**</sup>
+
+#### 💄 Styles
+
+- **misc**: Add imagen model to vertex ai.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Add imagen model to vertex ai, closes [#9699](https://github.com/lobehub/lobe-chat/issues/9699) ([3b2a2c1](https://github.com/lobehub/lobe-chat/commit/3b2a2c1))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.137.4](https://github.com/lobehub/lobe-chat/compare/v1.137.3...v1.137.4)
+
+<sup>Released on **2025-10-14**</sup>
+
+#### 🐛 Bug Fixes
+
+- **misc**: Prevent Vertex AI JSON credentials from being split by comma.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **misc**: Prevent Vertex AI JSON credentials from being split by comma, closes [#9703](https://github.com/lobehub/lobe-chat/issues/9703) [#9477](https://github.com/lobehub/lobe-chat/issues/9477) ([189081d](https://github.com/lobehub/lobe-chat/commit/189081d))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.137.3](https://github.com/lobehub/lobe-chat/compare/v1.137.2...v1.137.3)
+
+<sup>Released on **2025-10-14**</sup>
+
+#### 🐛 Bug Fixes
+
+- **misc**: Fix mcp server connect issue and refactor web search implement, fix tools calling long name length >64 issue.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **misc**: Fix mcp server connect issue and refactor web search implement, closes [#9694](https://github.com/lobehub/lobe-chat/issues/9694) ([15ebcb4](https://github.com/lobehub/lobe-chat/commit/15ebcb4))
+- **misc**: Fix tools calling long name length >64 issue, closes [#9697](https://github.com/lobehub/lobe-chat/issues/9697) ([cb98604](https://github.com/lobehub/lobe-chat/commit/cb98604))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.137.2](https://github.com/lobehub/lobe-chat/compare/v1.137.1...v1.137.2)
+
+<sup>Released on **2025-10-14**</sup>
+
+#### 🐛 Bug Fixes
+
+- **misc**: Fix the Worker URL cross-origin issue.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **misc**: Fix the Worker URL cross-origin issue, closes [#9624](https://github.com/lobehub/lobe-chat/issues/9624) ([d379112](https://github.com/lobehub/lobe-chat/commit/d379112))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.137.1](https://github.com/lobehub/lobe-chat/compare/v1.137.0...v1.137.1)
+
+<sup>Released on **2025-10-14**</sup>
+
+#### 💄 Styles
+
+- **misc**: Change the user chatItem maxWidth should use flex 1.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Change the user chatItem maxWidth should use flex 1, closes [#9689](https://github.com/lobehub/lobe-chat/issues/9689) ([cfd5221](https://github.com/lobehub/lobe-chat/commit/cfd5221))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+## [Version 1.137.0](https://github.com/lobehub/lobe-chat/compare/v1.136.13...v1.137.0)
+
+<sup>Released on **2025-10-12**</sup>
+
+#### ✨ Features
+
+- **misc**: Add new setting for default image num.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's improved
+
+- **misc**: Add new setting for default image num, closes [#9618](https://github.com/lobehub/lobe-chat/issues/9618) ([de7368b](https://github.com/lobehub/lobe-chat/commit/de7368b))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.13](https://github.com/lobehub/lobe-chat/compare/v1.136.12...v1.136.13)
+
+<sup>Released on **2025-10-12**</sup>
+
+#### 🐛 Bug Fixes
+
+- **misc**: Fix input cannot send markdown.
+
+#### 💄 Styles
+
+- **misc**: Optimize OpenRouter modelFetch endpoint, update i18n.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **misc**: Fix input cannot send markdown, closes [#9674](https://github.com/lobehub/lobe-chat/issues/9674) ([2518d7e](https://github.com/lobehub/lobe-chat/commit/2518d7e))
+
+#### Styles
+
+- **misc**: Optimize OpenRouter modelFetch endpoint, closes [#9671](https://github.com/lobehub/lobe-chat/issues/9671) ([0038a64](https://github.com/lobehub/lobe-chat/commit/0038a64))
+- **misc**: Update i18n, closes [#9665](https://github.com/lobehub/lobe-chat/issues/9665) ([02096ea](https://github.com/lobehub/lobe-chat/commit/02096ea))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.12](https://github.com/lobehub/lobe-chat/compare/v1.136.11...v1.136.12)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 💄 Styles
+
+- **misc**: Add more AWS regions, Update infini-ai models.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Add more AWS regions, closes [#9644](https://github.com/lobehub/lobe-chat/issues/9644) ([4a82daf](https://github.com/lobehub/lobe-chat/commit/4a82daf))
+- **misc**: Update infini-ai models, closes [#9646](https://github.com/lobehub/lobe-chat/issues/9646) ([5274225](https://github.com/lobehub/lobe-chat/commit/5274225))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.11](https://github.com/lobehub/lobe-chat/compare/v1.136.10...v1.136.11)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 💄 Styles
+
+- **misc**: Add capability inference for web search, image output and video recognition in model parsing and update UI form items to support search, imageOutput and video abilities.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Add capability inference for web search, image output and video recognition in model parsing and update UI form items to support search, imageOutput and video abilities, closes [#9022](https://github.com/lobehub/lobe-chat/issues/9022) ([4e44569](https://github.com/lobehub/lobe-chat/commit/4e44569))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.10](https://github.com/lobehub/lobe-chat/compare/v1.136.9...v1.136.10)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 💄 Styles
+
+- **misc**: Improve search experience.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Improve search experience, closes [#9661](https://github.com/lobehub/lobe-chat/issues/9661) ([8624f84](https://github.com/lobehub/lobe-chat/commit/8624f84))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.9](https://github.com/lobehub/lobe-chat/compare/v1.136.8...v1.136.9)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 💄 Styles
+
+- **misc**: Add lab to support disable/enable rich text.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Add lab to support disable/enable rich text, closes [#9652](https://github.com/lobehub/lobe-chat/issues/9652) ([658c294](https://github.com/lobehub/lobe-chat/commit/658c294))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.8](https://github.com/lobehub/lobe-chat/compare/v1.136.7...v1.136.8)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 🐛 Bug Fixes
+
+- **provider**: Add deepseek-v3.1-terminus to THINKING_MODELS.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **provider**: Add deepseek-v3.1-terminus to THINKING_MODELS, closes [#9653](https://github.com/lobehub/lobe-chat/issues/9653) [#9648](https://github.com/lobehub/lobe-chat/issues/9648) ([e9b5c69](https://github.com/lobehub/lobe-chat/commit/e9b5c69))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.7](https://github.com/lobehub/lobe-chat/compare/v1.136.6...v1.136.7)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 🐛 Bug Fixes
+
+- **misc**: Disable rich text in markdown editor.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **misc**: Disable rich text in markdown editor, closes [#9637](https://github.com/lobehub/lobe-chat/issues/9637) ([9349ce2](https://github.com/lobehub/lobe-chat/commit/9349ce2))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.6](https://github.com/lobehub/lobe-chat/compare/v1.136.5...v1.136.6)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 🐛 Bug Fixes
+
+- **bedrock**: Add parameter conflict handling for Claude 4+ models.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **bedrock**: Add parameter conflict handling for Claude 4+ models, closes [#9627](https://github.com/lobehub/lobe-chat/issues/9627) [#9523](https://github.com/lobehub/lobe-chat/issues/9523) ([54b6217](https://github.com/lobehub/lobe-chat/commit/54b6217))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.5](https://github.com/lobehub/lobe-chat/compare/v1.136.4...v1.136.5)
+
+<sup>Released on **2025-10-11**</sup>
+
+#### 🐛 Bug Fixes
+
+- **plugin-store**: Fix search functionality for old plugin store.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **plugin-store**: Fix search functionality for old plugin store, closes [#9651](https://github.com/lobehub/lobe-chat/issues/9651) [#9645](https://github.com/lobehub/lobe-chat/issues/9645) ([522fc09](https://github.com/lobehub/lobe-chat/commit/522fc09))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.4](https://github.com/lobehub/lobe-chat/compare/v1.136.3...v1.136.4)
+
+<sup>Released on **2025-10-10**</sup>
+
+#### 🐛 Bug Fixes
+
+- **misc**: Add 'gemini-2.5-flash-image' to disabled models Thinking.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **misc**: Add 'gemini-2.5-flash-image' to disabled models Thinking, closes [#9633](https://github.com/lobehub/lobe-chat/issues/9633) ([771b585](https://github.com/lobehub/lobe-chat/commit/771b585))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.3](https://github.com/lobehub/lobe-chat/compare/v1.136.2...v1.136.3)
+
+<sup>Released on **2025-10-10**</sup>
+
+#### 💄 Styles
+
+- **misc**: Add delete & regenerate hotkeys.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Add delete & regenerate hotkeys, closes [#9538](https://github.com/lobehub/lobe-chat/issues/9538) ([d948580](https://github.com/lobehub/lobe-chat/commit/d948580))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.2](https://github.com/lobehub/lobe-chat/compare/v1.136.1...v1.136.2)
+
+<sup>Released on **2025-10-10**</sup>
+
+#### 💄 Styles
+
+- **misc**: Update i18n.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Update i18n, closes [#9625](https://github.com/lobehub/lobe-chat/issues/9625) ([70d356d](https://github.com/lobehub/lobe-chat/commit/70d356d))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.136.1](https://github.com/lobehub/lobe-chat/compare/v1.136.0...v1.136.1)
+
+<sup>Released on **2025-10-09**</sup>
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+## [Version 1.136.0](https://github.com/lobehub/lobe-chat/compare/v1.135.6...v1.136.0)
+
+<sup>Released on **2025-10-09**</sup>
+
+#### ✨ Features
+
+- **misc**: Add new provider Cerebras.
+
+#### 🐛 Bug Fixes
+
+- **misc**: Fix standalone plugin rerender issue.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's improved
+
+- **misc**: Add new provider Cerebras, closes [#9559](https://github.com/lobehub/lobe-chat/issues/9559) ([9cceaad](https://github.com/lobehub/lobe-chat/commit/9cceaad))
+
+#### What's fixed
+
+- **misc**: Fix standalone plugin rerender issue, closes [#9611](https://github.com/lobehub/lobe-chat/issues/9611) [#9396](https://github.com/lobehub/lobe-chat/issues/9396) ([7ab30fc](https://github.com/lobehub/lobe-chat/commit/7ab30fc))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.135.6](https://github.com/lobehub/lobe-chat/compare/v1.135.5...v1.135.6)
+
+<sup>Released on **2025-10-08**</sup>
+
+#### 🐛 Bug Fixes
+
+- **desktop**: Macos26 small icon.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### What's fixed
+
+- **desktop**: Macos26 small icon, closes [#9421](https://github.com/lobehub/lobe-chat/issues/9421) ([ca03342](https://github.com/lobehub/lobe-chat/commit/ca03342))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.135.5](https://github.com/lobehub/lobe-chat/compare/v1.135.4...v1.135.5)
+
+<sup>Released on **2025-10-08**</sup>
+
+#### 💄 Styles
+
+- **misc**: Update i18n.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Update i18n, closes [#9602](https://github.com/lobehub/lobe-chat/issues/9602) ([ed267a4](https://github.com/lobehub/lobe-chat/commit/ed267a4))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.135.4](https://github.com/lobehub/lobe-chat/compare/v1.135.3...v1.135.4)
+
+<sup>Released on **2025-10-07**</sup>
+
+#### ♻ Code Refactoring
+
+- **misc**: Refactor chat item.
+
+#### 💄 Styles
+
+- **misc**: Add GPT-5 pro model.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Code refactoring
+
+- **misc**: Refactor chat item, closes [#9599](https://github.com/lobehub/lobe-chat/issues/9599) ([1f36158](https://github.com/lobehub/lobe-chat/commit/1f36158))
+
+#### Styles
+
+- **misc**: Add GPT-5 pro model, closes [#9594](https://github.com/lobehub/lobe-chat/issues/9594) ([775f30b](https://github.com/lobehub/lobe-chat/commit/775f30b))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
+### [Version 1.135.3](https://github.com/lobehub/lobe-chat/compare/v1.135.2...v1.135.3)
+
+<sup>Released on **2025-10-07**</sup>
+
+#### 💄 Styles
+
+- **misc**: Improve Korean translate.
+
+<br/>
+
+<details>
+<summary><kbd>Improvements and Fixes</kbd></summary>
+
+#### Styles
+
+- **misc**: Improve Korean translate, closes [#9597](https://github.com/lobehub/lobe-chat/issues/9597) ([319fbfb](https://github.com/lobehub/lobe-chat/commit/319fbfb))
+
+</details>
+
+<div align="right">
+
+[![](https://img.shields.io/badge/-BACK_TO_TOP-151515?style=flat-square)](#readme-top)
+
+</div>
+
 ### [Version 1.135.2](https://github.com/lobehub/lobe-chat/compare/v1.135.1...v1.135.2)

 <sup>Released on **2025-10-06**</sup>
@@ -31,28 +31,18 @@ This repository adopts a monorepo structure.

 see @.cursor/rules/typescript.mdc

-### Modify Code Rules
-
- **Code Language**:
-  - For files with existing Chinese comments: Continue using Chinese to maintain consistency
-  - For new files or files without Chinese comments: MUST use American English.
-    - eg: new react tsx file and new test file
- Conservative for existing code, modern approaches for new features
-
 ### Testing

-Testing work follows the Rule-Aware Task Execution system above.
-
- **Required Rule**: `testing-guide/testing-guide.mdc`
+- **Required Rule**: read `@.cursor/rules/testing-guide/testing-guide.mdc` before writing tests
 - **Command**:
  - web: `bunx vitest run --silent='passed-only' '[file-path-pattern]'`
  - packages(eg: database): `cd packages/database && bunx vitest run --silent='passed-only' '[file-path-pattern]'`

 **Important**:

- wrapped the file path in single quotes to avoid shell expansion
+- wrap the file path in single quotes to avoid shell expansion
 - Never run `bun run test` etc to run tests, this will run all tests and cost about 10mins
- If try to fix the same test twice, but still failed, stop and ask for help.
+- If trying to fix the same test twice, but still failed, stop and ask for help.

 ### Typecheck

@@ -61,15 +51,9 @@ Testing work follows the Rule-Aware Task Execution system above.
 ### i18n

 - **Keys**: Add to `src/locales/default/namespace.ts`
- **Dev**: Translate `locales/zh-CN/namespace.json` locale file only for preview
+- **Dev**: Translate `locales/zh-CN/namespace.json` and `locales/en-US/namespace.json` locales file only for dev preview
 - DON'T run `pnpm i18n`, let CI auto handle it

 ## Rules Index

-Some useful rules of this project. Read them when needed.
-
-**IMPORTANT**: All rule files referenced in this document are located in the `.cursor/rules/` directory. Throughout this document, rule files are referenced by their filename only for brevity.
-
-### 📋 Complete Rule Files
-
 Some useful project rules are listed in @.cursor/rules/rules-index.mdc
@@ -142,7 +142,8 @@ ENV ACCESS_CODE="" \
    DEFAULT_AGENT_CONFIG="" \
    SYSTEM_AGENT="" \
    FEATURE_FLAGS="" \
-    PROXY_URL=""
+    PROXY_URL="" \
+    ENABLE_AUTH_PROTECTION=""

 # Model Variables
 ENV \
@@ -255,7 +256,7 @@ ENV \
    # 302.AI
    AI302_API_KEY="" AI302_MODEL_LIST="" \
    # FAL
-    FAL_API_KEY="" FAL_MODEL_LIST="" \
+    ENABLED_FAL="" FAL_API_KEY="" FAL_MODEL_LIST="" \
    # BFL
    BFL_API_KEY="" BFL_MODEL_LIST="" \
    # Vercel AI Gateway
@@ -39,6 +39,8 @@ ARG USE_CN_MIRROR
 ARG NEXT_PUBLIC_BASE_PATH
 ARG NEXT_PUBLIC_SERVICE_MODE
 ARG NEXT_PUBLIC_ENABLE_NEXT_AUTH
+ARG NEXT_PUBLIC_ENABLE_CLERK_AUTH
+ARG NEXT_PUBLIC_CLERK_PUBLISHABLE_KEY
 ARG NEXT_PUBLIC_SENTRY_DSN
 ARG NEXT_PUBLIC_ANALYTICS_POSTHOG
 ARG NEXT_PUBLIC_POSTHOG_HOST
@@ -53,6 +55,9 @@ ENV NEXT_PUBLIC_BASE_PATH="${NEXT_PUBLIC_BASE_PATH}" \

 ENV NEXT_PUBLIC_SERVICE_MODE="${NEXT_PUBLIC_SERVICE_MODE:-server}" \
    NEXT_PUBLIC_ENABLE_NEXT_AUTH="${NEXT_PUBLIC_ENABLE_NEXT_AUTH:-1}" \
+    NEXT_PUBLIC_ENABLE_CLERK_AUTH="${NEXT_PUBLIC_ENABLE_CLERK_AUTH:-0}" \
+    NEXT_PUBLIC_CLERK_PUBLISHABLE_KEY="${NEXT_PUBLIC_CLERK_PUBLISHABLE_KEY}" \
+    CLERK_WEBHOOK_SECRET="whsec_xxx" \
    APP_URL="http://app.com" \
    DATABASE_DRIVER="node" \
    DATABASE_URL="postgres://postgres:password@localhost:5432/postgres" \
@@ -166,7 +171,8 @@ ENV ACCESS_CODE="" \
    DEFAULT_AGENT_CONFIG="" \
    SYSTEM_AGENT="" \
    FEATURE_FLAGS="" \
-    PROXY_URL=""
+    PROXY_URL="" \
+    ENABLE_AUTH_PROTECTION=""

 # Database
 ENV KEY_VAULTS_SECRET="" \
@@ -178,13 +184,19 @@ ENV NEXT_AUTH_SECRET="" \
    NEXT_AUTH_SSO_PROVIDERS="" \
    NEXTAUTH_URL=""

+# Clerk
+ENV CLERK_SECRET_KEY="" \
+    CLERK_WEBHOOK_SECRET=""
+
 # S3
 ENV NEXT_PUBLIC_S3_DOMAIN="" \
    S3_PUBLIC_DOMAIN="" \
    S3_ACCESS_KEY_ID="" \
    S3_BUCKET="" \
    S3_ENDPOINT="" \
-    S3_SECRET_ACCESS_KEY=""
+    S3_SECRET_ACCESS_KEY="" \
+    S3_ENABLE_PATH_STYLE="" \
+    S3_SET_ACL=""

 # Model Variables
 ENV \
@@ -297,11 +309,13 @@ ENV \
    # 302.AI
    AI302_API_KEY="" AI302_MODEL_LIST="" \
    # FAL
-    FAL_API_KEY="" FAL_MODEL_LIST="" \
+    ENABLED_FAL="" FAL_API_KEY="" FAL_MODEL_LIST="" \
    # BFL
    BFL_API_KEY="" BFL_MODEL_LIST="" \
    # Vercel AI Gateway
-    VERCELAIGATEWAY_API_KEY="" VERCELAIGATEWAY_MODEL_LIST=""
+    VERCELAIGATEWAY_API_KEY="" VERCELAIGATEWAY_MODEL_LIST="" \
+    # Cerebras
+    CEREBRAS_API_KEY="" CEREBRAS_MODEL_LIST=""

 USER nextjs

@@ -144,7 +144,8 @@ ENV ACCESS_CODE="" \
    DEFAULT_AGENT_CONFIG="" \
    SYSTEM_AGENT="" \
    FEATURE_FLAGS="" \
-    PROXY_URL=""
+    PROXY_URL="" \
+    ENABLE_AUTH_PROTECTION=""

 # Model Variables
 ENV \
@@ -253,7 +254,7 @@ ENV \
    # 302.AI
    AI302_API_KEY="" AI302_MODEL_LIST="" \
    # FAL
-    FAL_API_KEY="" FAL_MODEL_LIST="" \
+    ENABLED_FAL="" FAL_API_KEY="" FAL_MODEL_LIST="" \
    # BFL
    BFL_API_KEY="" BFL_MODEL_LIST="" \
    # Vercel AI Gateway
@@ -1,5 +1,7 @@
 const dotenv = require('dotenv');
+const fs = require('node:fs/promises');
 const os = require('node:os');
+const path = require('node:path');

 dotenv.config();

@@ -32,11 +34,50 @@ const getProtocolScheme = () => {

 const protocolScheme = getProtocolScheme();

+// Determine icon file based on version type
+const getIconFileName = () => {
+  if (isNightly) return 'Icon-nightly';
+  if (isBeta) return 'Icon-beta';
+  return 'Icon';
+};
+
 /**
 * @type {import('electron-builder').Configuration}
 * @see https://www.electron.build/configuration
 */
 const config = {
+  /**
+   * AfterPack hook to copy pre-generated Liquid Glass Assets.car for macOS 26+
+   * @see https://github.com/electron-userland/electron-builder/issues/9254
+   * @see https://github.com/MultiboxLabs/flow-browser/pull/159
+   * @see https://github.com/electron/packager/pull/1806
+   */
+  afterPack: async (context) => {
+    // Only process macOS builds
+    if (context.electronPlatformName !== 'darwin') {
+      return;
+    }
+
+    const iconFileName = getIconFileName();
+    const assetsCarSource = path.join(__dirname, 'build', `${iconFileName}.Assets.car`);
+    const resourcesPath = path.join(
+      context.appOutDir,
+      `${context.packager.appInfo.productFilename}.app`,
+      'Contents',
+      'Resources',
+    );
+    const assetsCarDest = path.join(resourcesPath, 'Assets.car');
+
+    try {
+      await fs.access(assetsCarSource);
+      await fs.copyFile(assetsCarSource, assetsCarDest);
+      console.log(`✅ Copied Liquid Glass icon: ${iconFileName}.Assets.car`);
+    } catch {
+      // Non-critical: Assets.car not found or copy failed
+      // App will use fallback .icns icon on all macOS versions
+      console.log(`⏭️  Skipping Assets.car (not found or copy failed)`);
+    }
+  },
  appId: isNightly
    ? 'com.lobehub.lobehub-desktop-nightly'
    : isBeta
@@ -81,6 +122,7 @@ const config = {
    compression: 'maximum',
    entitlementsInherit: 'build/entitlements.mac.plist',
    extendInfo: {
+      CFBundleIconName: 'AppIcon',
      CFBundleURLTypes: [
        {
          CFBundleURLName: 'LobeHub Protocol',
@@ -1,4 +1,173 @@
 [
+  {
+    "children": {
+      "improvements": ["Add imagen model to vertex ai."]
+    },
+    "date": "2025-10-14",
+    "version": "1.137.5"
+  },
+  {
+    "children": {
+      "fixes": ["Prevent Vertex AI JSON credentials from being split by comma."]
+    },
+    "date": "2025-10-14",
+    "version": "1.137.4"
+  },
+  {
+    "children": {
+      "fixes": [
+        "Fix mcp server connect issue and refactor web search implement, fix tools calling long name length >64 issue."
+      ]
+    },
+    "date": "2025-10-14",
+    "version": "1.137.3"
+  },
+  {
+    "children": {
+      "fixes": ["Fix the Worker URL cross-origin issue."]
+    },
+    "date": "2025-10-14",
+    "version": "1.137.2"
+  },
+  {
+    "children": {
+      "improvements": ["Change the user chatItem maxWidth should use flex 1."]
+    },
+    "date": "2025-10-14",
+    "version": "1.137.1"
+  },
+  {
+    "children": {
+      "features": ["Add new setting for default image num."]
+    },
+    "date": "2025-10-12",
+    "version": "1.137.0"
+  },
+  {
+    "children": {
+      "fixes": ["Fix input cannot send markdown."],
+      "improvements": ["Optimize OpenRouter modelFetch endpoint, update i18n."]
+    },
+    "date": "2025-10-12",
+    "version": "1.136.13"
+  },
+  {
+    "children": {
+      "improvements": ["Add more AWS regions, Update infini-ai models."]
+    },
+    "date": "2025-10-11",
+    "version": "1.136.12"
+  },
+  {
+    "children": {
+      "improvements": [
+        "Add capability inference for web search, image output and video recognition in model parsing and update UI form items to support search, imageOutput and video abilities."
+      ]
+    },
+    "date": "2025-10-11",
+    "version": "1.136.11"
+  },
+  {
+    "children": {
+      "improvements": ["Improve search experience."]
+    },
+    "date": "2025-10-11",
+    "version": "1.136.10"
+  },
+  {
+    "children": {
+      "improvements": ["Add lab to support disable/enable rich text."]
+    },
+    "date": "2025-10-11",
+    "version": "1.136.9"
+  },
+  {
+    "children": {},
+    "date": "2025-10-11",
+    "version": "1.136.8"
+  },
+  {
+    "children": {
+      "fixes": ["Disable rich text in markdown editor."]
+    },
+    "date": "2025-10-11",
+    "version": "1.136.7"
+  },
+  {
+    "children": {},
+    "date": "2025-10-11",
+    "version": "1.136.6"
+  },
+  {
+    "children": {},
+    "date": "2025-10-11",
+    "version": "1.136.5"
+  },
+  {
+    "children": {
+      "fixes": ["Add 'gemini-2.5-flash-image' to disabled models Thinking."]
+    },
+    "date": "2025-10-10",
+    "version": "1.136.4"
+  },
+  {
+    "children": {
+      "improvements": ["Add delete & regenerate hotkeys."]
+    },
+    "date": "2025-10-10",
+    "version": "1.136.3"
+  },
+  {
+    "children": {
+      "improvements": ["Update i18n."]
+    },
+    "date": "2025-10-10",
+    "version": "1.136.2"
+  },
+  {
+    "children": {},
+    "date": "2025-10-09",
+    "version": "1.136.1"
+  },
+  {
+    "children": {
+      "features": ["Add new provider Cerebras."],
+      "fixes": ["Fix standalone plugin rerender issue."]
+    },
+    "date": "2025-10-09",
+    "version": "1.136.0"
+  },
+  {
+    "children": {},
+    "date": "2025-10-08",
+    "version": "1.135.6"
+  },
+  {
+    "children": {
+      "improvements": ["Update i18n."]
+    },
+    "date": "2025-10-08",
+    "version": "1.135.5"
+  },
+  {
+    "children": {
+      "improvements": ["Add GPT-5 pro model."]
+    },
+    "date": "2025-10-07",
+    "version": "1.135.4"
+  },
+  {
+    "children": {
+      "improvements": ["Improve Korean translate."]
+    },
+    "date": "2025-10-07",
+    "version": "1.135.3"
+  },
+  {
+    "children": {},
+    "date": "2025-10-06",
+    "version": "1.135.2"
+  },
  {
    "children": {
      "improvements": ["Improve styles and fix tools calling condition."]
@@ -5,7 +5,9 @@ LobeChat is built on the Next.js framework and uses TypeScript as the primary de
 1. Routing: Define routes (`src/app`).
 2. Data Structure: Define data structures (`src/types`).
 3. Business Logic Implementation: Zustand store (`src/store`).
-4. Page Display: Write static components/pages (`src/app/<new-page>/features/<new-feature>.tsx`).
+4. Page Display: Write static components/pages. Create features in:
+   - `src/features/<feature-name>/` for **shared global features** (used across multiple pages)
+   - `src/app/<new-page>/features/<feature-name>/` for **page-specific features** (only used in this page)
 5. Function Binding: Bind the store with page triggers (`const [state, function] = useNewStore(s => [s.state, s.function])`).

 Taking the "Chat Messages" feature as an example, here are the brief steps to implement this feature:
@@ -60,7 +62,8 @@ export const useChatStore = create<ChatState>((set) => ({
 In `src/app/<new-page>/features/<new-feature>.tsx`, we need to create a new page or component to display "Chat Messages". In this file, we can use the Zustand Store created earlier and Ant Design components to build the UI:

 ```jsx
-// src/features/chat/index.tsx
+// src/app/chat/features/ChatPage/index.tsx
+// Note: Use src/app/<page>/features/ for page-specific components
 import { List, Typography } from 'antd';
 import { useChatStore } from 'src/store/chatStore';

@@ -82,6 +85,12 @@ const ChatPage = () => {
 export default ChatPage;
 ```

+> **Note on Feature Organization**: LobeChat uses two patterns for organizing features:
+> - **Global features** (`src/features/`): Shared components like `ChatInput`, `Conversation` used across the app
+> - **Page-specific features** (`src/app/<page>/features/`): Components used only within a specific page route
+>
+> Choose based on reusability. If unsure, start with page-specific and refactor to global if needed elsewhere.
+
 ## 5. Function Binding

 In a page or component, we need to bind the Zustand Store's state and methods to the UI. In the example above, we have already bound the `messages` state to the `dataSource` property of the list. Now, we also need a method to add new messages. We can define this method in the Zustand Store and then use it in the page or component:
@@ -5,7 +5,9 @@ LobeChat 基于 Next.js 框架构建，使用 TypeScript 作为主要开发语
 1. 路由：定义路由 (`src/app`)
 2. 数据结构： 定义数据结构 ( `src/types` )
 3. 业务功能实现： zustand store (`src/store`)
-4. 页面展示：书写静态组件 / 页面 (`src/app/<new-page>/features/<new-feature>.tsx`)
+4. 页面展示：书写静态组件 / 页面。根据以下方式创建功能组件：
+   - `src/features/<feature-name>/` 用于 **全局共享功能**（跨多个页面使用）
+   - `src/app/<new-page>/features/<feature-name>/` 用于 **页面专属功能**（仅在当前页面使用）
 5. 功能绑定：绑定 store 与页面的触发 (`const  [state,function]= useNewStore(s=>[s.state,s.function])`)

 我们以 "会话消息" 功能为例，以下是实现这个功能的简要步骤：
@@ -60,7 +62,8 @@ export const useChatStore = create<ChatState>((set) => ({
 在 `src/app/<new-page>/features/<new-feature>.tsx` 中，我们需要创建一个新的页面或组件来显示 "会话消息"。在这个文件中，我们可以使用上面创建的 Zustand Store，以及 Ant Design 的组件来构建 UI：

 ```jsx
-// src/features/chat/index.tsx
+// src/app/chat/features/ChatPage/index.tsx
+// 注意：使用 src/app/<page>/features/ 放置页面专属组件
 import { List, Typography } from 'antd';
 import { useChatStore } from 'src/store/chatStore';

@@ -82,6 +85,12 @@ const ChatPage = () => {
 export default ChatPage;
 ```

+> **关于功能组件组织方式的说明**：LobeChat 使用两种模式来组织功能组件：
+> - **全局功能**（`src/features/`）：跨应用共享的组件，如 `ChatInput`、`Conversation` 等
+> - **页面专属功能**（`src/app/<page>/features/`）：仅在特定页面路由中使用的组件
+>
+> 根据可复用性选择合适的方式。如果不确定，可以先放在页面专属位置，需要时再重构为全局共享。
+
 ## 5. 功能绑定

 在页面或组件中，我们需要将 Zustand Store 的状态和方法绑定到 UI 上。在上面的示例中，我们已经将 `messages` 状态绑定到了列表的 `dataSource` 属性上。现在，我们还需要一个方法来添加新的消息。我们可以在 Zustand Store 中定义这个方法，然后在页面或组件中使用它：
@@ -945,6 +945,7 @@ table user_settings {
  system_agent jsonb
  default_agent jsonb
  tool jsonb
+  image jsonb
 }

 table users {
@@ -965,6 +966,127 @@ table users {
  updated_at "timestamp with time zone" [not null, default: `now()`]
 }

+table user_memories {
+  id varchar(255) [pk, not null]
+  user_id text
+  memory_category varchar(255)
+  memory_layer varchar(255)
+  memory_type varchar(255)
+  title varchar(255)
+  summary text
+  summary_vector_1024 vector(1024)
+  details text
+  details_vector_1024 vector(1024)
+  status varchar(255)
+  accessed_count bigint [default: 0]
+  last_accessed_at "timestamp with time zone" [not null]
+  accessed_at "timestamp with time zone" [not null, default: `now()`]
+  created_at "timestamp with time zone" [not null, default: `now()`]
+  updated_at "timestamp with time zone" [not null, default: `now()`]
+
+  indexes {
+    summary_vector_1024 [name: 'user_memories_summary_vector_1024_index']
+    details_vector_1024 [name: 'user_memories_details_vector_1024_index']
+  }
+}
+
+table user_memories_contexts {
+  id varchar(255) [pk, not null]
+  user_memory_ids jsonb
+  labels jsonb
+  extracted_labels jsonb
+  associated_objects jsonb
+  associated_subjects jsonb
+  title text
+  title_vector vector(1024)
+  description text
+  description_vector vector(1024)
+  type varchar(255)
+  current_status text
+  score_impact numeric [default: 0]
+  score_urgency numeric [default: 0]
+  accessed_at "timestamp with time zone" [not null, default: `now()`]
+  created_at "timestamp with time zone" [not null, default: `now()`]
+  updated_at "timestamp with time zone" [not null, default: `now()`]
+
+  indexes {
+    title_vector [name: 'user_memories_contexts_title_vector_index']
+    description_vector [name: 'user_memories_contexts_description_vector_index']
+    type [name: 'user_memories_contexts_type_index']
+  }
+}
+
+table user_memories_experiences {
+  id varchar(255) [pk, not null]
+  user_memory_id text
+  labels jsonb
+  extracted_labels jsonb
+  type varchar(255)
+  situation text
+  situation_vector vector(1024)
+  reasoning text
+  possible_outcome text
+  action text
+  action_vector vector(1024)
+  key_learning text
+  key_learning_vector vector(1024)
+  metadata jsonb
+  score_confidence real [default: 0]
+  accessed_at "timestamp with time zone" [not null, default: `now()`]
+  created_at "timestamp with time zone" [not null, default: `now()`]
+  updated_at "timestamp with time zone" [not null, default: `now()`]
+
+  indexes {
+    situation_vector [name: 'user_memories_experiences_situation_vector_index']
+    action_vector [name: 'user_memories_experiences_action_vector_index']
+    key_learning_vector [name: 'user_memories_experiences_key_learning_vector_index']
+    type [name: 'user_memories_experiences_type_index']
+  }
+}
+
+table user_memories_identities {
+  current_focuses text
+  description text
+  description_vector vector(1024)
+  experience text
+  extracted_labels jsonb
+  id varchar(255) [pk, not null]
+  labels jsonb
+  relationship text
+  role text
+  type varchar(255)
+  user_memory_id text
+  accessed_at "timestamp with time zone" [not null, default: `now()`]
+  created_at "timestamp with time zone" [not null, default: `now()`]
+  updated_at "timestamp with time zone" [not null, default: `now()`]
+
+  indexes {
+    description_vector [name: 'user_memories_identities_description_vector_index']
+    type [name: 'user_memories_identities_type_index']
+  }
+}
+
+table user_memories_preferences {
+  id varchar(255) [pk, not null]
+  context_id varchar(255)
+  user_memory_id varchar(255)
+  labels jsonb
+  extracted_labels jsonb
+  extracted_scopes jsonb
+  conclusion_directives text
+  conclusion_directives_vector vector(1024)
+  type varchar(255)
+  suggestions text
+  score_priority numeric [default: 0]
+  accessed_at "timestamp with time zone" [not null, default: `now()`]
+  created_at "timestamp with time zone" [not null, default: `now()`]
+  updated_at "timestamp with time zone" [not null, default: `now()`]
+
+  indexes {
+    conclusion_directives_vector [name: 'user_memories_preferences_conclusion_directives_vector_index']
+  }
+}
+
 ref: agents_files.file_id > files.id

 ref: agents_files.agent_id > agents.id
@@ -31,20 +31,30 @@ You can achieve various feature combinations using the above configuration synta
  their default values).
 </Callout>

-| Configuration Item        | Description                                     | Default Value |
-| ------------------------- | ----------------------------------------------- | ------------- |
-| `webrtc_sync`             | Enables WebRTC sync functionality.              | Disabled      |
-| `language_model_settings` | Enables language model settings.                | Enabled       |
-| `openai_api_key`          | Allows users to customize the OpenAI API Key.   | Enabled       |
-| `openai_proxy_url`        | Allows users to customize the OpenAI proxy URL. | Enabled       |
-| `create_session`          | Allows users to create sessions.                | Enabled       |
-| `edit_agent`              | Allows users to edit assistants.                | Enabled       |
-| `dalle`                   | Enables the DALL-E functionality.               | Enabled       |
-| `check_updates`           | Allows checking for updates.                    | Enabled       |
-| `welcome_suggest`         | Displays welcome suggestions.                   | Enabled       |
-| `market`                  | Enables the assistant market functionality.     | Enabled       |
-| `speech_to_text`          | Enables speech-to-text functionality.           | Enabled       |
-| `knowledge_base`          | Enables the knowledge base functionality.       | Enabled       |
-| `clerk_sign_up`           | Enables the Clerk SignUp functionality.         | Enabled       |
+| Configuration Item        | Description                                                                                              | Default Value |
+| ------------------------- | -------------------------------------------------------------------------------------------------------- | ------------- |
+| `check_updates`           | Allows checking for updates.                                                                             | Enabled       |
+| `pin_list`                | Controls pinned agent list display in sidebar.                                                           | Disabled      |
+| `language_model_settings` | Enables language model settings.                                                                         | Enabled       |
+| `provider_settings`       | Controls model provider settings display.                                                                | Enabled       |
+| `openai_api_key`          | Allows users to customize the OpenAI API Key.                                                            | Enabled       |
+| `openai_proxy_url`        | Allows users to customize the OpenAI proxy URL.                                                          | Enabled       |
+| `api_key_manage`          | Controls access to API key management page (/profile/apikey).                                            | Disabled      |
+| `create_session`          | Allows users to create sessions.                                                                         | Enabled       |
+| `edit_agent`              | Allows users to edit assistants.                                                                         | Enabled       |
+| `plugins`                 | Controls plugin functionality in chat and agent settings.                                                | Enabled       |
+| `dalle`                   | Enables the DALL-E functionality.                                                                        | Enabled       |
+| `ai_image`                | Controls AI image generation feature and page (/image).                                                  | Enabled       |
+| `speech_to_text`          | Enables speech-to-text functionality.                                                                    | Enabled       |
+| `token_counter`           | Reserved for token counter display.                                                                      | Enabled       |
+| `welcome_suggest`         | Displays welcome suggestions.                                                                            | Enabled       |
+| `changelog`               | Controls changelog modal/page display.                                                                   | Enabled       |
+| `clerk_sign_up`           | Enables the Clerk SignUp functionality.                                                                  | Enabled       |
+| `market`                  | Enables the assistant market functionality.                                                              | Enabled       |
+| `knowledge_base`          | Enables the knowledge base functionality.                                                                | Enabled       |
+| `rag_eval`                | Controls RAG evaluation feature (/repos/\[id]/evals).                                                    | Disabled      |
+| `cloud_promotion`         | Controls cloud service promotion link display in user menu.                                              | Disabled      |
+| `commercial_hide_github`  | Hides GitHub-related links in settings footer (requires commercial license).                             | Disabled      |
+| `commercial_hide_docs`    | Hides documentation and help menu including changelog, docs, and feedback (requires commercial license). | Disabled      |

 You can always check the [featureFlags](https://github.com/lobehub/lobe-chat/blob/main/src/config/featureFlags/schema.ts) to get the latest list of feature flags.
@@ -28,20 +28,30 @@ tags:
  关键字，你需要手动控制所有的功能标志（否则它们会采用对应的默认值）。
 </Callout>

-| 配置项                       | 解释                      | 默认值 |
-| ------------------------- | ----------------------- | --- |
-| `webrtc_sync`             | 启用 WebRTC 同步功能。         | 关闭  |
-| `language_model_settings` | 启用语言模型设置。               | 开启  |
-| `openai_api_key`          | 允许用户自定义 OpenAI API Key。 | 开启  |
-| `openai_proxy_url`        | 允许用户自定义 OpenAI 代理 URL。  | 开启  |
-| `create_session`          | 允许用户创建会话。               | 开启  |
-| `edit_agent`              | 允许用户编辑助手。               | 开启  |
-| `dalle`                   | 启用 DALL-E 功能。           | 开启  |
-| `check_updates`           | 允许检查更新。                 | 开启  |
-| `welcome_suggest`         | 显示欢迎建议。                 | 开启  |
-| `market`                  | 启用助手市场功能。               | 开启  |
-| `speech_to_text`          | 启用语音转文本功能。              | 开启  |
-| `knowledge_base`          | 启用知识库功能。                | 开启  |
-| `clerk_sign_up`           | 启用 Clerk 注册功能。          | 开启  |
+| 配置项                       | 解释                                   | 默认值 |
+| ------------------------- | ------------------------------------ | --- |
+| `check_updates`           | 允许检查更新。                              | 开启  |
+| `pin_list`                | 控制侧边栏中置顶助手列表的显示。                     | 关闭  |
+| `language_model_settings` | 启用语言模型设置。                            | 开启  |
+| `provider_settings`       | 控制模型供应商设置的显示。                        | 开启  |
+| `openai_api_key`          | 允许用户自定义 OpenAI API Key。              | 开启  |
+| `openai_proxy_url`        | 允许用户自定义 OpenAI 代理 URL。               | 开启  |
+| `api_key_manage`          | 控制 API 密钥管理页面 (/profile/apikey) 的访问。 | 关闭  |
+| `create_session`          | 允许用户创建会话。                            | 开启  |
+| `edit_agent`              | 允许用户编辑助手。                            | 开启  |
+| `plugins`                 | 控制聊天和助手设置中的插件功能。                     | 开启  |
+| `dalle`                   | 启用 DALL-E 功能。                        | 开启  |
+| `ai_image`                | 控制 AI 图像生成功能和页面 (/image)。            | 开启  |
+| `speech_to_text`          | 启用语音转文本功能。                           | 开启  |
+| `token_counter`           | 保留用于令牌计数器显示。                         | 开启  |
+| `welcome_suggest`         | 显示欢迎建议。                              | 开启  |
+| `changelog`               | 控制更新日志弹窗 / 页面的显示。                    | 开启  |
+| `clerk_sign_up`           | 启用 Clerk 注册功能。                       | 开启  |
+| `market`                  | 启用助手市场功能。                            | 开启  |
+| `knowledge_base`          | 启用知识库功能。                             | 开启  |
+| `rag_eval`                | 控制 RAG 评估功能 (/repos/\[id]/evals)。    | 关闭  |
+| `cloud_promotion`         | 控制用户菜单中云服务推广链接的显示。                   | 关闭  |
+| `commercial_hide_github`  | 隐藏设置页面底部的 GitHub 相关链接（需要商业授权）。       | 关闭  |
+| `commercial_hide_docs`    | 隐藏文档和帮助菜单，包括更新日志、文档和反馈（需要商业授权）。      | 关闭  |

 你可以随时检查 [featureFlags](https://github.com/lobehub/lobe-chat/blob/main/src/config/featureFlags/schema.ts) 以获取最新的特性标志列表。
@@ -152,6 +152,18 @@ For specific content, please refer to the [Feature Flags](/docs/self-hosting/adv
 - Default: -
 - Example: `https://cdn.example.com`

+## AI Image
+
+### `AI_IMAGE_DEFAULT_IMAGE_NUM`
+
+- Type: Optional
+- Description: Sets the default number of images to generate for AI image generation. Users can still override this value in their settings.
+- Default: `4`
+- Example: `6`
+- Range: `1-20`
+
+This environment variable allows administrators to customize the default image generation count for their deployment. The value must be between 1 and 20. If not set, it defaults to 4. Users can still adjust this value in their personal settings.
+
 ## Plugin Service

 ### `PLUGINS_INDEX_URL`
@@ -148,6 +148,18 @@ LobeChat 在部署时提供了一些额外的配置项，你可以使用环境
 - 默认值：-
 - 示例：`https://cdn.example.com`

+## AI 图像
+
+### `AI_IMAGE_DEFAULT_IMAGE_NUM`
+
+- 类型：可选
+- 描述：设置 AI 图像生成的默认图片数量。用户仍可在个人设置中覆盖此值。
+- 默认值：`4`
+- 示例：`6`
+- 范围：`1-20`
+
+此环境变量允许管理员为其部署自定义默认图片生成数量。值必须在 1 到 20 之间。如果未设置，默认为 4。用户仍可在个人设置中调整此值。
+
 ## 插件服务

 ### `PLUGINS_INDEX_URL`
@@ -717,4 +717,20 @@ NewAPI is a multi-provider model aggregation service that supports automatic mod
 - Default: `-`
 - Example: `-all,+vercel-model-1,+vercel-model-2=vercel-special`

+## Cerebras
+
+### `CEREBRAS_API_KEY`
+
+- Type: Required
+- Description: This is the API key you applied for in the Cerebras service.
+- Default: -
+- Example: `csk-xxxxxx...xxxxxx`
+
+### `CEREBRAS_MODEL_LIST`
+
+- Type: Optional
+- Description: Used to control the Cerebras model list. Use `+` to add a model, `-` to hide a model, and `model_name=display_name` to customize the display name of a model. Separate multiple entries with commas. The definition syntax follows the same rules as other providers' model lists.
+- Default: `-`
+- Example: `-all,+cerebras-model-1,+cerebras-model-2=cerebras-special`
+
 [model-list]: /docs/self-hosting/advanced/model-list
@@ -720,4 +720,20 @@ LobeChat 在部署时提供了丰富的模型服务商相关的环境变量，
 - 默认值：`-`
 - 示例：`-all,+vercel-model-1,+vercel-model-2=vercel-special`

+## Cerebras
+
+### `CEREBRAS_API_KEY`
+
+- 类型：必选
+- 描述：这是你在 Cerebras 服务中申请的 API 密钥
+- 默认值：-
+- 示例：`csk-xxxxxx...xxxxxx`
+
+### `CEREBRAS_MODEL_LIST`
+
+- 类型：可选
+- 描述：用来控制 Cerebras 模型列表，使用 `+` 增加一个模型，使用 `-` 来隐藏一个模型，使用 `模型名=展示名` 来自定义模型的展示名，用英文逗号隔开。模型定义语法规则与其他 provider 保持一致。
+- 默认值：`-`
+- 示例：`-all,+cerebras-model-1,+cerebras-model-2=cerebras-special`
+
 [model-list]: /zh/docs/self-hosting/advanced/model-list
@@ -29,11 +29,6 @@ LobeChat integrates `next-auth`, a flexible and powerful identity verification l
 - **Social Login**: Support quick login via various social platforms.
 - **Data Security**: Ensure the security and privacy of user data.

-<Callout type={'warning'}>
-  Due to workload constraints, integration of next-auth with a server-side database has not been
-  implemented yet. If you need to use a server-side database, please use Clerk.
-</Callout>
-
 <Callout type={'info'}>
  For information on using Next-Auth, you can refer to [Authentication Services - Next
  Auth](/docs/self-hosting/advanced/authentication#next-auth).
@@ -25,11 +25,6 @@ LobeChat 集成了 `next-auth`，一个灵活且强大的身份验证库，支
 - **社交登录**：支持多种社交平台的快捷登录。
 - **数据安全**：保障用户数据的安全性和隐私性。

-<Callout type={'warning'}>
-  由于工作量原因，目前还没有实现 next-auth 与服务端数据库的集成，如果需要使用服务端数据库，请使用
-  Clerk 。
-</Callout>
-
 <Callout type={'info'}>
  关于 Next-Auth 的使用，可以查阅 [身份验证服务 - Next
  Auth](/zh/docs/self-hosting/advanced/authentication#next-auth)。
@@ -0,0 +1,73 @@
+import { expect, test } from '@playwright/test';
+
+// 覆盖核心可访问路径（含重定向来源）
+const baseRoutes: string[] = [
+  '/',
+  '/chat',
+  '/discover',
+  '/image',
+  '/files',
+  '/repos', // next.config.ts -> /files
+  '/changelog',
+];
+
+// settings 路由改为通过 query 参数控制 active tab
+// 参考 SettingsTabs: about, agent, common, hotkey, llm, provider, proxy, storage, system-agent, tts
+const settingsTabs = [
+  'common',
+  'llm',
+  'provider',
+  'about',
+  'hotkey',
+  'proxy',
+  'storage',
+  'tts',
+  'system-agent',
+  'agent',
+];
+
+const routes: string[] = [...baseRoutes, ...settingsTabs.map((key) => `/settings?active=${key}`)];
+
+// CI 环境下跳过容易不稳定或受特性开关影响的路由
+const ciSkipPaths = new Set<string>([
+  '/image',
+  '/changelog',
+  '/settings?active=common',
+  '/settings?active=llm',
+]);
+
+// @ts-ignore
+async function assertNoPageErrors(page: Parameters<typeof test>[0]['page']) {
+  const pageErrors: Error[] = [];
+  const consoleErrors: string[] = [];
+
+  page.on('pageerror', (err: Error) => pageErrors.push(err));
+  page.on('console', (msg: any) => {
+    if (msg.type() === 'error') consoleErrors.push(msg.text());
+  });
+
+  // 仅校验页面级错误，忽略控制台 error 以提升稳定性
+  expect
+    .soft(pageErrors, `page errors: ${pageErrors.map((e) => e.message).join('\n')}`)
+    .toHaveLength(0);
+}
+
+test.describe('Smoke: core routes', () => {
+  for (const path of routes) {
+    test(`should open ${path} without error`, async ({ page }) => {
+      if (process.env.CI && ciSkipPaths.has(path)) test.skip(true, 'skip flaky route on CI');
+      const response = await page.goto(path, { waitUntil: 'commit' });
+      // 2xx 或 3xx 视为可接受（允许中间件/重定向）
+      const status = response?.status() ?? 0;
+      expect(status, `unexpected status for ${path}: ${status}`).toBeLessThan(400);
+
+      // 一般错误标题防御
+      await expect(page).not.toHaveTitle(/not found|error/i);
+
+      // body 可见
+      await expect(page.locator('body')).toBeVisible();
+
+      await assertNoPageErrors(page);
+    });
+  }
+});
@@ -236,6 +236,7 @@
  },
  "information": "المجتمع والمعلومات",
  "installPWA": "تثبيت تطبيق المتصفح",
+  "labs": "المختبرات",
  "lang": {
    "ar": "العربية",
    "bg-BG": "البلغارية",
@@ -7,6 +7,14 @@
    "desc": "مسح الرسائل والملفات المرفوعة في المحادثة الحالية",
    "title": "مسح رسائل المحادثة"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "حذف الرسالة الأخيرة وإعادة إنشائها",
+    "title": "حذف وإعادة إنشاء"
+  },
+  "deleteLastMessage": {
+    "desc": "حذف الرسالة الأخيرة",
+    "title": "حذف الرسالة الأخيرة"
+  },
  "desktop": {
    "openSettings": {
      "desc": "افتح صفحة إعدادات التطبيق",
@@ -0,0 +1,14 @@
+{
+  "desc": "سنقوم بتحديث الميزات الجديدة التي نستكشفها من وقت لآخر، ندعوك لتجربتها!",
+  "features": {
+    "groupChat": {
+      "desc": "تفعيل إمكانية تنسيق المحادثات الجماعية متعددة الوكلاء.",
+      "title": "دردشة جماعية (متعددة الوكلاء)"
+    },
+    "inputMarkdown": {
+      "desc": "عرض Markdown في منطقة الإدخال بشكل فوري (مثل النص العريض، كتل الشيفرة، الجداول، وغيرها).",
+      "title": "عرض Markdown في حقل الإدخال"
+    }
+  },
+  "title": "المختبر"
+}
@@ -284,11 +284,19 @@
          "placeholder": "يرجى إدخال معرف النموذج، مثل gpt-4o أو claude-3.5-sonnet",
          "title": "معرف النموذج"
        },
+        "imageOutput": {
+          "extra": "سيؤدي هذا الإعداد فقط إلى تفعيل قدرة النموذج على توليد الصور، وتعتمد النتيجة بالكامل على قدرات النموذج نفسه. يُرجى اختبار ما إذا كان النموذج يدعم توليد الصور بشكل فعّال.",
+          "title": "يدعم توليد الصور"
+        },
        "modalTitle": "تكوين النموذج المخصص",
        "reasoning": {
          "extra": "هذا الإعداد سيفتح فقط قدرة النموذج على التفكير العميق، التأثير الفعلي يعتمد بالكامل على النموذج نفسه، يرجى اختبار ما إذا كان هذا النموذج يمتلك القدرة على التفكير العميق القابل للاستخدام",
          "title": "يدعم التفكير العميق"
        },
+        "search": {
+          "extra": "سيؤدي هذا الإعداد فقط إلى تفعيل قدرة محرك البحث المدمج في النموذج على الاتصال بالإنترنت. تعتمد إمكانية استخدام محرك البحث المدمج على قدرات النموذج نفسه. يُرجى اختبار ما إذا كان محرك البحث المدمج في النموذج يعمل بشكل فعّال.",
+          "title": "يدعم البحث عبر الإنترنت"
+        },
        "tokens": {
          "extra": "تعيين الحد الأقصى لعدد الرموز المدعومة من قبل النموذج",
          "title": "أقصى نافذة سياق",
@@ -309,6 +317,10 @@
          "placeholder": "يرجى اختيار نوع النموذج",
          "title": "نوع النموذج"
        },
+        "video": {
+          "extra": "سيؤدي هذا الإعداد فقط إلى تفعيل إعدادات التعرف على الفيديو داخل التطبيق. وتعتمد إمكانية التعرف على الفيديو بالكامل على قدرات النموذج نفسه. يُرجى اختبار ما إذا كان النموذج يدعم التعرف على الفيديو بشكل فعّال.",
+          "title": "يدعم التعرف على الفيديو"
+        },
        "vision": {
          "extra": "سيؤدي هذا التكوين إلى فتح إعدادات تحميل الصور في التطبيق، ما إذا كان يدعم التعرف يعتمد بالكامل على النموذج نفسه، يرجى اختبار قابلية استخدام التعرف البصري لهذا النموذج بنفسك",
          "title": "دعم التعرف البصري"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "مقدم من مايكروسوفت؛ تم ترقية نموذج DeepSeek R1 بإصدار فرعي، الإصدار الحالي هو DeepSeek-R1-0528. في التحديث الأخير، حسّن DeepSeek R1 بشكل كبير عمق الاستدلال وقدرات التنبؤ من خلال زيادة موارد الحوسبة وإدخال آليات تحسين الخوارزميات في مرحلة ما بعد التدريب. النموذج يحقق أداءً ممتازًا في اختبارات معيارية متعددة مثل الرياضيات والبرمجة والمنطق العام، وأداؤه الكلي يقترب من النماذج الرائدة مثل O3 و Gemini 2.5 Pro."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B هو نموذج خبراء هجين أطلقته شركة Baichuan Intelligence، يتمتع بقدرات استدلال قوية."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B هو نموذج لغوي كبير مفتوح المصدر قابل للاستخدام التجاري تم تطويره بواسطة Baichuan Intelligence، ويحتوي على 13 مليار معلمة، وقد حقق أفضل النتائج في المعايير الصينية والإنجليزية."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 مصمم خصيصًا للأدوار والشعور بالرفقة، يدعم الذاكرة متعددة الجولات الطويلة والحوار المخصص، ويستخدم على نطاق واسع."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 هو نموذج مغلق المصدر تم إصداره بواسطة مختبر KEG في جامعة تسينغهوا وشركة Zhizhu AI، وقد تم تدريبه مسبقًا على كميات هائلة من المعرفة المعرفية باللغتين الصينية والإنجليزية، وتم تحسينه وفقًا للاختيارات البشرية. مقارنة بالنموذج الأول، حقق تحسينات بنسبة 16٪ و 36٪ و 280٪ في MMLU و C-Eval و GSM8K على التوالي، وتصدر قائمة المهام الصينية C-Eval. يناسب هذا النموذج السيناريوهات التي تتطلب كميات كبيرة من المعرفة وقدرات الاستدلال والإبداع، مثل كتابة النصوص الإعلانية وكتابة الروايات وكتابة المحتوى المعرفي وتكوين الكود."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base هو النموذج الأساسي المفتوح المصدر الأحدث من سلسلة ChatGLM التي طورتها شركة Zhìpǔ، ويحتوي على 6 مليارات معلمة."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o هو نموذج ديناميكي يتم تحديثه في الوقت الحقيقي للحفاظ على أحدث إصدار. يجمع بين فهم اللغة القوي وقدرات التوليد، مما يجعله مناسبًا لمجموعة واسعة من التطبيقات، بما في ذلك خدمة العملاء والتعليم والدعم الفني."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus هو نسخة محدثة من نموذج V3.1 الذي أصدرته DeepSeek، ويصنف كنموذج لغة كبير لوكيل هجين. يركز هذا التحديث على إصلاح المشكلات التي أبلغ عنها المستخدمون وتحسين الاستقرار مع الحفاظ على القدرات الأصلية للنموذج. لقد حسّن بشكل ملحوظ اتساق اللغة، وقلل من الاستخدام المختلط للغة الصينية والإنجليزية والرموز غير الطبيعية. يدمج النموذج \"وضع التفكير\" و\"الوضع غير التفكيري\"، حيث يمكن للمستخدمين التبديل بينهما بسهولة عبر قوالب الدردشة لتناسب مهام مختلفة. كتحسين مهم، عزز V3.1-Terminus أداء وكيل الكود ووكيل البحث، مما يجعله أكثر موثوقية في استدعاء الأدوات وتنفيذ المهام المعقدة متعددة الخطوات."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "نموذج DeepSeek V3.2 Exp هو نموذج بهيكلية استدلال هجينة، يدعم وضعي التفكير وغير التفكير."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek 67B هو نموذج متقدم تم تدريبه للحوار المعقد."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1: نموذج استدلال من الجيل التالي يعزز القدرات على الاستدلال المعقد والتفكير التسلسلي، مناسب للمهام التي تتطلب تحليلاً عميقًا."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1: نموذج الاستدلال من الجيل التالي، يعزز القدرة على الاستنتاج المعقد والتفكير المتسلسل، ومناسب للمهام التي تتطلب تحليلاً عميقاً."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 هو نموذج لغوي بصري مختلط الخبراء (MoE) تم تطويره بناءً على DeepSeekMoE-27B، يستخدم بنية MoE ذات تفعيل نادر، محققًا أداءً ممتازًا مع تفعيل 4.5 مليار معلمة فقط. يقدم هذا النموذج أداءً ممتازًا في مهام مثل الأسئلة البصرية، التعرف الضوئي على الأحرف، فهم الوثائق/الجداول/الرسوم البيانية، وتحديد المواقع البصرية."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 هو نموذج استدلال هجين جديد أطلقته DeepSeek، يدعم وضعين للاستدلال: التفكير وعدم التفكير، مع كفاءة تفكير أعلى مقارنة بـ DeepSeek-R1-0528. بعد تحسين ما بعد التدريب، تم تعزيز استخدام أدوات الوكيل وأداء مهام الوكيل بشكل كبير. يدعم نافذة سياق تصل إلى 128 ألف، وطول إخراج يصل إلى 64 ألف رمز."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus هو إصدار محسن من نموذج اللغة الكبير أطلقته DeepSeek، ومُصمم خصيصًا للأجهزة الطرفية."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1: نموذج استدلال من الجيل التالي يعزز القدرات على الاستدلال المعقد والتفكير التسلسلي، مناسب للمهام التي تتطلب تحليلاً عميقًا."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "نموذج اللغة الكبير الرائد الذي طورته بايدو، والذي يظهر أداءً ممتازًا بشكل شامل، ويستخدم على نطاق واسع في مشاهد المهام المعقدة في مختلف المجالات؛ يدعم الاتصال التلقائي بمكونات البحث من بايدو، مما يضمن تحديث معلومات الإجابة. مقارنةً بـ ERNIE 4.0، يظهر أداءً أفضل."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B هو نموذج خبراء هجين أطلقته Baidu Wenxin، يتمتع بقدرات قوية في الاستدلال ودعم متعدد اللغات."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B هو نموذج خبراء هجين فائق الحجم أطلقته Baidu Wenxin، يتميز بقدرات استدلال فائقة."
+  },
  "ernie-4.5-8k-preview": {
    "description": "نموذج ونسين 4.5 هو نموذج أساسي جديد متعدد الوسائط تم تطويره ذاتيًا بواسطة بايدو، من خلال نمذجة متعددة الوسائط لتحقيق تحسين متزامن، ويظهر قدرة ممتازة على الفهم متعدد الوسائط؛ يتمتع بقدرات لغوية متقدمة، مع تحسين شامل في الفهم، والتوليد، والمنطق، والذاكرة، مع تحسين كبير في إزالة الأوهام، والاستدلال المنطقي، وقدرات البرمجة."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 هو أحدث إصدار من النموذج، مصمم للمهام المعقدة والمتنوعة، ويظهر أداءً ممتازًا."
  },
  "glm-4-9b-chat": {
-    "description": "يظهر GLM-4-9B-Chat أداءً عاليًا في مجالات متعددة مثل الدلالات والرياضيات والاستدلال والترميز والمعرفة. كما أنه مزود بقدرات تصفح الويب وتنفيذ الشيفرات واستدعاء الأدوات المخصصة واستدلال النصوص الطويلة. يدعم 26 لغة بما في ذلك اليابانية والكورية والألمانية."
+    "description": "يُظهر GLM-4-9B-Chat أداءً عاليًا في مجالات الدلالة، والرياضيات، والاستدلال، والبرمجة، والمعرفة. كما يدعم تصفح الويب، وتنفيذ الأكواد، واستدعاء الأدوات المخصصة، والاستدلال على النصوص الطويلة. يدعم 26 لغة من بينها اليابانية والكورية والألمانية."
  },
  "glm-4-air": {
    "description": "GLM-4-Air هو إصدار ذو قيمة عالية، يتمتع بأداء قريب من GLM-4، ويقدم سرعة عالية وسعرًا معقولًا."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "يمتلك GLM-Zero-Preview قدرة قوية على الاستدلال المعقد، ويظهر أداءً ممتازًا في مجالات الاستدلال المنطقي، والرياضيات، والبرمجة."
  },
+  "glm4.6:355b": {
+    "description": "نموذج GLM-4.6 (355B) الرائد الأحدث من Zhipu يتفوق بشكل شامل على الجيل السابق في الترميز المتقدم، ومعالجة النصوص الطويلة، والاستدلال، وقدرات الوكلاء الذكيين، لا سيما في مجال البرمجة حيث يتماشى مع Claude Sonnet 4، ليصبح من أفضل نماذج الترميز في الصين."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash يقدم ميزات الجيل التالي وتحسينات تشمل سرعة فائقة، استخدام أدوات مدمجة، توليد متعدد الوسائط، ونافذة سياق تصل إلى مليون رمز."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "أسرع وأكفأ نسخة من GPT-5 من حيث التكلفة. مثالية للتطبيقات التي تتطلب استجابة سريعة وحساسة للتكلفة."
  },
+  "gpt-5-pro": {
+    "description": "يستخدم GPT-5 pro قدرة حسابية أكبر للتفكير بشكل أعمق، ويواصل تقديم إجابات أفضل باستمرار."
+  },
  "gpt-audio": {
    "description": "GPT Audio هو نموذج دردشة عام موجه لإدخال وإخراج الصوت، ويدعم استخدام الصوت في واجهة برمجة تطبيقات Chat Completions."
  },
  "gpt-image-1": {
    "description": "نموذج توليد الصور متعدد الوسائط الأصلي من ChatGPT"
  },
+  "gpt-image-1-mini": {
+    "description": "نسخة منخفضة التكلفة من GPT Image 1، تدعم إدخال النصوص والصور بشكل أصلي وتوليد مخرجات على شكل صور."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4: هيكل Transformer محسّن بالكمية، يحافظ على أداء قوي حتى في ظل محدودية الموارد."
+    "description": "يتطلب هذا النموذج تقديم طلب للتجربة. GPT-OSS-120B هو نموذج لغة مفتوح المصدر واسع النطاق أطلقته OpenAI، يتمتع بقدرات قوية في توليد النصوص."
+  },
+  "gpt-oss-20b": {
+    "description": "يتطلب هذا النموذج تقديم طلب للتجربة. GPT-OSS-20B هو نموذج لغة مفتوح المصدر متوسط الحجم أطلقته OpenAI، يتميز بكفاءة عالية في توليد النصوص."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B هو نموذج لغة كبير مفتوح المصدر أصدرته OpenAI، يستخدم تقنية التكميم MXFP4، ويعتبر نموذجًا رائدًا. يتطلب تشغيله بيئة متعددة وحدات معالجة الرسومات أو محطة عمل عالية الأداء، ويتميز بأداء متفوق في الاستدلال المعقد، وتوليد الأكواد، ومعالجة اللغات المتعددة، ويدعم استدعاء الدوال المتقدمة وتكامل الأدوات."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "نموذج عام في الوقت الحقيقي يدعم الإدخال والإخراج النصي والصوتي، ويدعم أيضًا إدخال الصور."
  },
-  "grok-2-1212": {
-    "description": "لقد تم تحسين هذا النموذج في الدقة، والامتثال للتعليمات، والقدرة على التعامل مع لغات متعددة."
-  },
  "grok-2-image-1212": {
    "description": "نموذج توليد الصور الأحدث لدينا قادر على توليد صور حيوية وواقعية بناءً على الأوامر النصية. يبرع في مجالات التسويق، وسائل التواصل الاجتماعي، والترفيه."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "نموذج رائد، بارع في استخراج البيانات، البرمجة، وتلخيص النصوص لتطبيقات المؤسسات، يمتلك معرفة عميقة في مجالات المالية، الطب، القانون، والعلوم."
  },
-  "grok-3-fast": {
-    "description": "نموذج رائد، بارع في استخراج البيانات، البرمجة، وتلخيص النصوص لتطبيقات المؤسسات، يمتلك معرفة عميقة في مجالات المالية، الطب، القانون، والعلوم."
-  },
  "grok-3-mini": {
    "description": "نموذج خفيف الوزن، يفكر قبل المحادثة. سريع وذكي، مناسب للمهام المنطقية التي لا تتطلب معرفة متخصصة عميقة، ويستطيع تتبع مسار التفكير الأصلي."
  },
-  "grok-3-mini-fast": {
-    "description": "نموذج خفيف الوزن، يفكر قبل المحادثة. سريع وذكي، مناسب للمهام المنطقية التي لا تتطلب معرفة متخصصة عميقة، ويستطيع تتبع مسار التفكير الأصلي."
-  },
  "grok-4": {
    "description": "نموذجنا الرائد الأحدث والأقوى، يتميز بأداء ممتاز في معالجة اللغة الطبيعية، الحسابات الرياضية، والاستدلال — إنه لاعب شامل مثالي."
  },
@@ -1851,7 +1866,7 @@
    "description": "أحدث نموذج تفكير عميق متعدد الوسائط t1-vision من Hunyuan، يدعم سلسلة التفكير الأصلية متعددة الوسائط، مع تحسين شامل مقارنة بالإصدار الافتراضي السابق."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "نموذج تفكير عميق متعدد الوسائط من Hunyuan، يدعم سلاسل تفكير طويلة أصلية متعددة الوسائط، بارع في معالجة مختلف سيناريوهات الاستدلال بالصور، مع تحسين شامل مقارنة بنموذج التفكير السريع في مسائل العلوم."
+    "description": "أحدث إصدار من نموذج Hunyuan t1-vision للتفكير البصري العميق، يقدم تحسينات شاملة مقارنة بالإصدار السابق في مهام الأسئلة والأجوبة العامة على الصور والنصوص، التحديد البصري، التعرف البصري على الحروف (OCR)، الرسوم البيانية، حل المسائل المصورة، والإبداع البصري، مع تحسين ملحوظ في دعم اللغة الإنجليزية واللغات الأقل استخدامًا."
  },
  "hunyuan-turbo": {
    "description": "نسخة المعاينة من الجيل الجديد من نموذج اللغة الكبير، يستخدم هيكل نموذج الخبراء المختلط (MoE) الجديد، مما يوفر كفاءة استدلال أسرع وأداء أقوى مقارنة بـ hunyuan-pro."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "نموذج kimi-k2-0905-preview يدعم طول سياق 256k، يتمتع بقدرات ترميز وكيل أقوى، وجمالية وعملية أفضل في الشيفرة الأمامية، وفهم سياق محسن."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct هو نموذج لغة كبير أطلقته Moonshot AI، يتمتع بقدرة فائقة على معالجة السياقات الطويلة."
+  },
  "kimi-k2-turbo-preview": {
    "description": "kimi-k2 هو نموذج أساسي بمعمارية MoE يتمتع بقدرات قوية للغاية في البرمجة وقدرات الوكيل (Agent)، بإجمالي معلمات يبلغ 1 تريليون والمعلمات المُفعَّلة 32 مليار. في اختبارات الأداء المعيارية للفئات الرئيسية مثل الاستدلال المعرفي العام والبرمجة والرياضيات والوكلاء (Agent)، تفوق أداء نموذج K2 على النماذج المفتوحة المصدر السائدة الأخرى."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "سبارك لايت هو نموذج لغوي كبير خفيف الوزن، يتميز بتأخير منخفض للغاية وكفاءة عالية في المعالجة، وهو مجاني تمامًا ومفتوح، ويدعم وظيفة البحث عبر الإنترنت في الوقت الحقيقي. تجعل خصائص استجابته السريعة منه مثاليًا لتطبيقات الاستدلال على الأجهزة ذات القدرة الحاسوبية المنخفضة وضبط النماذج، مما يوفر للمستخدمين قيمة ممتازة من حيث التكلفة وتجربة ذكية، خاصة في مجالات الأسئلة والأجوبة المعرفية، وتوليد المحتوى، وسيناريوهات البحث."
  },
-  "llama-2-7b-chat": {
-    "description": "Llama2 هو سلسلة من النماذج اللغوية الكبيرة (LLM) التي طورتها Meta وأطلقتها كمصدر مفتوح، وهي تتكون من نماذج توليد نص مسبقة التدريب ومتخصصة بحجم يتراوح من 7 مليارات إلى 70 مليار معلمة. على مستوى العمارة، Llama2 هو نموذج لغوي تراجعي تلقائي يستخدم معمارية محول محسنة. الإصدارات المعدلة تستخدم التدريب الدقيق تحت الإشراف (SFT) والتعلم التقويمي مع تعزيزات من البشر (RLHF) لتوافق تفضيلات البشر فيما يتعلق بالفائدة والأمان. أظهر Llama2 أداءً أفضل بكثير من سلسلة Llama في العديد من المجموعات الأكاديمية، مما قدم إلهامًا لتصميم وتطوير العديد من النماذج الأخرى."
-  },
  "llama-3.1-70b-versatile": {
    "description": "Llama 3.1 70B يوفر قدرة استدلال ذكائي أقوى، مناسب للتطبيقات المعقدة، يدعم معالجة حسابية ضخمة ويضمن الكفاءة والدقة."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "تم تحسين نموذج Llama 3.2-Vision المعدل للتعليمات للتعرف البصري، والاستدلال على الصور، ووصف الصور، والإجابة على الأسئلة العامة المتعلقة بالصور."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 هو النموذج الأكثر تقدمًا في سلسلة Llama، وهو نموذج لغوي مفتوح المصدر متعدد اللغات، يوفر تجربة أداء تنافس نموذج 405B بتكلفة منخفضة للغاية. يعتمد على هيكل Transformer، وتم تحسين فائدته وأمانه من خلال التعديل الدقيق تحت الإشراف (SFT) والتعلم المعزز من خلال التغذية الراجعة البشرية (RLHF). تم تحسين نسخة التعديل الخاصة به لتكون مثالية للحوار متعدد اللغات، حيث يتفوق في العديد من المعايير الصناعية على العديد من نماذج الدردشة المفتوحة والمغلقة. تاريخ انتهاء المعرفة هو ديسمبر 2023."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B: نموذج Llama متوسط إلى كبير الحجم، يوازن بين قدرات الاستدلال والكفاءة الإنتاجية."
  },
  "llama-3.3-70b-versatile": {
    "description": "ميتّا لاما 3.3 هو نموذج لغة كبير متعدد اللغات (LLM) يضم 70 مليار (إدخال نص/إخراج نص) من النموذج المدرب مسبقًا والمعدل وفقًا للتعليمات. تم تحسين نموذج لاما 3.3 المعدل وفقًا للتعليمات للاستخدامات الحوارية متعددة اللغات ويتفوق على العديد من النماذج المتاحة مفتوحة المصدر والمغلقة في المعايير الصناعية الشائعة."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "تم تحسين نموذج Llama 3.3 المعدل للتعليمات خصيصًا لسيناريوهات المحادثة، حيث تفوق على العديد من نماذج الدردشة مفتوحة المصدر الحالية في اختبارات المعايير الصناعية الشائعة."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick: نموذج عالي الأداء من سلسلة Llama، مناسب لمهام الاستدلال المتقدم، حل المشكلات المعقدة، وتنفيذ التعليمات."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout: نموذج عالي الأداء من سلسلة Llama، مثالي للسيناريوهات التي تتطلب إنتاجية عالية وزمن استجابة منخفض."
+  },
  "llama3-70b-8192": {
    "description": "Meta Llama 3 70B يوفر قدرة معالجة معقدة لا مثيل لها، مصمم خصيصًا للمشاريع ذات المتطلبات العالية."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 هو النموذج الرائد الذي أطلقته Meta، يدعم ما يصل إلى 405B من المعلمات، ويمكن تطبيقه في مجالات الحوار المعقد، والترجمة متعددة اللغات، وتحليل البيانات."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B: إصدار خفيف ومنخفض التأخير من Llama، مناسب للاستدلال التفاعلي الخفيف عبر الإنترنت."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 هو النموذج الرائد الذي أطلقته Meta، يدعم ما يصل إلى 405B من المعلمات، ويمكن تطبيقه في مجالات الحوار المعقد، والترجمة متعددة اللغات، وتحليل البيانات."
  },
@@ -2067,7 +2091,7 @@
    "description": "سبارك ماكس 32K مزود بقدرة معالجة سياق كبيرة، مع فهم أقوى للسياق وقدرة على الاستدلال المنطقي، يدعم إدخال نصوص تصل إلى 32K توكن، مما يجعله مناسبًا لقراءة الوثائق الطويلة، والأسئلة والأجوبة المعرفية الخاصة، وغيرها من السيناريوهات."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct هو نموذج لغة كبير تم تدريبه بشكل مستقل من قبل شركة ووون تشينغ. يهدف Megrez-3B-Instruct إلى تقديم حل ذكاء على جهاز نهائي سريع وصغير وسهل الاستخدام من خلال مفهوم التكامل بين البرمجيات والأجهزة."
+    "description": "Megrez 3B Instruct هو نموذج صغير الحجم وعالي الكفاءة أطلقته شركة Wuwen Xinqiong."
  },
  "meta-llama-3-70b-instruct": {
    "description": "نموذج قوي بحجم 70 مليار معلمة يتفوق في التفكير، والترميز، وتطبيقات اللغة الواسعة."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "سبارك برو 128K مزود بقدرة معالجة سياق كبيرة جدًا، قادر على معالجة ما يصل إلى 128K من معلومات السياق، مما يجعله مناسبًا بشكل خاص للتحليل الشامل ومعالجة الروابط المنطقية طويلة الأمد في المحتوى الطويل، ويمكنه تقديم منطق سلس ومتسق ودعم متنوع للاقتباسات في الاتصالات النصية المعقدة."
  },
+  "pro-deepseek-r1": {
+    "description": "نموذج مخصص لخدمات المؤسسات، يشمل خدمات متزامنة."
+  },
+  "pro-deepseek-v3": {
+    "description": "نموذج مخصص لخدمات المؤسسات، يشمل خدمات متزامنة."
+  },
  "qvq-72b-preview": {
    "description": "نموذج QVQ هو نموذج بحث تجريبي تم تطويره بواسطة فريق Qwen، يركز على تعزيز قدرات الاستدلال البصري، خاصة في مجال الاستدلال الرياضي."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "نموذج استدلال بصري يدعم الإدخال البصري وإخراج سلسلة التفكير. النسخة بلس التي تلت نموذج qvq-max، تتميز بسرعة استدلال أعلى وتوازن أفضل بين الأداء والتكلفة مقارنة بنموذج qvq-max."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B: نموذج من سلسلة Qwen يتميز بأداء ممتاز في المهام متعددة اللغات والبرمجة، مناسب للاستخدام الإنتاجي متوسط النطاق."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B: نموذج طويل السياق مخصص لتوليد الشيفرات والمهام البرمجية المعقدة."
+  },
  "qwen-coder-plus": {
    "description": "نموذج Tongyi Qianwen للبرمجة."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 هو نموذج لغوي كبير من الجيل الجديد من Alibaba، يدعم أداءً ممتازًا لتلبية احتياجات التطبيقات المتنوعة."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2 هو سلسلة نماذج لغوية كبيرة جديدة تم إطلاقها من قبل فريق Qwen. تعتمد هذه النماذج على هندسة Transformer وتستخدم دالة التنشيط SwiGLU، وتحيز الانتباه QKV (attention QKV bias)، وانتباه الاستفسار الجماعي (group query attention)، وخلط انتباه النافذة المتزحلقة والانتباه الكامل (mixture of sliding window attention and full attention). بالإضافة إلى ذلك، قام فريق Qwen بتحسين مجزئ يتكيف مع العديد من اللغات الطبيعية والأكواد."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2 هو سلسلة نماذج لغوية كبيرة جديدة تم طرحها من قبل فريق Qwen. يعتمد هذا النموذج على هندسة Transformer، ويستخدم دالة التنشيط SwiGLU، وتحيز QKV للانتباه (attention QKV bias)، وانتباه الاستفسار الجماعي (group query attention)، وخلط انتباه النافذة المتزحلقة والانتباه الكامل. بالإضافة إلى ذلك، قام فريق Qwen بتحسين المقطّع الذي يتكيف مع العديد من اللغات الطبيعية والأكواد."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 هو الجيل الجديد من نماذج اللغة الكبيرة من Alibaba، يدعم احتياجات التطبيقات المتنوعة بأداء ممتاز."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "نموذج مفتوح المصدر من الجيل الجديد لوضع التفكير مبني على Qwen3، يتميز بتحسين في الالتزام بالتعليمات مقارنة بالإصدار السابق (Tongyi Qianwen 3-235B-A22B-Thinking-2507)، مع ردود ملخصة وأكثر إيجازًا من النموذج."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct هو نموذج متعدد الوسائط أطلقته Tongyi Qianwen، يدعم الفهم البصري والاستدلال."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking هو نموذج استدلال متعدد الوسائط أطلقته Tongyi Qianwen، يدعم الفهم البصري والاستدلال."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL هو نموذج توليد نصوص يمتلك قدرات فهم بصرية (صور)، لا يقتصر على التعرف الضوئي على الحروف (OCR)، بل يمكنه أيضًا التلخيص والاستدلال، مثل استخراج خصائص من صور المنتجات، وحل المسائل بناءً على صور التمارين."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "هذا النموذج هو نموذج استدلال كبير يتمتع بقدرة قوية على فهم الصور، يمكنه معالجة المعلومات النصية والصورية، ويخرج نصوصًا بعد تفكير عميق. يظهر هذا النموذج أداءً بارزًا في مجال الاستدلال البصري، كما يمتلك قدرات رياضية، برمجية، ونصية من الدرجة الأولى. طول السياق هو 100k."
  },
+  "step3": {
+    "description": "Step3 هو نموذج متعدد الوسائط أطلقته Jiexue Xingchen، يتمتع بقدرات قوية في الفهم البصري."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 هو نموذج استدلال متعدد الوسائط متقدم أصدرته شركة 阶跃星辰 (StepFun). بُني على بنية مزيج الخبراء (MoE) التي تضم 321 مليار معلمة إجمالية و38 مليار معلمة تنشيط. صُمم النموذج بنهج من الطرف إلى الطرف ليقلل تكلفة فك الترميز، مع تقديم أداء رائد في الاستدلال البصري-اللغوي. من خلال التصميم التعاوني لآلية انتباه تفكيك متعدد المصفوفات (MFA) وفصل الانتباه عن شبكة التغذية الأمامية (AFD)، يحافظ Step3 على كفاءة ممتازة على كل من المسرعات الرائدة والمسرعات منخفضة التكلفة. في مرحلة ما قبل التدريب عالج Step3 أكثر من 20 تريليون توكن نصي و4 تريليون توكن مختلط نص-صورة، مغطياً أكثر من عشر لغات. حقق النموذج أداءً متقدماً بين نماذج المصدر المفتوح في عدة معايير قياسية تشمل الرياضيات والبرمجة والمهام متعددة الوسائط."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "أحدث وأعظم نموذج رائد من xAI، يقدم أداءً لا مثيل له في اللغة الطبيعية، الرياضيات، والاستدلال — الخيار المثالي متعدد الاستخدامات."
  },
-  "yi-1.5-34b-chat": {
-    "description": "يي-1.5 هو إصدار مُحدّث من يي. تم تدريبه بشكل مُسبق باستخدام مكتبة بيانات عالية الجودة تحتوي على 500 مليار علامة (Token) على يي، وتم تحسينه أيضًا باستخدام 3 ملايين مثال متنوع للتدريب الدقيق."
-  },
  "yi-large": {
    "description": "نموذج جديد بمليارات المعلمات، يوفر قدرة قوية على الإجابة وتوليد النصوص."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "مختبر أبحاث رائد في مقدمة الذكاء الاصطناعي، يبني البنية التحتية البصرية للمستقبل."
  },
+  "cerebras": {
+    "description": "Cerebras هو نظام استدلال ذكاء اصطناعي يعتمد على نظام CS-3 المخصص، ويهدف إلى تقديم أسرع خدمات النماذج اللغوية الكبيرة (LLM) في العالم مع استجابة فورية وقدرة معالجة عالية. تم تصميمه خصيصًا للقضاء على التأخير وتسريع سير العمل المعقد للذكاء الاصطناعي مثل توليد الشيفرات في الوقت الحقيقي والمهام التفاعلية."
+  },
  "cloudflare": {
    "description": "تشغيل نماذج التعلم الآلي المدفوعة بوحدات معالجة الرسوميات بدون خادم على شبكة Cloudflare العالمية."
  },
@@ -294,6 +294,13 @@
    },
    "title": "الإعدادات العامة"
  },
+  "settingImage": {
+    "defaultCount": {
+      "desc": "اضبط عدد الصور الافتراضي عند إنشاء مهمة جديدة في لوحة توليد الصور.",
+      "label": "عدد الصور الافتراضي",
+      "title": "إعدادات الرسم بالذكاء الاصطناعي"
+    }
+  },
  "settingModel": {
    "enableMaxTokens": {
      "title": "تمكين الحد الأقصى للردود"
@@ -549,6 +556,7 @@
    "common": "إعدادات عامة",
    "experiment": "تجربة",
    "hotkey": "اختصارات لوحة المفاتيح",
+    "image": "الرسم بالذكاء الاصطناعي",
    "llm": "نموذج اللغة",
    "provider": "مزود خدمة الذكاء الاصطناعي",
    "proxy": "وكيل الشبكة",
@@ -236,6 +236,7 @@
  },
  "information": "Общност и информация",
  "installPWA": "Инсталиране на PWA",
+  "labs": "Лаборатория",
  "lang": {
    "ar": "Арабски",
    "bg-BG": "български",
@@ -7,6 +7,14 @@
    "desc": "Изтриване на текущите съобщения и качените файлове в сесията",
    "title": "Изтриване на съобщенията в сесията"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "Изтриване на последното съобщение и повторно генериране",
+    "title": "Изтрий и генерирай отново"
+  },
+  "deleteLastMessage": {
+    "desc": "Изтриване на последното съобщение",
+    "title": "Изтрий последното съобщение"
+  },
  "desktop": {
    "openSettings": {
      "desc": "Отворете страницата с настройки на приложението",
@@ -0,0 +1,14 @@
+{
+  "desc": "Тук периодично ще актуализираме новите функции, които изследваме. Добре дошли да ги изпробвате!",
+  "features": {
+    "groupChat": {
+      "desc": "Активиране на възможността за координация в групов чат с множество интелигентни агенти.",
+      "title": "Групов чат (множество агенти)"
+    },
+    "inputMarkdown": {
+      "desc": "Реално време визуализация на Markdown в полето за въвеждане (удебелен текст, кодови блокове, таблици и др.).",
+      "title": "Markdown визуализация в полето за въвеждане"
+    }
+  },
+  "title": "Лаборатория"
+}
@@ -284,11 +284,19 @@
          "placeholder": "Моля, въведете идентификатор на модела, например gpt-4o или claude-3.5-sonnet",
          "title": "ID на модела"
        },
+        "imageOutput": {
+          "extra": "Тази конфигурация ще активира само способността на модела да генерира изображения. Конкретният резултат зависи изцяло от самия модел. Моля, тествайте сами дали моделът има способност за генериране на изображения.",
+          "title": "Поддържа генериране на изображения"
+        },
        "modalTitle": "Конфигурация на персонализиран модел",
        "reasoning": {
          "extra": "Тази конфигурация ще активира само способността на модела за дълбоко мислене, конкретният ефект зависи изцяло от самия модел, моля, тествайте сами дали моделът притежава налична способност за дълбоко мислене",
          "title": "Поддръжка на дълбоко мислене"
        },
+        "search": {
+          "extra": "Тази конфигурация ще активира само възможността за онлайн търсене чрез вградения търсач на модела. Дали се поддържа вграден търсач зависи от самия модел. Моля, тествайте сами дали тази функция е налична.",
+          "title": "Поддържа онлайн търсене"
+        },
        "tokens": {
          "extra": "Настройте максималния брой токени, поддържани от модела",
          "title": "Максимален контекстуален прозорец",
@@ -309,6 +317,10 @@
          "placeholder": "Моля, изберете тип модел",
          "title": "Тип модел"
        },
+        "video": {
+          "extra": "Тази конфигурация ще активира само настройките за разпознаване на видео в приложението. Дали се поддържа разпознаване зависи изцяло от самия модел. Моля, тествайте сами дали моделът поддържа разпознаване на видео.",
+          "title": "Поддържа разпознаване на видео"
+        },
        "vision": {
          "extra": "Тази конфигурация ще активира само конфигурацията за качване на изображения в приложението, дали поддържа разпознаване зависи изцяло от самия модел, моля, тествайте наличността на визуалната разпознаваемост на този модел.",
          "title": "Поддръжка на визуално разпознаване"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "Доставен от Microsoft; моделът DeepSeek R1 е получил малка версия ъпгрейд, текущата версия е DeepSeek-R1-0528. В най-новата актуализация DeepSeek R1 значително подобрява дълбочината на разсъждение и способността за извод чрез увеличаване на изчислителните ресурси и въвеждане на алгоритмична оптимизация в следтренировъчния етап. Този модел се представя отлично в множество бенчмаркове като математика, програмиране и обща логика, като общата му производителност вече е близка до водещи модели като O3 и Gemini 2.5 Pro."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B е хибриден експертен модел, разработен от Baichuan Intelligence, с мощни способности за извеждане на заключения."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B е отворен, комерсиален голям езиков модел, разработен от Baichuan Intelligence, с 13 милиарда параметри, който постига най-добрите резултати в своя размер на авторитетни бенчмаркове на китайски и английски."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 е проектиран за ролеви игри и емоционално придружаване, поддържащ дългосрочна памет и персонализирани диалози, с широко приложение."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 е закритоизточен модел, обявен от интелигентната платформа AI и лабораторията KEG на Университета в Тайхуа. Той е претрениран с голям обем на китайски и английски идентификатори и е подложен на тренировка за съответствие с хуманите предпочитания. Сравнено с първата версия на модела, ChatGLM3 постига подобрения от 16%, 36% и 280% в MMLU, C-Eval и GSM8K съответно, и е класифициран на първо място в китайския рейтинг C-Eval. Този модел е подходящ за сценарии, които изискват високи стандарти за знания, умения за разсъждаване и креативност, като например създаване на рекламни текстове, писане на романи, научно-популярно писане и генериране на код."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base е последната генерация на редицата ChatGLM, разработена от компанията Zhipu, с 6 милиарда параметри и е открит източник."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o е динамичен модел, който се актуализира в реално време, за да поддържа най-новата версия. Той комбинира мощно разбиране на езика и генериране на текст, подходящ за мащабни приложения, включително обслужване на клиенти, образование и техническа поддръжка."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus е обновена версия на модела V3.1, пусната от DeepSeek, позиционирана като хибриден интелигентен голям езиков модел. Тази актуализация запазва оригиналните възможности на модела, като се фокусира върху отстраняване на проблеми, посочени от потребителите, и подобряване на стабилността. Значително е подобрена езиковата последователност, намалено е смесването на китайски и английски и появата на аномални символи. Моделът интегрира „режим на мислене“ и „режим без мислене“, като потребителите могат гъвкаво да превключват между тях чрез чат шаблони за различни задачи. Като важна оптимизация, V3.1-Terminus подобрява производителността на кодовия агент и търсещия агент, което ги прави по-надеждни при извикване на инструменти и изпълнение на многократни сложни задачи."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "Моделът DeepSeek V3.2 Exp е с хибридна архитектура за извеждане на заключения и поддържа както мисловен, така и немисловен режим."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek 67B е напреднал модел, обучен за диалози с висока сложност."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1: следващо поколение модел за разсъждение, подобряващ способностите за сложни разсъждения и свързано мислене, подходящ за задачи, изискващи задълбочен анализ."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1: следващо поколение модел за разсъждение, с подобрени способности за сложни логически връзки и аналитично мислене, подходящ за задачи, изискващи задълбочен анализ."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 е визуален езиков модел, разработен на базата на DeepSeekMoE-27B, който използва архитектура на смесени експерти (MoE) с рядка активация, постигайки изключителна производителност с активирани само 4.5B параметри. Моделът показва отлични резултати в множество задачи, включително визуални въпроси и отговори, оптично разпознаване на символи, разбиране на документи/таблици/графики и визуална локализация."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 е новият хибриден модел за разсъждение на DeepSeek, който поддържа два режима на разсъждение: мислене и немислене, с по-висока ефективност на мислене в сравнение с DeepSeek-R1-0528. След оптимизация чрез пост-тренировка, използването на агентски инструменти и изпълнението на задачи от интелигентни агенти са значително подобрени. Поддържа контекстен прозорец до 128k и максимална дължина на изхода до 64k токена."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus е оптимизирана версия на голям езиков модел от DeepSeek, създаден специално за крайни устройства."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1: следващо поколение модел за разсъждение, подобряващ способностите за сложни разсъждения и свързано мислене, подходящ за задачи, изискващи задълбочен анализ."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "Флагманският голям езиков модел, разработен от Baidu, с отлични общи резултати, широко приложим в сложни задачи в различни области; поддържа автоматично свързване с плъгина за търсене на Baidu, осигурявайки актуалност на информацията. В сравнение с ERNIE 4.0, показва по-добри резултати."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B е хибриден експертен модел, разработен от Baidu Wenxin, с мощни способности за извеждане на заключения и поддръжка на множество езици."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B е мащабен хибриден експертен модел от Baidu Wenxin, отличаващ се с изключителни способности за извеждане на заключения."
+  },
  "ernie-4.5-8k-preview": {
    "description": "Моделът Ernie 4.5 е ново поколение оригинален много модален основен модел, разработен от Baidu, който постига съвместна оптимизация чрез многомодално моделиране, с отлични способности за разбиране на много модалности; предлага усъвършенствани езикови способности, с подобрено разбиране, генериране, логика и памет, значително подобрени способности за избягване на халюцинации, логическо разсъждение и код."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 е най-новата версия на модела, проектирана за високо сложни и разнообразни задачи, с отлични резултати."
  },
  "glm-4-9b-chat": {
-    "description": "GLM-4-9B-Chat показва висока производителност в множество области, включително семантика, математика, логическо разсъждение, код и знания. Също така предлага уеб браузинг, изпълнение на код, извикване на персонализирани инструменти и разсъждение върху дълги текстове. Поддържа 26 езика, включително японски, корейски и немски."
+    "description": "GLM-4-9B-Chat показва висока производителност в области като семантика, математика, логическо мислене, програмиране и общи знания. Поддържа също така уеб браузване, изпълнение на код, извикване на персонализирани инструменти и извеждане на заключения от дълги текстове. Поддържа 26 езика, включително японски, корейски и немски."
  },
  "glm-4-air": {
    "description": "GLM-4-Air е икономичен вариант, с производителност близка до GLM-4, предлагаща бързина и достъпна цена."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "GLM-Zero-Preview притежава мощни способности за сложни разсъждения, показвайки отлични резултати в логическото разсъждение, математиката и програмирането."
  },
+  "glm4.6:355b": {
+    "description": "Най-новият флагмански модел на Zhipu — GLM-4.6 (355B) — значително надминава предшествениците си в напреднало програмиране, обработка на дълги текстове, логическо разсъждение и способности на интелигентни агенти. Особено в програмирането се изравнява с Claude Sonnet 4, превръщайки се в водещ модел за кодиране в Китай."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash предлага следващо поколение функции и подобрения, включително изключителна скорост, вградена употреба на инструменти, мултимодално генериране и контекстен прозорец от 1 милион токена."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "Най-бързата и най-икономична версия на GPT-5. Отлично подходяща за приложения, изискващи бърз отговор и чувствителни към разходите."
  },
+  "gpt-5-pro": {
+    "description": "GPT-5 pro използва повече изчислителна мощност за по-задълбочено мислене и постоянно предоставя по-добри отговори."
+  },
  "gpt-audio": {
    "description": "GPT Audio е универсален чат модел, ориентиран към аудио вход и изход, поддържащ използване на аудио I/O в Chat Completions API."
  },
  "gpt-image-1": {
    "description": "Роден мултимодален модел за генериране на изображения ChatGPT."
  },
+  "gpt-image-1-mini": {
+    "description": "По-икономична версия на GPT Image 1, с вградена поддръжка за вход от текст и изображение и генериране на изходно изображение."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4 квантизиран трансформър модел, който запазва силна производителност при ограничени ресурси."
+    "description": "Този модел изисква заявка за достъп. GPT-OSS-120B е отворен голям езиков модел, разработен от OpenAI, с мощни способности за генериране на текст."
+  },
+  "gpt-oss-20b": {
+    "description": "Този модел изисква заявка за достъп. GPT-OSS-20B е отворен среден езиков модел, разработен от OpenAI, с ефективни способности за генериране на текст."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B е голям отворен езиков модел, публикуван от OpenAI, използващ технологията за квантуване MXFP4, предназначен за флагмански клас модели. Изисква многократни GPU или високопроизводителна работна станция за работа, с изключителни възможности в сложни разсъждения, генериране на код и многоезична обработка, поддържайки усъвършенствано извикване на функции и интеграция на инструменти."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "Универсален модел в реално време, поддържащ текстов и аудио вход и изход, както и вход на изображения."
  },
-  "grok-2-1212": {
-    "description": "Този модел е подобрен по отношение на точност, спазване на инструкции и многоезични способности."
-  },
  "grok-2-image-1212": {
    "description": "Нашият най-нов модел за генериране на изображения може да създава живи и реалистични изображения въз основа на текстови подсказки. Той се представя отлично в маркетинг, социални медии и развлекателни области."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "Флагмански модел, експертен в извличане на данни, програмиране и обобщаване на текст за корпоративни приложения, с дълбоки знания в областите финанси, медицина, право и наука."
  },
-  "grok-3-fast": {
-    "description": "Флагмански модел, експертен в извличане на данни, програмиране и обобщаване на текст за корпоративни приложения, с дълбоки знания в областите финанси, медицина, право и наука."
-  },
  "grok-3-mini": {
    "description": "Лек модел, който мисли преди разговор. Работи бързо и интелигентно, подходящ за логически задачи без нужда от дълбоки специализирани знания и позволява проследяване на оригиналния мисловен процес."
  },
-  "grok-3-mini-fast": {
-    "description": "Лек модел, който мисли преди разговор. Работи бързо и интелигентно, подходящ за логически задачи без нужда от дълбоки специализирани знания и позволява проследяване на оригиналния мисловен процес."
-  },
  "grok-4": {
    "description": "Нашият най-нов и най-мощен флагмански модел, който се отличава с изключителни резултати в обработката на естествен език, математическите изчисления и разсъжденията — перфектен универсален играч."
  },
@@ -1851,7 +1866,7 @@
    "description": "Най-новият мултимодален дълбок мислещ модел t1-vision на Hunyuan, който поддържа оригинални мултимодални вериги на мисълта и предлага цялостно подобрение спрямо предишната версия по подразбиране."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "Модел за дълбоко мислене с мултимодално разбиране Hunyuan, поддържащ родна мултимодална дълга мисловна верига, експертен в обработката на различни визуални задачи за разсъждение, с цялостно подобрение спрямо бързия мисловен модел при научни проблеми."
+    "description": "Най-новият модел за визуално дълбоко мислене Hunyuan t1-vision предлага цялостни подобрения спрямо предишната версия в задачи като общи въпроси и отговори по изображения, визуално локализиране, OCR, графики, решаване на задачи по снимка и творческо писане по изображение. Значително е подобрена и поддръжката на английски и по-малко разпространени езици."
  },
  "hunyuan-turbo": {
    "description": "Предварителна версия на новото поколение голям езиков модел на HunYuan, използваща нова структура на смесен експертен модел (MoE), с по-бърза скорост на извеждане и по-силни резултати в сравнение с hunyuan-pro."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "Моделът kimi-k2-0905-preview има контекстна дължина от 256k, с по-силни способности за агентно кодиране, по-изразителна естетика и практичност на фронтенд кода, както и по-добро разбиране на контекста."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct е голям езиков модел, разработен от Moonshot AI, с изключителна способност за обработка на дълъг контекст."
+  },
  "kimi-k2-turbo-preview": {
    "description": "Kimi-k2 е базов модел с MoE архитектура, който притежава изключителни възможности за работа с код и агентни функции. Общият брой параметри е 1T, а активните параметри са 32B. В бенчмарковете за основни категории като общо знание и разсъждение, програмиране, математика и агентни задачи, моделът K2 превъзхожда другите водещи отворени модели."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "Spark Lite е лек модел на голям език, с изключително ниска латентност и ефективна обработка, напълно безплатен и отворен, поддържащ функции за онлайн търсене в реално време. Неговите бързи отговори го правят отличен за приложения на нискомощни устройства и фина настройка на модели, предоставяйки на потребителите отлична рентабилност и интелигентно изживяване, особено в контекста на въпроси и отговори, генериране на съдържание и търсене."
  },
-  "llama-2-7b-chat": {
-    "description": "Llama2 е серия от големи модели за език (LLM), разработени и с отворен код от Meta. Това е набор от генеративни текстови модели с различен размер, от 7 милиарда до 70 милиарда параметри, които са претренирани и майсторски оптимизирани. Архитектурно, Llama2 е автоматично регресивен езиков модел, използващ оптимизирана трансформаторна архитектура. Подобренията включват супервизирано майсторско трениране (SFT) и подкрепено с учене с награди (RLHF) за подреждане на предпочтенията на хората за полезност и безопасност. Llama2 демонстрира значително подобрени резултати върху множество академични набори от данни, което предоставя възможности за дизайн и развитие на много други модели."
-  },
  "llama-3.1-70b-versatile": {
    "description": "Llama 3.1 70B предлага по-мощни способности за разсъждение на AI, подходящи за сложни приложения, поддържащи множество изчислителни обработки и осигуряващи ефективност и точност."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "Моделът Llama 3.2-Vision с инструкции е оптимизиран за визуално разпознаване, изводи от изображения, описание на изображения и отговаряне на общи въпроси, свързани с изображения."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 е най-напредналият многоезичен отворен езиков модел от серията Llama, който предлага производителност, сравнима с 405B моделите, на изключително ниска цена. Базиран на структурата Transformer и подобрен чрез супервизирано фино настройване (SFT) и обучение с човешка обратна връзка (RLHF) за повишаване на полезността и безопасността. Неговата версия, оптимизирана за инструкции, е специално проектирана за многоезични диалози и показва по-добри резултати от много от отворените и затворените чат модели в множество индустриални бенчмаркове. Краен срок за знания: декември 2023."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B: средно до голямо Llama решение, съчетаващо логическо разсъждение и висока производителност."
  },
  "llama-3.3-70b-versatile": {
    "description": "Meta Llama 3.3 е многоезичен модел за генерация на език (LLM) с 70B (вход/изход на текст), който е предварително обучен и е пригоден за указания. Чистият текстов модел на Llama 3.3 е оптимизиран за многоезични диалогови случаи и надминава много налични отворени и затворени чат модели на стандартни индустриални тестове."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "Моделата Llama 3.3 с фина настройка за инструкции е оптимизирана за диалогови сценарии и надминава много съществуващи модели с отворен код в общи отраслови бенчмарк тестове."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick: високопроизводителен модел от серията Llama, подходящ за напреднало разсъждение, решаване на сложни задачи и следване на инструкции."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout: високопроизводителен модел от серията Llama, оптимизиран за сценарии с висока пропускателна способност и ниска латентност."
+  },
  "llama3-70b-8192": {
    "description": "Meta Llama 3 70B предлага ненадмината способност за обработка на сложност, проектирана за високи изисквания."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 е водещ модел, представен от Meta, поддържащ до 405B параметри, приложим в области като сложни диалози, многоезичен превод и анализ на данни."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B: лек и с ниска латентност вариант на Llama, идеален за онлайн разсъждение и интерактивни приложения."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 е водещ модел, представен от Meta, поддържащ до 405B параметри, приложим в области като сложни диалози, многоезичен превод и анализ на данни."
  },
@@ -2067,7 +2091,7 @@
    "description": "Spark Max 32K е конфигуриран с голяма способност за обработка на контекст, с по-силно разбиране на контекста и логическо разсъждение, поддържащ текстови входове до 32K токена, подходящ за четене на дълги документи, частни въпроси и отговори и други сценарии."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct е голям езиков модел, напълно обучен от безкрайната връху чиповете. Megrez-3B-Instruct се стреми чрез концепцията за съвместно хардуерно-софтуерно взаимодействие да създаде решение за крайните устройства, което е бързо за извършване, компактно и лесно за използване."
+    "description": "Megrez 3B Instruct е ефективен модел с малък брой параметри, разработен от Wuwen Xinqiong."
  },
  "meta-llama-3-70b-instruct": {
    "description": "Мощен модел с 70 милиарда параметри, отличаващ се в разсъждения, кодиране и широки езикови приложения."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "Spark Pro 128K е конфигуриран с изключителна способност за обработка на контекст, способен да обработва до 128K контекстна информация, особено подходящ за дълги текстове, изискващи цялостен анализ и дългосрочна логическа свързаност, предоставяйки гладка и последователна логика и разнообразна поддръжка на цитати в сложни текстови комуникации."
  },
+  "pro-deepseek-r1": {
+    "description": "Специализиран модел за корпоративни услуги, включващ паралелна обработка."
+  },
+  "pro-deepseek-v3": {
+    "description": "Специализиран модел за корпоративни услуги, включващ паралелна обработка."
+  },
  "qvq-72b-preview": {
    "description": "QVQ моделът е експериментален изследователски модел, разработен от екипа на Qwen, фокусиран върху повишаване на визуалните способности за разсъждение, особено в областта на математическото разсъждение."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "Модел за визуално разсъждение. Поддържа визуален вход и изход на мисловни вериги. Версия plus, пусната след модела qvq-max, предлага по-бързо разсъждение и по-добър баланс между ефективност и разходи в сравнение с qvq-max."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B: модел от серията Qwen с отлична производителност при многоезични и програмни задачи, подходящ за средномащабна продукционна употреба."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B: модел с дълъг контекст, предназначен за генериране на код и сложни програмни задачи."
+  },
  "qwen-coder-plus": {
    "description": "Tongyi Qianwen модел за кодиране."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 е новото поколение голям езиков модел на Alibaba, предлагащ отлична производителност за разнообразни приложения."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2 е новият серий на големи модели за език, предложен от екипа Qwen. Той се основава на архитектурата Transformer и използва SwiGLU активационна функция, внимание QKV смещение (attention QKV bias), групово запитване на внимание (group query attention), смесени техники за внимание с превъртващи се прозорци (mixture of sliding window attention) и пълно внимание. Освен това, екипът Qwen също е подобрал токенизатора, който поддържа множество езици и код."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2 е новият серийен модел за големи езици, представен от екипа Qwen. Той се основава на архитектурата Transformer и използва SwiGLU активационна функция, внимание с QKV смещение (attention QKV bias), групово внимание за заявки (group query attention), смесени техники за обръщане на внимание с превъртващи се прозорци (mixture of sliding window attention) и пълно внимание. Освен това, екипът Qwen е подобрил токенизатора, който поддържа множество езици и код."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 е новото поколение мащабен езиков модел на Alibaba, който предлага отлична производителност, за да отговори на разнообразни приложни нужди."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "Базирано на Qwen3, ново поколение отворен модел с мисловен режим, който подобрява спазването на инструкции и предоставя по-кратки и точни обобщения в сравнение с предишната версия (Tongyi Qianwen 3-235B-A22B-Thinking-2507)."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct е мултимодален модел, разработен от Tongyi Qianwen, който поддържа визуално разбиране и извеждане на заключения."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking е мултимодален модел за извеждане на заключения, разработен от Tongyi Qianwen, с поддръжка на визуално разбиране и логическо мислене."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL е текстов генеративен модел с визуални (изображения) разбирания, който не само може да извършва OCR (разпознаване на текст в изображения), но и да обобщава и прави изводи, например извличане на атрибути от снимки на продукти или решаване на задачи по математика от изображения."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "Този модел е мощен модел за разсъждение с отлични способности за разбиране на изображения, способен да обработва информация от изображения и текст, и след дълбочинно разсъждение да генерира текстово съдържание. Моделът показва изключителни резултати в областта на визуалните разсъждения, като същевременно притежава първокласни способности в математиката, кода и текстовите разсъждения. Дължината на контекста е 100k."
  },
+  "step3": {
+    "description": "Step3 е мултимодален модел, разработен от StepStar, с мощни способности за визуално разбиране."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 е авангарден мултимодален модел за разсъждение, публикуван от StepFun (阶跃星辰). Той е изграден върху архитектура на смес от експерти (MoE) с общо 321 милиарда параметъра и 38 милиарда активни параметъра. Моделът е с енд-ту-енд дизайн, целящ минимизиране на разходите за декодиране, като същевременно предоставя водещи резултати във визуално-лингвистичното разсъждение. Чрез кооперативния дизайн на многоматрично факторизирано внимание (MFA) и декуплиране на внимание и FFN (AFD), Step3 поддържа отлична ефективност както на флагмански, така и на по-бюджетни ускорители. По време на предварителното обучение Step3 е обработил над 20 трилиона текстови токена и 4 трилиона смесени текстово-изображенчески токена, обхващайки повече от десет езика. Моделът постига водещи резултати сред отворените модели в множество бенчмаркове, включително математика, код и мултимодални задачи."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "Най-новият и най-велик флагмански модел на xAI, предоставящ ненадмината производителност в естествен език, математика и разсъждения — перфектният универсален играч."
  },
-  "yi-1.5-34b-chat": {
-    "description": "Yi-1.5 е обновена версия на Yi. Тя използва висококачествен корпус от 500B токена за продължителна предварителна обучение на Yi и е майсторски подобрявана с 3M разнообразни примера за fino-tuning."
-  },
  "yi-large": {
    "description": "Новият модел с хиляда милиарда параметри предлага изключителни способности за отговори и генериране на текст."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "Водеща изследователска лаборатория за авангарден изкуствен интелект, която изгражда визуалната инфраструктура на утрешния ден."
  },
+  "cerebras": {
+    "description": "Cerebras е AI платформа за извеждане, базирана на специализираната си система CS-3, създадена да предоставя най-бързите в света услуги за големи езикови модели (LLM) с незабавен отговор и висок капацитет на обработка. Тя е проектирана да елиминира закъсненията и да ускори сложни AI работни процеси, като генериране на код в реално време и изпълнение на агентски задачи."
+  },
  "cloudflare": {
    "description": "Работа с модели на машинно обучение, задвижвани от безсървърни GPU, в глобалната мрежа на Cloudflare."
  },
@@ -294,6 +294,13 @@
    },
    "title": "Общи настройки"
  },
+  "settingImage": {
+    "defaultCount": {
+      "desc": "Задайте броя на изображенията по подразбиране, които да се генерират при създаване на нова задача в панела за генериране на изображения.",
+      "label": "Брой изображения по подразбиране",
+      "title": "Настройки за AI рисуване"
+    }
+  },
  "settingModel": {
    "enableMaxTokens": {
      "title": "Активиране на ограничението за максимален брой токени"
@@ -549,6 +556,7 @@
    "common": "Общи настройки",
    "experiment": "Експеримент",
    "hotkey": "Бързи клавиши",
+    "image": "AI рисуване",
    "llm": "Езиков модел",
    "provider": "AI доставчик",
    "proxy": "Мрежов прокси",
@@ -236,6 +236,7 @@
  },
  "information": "Community und Informationen",
  "installPWA": "Installiere die Browser-App",
+  "labs": "Labore",
  "lang": {
    "ar": "Arabisch",
    "bg-BG": "Bulgarisch",
@@ -7,6 +7,14 @@
    "desc": "Aktuelle Nachrichten und hochgeladene Dateien im Gespräch löschen",
    "title": "Gesprächsnachrichten löschen"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "Letzte Nachricht löschen und neu generieren",
+    "title": "Löschen und neu generieren"
+  },
+  "deleteLastMessage": {
+    "desc": "Letzte Nachricht löschen",
+    "title": "Letzte Nachricht löschen"
+  },
  "desktop": {
    "openSettings": {
      "desc": "Öffnet die Anwendungseinstellungsseite",
@@ -0,0 +1,14 @@
+{
+  "desc": "Hier werden wir regelmäßig neue Funktionen vorstellen, die wir gerade erforschen – probieren Sie sie gerne aus!",
+  "features": {
+    "groupChat": {
+      "desc": "Aktivieren Sie die Koordination von Gruppenchats mit mehreren KI-Agenten.",
+      "title": "Gruppenchats (mehrere Agenten)"
+    },
+    "inputMarkdown": {
+      "desc": "Echtzeit-Rendering von Markdown im Eingabefeld (Fettdruck, Codeblöcke, Tabellen usw.).",
+      "title": "Markdown-Rendering im Eingabefeld"
+    }
+  },
+  "title": "Labor"
+}
@@ -284,11 +284,19 @@
          "placeholder": "Bitte Modell-ID eingeben, z. B. gpt-4o oder claude-3.5-sonnet",
          "title": "Modell-ID"
        },
+        "imageOutput": {
+          "extra": "Diese Einstellung aktiviert lediglich die Fähigkeit des Modells zur Bildgenerierung. Die tatsächliche Leistung hängt vollständig vom Modell selbst ab. Bitte testen Sie selbst, ob das Modell über eine nutzbare Bildgenerierungsfunktion verfügt.",
+          "title": "Bildgenerierung unterstützen"
+        },
        "modalTitle": "Benutzerdefinierte Modellkonfiguration",
        "reasoning": {
          "extra": "Diese Konfiguration aktiviert nur die Fähigkeit des Modells zu tiefem Denken. Die tatsächlichen Ergebnisse hängen vollständig vom Modell selbst ab. Bitte testen Sie selbst, ob das Modell über die Fähigkeit zum tiefen Denken verfügt.",
          "title": "Unterstützung für tiefes Denken"
        },
+        "search": {
+          "extra": "Diese Einstellung aktiviert lediglich die integrierte Internetsuchfunktion des Modells. Ob eine integrierte Suchmaschine unterstützt wird, hängt vom Modell selbst ab. Bitte testen Sie selbst, ob die Suchfunktion des Modells verfügbar ist.",
+          "title": "Internetsuche unterstützen"
+        },
        "tokens": {
          "extra": "Maximale Token-Anzahl für das Modell festlegen",
          "title": "Maximales Kontextfenster",
@@ -309,6 +317,10 @@
          "placeholder": "Bitte Modelltyp auswählen",
          "title": "Modelltyp"
        },
+        "video": {
+          "extra": "Diese Einstellung aktiviert lediglich die Videorekognition innerhalb der Anwendung. Ob die Erkennung unterstützt wird, hängt vollständig vom Modell selbst ab. Bitte testen Sie selbst, ob das Modell über eine nutzbare Videorekognitionsfunktion verfügt.",
+          "title": "Videoerkennung unterstützen"
+        },
        "vision": {
          "extra": "Diese Konfiguration aktiviert nur die Bild-Upload-Funktionalität in der Anwendung. Ob die Erkennung unterstützt wird, hängt vollständig vom Modell selbst ab. Bitte testen Sie die Verwendbarkeit der visuellen Erkennungsfähigkeit des Modells selbst.",
          "title": "Visuelle Erkennung unterstützen"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "Bereitgestellt von Microsoft; Das DeepSeek R1 Modell wurde in einer kleinen Versionsaktualisierung verbessert, die aktuelle Version ist DeepSeek-R1-0528. Im neuesten Update wurde die Rechentiefe und Inferenzfähigkeit von DeepSeek R1 durch Erhöhung der Rechenressourcen und Einführung eines Algorithmus-Optimierungsmechanismus in der Nachtrainingsphase erheblich gesteigert. Dieses Modell zeigt hervorragende Leistungen in mehreren Benchmark-Tests wie Mathematik, Programmierung und allgemeiner Logik und nähert sich in der Gesamtleistung führenden Modellen wie O3 und Gemini 2.5 Pro an."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B ist ein hybrides Expertenmodell von Baichuan Intelligence mit leistungsstarken Fähigkeiten im logischen Schlussfolgern."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B ist ein Open-Source-Sprachmodell mit 13 Milliarden Parametern, das von Baichuan Intelligence entwickelt wurde und in autorisierten chinesischen und englischen Benchmarks die besten Ergebnisse in seiner Größenordnung erzielt hat."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 wurde speziell für Rollenspiele und emotionale Begleitung entwickelt, unterstützt extrem lange Mehrfachgedächtnisse und personalisierte Dialoge und findet breite Anwendung."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 ist ein proprietäres Modell, das von der KI-Forschungsgruppe Zhipu AI und dem KEG-Labor der Tsinghua-Universität veröffentlicht wurde. Es wurde durch umfangreiche Vortrainings mit chinesischen und englischen Bezeichnern sowie durch die Anpassung an menschliche Präferenzen entwickelt. Im Vergleich zum ersten Modell erzielte es Verbesserungen von 16 %, 36 % und 280 % in den Benchmarks MMLU, C-Eval und GSM8K und steht an der Spitze der chinesischen Aufgabenliste C-Eval. Es eignet sich für Szenarien, die hohe Anforderungen an das Wissensvolumen, die Inferenzfähigkeit und die Kreativität stellen, wie z. B. die Erstellung von Werbetexten, das Schreiben von Romanen, wissensbasiertes Schreiben und die Generierung von Code."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base ist das neueste Modell der ChatGLM-Serie mit 6 Milliarden Parametern, entwickelt von Zhipu."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o ist ein dynamisches Modell, das in Echtzeit aktualisiert wird, um die neueste Version zu gewährleisten. Es kombiniert starke Sprachverständnis- und Generierungsfähigkeiten und eignet sich für großangelegte Anwendungsszenarien, einschließlich Kundenservice, Bildung und technische Unterstützung."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus ist eine aktualisierte Version des V3.1-Modells von DeepSeek, positioniert als hybrides Agenten-Großsprachmodell. Dieses Update konzentriert sich darauf, auf Nutzerfeedback basierende Probleme zu beheben und die Stabilität zu verbessern, während die ursprünglichen Modellfähigkeiten erhalten bleiben. Es verbessert deutlich die Sprachkonsistenz und reduziert das Vermischen von Chinesisch und Englisch sowie das Auftreten ungewöhnlicher Zeichen. Das Modell integriert den „Denkmodus“ (Thinking Mode) und den „Nicht-Denkmodus“ (Non-thinking Mode), zwischen denen Nutzer flexibel über Chatvorlagen wechseln können, um unterschiedlichen Aufgaben gerecht zu werden. Als wichtige Optimierung verbessert V3.1-Terminus die Leistung des Code-Agenten und des Such-Agenten, wodurch diese bei Werkzeugaufrufen und der Ausführung mehrstufiger komplexer Aufgaben zuverlässiger sind."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "Das DeepSeek V3.2 Exp Modell ist eine hybride Inferenzarchitektur, die sowohl Denk- als auch Nicht-Denk-Modi unterstützt."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek 67B ist ein fortschrittliches Modell, das für komplexe Dialoge trainiert wurde."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1: Ein Inferenzmodell der nächsten Generation, das komplexe Schlussfolgerungen und verknüpfte Denkfähigkeiten verbessert und sich für Aufgaben eignet, die tiefgehende Analysen erfordern."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1: Das nächste Generation von Inferenzmodellen mit verbesserter Fähigkeit zum komplexen Schlussfolgern und vernetztem Denken – ideal für Aufgaben, die tiefgehende Analysen erfordern."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 ist ein hybrides Expertenmodell (MoE) für visuelle Sprache, das auf DeepSeekMoE-27B basiert und eine spärliche Aktivierung der MoE-Architektur verwendet, um außergewöhnliche Leistungen bei der Aktivierung von nur 4,5 Milliarden Parametern zu erzielen. Dieses Modell zeigt hervorragende Leistungen in mehreren Aufgaben, darunter visuelle Fragenbeantwortung, optische Zeichenerkennung, Dokument-/Tabellen-/Diagrammverständnis und visuelle Lokalisierung."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 ist ein neu eingeführtes hybrides Inferenzmodell von DeepSeek, das zwei Inferenzmodi unterstützt: Denkmodus und Nicht-Denkmodus. Es ist effizienter im Denkprozess als DeepSeek-R1-0528. Durch Post-Training-Optimierung wurden die Nutzung von Agenten-Tools und die Leistung bei Agentenaufgaben erheblich verbessert. Unterstützt ein Kontextfenster von 128k und eine maximale Ausgabelänge von 64k Tokens."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus ist eine optimierte Version des großen Sprachmodells von DeepSeek, speziell für Endgeräte entwickelt."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1: Ein Inferenzmodell der nächsten Generation, das komplexe Schlussfolgerungen und verknüpfte Denkfähigkeiten verbessert und sich für Aufgaben eignet, die tiefgehende Analysen erfordern."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "Das von Baidu entwickelte Flaggschiff-Modell für große Sprachmodelle zeigt hervorragende Gesamtergebnisse und ist weit verbreitet in komplexen Aufgabenbereichen anwendbar; es unterstützt die automatische Anbindung an das Baidu-Suchplugin, um die Aktualität der Antwortinformationen zu gewährleisten. Im Vergleich zu ERNIE 4.0 bietet es eine bessere Leistung."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B ist ein hybrides Expertenmodell von Baidu Wenxin mit herausragenden Fähigkeiten im logischen Denken und in der Mehrsprachigkeit."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B ist ein großskaliges hybrides Expertenmodell von Baidu Wenxin mit exzellenten Fähigkeiten im logischen Schlussfolgern."
+  },
  "ernie-4.5-8k-preview": {
    "description": "Das ERNIE 4.5 Modell ist ein neu entwickeltes, natives multimodales Basis-Modell von Baidu, das durch die gemeinsame Modellierung mehrerer Modalitäten eine synergistische Optimierung erreicht und über hervorragende multimodale Verständnisfähigkeiten verfügt; es bietet verbesserte Sprachfähigkeiten, umfassende Verbesserungen in Verständnis, Generierung, Logik und Gedächtnis, sowie signifikante Verbesserungen in der Vermeidung von Halluzinationen, logischen Schlussfolgerungen und Programmierfähigkeiten."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 ist die neueste Modellversion, die für hochkomplexe und vielfältige Aufgaben konzipiert wurde und hervorragende Leistungen zeigt."
  },
  "glm-4-9b-chat": {
-    "description": "GLM-4-9B-Chat zeigt in den Bereichen Semantik, Mathematik, Schlussfolgerungen, Code und Wissen eine hohe Leistung. Es verfügt auch über Funktionen wie Web-Browsing, Code-Ausführung, benutzerdefinierte Toolaufrufe und langes Textverständnis. Es unterstützt 26 Sprachen, darunter Japanisch, Koreanisch und Deutsch."
+    "description": "GLM-4-9B-Chat bietet hohe Leistung in Bereichen wie Semantik, Mathematik, logisches Denken, Programmierung und Wissen. Es unterstützt Web-Browsing, Code-Ausführung, benutzerdefinierte Tool-Nutzung und Langtext-Inferenz. Unterstützt 26 Sprachen, darunter Japanisch, Koreanisch und Deutsch."
  },
  "glm-4-air": {
    "description": "GLM-4-Air ist eine kosteneffiziente Version, die in der Leistung nahe am GLM-4 liegt und schnelle Geschwindigkeiten zu einem erschwinglichen Preis bietet."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "GLM-Zero-Preview verfügt über starke Fähigkeiten zur komplexen Schlussfolgerung und zeigt hervorragende Leistungen in den Bereichen logisches Denken, Mathematik und Programmierung."
  },
+  "glm4.6:355b": {
+    "description": "Das neueste Flaggschiffmodell GLM-4.6 (355B) von Zhipu übertrifft seinen Vorgänger in den Bereichen fortgeschrittenes Codieren, Verarbeitung langer Texte, logisches Schlussfolgern und Agentenfähigkeiten deutlich. Besonders im Bereich Programmierung erreicht es das Niveau von Claude Sonnet 4 und zählt damit zu den führenden Coding-Modellen in China."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash bietet Funktionen der nächsten Generation und Verbesserungen, darunter herausragende Geschwindigkeit, integrierte Werkzeugnutzung, multimodale Generierung und ein Kontextfenster von 1 Million Tokens."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "Die schnellste und kostengünstigste Version von GPT-5. Besonders geeignet für Anwendungen, die schnelle Reaktionen und Kostenbewusstsein erfordern."
  },
+  "gpt-5-pro": {
+    "description": "GPT-5 Pro nutzt mehr Rechenleistung für tiefgreifendere Überlegungen und liefert kontinuierlich bessere Antworten."
+  },
  "gpt-audio": {
    "description": "GPT Audio ist ein universelles Chatmodell für Audioeingabe und -ausgabe, das Audio-I/O in der Chat Completions API unterstützt."
  },
  "gpt-image-1": {
    "description": "ChatGPT natives multimodales Bildgenerierungsmodell"
  },
+  "gpt-image-1-mini": {
+    "description": "Kostengünstigere Version von GPT Image 1 mit nativer Unterstützung für Text- und Bildeingaben sowie Bildausgaben."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4 quantisierte Transformer-Struktur, die auch bei begrenzten Ressourcen starke Leistung beibehält."
+    "description": "Dieses Modell erfordert eine Zugangsanfrage. GPT-OSS-120B ist ein quelloffenes, großskaliges Sprachmodell von OpenAI mit leistungsstarken Textgenerierungsfähigkeiten."
+  },
+  "gpt-oss-20b": {
+    "description": "Dieses Modell erfordert eine Zugangsanfrage. GPT-OSS-20B ist ein quelloffenes, mittelgroßes Sprachmodell von OpenAI mit effizienter Textgenerierung."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B ist ein von OpenAI veröffentlichtes großes Open-Source-Sprachmodell, das die MXFP4-Quantisierungstechnologie verwendet und als Flaggschiff-Modell gilt. Es erfordert den Betrieb auf Multi-GPU- oder Hochleistungs-Workstation-Umgebungen und bietet herausragende Leistungen bei komplexen Inferenzaufgaben, Codegenerierung und mehrsprachiger Verarbeitung. Es unterstützt fortgeschrittene Funktionsaufrufe und die Integration von Werkzeugen."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "Universelles Echtzeitmodell, das Echtzeit-Text- und Audioeingabe/-ausgabe unterstützt und zudem Bildinput ermöglicht."
  },
-  "grok-2-1212": {
-    "description": "Dieses Modell hat Verbesserungen in Bezug auf Genauigkeit, Befolgung von Anweisungen und Mehrsprachigkeit erfahren."
-  },
  "grok-2-image-1212": {
    "description": "Unser neuestes Bildgenerierungsmodell kann lebendige und realistische Bilder basierend auf Text-Prompts erzeugen. Es zeigt hervorragende Leistungen in den Bereichen Marketing, soziale Medien und Unterhaltung."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "Ein Flaggschiffmodell, spezialisiert auf Datenextraktion, Programmierung und Textzusammenfassung für Unternehmensanwendungen, mit tiefgreifendem Wissen in den Bereichen Finanzen, Medizin, Recht und Wissenschaft."
  },
-  "grok-3-fast": {
-    "description": "Ein Flaggschiffmodell, spezialisiert auf Datenextraktion, Programmierung und Textzusammenfassung für Unternehmensanwendungen, mit tiefgreifendem Wissen in den Bereichen Finanzen, Medizin, Recht und Wissenschaft."
-  },
  "grok-3-mini": {
    "description": "Ein leichtgewichtiges Modell, das vor der Antwort nachdenkt. Es arbeitet schnell und intelligent, eignet sich für logische Aufgaben ohne tiefgehendes Fachwissen und ermöglicht die Nachverfolgung des ursprünglichen Denkprozesses."
  },
-  "grok-3-mini-fast": {
-    "description": "Ein leichtgewichtiges Modell, das vor der Antwort nachdenkt. Es arbeitet schnell und intelligent, eignet sich für logische Aufgaben ohne tiefgehendes Fachwissen und ermöglicht die Nachverfolgung des ursprünglichen Denkprozesses."
-  },
  "grok-4": {
    "description": "Unser neuestes und leistungsstärkstes Flaggschiffmodell, das in der Verarbeitung natürlicher Sprache, mathematischen Berechnungen und logischem Denken herausragende Leistungen erbringt – ein perfekter Allrounder."
  },
@@ -1851,7 +1866,7 @@
    "description": "Die neueste Version des hunyuan t1-vision multimodalen tiefen Denkmodells unterstützt native multimodale Chain-of-Thought-Mechanismen und bietet im Vergleich zur vorherigen Standardversion umfassende Verbesserungen."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "Hunyuan multimodales Verständnis- und Tiefdenkmodell, unterstützt native multimodale lange Gedankenketten, ist spezialisiert auf verschiedene Bildinferenzszenarien und verbessert sich im Vergleich zum Schnelldenkmodell umfassend bei naturwissenschaftlichen Problemen."
+    "description": "Die neueste Version des Hunyuan t1-vision Modells für visuelles, tiefes Denken bietet im Vergleich zur Vorgängerversion umfassende Verbesserungen bei allgemeinen Bild-Text-Fragen, visueller Lokalisierung, OCR, Diagrammverarbeitung, Aufgabenlösung per Foto und kreativer Bildinterpretation. Die Fähigkeiten in Englisch und kleineren Sprachen wurden deutlich optimiert."
  },
  "hunyuan-turbo": {
    "description": "Die Vorschauversion des neuen großen Sprachmodells von Hunyuan verwendet eine neuartige hybride Expertenmodellstruktur (MoE) und bietet im Vergleich zu Hunyuan-Pro eine schnellere Inferenz und bessere Leistung."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "Das Modell kimi-k2-0905-preview hat eine Kontextlänge von 256k, verfügt über stärkere Agentic-Coding-Fähigkeiten, eine herausragendere Ästhetik und Praktikabilität von Frontend-Code sowie ein besseres Kontextverständnis."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct ist ein großes Sprachmodell von Moonshot AI mit der Fähigkeit zur Verarbeitung extrem langer Kontexte."
+  },
  "kimi-k2-turbo-preview": {
    "description": "kimi-k2 ist ein Basis-Modell mit MoE-Architektur und besonders starken Fähigkeiten im Bereich Code und Agenten. Es verfügt über insgesamt 1T Parameter und 32B aktivierte Parameter. In Benchmark-Tests der wichtigsten Kategorien – allgemeines Wissens-Reasoning, Programmierung, Mathematik und Agenten – übertrifft das K2-Modell die Leistung anderer gängiger Open‑Source‑Modelle."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "Spark Lite ist ein leichtgewichtiges großes Sprachmodell mit extrem niedriger Latenz und effizienter Verarbeitung, das vollständig kostenlos und offen ist und Echtzeitsuchfunktionen unterstützt. Seine schnelle Reaktionsfähigkeit macht es besonders geeignet für Inferenzanwendungen und Modellanpassungen auf Geräten mit geringer Rechenleistung und bietet den Nutzern ein hervorragendes Kosten-Nutzen-Verhältnis sowie ein intelligentes Erlebnis, insbesondere in den Bereichen Wissensabfragen, Inhaltserstellung und Suchszenarien."
  },
-  "llama-2-7b-chat": {
-    "description": "Llama2 ist eine Serie großer Sprachmodelle (LLM), die von Meta entwickelt und als Open Source veröffentlicht wurden. Diese Serie umfasst generative Textmodelle mit einer Parameteranzahl von 7 Milliarden bis 70 Milliarden, die vortrainiert und feinjustiert wurden. Architekturtechnisch ist Llama2 ein autoregressives Sprachmodell, das eine optimierte Transformer-Architektur verwendet. Die angepassten Versionen nutzen überwachte Feinabstimmung (SFT) und Reinforcement Learning mit menschlichem Feedback (RLHF), um den menschlichen Vorlieben für Nützlichkeit und Sicherheit zu entsprechen. Llama2 übertrifft die Leistung der Llama-Serie in mehreren akademischen Datensätzen und bietet Inspiration für die Entwicklung und Gestaltung vieler anderer Modelle."
-  },
  "llama-3.1-70b-versatile": {
    "description": "Llama 3.1 70B bietet leistungsstarke KI-Schlussfolgerungsfähigkeiten, die für komplexe Anwendungen geeignet sind und eine hohe Rechenverarbeitung bei gleichzeitiger Effizienz und Genauigkeit unterstützen."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "Das Llama 3.2-Vision-Instruct-Modell ist optimiert für visuelle Erkennung, Bildschlussfolgerungen, Bildbeschreibungen und das Beantworten von allgemeinen Fragen, die mit Bildern zusammenhängen."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 ist das fortschrittlichste mehrsprachige Open-Source-Sprachmodell der Llama-Serie, das eine Leistung bietet, die mit einem 405B-Modell vergleichbar ist, und das zu extrem niedrigen Kosten. Es basiert auf der Transformer-Architektur und verbessert die Nützlichkeit und Sicherheit durch überwachte Feinabstimmung (SFT) und verstärkendes Lernen mit menschlichem Feedback (RLHF). Die auf Anweisungen optimierte Version ist speziell für mehrsprachige Dialoge optimiert und übertrifft in mehreren Branchenbenchmarks viele Open-Source- und geschlossene Chat-Modelle. Das Wissensdatum endet im Dezember 2023."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B: Ein mittelgroßes Llama-Modell, das eine ausgewogene Kombination aus logischem Denken und hoher Verarbeitungskapazität bietet."
  },
  "llama-3.3-70b-versatile": {
    "description": "Das Meta Llama 3.3 ist ein mehrsprachiges, großes Sprachmodell (LLM), das aus einem vortrainierten und anweisungsorientierten generativen Modell mit 70B (Text-Eingabe/Text-Ausgabe) besteht. Das anweisungsorientierte Modell von Llama 3.3 ist für mehrsprachige Dialoganwendungen optimiert und übertrifft viele verfügbare Open-Source- und Closed-Source-Chat-Modelle bei gängigen Branchenbenchmarks."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "Das Llama 3.3 Instruct-Modell ist für Dialogszenarien optimiert und übertrifft in gängigen Branchenbenchmarks viele bestehende Open-Source-Chatmodelle."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick: Ein leistungsstarkes Modell der Llama-Serie, ideal für fortgeschrittenes logisches Denken, komplexe Problemlösungen und Aufgaben mit Anweisungsbefolgung."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout: Ein leistungsstarkes Modell der Llama-Serie, optimiert für Szenarien mit hoher Verarbeitungsgeschwindigkeit und niedriger Latenz."
+  },
  "llama3-70b-8192": {
    "description": "Meta Llama 3 70B bietet unvergleichliche Fähigkeiten zur Verarbeitung von Komplexität und ist maßgeschneidert für Projekte mit hohen Anforderungen."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 ist ein führendes Modell von Meta, das bis zu 405B Parameter unterstützt und in den Bereichen komplexe Dialoge, mehrsprachige Übersetzungen und Datenanalysen eingesetzt werden kann."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B: Eine kompakte, latenzarme Variante des Llama-Modells, ideal für leichte Online-Inferenz- und Interaktionsszenarien."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 ist ein führendes Modell von Meta, das bis zu 405B Parameter unterstützt und in den Bereichen komplexe Dialoge, mehrsprachige Übersetzungen und Datenanalysen eingesetzt werden kann."
  },
@@ -2067,7 +2091,7 @@
    "description": "Spark Max 32K bietet eine große Kontextverarbeitungsfähigkeit mit verbesserter Kontextverständnis und logischer Schlussfolgerungsfähigkeit und unterstützt Texteingaben von bis zu 32K Tokens, was es ideal für das Lesen langer Dokumente und private Wissensabfragen macht."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct ist ein großes Sprachmodell, das vollständig von Wuxin XinQiong trainiert wurde. Megrez-3B-Instruct zielt darauf ab, durch die Idee der Hardware-Software-Kooperation eine schnelle Inferenz, ein kompaktes Design und eine benutzerfreundliche Endgerätlösung zu schaffen."
+    "description": "Megrez 3B Instruct ist ein effizientes Modell mit geringer Parameteranzahl, entwickelt von Wuwen Xinqiong."
  },
  "meta-llama-3-70b-instruct": {
    "description": "Ein leistungsstarkes Modell mit 70 Milliarden Parametern, das in den Bereichen Schlussfolgerungen, Programmierung und breiten Sprachanwendungen herausragt."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "Spark Pro 128K verfügt über eine außergewöhnliche Kontextverarbeitungsfähigkeit und kann bis zu 128K Kontextinformationen verarbeiten, was es besonders geeignet für die Analyse langer Texte und die Verarbeitung langfristiger logischer Zusammenhänge macht. Es bietet in komplexen Textkommunikationen flüssige und konsistente Logik sowie vielfältige Unterstützung für Zitate."
  },
+  "pro-deepseek-r1": {
+    "description": "Modell für exklusive Unternehmensdienste, inklusive paralleler Serviceunterstützung."
+  },
+  "pro-deepseek-v3": {
+    "description": "Modell für exklusive Unternehmensdienste, inklusive paralleler Serviceunterstützung."
+  },
  "qvq-72b-preview": {
    "description": "Das QVQ-Modell ist ein experimentelles Forschungsmodell, das vom Qwen-Team entwickelt wurde und sich auf die Verbesserung der visuellen Schlussfolgerungsfähigkeiten konzentriert, insbesondere im Bereich der mathematischen Schlussfolgerungen."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "Visuelles Schlussfolgerungsmodell. Unterstützt visuelle Eingaben und Denkprozess-Ausgaben. Die Plus-Version, die auf dem qvq-max-Modell basiert, bietet schnellere Inferenzgeschwindigkeit sowie ein ausgewogeneres Verhältnis von Leistung und Kosten."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B: Ein Modell der Qwen-Serie mit starker Leistung bei mehrsprachigen und Programmieraufgaben, geeignet für mittelgroße produktive Einsätze."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B: Ein Modell mit langem Kontext, das für Codegenerierung und komplexe Programmieraufgaben entwickelt wurde."
+  },
  "qwen-coder-plus": {
    "description": "Tongyi Qianwen Codierungsmodell."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 ist das neue große Sprachmodell von Alibaba, das mit hervorragender Leistung eine Vielzahl von Anwendungsanforderungen unterstützt."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2 ist die neueste Generation von Sprachmodellen, die vom Qwen-Team entwickelt wurde. Es basiert auf der Transformer-Architektur und verwendet Techniken wie die SwiGLU-Aktivierungsfunktion, die Aufmerksamkeits-QKV-Bias (attention QKV bias), die gruppenbasierte Abfrageaufmerksamkeit (group query attention) und eine Mischung aus rutschendem Fenster und voller Aufmerksamkeit (mixture of sliding window attention and full attention). Darüber hinaus hat das Qwen-Team den Tokenizer verbessert, der für die Verarbeitung von natürlicher Sprache und Code optimiert ist."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2 ist die neueste Serie von großen Sprachmodellen, die vom Qwen-Team entwickelt wurde. Es basiert auf der Transformer-Architektur und verwendet Techniken wie die SwiGLU-Aktivierungsfunktion, die Aufmerksamkeits-QKV-Bias (attention QKV bias), die Gruppenabfrageaufmerksamkeit (group query attention) und eine Mischung aus rutschendem Fenster und voller Aufmerksamkeit (mixture of sliding window attention and full attention). Zudem hat das Qwen-Team den Tokenizer verbessert, um mehrere natürliche Sprachen und Code besser zu verarbeiten."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 ist das neue, groß angelegte Sprachmodell der Alibaba-Gruppe, das hervorragende Leistungen zur Unterstützung vielfältiger Anwendungsbedürfnisse bietet."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "Ein neues Open-Source-Modell der nächsten Generation im Denkmodus basierend auf Qwen3. Im Vergleich zur vorherigen Version (Tongyi Qianwen 3-235B-A22B-Thinking-2507) wurde die Befehlsbefolgung verbessert und die Modellantworten sind prägnanter zusammengefasst."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct ist ein multimodales Modell von Tongyi Qianwen mit Unterstützung für visuelles Verständnis und logisches Denken."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking ist ein multimodales Inferenzmodell von Tongyi Qianwen mit Unterstützung für visuelles Verständnis und logisches Denken."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL ist ein Textgenerierungsmodell mit visuellen (Bild-)Verständnisfähigkeiten. Es kann nicht nur OCR (Texterkennung in Bildern) durchführen, sondern auch weiterführende Zusammenfassungen und Schlussfolgerungen ziehen, z. B. Attribute aus Produktfotos extrahieren oder Aufgaben anhand von Übungsbildern lösen."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "Dieses Modell ist ein leistungsstarkes Schlussfolgerungsmodell mit starker Bildverständnisfähigkeit, das in der Lage ist, Bild- und Textinformationen zu verarbeiten und nach tiefem Denken Textinhalte zu generieren. Es zeigt herausragende Leistungen im Bereich der visuellen Schlussfolgerung und verfügt über erstklassige Fähigkeiten in Mathematik, Programmierung und Textschlussfolgerung. Die Kontextlänge beträgt 100k."
  },
+  "step3": {
+    "description": "Step3 ist ein multimodales Modell von StepStar mit leistungsstarken Fähigkeiten im visuellen Verständnis."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 ist ein wegweisendes multimodales Inferenzmodell, veröffentlicht von StepFun (阶跃星辰). Es basiert auf einer Mixture-of-Experts-(MoE)-Architektur mit insgesamt 321 Milliarden Parametern und 38 Milliarden Aktivierungsparametern. Das Modell ist als End-to-End-System konzipiert, um die Decodierungskosten zu minimieren und gleichzeitig erstklassige Leistung bei visuell-sprachlicher Inferenz zu bieten. Durch die synergistische Kombination von Multi-Matrix-Factorization-Attention (MFA) und Attention-FFN-Dekopplung (AFD) erzielt Step3 sowohl auf High-End- als auch auf ressourcenbeschränkten Beschleunigern hohe Effizienz. In der Vortrainingsphase verarbeitete Step3 mehr als 20 Billionen Text-Tokens und 4 Billionen multimodale (Bild‑Text) Tokens und deckt dabei über zehn Sprachen ab. Das Modell erzielt in zahlreichen Benchmarks — etwa in Mathematik, Programmierung und Multimodalität — führende Ergebnisse unter den Open‑Source‑Modellen."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "xAIs neuestes und bestes Flaggschiffmodell mit unvergleichlicher Leistung in natürlicher Sprache, Mathematik und Inferenz – der perfekte Allrounder."
  },
-  "yi-1.5-34b-chat": {
-    "description": "Yi-1.5 ist eine verbesserte Version von Yi. Es wurde mit einem hochwertigen Korpus von 500B Tokens auf Yi fortlaufend vortrainiert und auf 3M diversen Feinabstimmungsbeispielen feinjustiert."
-  },
  "yi-large": {
    "description": "Das brandneue Modell mit einer Billion Parametern bietet außergewöhnliche Frage- und Textgenerierungsfähigkeiten."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "Ein führendes, an vorderster Front tätiges KI-Forschungslabor, das die visuelle Infrastruktur von morgen gestaltet."
  },
+  "cerebras": {
+    "description": "Cerebras ist eine KI-Inferenzplattform, die auf dem spezialisierten CS-3-System basiert. Sie wurde entwickelt, um weltweit die schnellsten, in Echtzeit reagierenden und hochdurchsatzfähigen LLM-Dienste bereitzustellen. Ziel ist es, Latenzen zu eliminieren und komplexe KI-Workflows wie die Echtzeit-Codegenerierung und Agentenaufgaben zu beschleunigen."
+  },
  "cloudflare": {
    "description": "Führen Sie von serverlosen GPUs betriebene Machine-Learning-Modelle im globalen Netzwerk von Cloudflare aus."
  },
@@ -294,6 +294,13 @@
    },
    "title": "Allgemeine Einstellungen"
  },
+  "settingImage": {
+    "defaultCount": {
+      "desc": "Legen Sie die Standardanzahl der Bilder fest, die beim Erstellen einer neuen Aufgabe im Bildgenerierungs-Panel erzeugt werden.",
+      "label": "Standardanzahl der Bilder",
+      "title": "AI-Zeichnungseinstellungen"
+    }
+  },
  "settingModel": {
    "enableMaxTokens": {
      "title": "Maximale Token pro Antwort aktivieren"
@@ -549,6 +556,7 @@
    "common": "Allgemeine Einstellungen",
    "experiment": "Experiment",
    "hotkey": "Tastenkombinationen",
+    "image": "AI-Zeichnung",
    "llm": "Sprachmodell",
    "provider": "KI-Dienstanbieter",
    "proxy": "Netzwerkproxy",
@@ -236,6 +236,7 @@
  },
  "information": "Community and News",
  "installPWA": "Install browser app",
+  "labs": "Labs",
  "lang": {
    "ar": "Arabic",
    "bg-BG": "Bulgarian",
@@ -7,6 +7,14 @@
    "desc": "Clear the messages and uploaded files from the current conversation",
    "title": "Clear Conversation Messages"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "Delete the last message and regenerate",
+    "title": "Delete and Regenerate"
+  },
+  "deleteLastMessage": {
+    "desc": "Delete the last message",
+    "title": "Delete Last Message"
+  },
  "desktop": {
    "openSettings": {
      "desc": "Open the application settings page",
@@ -0,0 +1,14 @@
+{
+  "desc": "Here you'll find occasional updates on new features we're exploring—feel free to try them out!",
+  "features": {
+    "groupChat": {
+      "desc": "Enable multi-agent group chat coordination.",
+      "title": "Group Chat (Multi-Agent)"
+    },
+    "inputMarkdown": {
+      "desc": "Render Markdown in the input area in real time (bold text, code blocks, tables, etc.).",
+      "title": "Input Markdown Rendering"
+    }
+  },
+  "title": "Labs"
+}
@@ -284,11 +284,19 @@
          "placeholder": "Please enter the model ID, e.g., gpt-4o or claude-3.5-sonnet",
          "title": "Model ID"
        },
+        "imageOutput": {
+          "extra": "This setting enables the model's image generation capability only. The actual performance depends entirely on the model itself. Please test the model to determine if it supports image generation.",
+          "title": "Supports Image Generation"
+        },
        "modalTitle": "Custom Model Configuration",
        "reasoning": {
          "extra": "This configuration will enable the model's deep thinking capabilities, and the specific effects depend entirely on the model itself. Please test whether this model has usable deep thinking abilities.",
          "title": "Support Deep Thinking"
        },
+        "search": {
+          "extra": "This setting enables the model's built-in web search capability. Whether the built-in search engine is supported depends on the model itself. Please test the model to verify the availability of this feature.",
+          "title": "Supports Web Search"
+        },
        "tokens": {
          "extra": "Set the maximum number of tokens supported by the model",
          "title": "Maximum Context Window",
@@ -309,6 +317,10 @@
          "placeholder": "Please select a model type",
          "title": "Model Type"
        },
+        "video": {
+          "extra": "This setting enables video recognition configuration within the application. Whether video recognition is supported depends entirely on the model itself. Please test the model to verify the availability of this feature.",
+          "title": "Supports Video Recognition"
+        },
        "vision": {
          "extra": "This configuration will only enable image upload capabilities in the application. Whether recognition is supported depends entirely on the model itself. Please test the visual recognition capabilities of the model yourself.",
          "title": "Support Vision"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "Deployed and provided by Microsoft; the DeepSeek R1 model has undergone a minor version upgrade, currently at DeepSeek-R1-0528. In the latest update, DeepSeek R1 significantly improves inference depth and reasoning ability by increasing computational resources and introducing algorithmic optimizations in the post-training phase. This model excels in benchmarks including mathematics, programming, and general logic, with overall performance approaching leading models such as O3 and Gemini 2.5 Pro."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B is a Mixture of Experts model developed by Baichuan Intelligence, featuring powerful reasoning capabilities."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B is an open-source, commercially usable large language model developed by Baichuan Intelligence, containing 13 billion parameters, achieving the best results in its size on authoritative Chinese and English benchmarks."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 is designed for role-playing and emotional companionship, supporting ultra-long multi-turn memory and personalized dialogue, with wide-ranging applications."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 is a closed-source model released by Zhipu AI and Tsinghua KEG Lab. It has been pre-trained on a massive amount of Chinese and English identifiers and fine-tuned with human preference alignment. Compared to the first-generation model, it has achieved improvements of 16%, 36%, and 280% in MMLU, C-Eval, and GSM8K, respectively, and topped the Chinese task leaderboard C-Eval. It is suitable for scenarios that require a high level of knowledge, reasoning, and creativity, such as advertising copywriting, novel writing, knowledge-based writing, and code generation."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base is the latest generation of the ChatGLM series, a 6 billion parameter open-source base model developed by Zhipu."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o is a dynamic model that updates in real-time to stay current with the latest version. It combines powerful language understanding and generation capabilities, making it suitable for large-scale applications, including customer service, education, and technical support."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus is an updated version of the V3.1 model released by DeepSeek, positioned as a hybrid agent large language model. This update focuses on fixing user-reported issues and improving stability while maintaining the model's original capabilities. It significantly enhances language consistency, reducing the mixing of Chinese and English and the occurrence of abnormal characters. The model integrates both \"Thinking Mode\" and \"Non-thinking Mode,\" allowing users to switch flexibly between chat templates to suit different tasks. As a key optimization, V3.1-Terminus improves the performance of the Code Agent and Search Agent, making tool invocation and multi-step complex task execution more reliable."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "The DeepSeek V3.2 Exp model adopts a hybrid reasoning architecture, supporting both reflective and non-reflective modes."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek 67B is an advanced model trained for highly complex conversations."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1: The next-generation reasoning model that enhances complex reasoning and chain-of-thought capabilities, suitable for tasks requiring in-depth analysis."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1: A next-generation reasoning model designed to enhance complex reasoning and chain-of-thought capabilities, ideal for tasks requiring in-depth analysis."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 is a mixture of experts (MoE) visual language model developed based on DeepSeekMoE-27B, employing a sparsely activated MoE architecture that achieves outstanding performance while activating only 4.5 billion parameters. This model excels in various tasks, including visual question answering, optical character recognition, document/table/chart understanding, and visual localization."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 is a newly launched hybrid reasoning model by DeepSeek, supporting two reasoning modes: thinking and non-thinking. It offers higher thinking efficiency compared to DeepSeek-R1-0528. With post-training optimization, the use of Agent tools and agent task performance have been significantly enhanced. It supports a 128k context window and an output length of up to 64k tokens."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus is an optimized large language model developed by DeepSeek, specifically tailored for terminal devices."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1: The next-generation reasoning model that enhances complex reasoning and chain-of-thought capabilities, suitable for tasks requiring in-depth analysis."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "Baidu's flagship ultra-large-scale language model, demonstrating outstanding overall performance, widely applicable to complex task scenarios across various fields; it supports automatic integration with Baidu search plugins to ensure the timeliness of Q&A information. It performs better than ERNIE 4.0 in terms of performance."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B is a Mixture of Experts model from Baidu's Wenxin series, offering strong reasoning and multilingual capabilities."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B is a large-scale Mixture of Experts model from Baidu's Wenxin series, delivering exceptional reasoning performance."
+  },
  "ernie-4.5-8k-preview": {
    "description": "ERNIE 4.5 is Baidu's self-developed next-generation native multimodal foundational model, achieving collaborative optimization through joint modeling of multiple modalities, with excellent multimodal understanding capabilities; it features enhanced language abilities, with significant improvements in understanding, generation, logic, and memory, as well as reduced hallucinations and improved logical reasoning and coding capabilities."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 is the latest model version designed for highly complex and diverse tasks, demonstrating outstanding performance."
  },
  "glm-4-9b-chat": {
-    "description": "GLM-4-9B-Chat demonstrates high performance across various aspects, including semantics, mathematics, reasoning, coding, and knowledge. It also features web browsing, code execution, custom tool invocation, and long text reasoning, supporting 26 languages including Japanese, Korean, and German."
+    "description": "GLM-4-9B-Chat demonstrates high performance across semantics, mathematics, reasoning, coding, and knowledge. It also supports web browsing, code execution, custom tool invocation, and long-text reasoning. Supports 26 languages including Japanese, Korean, and German."
  },
  "glm-4-air": {
    "description": "GLM-4-Air is a cost-effective version with performance close to GLM-4, offering fast speed at an affordable price."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "GLM-Zero-Preview possesses strong complex reasoning abilities, excelling in logical reasoning, mathematics, programming, and other fields."
  },
+  "glm4.6:355b": {
+    "description": "GLM-4.6 (355B), the latest flagship model from Zhipu, delivers comprehensive improvements over its predecessor in advanced coding, long-text processing, reasoning, and agent capabilities. It is particularly aligned with Claude Sonnet 4 in programming performance, making it one of the top coding models in China."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash offers next-generation features and improvements, including exceptional speed, built-in tool usage, multimodal generation, and a 1 million token context window."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "The fastest and most cost-efficient version of GPT-5. Perfectly suited for applications requiring rapid responses and cost sensitivity."
  },
+  "gpt-5-pro": {
+    "description": "GPT-5 Pro leverages greater computational power for deeper reasoning and consistently delivers improved answers."
+  },
  "gpt-audio": {
    "description": "GPT Audio is a general-purpose chat model designed for audio input and output, supporting audio I/O in the Chat Completions API."
  },
  "gpt-image-1": {
    "description": "ChatGPT native multimodal image generation model."
  },
+  "gpt-image-1-mini": {
+    "description": "A more cost-effective version of GPT Image 1, natively supporting both text and image inputs with image generation output."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4 quantized Transformer architecture, delivering strong performance even under resource constraints."
+    "description": "Access to this model requires an application. GPT-OSS-120B is an open-source large-scale language model released by OpenAI, known for its powerful text generation capabilities."
+  },
+  "gpt-oss-20b": {
+    "description": "Access to this model requires an application. GPT-OSS-20B is an open-source mid-sized language model from OpenAI, offering efficient text generation."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B is a large open-source language model released by OpenAI, employing MXFP4 quantization technology as a flagship model. It requires multi-GPU or high-performance workstation environments to operate and delivers outstanding performance in complex reasoning, code generation, and multilingual processing, supporting advanced function calls and tool integration."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "A general-purpose real-time model supporting real-time text and audio input/output, as well as image input."
  },
-  "grok-2-1212": {
-    "description": "This model has improved in accuracy, instruction adherence, and multilingual capabilities."
-  },
  "grok-2-image-1212": {
    "description": "Our latest image generation model can create vivid and realistic images based on text prompts. It performs excellently in image generation for marketing, social media, and entertainment."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "A flagship model skilled in data extraction, programming, and text summarization for enterprise applications, with deep knowledge in finance, healthcare, law, and science."
  },
-  "grok-3-fast": {
-    "description": "A flagship model skilled in data extraction, programming, and text summarization for enterprise applications, with deep knowledge in finance, healthcare, law, and science."
-  },
  "grok-3-mini": {
    "description": "A lightweight model that thinks before responding. It runs fast and intelligently, suitable for logical tasks that do not require deep domain knowledge, and can provide raw thought trajectories."
  },
-  "grok-3-mini-fast": {
-    "description": "A lightweight model that thinks before responding. It runs fast and intelligently, suitable for logical tasks that do not require deep domain knowledge, and can provide raw thought trajectories."
-  },
  "grok-4": {
    "description": "Our latest and most powerful flagship model, excelling in natural language processing, mathematical computation, and reasoning — a perfect all-rounder."
  },
@@ -1851,7 +1866,7 @@
    "description": "The latest Hunyuan t1-vision multimodal deep thinking model supports native long Chain-of-Thought reasoning across modalities, comprehensively improving over the previous default version."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "Hunyuan multimodal deep reasoning model supporting native long chains of thought across modalities, excelling in various image reasoning scenarios and significantly improving performance on science problems compared to fast thinking models."
+    "description": "The latest Hunyuan t1-vision model excels in visual deep reasoning. Compared to the previous version, it offers significant enhancements in general image-text Q&A, visual localization, OCR, chart interpretation, problem-solving from photos, and image-based creative tasks, with notable improvements in English and low-resource language capabilities."
  },
  "hunyuan-turbo": {
    "description": "The preview version of the next-generation Hunyuan large language model, featuring a brand-new mixed expert model (MoE) structure, which offers faster inference efficiency and stronger performance compared to Hunyuan Pro."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "The kimi-k2-0905-preview model has a context length of 256k, featuring stronger Agentic Coding capabilities, more outstanding aesthetics and practicality of frontend code, and better context understanding."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct is a large language model developed by Moonshot AI, featuring ultra-long context processing capabilities."
+  },
  "kimi-k2-turbo-preview": {
    "description": "Kimi-K2 is a Mixture-of-Experts (MoE) foundation model with exceptional coding and agent capabilities, featuring 1T total parameters and 32B activated parameters. In benchmark evaluations across core categories — general knowledge reasoning, programming, mathematics, and agent tasks — the K2 model outperforms other leading open-source models."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "Spark Lite is a lightweight large language model with extremely low latency and efficient processing capabilities, completely free and open, supporting real-time online search functionality. Its quick response feature makes it excel in inference applications and model fine-tuning on low-power devices, providing users with excellent cost-effectiveness and intelligent experiences, particularly in knowledge Q&A, content generation, and search scenarios."
  },
-  "llama-2-7b-chat": {
-    "description": "Llama2 is a series of large language models (LLMs) developed and open-sourced by Meta. This series includes generative text models of varying sizes, ranging from 7 billion to 70 billion parameters, which have been pre-trained and fine-tuned. Architecturally, Llama2 is an autoregressive language model that uses an optimized transformer architecture. The fine-tuned versions leverage supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for usefulness and safety. Llama2 outperforms the Llama series on multiple academic datasets and provides valuable insights for the design and development of other models."
-  },
  "llama-3.1-70b-versatile": {
    "description": "Llama 3.1 70B provides enhanced AI reasoning capabilities, suitable for complex applications, supporting extensive computational processing while ensuring efficiency and accuracy."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "The Llama 3.2-Vision instruction-tuned model is optimized for visual recognition, image reasoning, image captioning, and answering general questions related to images."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 is the most advanced multilingual open-source large language model in the Llama series, offering performance comparable to a 405B model at an extremely low cost. Based on the Transformer architecture, it enhances usability and safety through supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF). Its instruction-tuned version is optimized for multilingual dialogue and outperforms many open-source and closed chat models on various industry benchmarks. Knowledge cutoff date is December 2023."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B: A mid-to-large scale Llama model that balances reasoning power and throughput."
  },
  "llama-3.3-70b-versatile": {
    "description": "Meta Llama 3.3 is a multilingual large language model (LLM) with 70 billion parameters (text input/text output), featuring pre-training and instruction-tuning. The instruction-tuned pure text model of Llama 3.3 is optimized for multilingual conversational use cases and outperforms many available open-source and closed chat models on common industry benchmarks."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "The Llama 3.3 instruction-tuned model is optimized for conversational scenarios, outperforming many existing open-source chat models on common industry benchmarks."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick: A high-performance model in the Llama series, ideal for advanced reasoning, complex problem-solving, and instruction-following tasks."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout: A high-performance Llama model optimized for scenarios requiring high throughput and low latency."
+  },
  "llama3-70b-8192": {
    "description": "Meta Llama 3 70B provides unparalleled complexity handling capabilities, tailored for high-demand projects."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 is a leading model launched by Meta, supporting up to 405B parameters, applicable in complex dialogues, multilingual translation, and data analysis."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B: A lightweight, low-latency variant of Llama, well-suited for real-time inference and interactive applications."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 is a leading model launched by Meta, supporting up to 405B parameters, applicable in complex dialogues, multilingual translation, and data analysis."
  },
@@ -2067,7 +2091,7 @@
    "description": "Spark Max 32K is configured with large context processing capabilities, enhanced contextual understanding, and logical reasoning abilities, supporting text input of 32K tokens, suitable for long document reading, private knowledge Q&A, and other scenarios."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct is a large language model fully trained by Wuwen Xin Qiong. Megrez-3B-Instruct aims to create an ultra-fast, compact, and easy-to-use intelligent solution for edge devices through the concept of hardware-software co-design."
+    "description": "Megrez 3B Instruct is a compact and efficient model developed by Wuwen Xinqiong."
  },
  "meta-llama-3-70b-instruct": {
    "description": "A powerful 70-billion parameter model excelling in reasoning, coding, and broad language applications."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "Spark Pro 128K is equipped with an extra-large context processing capability, able to handle up to 128K of contextual information, making it particularly suitable for long-form content that requires comprehensive analysis and long-term logical connections, providing smooth and consistent logic and diverse citation support in complex text communication."
  },
+  "pro-deepseek-r1": {
+    "description": "Enterprise-exclusive service model with concurrent service support."
+  },
+  "pro-deepseek-v3": {
+    "description": "Enterprise-exclusive service model with concurrent service support."
+  },
  "qvq-72b-preview": {
    "description": "The QVQ model is an experimental research model developed by the Qwen team, focusing on enhancing visual reasoning capabilities, particularly in the field of mathematical reasoning."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "A visual reasoning model supporting visual inputs and chain-of-thought outputs. The plus version, succeeding the qvq-max model, offers faster reasoning speed and a more balanced trade-off between performance and cost."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B: A strong performer in multilingual and coding tasks, suitable for medium-scale production use."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B: A long-context model designed for code generation and complex programming tasks."
+  },
  "qwen-coder-plus": {
    "description": "Tongyi Qianwen coding model."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 is Alibaba's next-generation large-scale language model, supporting diverse application needs with excellent performance."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2 is the new generation of large language model series introduced by the Qwen team. It is based on the Transformer architecture and incorporates technologies such as the SwiGLU activation function, attention QKV bias, group query attention, a mixture of sliding window attention, and full attention. Additionally, the Qwen team has improved the tokenizer to better adapt to multiple natural languages and code."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2 is the new generation of large language model series introduced by the Qwen team. It is based on the Transformer architecture and incorporates technologies such as the SwiGLU activation function, attention QKV bias, group query attention, a mixture of sliding window attention, and full attention. Additionally, the Qwen team has improved the tokenizer to better adapt to multiple natural languages and code."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 is Alibaba's next-generation large-scale language model, supporting diverse application needs with outstanding performance."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "A new generation of thinking mode open-source model based on Qwen3. Compared to the previous version (Tongyi Qianwen 3-235B-A22B-Thinking-2507), it features improved instruction-following capabilities and more concise model-generated summaries."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct is a multimodal model developed by Tongyi Qianwen, supporting visual understanding and reasoning."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking is a multimodal reasoning model from Tongyi Qianwen, designed for visual understanding and reasoning tasks."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL is a text generation model with visual (image) understanding capabilities. It can perform OCR (image text recognition) and further summarize and reason, such as extracting attributes from product photos or solving problems based on exercise images."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "This model is a powerful reasoning model with strong image understanding capabilities, able to process both image and text information, generating text content after deep reasoning. It excels in visual reasoning while also possessing first-tier capabilities in mathematics, coding, and text reasoning. The context length is 100k."
  },
+  "step3": {
+    "description": "Step3 is a multimodal model developed by StepStar, offering advanced visual understanding capabilities."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 is a cutting-edge multimodal reasoning model released by StepFun. It is built on a mixture-of-experts (MoE) architecture with 321B total parameters and 38B active parameters. The model adopts an end-to-end design to minimize decoding cost while delivering top-tier performance in visual-language reasoning. Through the combined design of Multi-Matrix Factorized Attention (MFA) and Attention-FFN Decoupling (AFD), Step3 maintains exceptional efficiency on both high-end and low-end accelerators. During pretraining, Step3 processed over 20 trillion text tokens and 4 trillion image-text mixed tokens, covering more than a dozen languages. The model achieves leading performance among open-source models across benchmarks in mathematics, code, and multimodal tasks."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "xAI's latest and greatest flagship model, delivering unparalleled performance in natural language, mathematics, and reasoning—an ideal all-rounder."
  },
-  "yi-1.5-34b-chat": {
-    "description": "Yi-1.5 is an upgraded version of Yi. It continues pre-training on Yi using a high-quality corpus of 500B tokens and is fine-tuned on 3M diverse samples."
-  },
  "yi-large": {
    "description": "A new trillion-parameter model, providing super strong question-answering and text generation capabilities."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "A leading, cutting-edge artificial intelligence research lab building the visual infrastructure of tomorrow."
  },
+  "cerebras": {
+    "description": "Cerebras is an AI inference platform built on its dedicated CS-3 system, designed to deliver the world's fastest, real-time, high-throughput LLM services. It is specifically engineered to eliminate latency and accelerate complex AI workflows such as real-time code generation and agent-based tasks."
+  },
  "cloudflare": {
    "description": "Run serverless GPU-powered machine learning models on Cloudflare's global network."
  },
@@ -294,6 +294,13 @@
    },
    "title": "General Settings"
  },
+  "settingImage": {
+    "defaultCount": {
+      "desc": "Set the default number of images generated when creating a new task in the image generation panel.",
+      "label": "Default Image Count",
+      "title": "AI Drawing Settings"
+    }
+  },
  "settingModel": {
    "enableMaxTokens": {
      "title": "Enable Max Tokens Limit"
@@ -549,6 +556,7 @@
    "common": "Common Settings",
    "experiment": "Experiment",
    "hotkey": "Hotkeys",
+    "image": "AI Drawing",
    "llm": "Language Model",
    "provider": "AI Service Provider",
    "proxy": "Network Proxy",
@@ -236,6 +236,7 @@
  },
  "information": "Comunidad e Información",
  "installPWA": "Instalar la aplicación del navegador",
+  "labs": "Laboratorio",
  "lang": {
    "ar": "árabe",
    "bg-BG": "búlgaro",
@@ -7,6 +7,14 @@
    "desc": "Eliminar los mensajes y archivos subidos de la conversación actual",
    "title": "Eliminar mensajes de la conversación"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "Eliminar el último mensaje y volver a generarlo",
+    "title": "Eliminar y regenerar"
+  },
+  "deleteLastMessage": {
+    "desc": "Eliminar el último mensaje",
+    "title": "Eliminar el último mensaje"
+  },
  "desktop": {
    "openSettings": {
      "desc": "Abrir la página de configuración de la aplicación",
@@ -0,0 +1,14 @@
+{
+  "desc": "Aquí actualizaremos periódicamente las nuevas funciones que estamos explorando. ¡Te invitamos a probarlas!",
+  "features": {
+    "groupChat": {
+      "desc": "Activa la capacidad de coordinación de chats grupales con múltiples agentes.",
+      "title": "Chat grupal (multiagente)"
+    },
+    "inputMarkdown": {
+      "desc": "Renderiza Markdown en tiempo real en el área de entrada (negrita, bloques de código, tablas, etc.).",
+      "title": "Renderizado Markdown en el campo de entrada"
+    }
+  },
+  "title": "Laboratorio"
+}
@@ -284,11 +284,19 @@
          "placeholder": "Introduce el id del modelo, por ejemplo gpt-4o o claude-3.5-sonnet",
          "title": "ID del modelo"
        },
+        "imageOutput": {
+          "extra": "Esta configuración solo habilitará la capacidad del modelo para generar imágenes. El efecto específico depende completamente del propio modelo. Por favor, prueba si el modelo tiene la capacidad de generar imágenes de manera utilizable.",
+          "title": "Soporte para generación de imágenes"
+        },
        "modalTitle": "Configuración del modelo personalizado",
        "reasoning": {
          "extra": "Esta configuración solo activará la capacidad de pensamiento profundo del modelo, el efecto específico depende completamente del modelo en sí, por favor, pruebe si este modelo tiene la capacidad de pensamiento profundo utilizable",
          "title": "Soporte para pensamiento profundo"
        },
+        "search": {
+          "extra": "Esta configuración solo habilitará la capacidad de búsqueda en línea del motor de búsqueda incorporado del modelo. Si se admite o no depende del propio modelo. Por favor, prueba si el motor de búsqueda incorporado del modelo es utilizable.",
+          "title": "Soporte para búsqueda en línea"
+        },
        "tokens": {
          "extra": "Establecer el número máximo de tokens que el modelo puede soportar",
          "title": "Máximo de ventana de contexto",
@@ -309,6 +317,10 @@
          "placeholder": "Por favor, seleccione el tipo de modelo",
          "title": "Tipo de modelo"
        },
+        "video": {
+          "extra": "Esta configuración solo habilitará la configuración de reconocimiento de video en la aplicación. Si se admite el reconocimiento depende completamente del propio modelo. Por favor, prueba si el modelo tiene la capacidad de reconocimiento de video utilizable.",
+          "title": "Soporte para reconocimiento de video"
+        },
        "vision": {
          "extra": "Esta configuración solo habilitará la configuración de carga de imágenes en la aplicación, si se admite el reconocimiento depende completamente del modelo en sí, prueba la disponibilidad de la capacidad de reconocimiento visual de este modelo.",
          "title": "Soporte para reconocimiento visual"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "Desplegado y proporcionado por Microsoft; el modelo DeepSeek R1 ha recibido una actualización menor, la versión actual es DeepSeek-R1-0528. En la última actualización, DeepSeek R1 ha mejorado significativamente la profundidad de inferencia y la capacidad de deducción mediante el aumento de recursos computacionales y la introducción de mecanismos de optimización algorítmica en la fase posterior al entrenamiento. Este modelo destaca en múltiples pruebas de referencia en matemáticas, programación y lógica general, y su rendimiento general se acerca a modelos líderes como O3 y Gemini 2.5 Pro."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B es un modelo de expertos mixto desarrollado por Baichuan Intelligence, con potentes capacidades de razonamiento."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B es un modelo de lenguaje de gran escala de código abierto y comercializable desarrollado por Baichuan Intelligence, que cuenta con 13 mil millones de parámetros y ha logrado los mejores resultados en benchmarks autorizados en chino e inglés."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 está diseñado para el juego de roles y la compañía emocional, soportando memoria de múltiples turnos de larga duración y diálogos personalizados, con aplicaciones amplias."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 es un modelo de código cerrado desarrollado por Zhipu AI y el Laboratorio KEG de Tsinghua. Ha sido preentrenado con una gran cantidad de identificadores en chino e inglés y ajustado a las preferencias humanas. En comparación con el modelo de primera generación, ha logrado mejoras del 16%, 36% y 280% en MMLU, C-Eval y GSM8K, respectivamente, y ha alcanzado el primer lugar en el ranking de tareas en chino C-Eval. Es adecuado para escenarios que requieren un alto nivel de conocimiento, capacidad de razonamiento y creatividad, como la redacción de anuncios, la escritura de novelas, la redacción de contenido de conocimiento y la generación de código."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base es el modelo base de la última generación de la serie ChatGLM, desarrollado por Zhipu, con una escala de 6.000 millones de parámetros y de código abierto."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o es un modelo dinámico que se actualiza en tiempo real para mantener la versión más actual. Combina una poderosa comprensión y generación de lenguaje, adecuado para aplicaciones a gran escala, incluyendo servicio al cliente, educación y soporte técnico."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus es una versión actualizada del modelo V3.1 lanzado por DeepSeek, posicionada como un modelo de lenguaje grande con agentes híbridos. Esta actualización mantiene las capacidades originales del modelo, enfocándose en corregir problemas reportados por los usuarios y mejorar la estabilidad. Mejora significativamente la coherencia del lenguaje, reduciendo la mezcla de chino e inglés y la aparición de caracteres anómalos. El modelo integra el “Modo de pensamiento” y el “Modo sin pensamiento”, permitiendo a los usuarios cambiar flexiblemente mediante plantillas de chat para adaptarse a diferentes tareas. Como optimización importante, V3.1-Terminus mejora el rendimiento del agente de código y del agente de búsqueda, haciéndolos más confiables en la invocación de herramientas y en la ejecución de tareas complejas de múltiples pasos."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "El modelo DeepSeek V3.2 Exp es una arquitectura de inferencia híbrida que admite tanto el modo reflexivo como el no reflexivo."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek 67B es un modelo avanzado entrenado para diálogos de alta complejidad."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1: modelo de inferencia de próxima generación que mejora las capacidades de razonamiento complejo y pensamiento en cadena, ideal para tareas que requieren análisis profundo."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1: un modelo de inferencia de nueva generación que mejora la capacidad de razonamiento complejo y pensamiento en cadena, ideal para tareas que requieren un análisis profundo."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 es un modelo de lenguaje visual de expertos mixtos (MoE) desarrollado sobre DeepSeekMoE-27B, que utiliza una arquitectura MoE de activación dispersa, logrando un rendimiento excepcional al activar solo 4.5B de parámetros. Este modelo destaca en múltiples tareas como preguntas visuales, reconocimiento óptico de caracteres, comprensión de documentos/tablas/gráficos y localización visual."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 es un nuevo modelo híbrido de razonamiento lanzado por DeepSeek, que soporta dos modos de razonamiento: con pensamiento y sin pensamiento, con una eficiencia de pensamiento superior a DeepSeek-R1-0528. Tras una optimización post-entrenamiento, el uso de herramientas Agent y el rendimiento en tareas inteligentes han mejorado significativamente. Soporta una ventana de contexto de 128k y una longitud máxima de salida de 64k tokens."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus es una versión optimizada del modelo de lenguaje a gran escala lanzado por DeepSeek, especialmente diseñado para dispositivos terminales."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1: modelo de inferencia de próxima generación que mejora las capacidades de razonamiento complejo y pensamiento en cadena, ideal para tareas que requieren análisis profundo."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "El modelo de lenguaje grande de bandera de Baidu, desarrollado internamente, de ultra gran escala, muestra un rendimiento excepcional en general, siendo ampliamente aplicable en escenarios de tareas complejas en diversos campos; soporta la integración automática con el plugin de búsqueda de Baidu, garantizando la actualidad de la información de preguntas y respuestas. En comparación con ERNIE 4.0, presenta un rendimiento superior."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B es un modelo de expertos mixto desarrollado por Wenxin de Baidu, con potentes capacidades de razonamiento y soporte multilingüe."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B es un modelo de expertos mixto a gran escala desarrollado por Wenxin de Baidu, con capacidades de razonamiento excepcionales."
+  },
  "ernie-4.5-8k-preview": {
    "description": "El modelo grande Wenxin 4.5 es un nuevo modelo base multimodal nativo desarrollado por Baidu, que logra una optimización colaborativa a través de modelado conjunto de múltiples modalidades, con excelentes capacidades de comprensión multimodal; presenta una capacidad lingüística más avanzada, con mejoras en comprensión, generación, lógica y memoria, así como una notable reducción de alucinaciones y mejoras en razonamiento lógico y capacidades de codificación."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 es la última versión del modelo, diseñada para tareas altamente complejas y diversas, con un rendimiento excepcional."
  },
  "glm-4-9b-chat": {
-    "description": "GLM-4-9B-Chat muestra un alto rendimiento en semántica, matemáticas, razonamiento, código y conocimiento. También cuenta con navegación web, ejecución de código, llamadas a herramientas personalizadas y razonamiento de textos largos. Soporta 26 idiomas, incluidos japonés, coreano y alemán."
+    "description": "GLM-4-9B-Chat ofrece un alto rendimiento en semántica, matemáticas, razonamiento, programación y conocimiento. También admite navegación web, ejecución de código, uso de herramientas personalizadas e inferencia de textos largos. Compatible con 26 idiomas, incluidos japonés, coreano y alemán."
  },
  "glm-4-air": {
    "description": "GLM-4-Air es una versión de alto costo-beneficio, con un rendimiento cercano al GLM-4, ofreciendo velocidad y precios asequibles."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "GLM-Zero-Preview posee una poderosa capacidad de razonamiento complejo, destacándose en áreas como razonamiento lógico, matemáticas y programación."
  },
+  "glm4.6:355b": {
+    "description": "El modelo insignia más reciente de Zhipu, GLM-4.6 (355B), supera ampliamente a su predecesor en codificación avanzada, procesamiento de textos largos, razonamiento y capacidades de agentes inteligentes. En particular, su habilidad para programar está a la par con Claude Sonnet 4, consolidándose como el modelo de codificación líder en China."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash ofrece funcionalidades de próxima generación y mejoras, incluyendo velocidad sobresaliente, uso integrado de herramientas, generación multimodal y una ventana de contexto de 1 millón de tokens."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "Versión más rápida y económica de GPT-5. Perfecta para escenarios que requieren respuestas rápidas y son sensibles al costo."
  },
+  "gpt-5-pro": {
+    "description": "GPT-5 pro utiliza más capacidad de cómputo para pensar de forma más profunda y ofrecer respuestas de mayor calidad de manera constante."
+  },
  "gpt-audio": {
    "description": "GPT Audio es un modelo de chat general para entrada y salida de audio, compatible con el uso de audio I/O en la API de Chat Completions."
  },
  "gpt-image-1": {
    "description": "Modelo nativo multimodal de generación de imágenes de ChatGPT."
  },
+  "gpt-image-1-mini": {
+    "description": "Una versión más económica de GPT Image 1, con soporte nativo para entrada de texto e imagen y generación de salida en formato de imagen."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4: estructura Transformer cuantificada que mantiene un rendimiento sólido incluso con recursos limitados."
+    "description": "Este modelo requiere solicitud para su uso. GPT-OSS-120B es un modelo de lenguaje de código abierto a gran escala desarrollado por OpenAI, con potentes capacidades de generación de texto."
+  },
+  "gpt-oss-20b": {
+    "description": "Este modelo requiere solicitud para su uso. GPT-OSS-20B es un modelo de lenguaje de código abierto de tamaño medio desarrollado por OpenAI, con capacidades eficientes de generación de texto."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B es un modelo de lenguaje abierto de gran escala lanzado por OpenAI, que emplea la tecnología de cuantificación MXFP4, siendo un modelo insignia. Requiere múltiples GPU o estaciones de trabajo de alto rendimiento para su ejecución, y ofrece un rendimiento sobresaliente en razonamiento complejo, generación de código y procesamiento multilingüe, soportando llamadas avanzadas a funciones e integración de herramientas."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "Modelo universal en tiempo real que soporta entrada y salida de texto y audio, además de entrada de imágenes."
  },
-  "grok-2-1212": {
-    "description": "Este modelo ha mejorado en precisión, cumplimiento de instrucciones y capacidades multilingües."
-  },
  "grok-2-image-1212": {
    "description": "Nuestro último modelo de generación de imágenes puede crear imágenes vívidas y realistas a partir de indicaciones textuales. Destaca en generación de imágenes para marketing, redes sociales y entretenimiento."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "Modelo insignia, experto en extracción de datos, programación y resumen de texto para aplicaciones empresariales, con profundo conocimiento en finanzas, medicina, derecho y ciencias."
  },
-  "grok-3-fast": {
-    "description": "Modelo insignia, experto en extracción de datos, programación y resumen de texto para aplicaciones empresariales, con profundo conocimiento en finanzas, medicina, derecho y ciencias."
-  },
  "grok-3-mini": {
    "description": "Modelo ligero que piensa antes de responder. Rápido e inteligente, adecuado para tareas lógicas que no requieren conocimientos profundos de dominio y capaz de proporcionar la trayectoria original del pensamiento."
  },
-  "grok-3-mini-fast": {
-    "description": "Modelo ligero que piensa antes de responder. Rápido e inteligente, adecuado para tareas lógicas que no requieren conocimientos profundos de dominio y capaz de proporcionar la trayectoria original del pensamiento."
-  },
  "grok-4": {
    "description": "Nuestro modelo insignia más reciente y potente, que destaca en procesamiento de lenguaje natural, cálculo matemático y razonamiento — un competidor versátil y perfecto."
  },
@@ -1851,7 +1866,7 @@
    "description": "La última versión del modelo de pensamiento profundo multimodal t1-vision de Hunyuan, que soporta cadenas de pensamiento nativas multimodales, con mejoras integrales respecto a la versión predeterminada anterior."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "Modelo de pensamiento profundo multimodal Hunyuan, que soporta cadenas de pensamiento nativas multimodales prolongadas, especializado en diversos escenarios de razonamiento con imágenes, y que mejora integralmente el rendimiento en problemas científicos en comparación con el modelo de pensamiento rápido."
+    "description": "La última versión del modelo de pensamiento visual profundo Hunyuan t1-vision ofrece mejoras integrales respecto a su versión anterior en tareas como preguntas y respuestas generales sobre imágenes, localización visual, OCR, interpretación de gráficos, resolución de problemas a partir de fotos y creación visual. También se ha optimizado notablemente su rendimiento en inglés y lenguas minoritarias."
  },
  "hunyuan-turbo": {
    "description": "Versión preliminar de la nueva generación del modelo de lenguaje de Hunyuan, que utiliza una nueva estructura de modelo de expertos mixtos (MoE), con una eficiencia de inferencia más rápida y un rendimiento más fuerte en comparación con Hunyuan-Pro."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "El modelo kimi-k2-0905-preview tiene una longitud de contexto de 256k, con una mayor capacidad de codificación agentiva, una estética y funcionalidad mejoradas en el código frontend, y una mejor comprensión del contexto."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct es un modelo de lenguaje a gran escala desarrollado por Moonshot AI, con capacidad para manejar contextos extremadamente largos."
+  },
  "kimi-k2-turbo-preview": {
    "description": "kimi-k2 es un modelo base con arquitectura MoE que ofrece potentes capacidades para código y agentes, con 1T parámetros totales y 32B parámetros activados. En las pruebas de referencia en categorías principales como razonamiento de conocimiento general, programación, matemáticas y agentes, el rendimiento del modelo K2 supera al de otros modelos de código abierto más extendidos."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "Spark Lite es un modelo de lenguaje grande y ligero, con una latencia extremadamente baja y una capacidad de procesamiento eficiente, completamente gratuito y de código abierto, que admite funciones de búsqueda en línea en tiempo real. Su característica de respuesta rápida lo hace destacar en aplicaciones de inferencia y ajuste de modelos en dispositivos de baja potencia, brindando a los usuarios una excelente relación costo-beneficio y experiencia inteligente, especialmente en escenarios de preguntas y respuestas, generación de contenido y búsqueda."
  },
-  "llama-2-7b-chat": {
-    "description": "Llama2 es una serie de modelos de lenguaje de gran escala (LLM) desarrollados y publicados por Meta, que incluye modelos de texto generativo preentrenados y ajustados de diferentes tamaños, desde 7 mil millones hasta 70 mil millones de parámetros. A nivel de arquitectura, Llama2 es un modelo de lenguaje autoregresivo que utiliza una arquitectura de transformador optimizada. Las versiones ajustadas utilizan un ajuste de fine-tuning supervisado (SFT) y aprendizaje por refuerzo con retroalimentación humana (RLHF) para alinear las preferencias de utilidad y seguridad humanas. Llama2 supera a la serie Llama en varios conjuntos de datos académicos, proporcionando ideas para el diseño y desarrollo de numerosos otros modelos."
-  },
  "llama-3.1-70b-versatile": {
    "description": "Llama 3.1 70B ofrece una capacidad de razonamiento AI más potente, adecuada para aplicaciones complejas, soportando un procesamiento computacional extenso y garantizando eficiencia y precisión."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "El modelo Llama 3.2-Vision con ajuste fino de instrucciones está optimizado para reconocimiento visual, razonamiento sobre imágenes, descripción de imágenes y respuesta a preguntas generales relacionadas con imágenes."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 es el modelo de lenguaje de código abierto multilingüe más avanzado de la serie Llama, que ofrece un rendimiento comparable al modelo de 405B a un costo extremadamente bajo. Basado en la estructura Transformer, y mejorado en utilidad y seguridad a través de ajuste fino supervisado (SFT) y aprendizaje por refuerzo con retroalimentación humana (RLHF). Su versión ajustada para instrucciones está optimizada para diálogos multilingües, superando a muchos modelos de chat de código abierto y cerrado en múltiples benchmarks de la industria. La fecha límite de conocimiento es diciembre de 2023."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B: un modelo Llama de tamaño medio-grande que equilibra capacidad de razonamiento y rendimiento."
  },
  "llama-3.3-70b-versatile": {
    "description": "El modelo de lenguaje multilingüe Meta Llama 3.3 (LLM) es un modelo generativo preentrenado y ajustado para instrucciones de 70B (entrada/salida de texto). El modelo de texto puro ajustado para instrucciones de Llama 3.3 está optimizado para casos de uso de conversación multilingüe y supera a muchos modelos de chat de código abierto y cerrado en benchmarks industriales comunes."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "El modelo de instrucción Llama 3.3, optimizado para escenarios de diálogo, supera a muchos modelos de chat de código abierto existentes en pruebas de referencia comunes de la industria."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick: un modelo de alto rendimiento de la serie Llama, ideal para razonamiento avanzado, resolución de problemas complejos y tareas de seguimiento de instrucciones."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout: un modelo de alto rendimiento de la serie Llama, diseñado para escenarios que requieren alto rendimiento y baja latencia."
+  },
  "llama3-70b-8192": {
    "description": "Meta Llama 3 70B proporciona una capacidad de procesamiento de complejidad inigualable, diseñado a medida para proyectos de alta demanda."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 es el modelo líder lanzado por Meta, que admite hasta 405B de parámetros, aplicable en diálogos complejos, traducción multilingüe y análisis de datos."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B: una variante ligera y de baja latencia de Llama, adecuada para inferencia en línea y escenarios interactivos de bajo consumo."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 es el modelo líder lanzado por Meta, que admite hasta 405B de parámetros, aplicable en diálogos complejos, traducción multilingüe y análisis de datos."
  },
@@ -2067,7 +2091,7 @@
    "description": "Spark Max 32K está equipado con una capacidad de procesamiento de contexto grande, con una comprensión contextual más fuerte y habilidades de razonamiento lógico, soportando entradas de texto de 32K tokens, adecuado para la lectura de documentos largos, preguntas y respuestas de conocimiento privado y otros escenarios."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct es un modelo de lenguaje grande entrenado completamente de forma autónoma por Wúwèn Xīnqióng. Megrez-3B-Instruct tiene como objetivo crear una solución de inteligencia periférica rápida, compacta y fácil de usar, basada en el concepto de colaboración entre hardware y software."
+    "description": "Megrez 3B Instruct es un modelo eficiente de bajo número de parámetros desarrollado por Wuwen Xinqiong."
  },
  "meta-llama-3-70b-instruct": {
    "description": "Un poderoso modelo de 70 mil millones de parámetros que sobresale en razonamiento, codificación y amplias aplicaciones de lenguaje."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "Spark Pro 128K está equipado con una capacidad de procesamiento de contexto extragrande, capaz de manejar hasta 128K de información contextual, especialmente adecuado para el análisis completo y el manejo de relaciones lógicas a largo plazo en contenido extenso, proporcionando una lógica fluida y coherente y un soporte diverso de citas en comunicaciones de texto complejas."
  },
+  "pro-deepseek-r1": {
+    "description": "Modelo exclusivo para servicios empresariales, incluye servicio concurrente."
+  },
+  "pro-deepseek-v3": {
+    "description": "Modelo exclusivo para servicios empresariales, incluye servicio concurrente."
+  },
  "qvq-72b-preview": {
    "description": "El modelo QVQ es un modelo de investigación experimental desarrollado por el equipo de Qwen, enfocado en mejorar la capacidad de razonamiento visual, especialmente en el ámbito del razonamiento matemático."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "Modelo de razonamiento visual. Soporta entrada visual y salida en cadena de pensamiento. Versión plus lanzada tras el modelo qvq-max, con mayor velocidad de razonamiento y un equilibrio mejorado entre eficacia y coste en comparación con qvq-max."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B: el modelo de la serie Qwen ofrece un excelente rendimiento en tareas multilingües y de codificación, ideal para aplicaciones de producción a escala media."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B: un modelo de contexto largo diseñado para generación de código y tareas complejas de programación."
+  },
  "qwen-coder-plus": {
    "description": "Modelo de código Tongyi Qianwen."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 es el nuevo modelo de lenguaje a gran escala de Alibaba, que ofrece un rendimiento excepcional para satisfacer diversas necesidades de aplicación."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2 es la nueva serie de modelos de lenguaje de gran escala presentada por el equipo de Qwen. Se basa en la arquitectura Transformer y utiliza funciones de activación SwiGLU, sesgo de atención QKV (attention QKV bias), atención de consulta grupal (group query attention), una mezcla de atención de ventana deslizante y atención completa (mixture of sliding window attention and full attention). Además, el equipo de Qwen ha mejorado el tokenizador para adaptarse a múltiples lenguajes naturales y códigos."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2 es una nueva serie de modelos de lenguaje de gran escala desarrollada por el equipo de Qwen. Se basa en la arquitectura Transformer y utiliza funciones de activación SwiGLU, sesgo de atención QKV (attention QKV bias), atención de consulta grupal (group query attention), una mezcla de atención de ventana deslizante y atención completa (mixture of sliding window attention and full attention). Además, el equipo de Qwen ha mejorado el tokenizador para adaptarse a múltiples lenguajes naturales y códigos."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 es la nueva generación de modelos de lenguaje a gran escala de Alibaba, que ofrece un rendimiento excepcional para satisfacer diversas necesidades de aplicación."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "Modelo de código abierto de nueva generación basado en Qwen3 en modo reflexivo, que mejora la capacidad de seguir instrucciones y ofrece respuestas más concisas en comparación con la versión anterior (Tongyi Qianwen 3-235B-A22B-Thinking-2507)."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct es un modelo multimodal desarrollado por Tongyi Qianwen, compatible con comprensión visual y razonamiento."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking es un modelo de razonamiento multimodal desarrollado por Tongyi Qianwen, compatible con comprensión visual y razonamiento."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL es un modelo generativo de texto con capacidad de comprensión visual (imágenes). No solo puede realizar OCR (reconocimiento de texto en imágenes), sino también resumir y razonar, por ejemplo, extrayendo atributos de fotos de productos o resolviendo problemas a partir de imágenes de ejercicios."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "Este modelo es un gran modelo de inferencia con una poderosa capacidad de comprensión de imágenes, capaz de procesar información de imágenes y texto, generando contenido textual tras un profundo razonamiento. Este modelo destaca en el campo del razonamiento visual, además de poseer capacidades de razonamiento matemático, de código y textual de primer nivel. La longitud del contexto es de 100k."
  },
+  "step3": {
+    "description": "Step3 es un modelo multimodal desarrollado por StepStar, con potentes capacidades de comprensión visual."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 es un modelo de inferencia multimodal de vanguardia publicado por 阶跃星辰 (StepFun), construido sobre una arquitectura Mixture-of-Experts (MoE) con 321B de parámetros totales y 38B de parámetros de activación. El modelo presenta un diseño de extremo a extremo orientado a minimizar el coste de decodificación, al tiempo que ofrece un rendimiento de primer nivel en razonamiento visual-lingüístico. Gracias al diseño sinérgico entre la atención por descomposición de múltiples matrices (MFA) y el desacoplamiento atención‑FFN (AFD), Step3 mantiene una eficiencia sobresaliente tanto en aceleradores de gama alta como de gama baja. En la fase de preentrenamiento, Step3 procesó más de 20T de tokens de texto y 4T de tokens mixtos imagen-texto, abarcando más de una decena de idiomas. El modelo ha alcanzado niveles líderes entre los modelos de código abierto en múltiples benchmarks, incluidos matemáticas, código y tareas multimodales."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "El modelo insignia más reciente y avanzado de xAI, que ofrece un rendimiento inigualable en lenguaje natural, matemáticas y razonamiento, siendo un competidor perfecto y versátil."
  },
-  "yi-1.5-34b-chat": {
-    "description": "Yi-1.5 es una versión mejorada de Yi. Utiliza un corpus de alta calidad de 500B tokens para continuar el preentrenamiento de Yi y se微调 en 3M muestras de ajuste fino diversificadas."
-  },
  "yi-large": {
    "description": "Modelo de mil millones de parámetros completamente nuevo, que ofrece capacidades excepcionales de preguntas y respuestas y generación de texto."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "Laboratorio líder en investigación de inteligencia artificial de vanguardia, construyendo la infraestructura visual del mañana."
  },
+  "cerebras": {
+    "description": "Cerebras es una plataforma de inferencia de IA basada en su sistema especializado CS-3, diseñada para ofrecer el servicio de LLM más rápido del mundo, con respuesta en tiempo real y alto rendimiento. Está especialmente concebida para eliminar la latencia y acelerar flujos de trabajo complejos de IA, como la generación de código en tiempo real y tareas de agentes."
+  },
  "cloudflare": {
    "description": "Ejecuta modelos de aprendizaje automático impulsados por GPU sin servidor en la red global de Cloudflare."
  },
@@ -294,6 +294,13 @@
    },
    "title": "Configuración Común"
  },
+  "settingImage": {
+    "defaultCount": {
+      "desc": "Establece la cantidad predeterminada de imágenes al crear una nueva tarea en el panel de generación de imágenes.",
+      "label": "Cantidad de imágenes predeterminada",
+      "title": "Configuración de dibujo con IA"
+    }
+  },
  "settingModel": {
    "enableMaxTokens": {
      "title": "Activar límite de tokens por respuesta"
@@ -549,6 +556,7 @@
    "common": "Configuración común",
    "experiment": "Experimento",
    "hotkey": "Atajos de teclado",
+    "image": "Dibujo con IA",
    "llm": "Modelo de lenguaje",
    "provider": "Proveedor de servicios de IA",
    "proxy": "Proxy de red",
@@ -236,6 +236,7 @@
  },
  "information": "جامعه و اطلاعات",
  "installPWA": "نصب برنامه وب پیشرو (PWA)",
+  "labs": "آزمایشگاه‌ها",
  "lang": {
    "ar": "عربی",
    "bg-BG": "بلغاری",
@@ -7,6 +7,14 @@
    "desc": "حذف پیام‌ها و فایل‌های بارگذاری شده در جلسه جاری",
    "title": "حذف پیام‌های جلسه"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "آخرین پیام را حذف کرده و دوباره تولید کن",
+    "title": "حذف و تولید مجدد"
+  },
+  "deleteLastMessage": {
+    "desc": "آخرین پیام را حذف کن",
+    "title": "حذف آخرین پیام"
+  },
  "desktop": {
    "openSettings": {
      "desc": "باز کردن صفحه تنظیمات برنامه",
@@ -0,0 +1,14 @@
+{
+  "desc": "در اینجا به‌طور دوره‌ای ویژگی‌های جدیدی که در حال آزمایش آن‌ها هستیم به‌روزرسانی می‌شود. خوشحال می‌شویم آن‌ها را امتحان کنید!",
+  "features": {
+    "groupChat": {
+      "desc": "قابلیت هماهنگی چت گروهی با چند عامل هوشمند را فعال کنید.",
+      "title": "گفت‌وگوی گروهی (چند عاملی)"
+    },
+    "inputMarkdown": {
+      "desc": "رندر هم‌زمان Markdown (مانند بولد، بلوک کد، جدول و غیره) در ناحیه ورودی.",
+      "title": "رندر Markdown در کادر ورودی"
+    }
+  },
+  "title": "آزمایشگاه"
+}
@@ -284,11 +284,19 @@
          "placeholder": "لطفاً شناسه مدل را وارد کنید، مانند gpt-4o یا claude-3.5-sonnet",
          "title": "شناسه مدل"
        },
+        "imageOutput": {
+          "extra": "این پیکربندی تنها قابلیت تولید تصویر توسط مدل را فعال می‌کند. نتیجه نهایی کاملاً به توانایی‌های خود مدل بستگی دارد. لطفاً خودتان بررسی کنید که آیا این مدل توانایی تولید تصویر را دارد یا خیر.",
+          "title": "پشتیبانی از تولید تصویر"
+        },
        "modalTitle": "پیکربندی مدل سفارشی",
        "reasoning": {
          "extra": "این تنظیم فقط قابلیت تفکر عمیق مدل را فعال می‌کند و تأثیر دقیق آن کاملاً به خود مدل بستگی دارد، لطفاً خودتان آزمایش کنید که آیا این مدل قابلیت تفکر عمیق قابل استفاده را دارد یا خیر",
          "title": "پشتیبانی از تفکر عمیق"
        },
+        "search": {
+          "extra": "این پیکربندی تنها قابلیت جستجوی آنلاین از طریق موتور جستجوی داخلی مدل را فعال می‌کند. پشتیبانی از موتور جستجوی داخلی به توانایی‌های خود مدل بستگی دارد. لطفاً خودتان بررسی کنید که آیا این مدل از موتور جستجوی داخلی پشتیبانی می‌کند یا خیر.",
+          "title": "پشتیبانی از جستجوی آنلاین"
+        },
        "tokens": {
          "extra": "حداکثر تعداد توکن‌های پشتیبانی شده توسط مدل را تنظیم کنید",
          "title": "حداکثر پنجره زمینه",
@@ -309,6 +317,10 @@
          "placeholder": "لطفاً نوع مدل را انتخاب کنید",
          "title": "نوع مدل"
        },
+        "video": {
+          "extra": "این پیکربندی تنها قابلیت شناسایی ویدیو در برنامه را فعال می‌کند. پشتیبانی از شناسایی ویدیو کاملاً به توانایی‌های خود مدل بستگی دارد. لطفاً خودتان بررسی کنید که آیا این مدل توانایی شناسایی ویدیو را دارد یا خیر.",
+          "title": "پشتیبانی از شناسایی ویدیو"
+        },
        "vision": {
          "extra": "این پیکربندی تنها قابلیت بارگذاری تصویر در برنامه را فعال می‌کند، اینکه آیا شناسایی پشتیبانی می‌شود به خود مدل بستگی دارد، لطفاً قابلیت استفاده از شناسایی بصری این مدل را آزمایش کنید",
          "title": "پشتیبانی از شناسایی بصری"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "ارائه شده توسط مایکروسافت؛ مدل DeepSeek R1 به‌روزرسانی‌های جزئی دریافت کرده است و نسخه فعلی آن DeepSeek-R1-0528 می‌باشد. در آخرین به‌روزرسانی، DeepSeek R1 با افزایش منابع محاسباتی و معرفی مکانیزم بهینه‌سازی الگوریتم در مرحله پس‌آموزش، عمق استنتاج و توانایی پیش‌بینی را به طور قابل توجهی بهبود بخشیده است. این مدل در آزمون‌های معیار مختلفی مانند ریاضیات، برنامه‌نویسی و منطق عمومی عملکرد برجسته‌ای دارد و عملکرد کلی آن به مدل‌های پیشرو مانند O3 و Gemini 2.5 Pro نزدیک شده است."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B یک مدل متخصص ترکیبی است که توسط Baichuan Intelligence ارائه شده و دارای توانایی استدلالی قدرتمندی می‌باشد."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B یک مدل زبان بزرگ متن باز و قابل تجاری با 130 میلیارد پارامتر است که در آزمون‌های معتبر چینی و انگلیسی بهترین عملکرد را در اندازه مشابه به دست آورده است."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 به‌طور خاص برای نقش‌آفرینی و همراهی عاطفی طراحی شده است و از حافظه چند دور طولانی و گفتگوی شخصی‌سازی شده پشتیبانی می‌کند و کاربردهای گسترده‌ای دارد."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 یک مدل بسته‌شده است که توسط هوش مصنوعی Zhima و آزمایشگاه KEG دانشگاه Tsinghua منتشر شده است. این مدل با پیش‌آموزش بر روی مجموعه‌ای وسیع از نمادهای چینی و انگلیسی و همچنین آموزش مطابق با ترجیحات انسانی، نسبت به نسل اول مدل، بهبود‌های 16٪، 36٪ و 280٪ در MMLU، C-Eval و GSM8K به دست آورده است و در رتبه‌بندی وظایف چینی C-Eval رتبه اول را کسب کرده است. این مدل برای صحنه‌هایی که نیاز به مقدار زیادی دانش، توانایی استدلال و خلاقیت دارند، مانند نوشتن متن تبلیغاتی، نویسندگی داستان، نوشتن محتوای دانشگاهی و تولید کد مناسب است."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base یک مدل پایه منبع باز با مقیاس ۶ میلیارد پارامتر از نسل جدید سری ChatGLM است که توسط شرکت Zhizhu (智谱) توسعه یافته است."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o یک مدل پویا است که به‌صورت زنده به‌روزرسانی می‌شود تا همیشه نسخه‌ی جدید و به‌روز باشد. این مدل ترکیبی از توانایی‌های قوی در درک و تولید زبان است و برای کاربردهای گسترده مانند خدمات مشتری، آموزش و پشتیبانی فنی مناسب است."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus نسخه به‌روزرسانی شده مدل V3.1 منتشر شده توسط DeepSeek است که به عنوان یک مدل زبان بزرگ با عامل ترکیبی شناخته می‌شود. این به‌روزرسانی ضمن حفظ قابلیت‌های اصلی مدل، بر رفع مشکلات گزارش شده توسط کاربران و افزایش پایداری تمرکز دارد. این نسخه به طور قابل توجهی انسجام زبانی را بهبود بخشیده و از بروز ترکیب زبان چینی و انگلیسی و کاراکترهای نامتعارف کاسته است. مدل شامل حالت «تفکر» (Thinking Mode) و «غیرتفکر» (Non-thinking Mode) است که کاربران می‌توانند از طریق قالب‌های گفتگو به صورت انعطاف‌پذیر بین آن‌ها جابجا شوند تا با وظایف مختلف سازگار شوند. به عنوان یک بهینه‌سازی مهم، V3.1-Terminus عملکرد عامل کد (Code Agent) و عامل جستجو (Search Agent) را تقویت کرده است تا در فراخوانی ابزارها و اجرای وظایف پیچیده چندمرحله‌ای قابل اعتمادتر باشد."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "مدل DeepSeek V3.2 Exp یک معماری ترکیبی برای استدلال است که از هر دو حالت تفکر و غیرتفکر پشتیبانی می‌کند."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek LLM Chat (67B) یک مدل نوآورانه هوش مصنوعی است که توانایی درک عمیق زبان و تعامل را فراهم می‌کند."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1: مدل استنتاج نسل بعدی که توانایی‌های استنتاج پیچیده و تفکر زنجیره‌ای را بهبود بخشیده و برای وظایفی که نیاز به تحلیل عمیق دارند مناسب است."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1: نسل جدیدی از مدل‌های استنتاج که توانایی استدلال پیچیده و تفکر زنجیره‌ای را بهبود می‌بخشد، مناسب برای وظایفی که نیاز به تحلیل عمیق دارند."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 یک مدل زبانی بصری مبتنی بر DeepSeekMoE-27B است که از معماری MoE با فعال‌سازی پراکنده استفاده می‌کند و در حالی که تنها 4.5 میلیارد پارامتر فعال است، عملکرد فوق‌العاده‌ای را ارائه می‌دهد. این مدل در چندین وظیفه از جمله پرسش و پاسخ بصری، شناسایی کاراکتر نوری، درک اسناد/جدول‌ها/نمودارها و مکان‌یابی بصری عملکرد عالی دارد."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 یک مدل استدلال ترکیبی جدید از DeepSeek است که از دو حالت استدلال تفکری و غیرتفکری پشتیبانی می‌کند و نسبت به DeepSeek-R1-0528 در حالت تفکری کارایی بالاتری دارد. پس از آموزش تکمیلی، استفاده از ابزارهای Agent و عملکرد وظایف هوشمند به طور قابل توجهی بهبود یافته است. پشتیبانی از پنجره متنی 128k و طول خروجی تا 64k توکن."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus نسخه‌ای بهینه‌سازی‌شده از مدل زبان بزرگ DeepSeek است که به‌طور خاص برای دستگاه‌های نهایی طراحی شده است."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1: مدل استنتاج نسل بعدی که توانایی‌های استنتاج پیچیده و تفکر زنجیره‌ای را بهبود بخشیده و برای وظایفی که نیاز به تحلیل عمیق دارند مناسب است."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "مدل زبان بزرگ فوق‌العاده پرچمدار خود توسعه یافته توسط بایدو، که عملکرد کلی آن بسیار خوب است و به طور گسترده‌ای در زمینه‌های مختلف برای وظایف پیچیده کاربرد دارد؛ از اتصال خودکار به افزونه جستجوی بایدو پشتیبانی می‌کند تا اطلاعات پرسش و پاسخ به روز باشد. نسبت به ERNIE 4.0 در عملکرد بهتر است."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B یک مدل متخصص ترکیبی است که توسط Wenxin Baidu توسعه یافته و دارای توانایی‌های قوی در استدلال و پشتیبانی از چند زبان می‌باشد."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B یک مدل بسیار بزرگ متخصص ترکیبی است که توسط Wenxin Baidu ارائه شده و از توانایی استدلالی برجسته‌ای برخوردار است."
+  },
  "ernie-4.5-8k-preview": {
    "description": "مدل بزرگ 4.5 Ernie یک مدل پایه چندرسانه‌ای نسل جدید است که توسط بایدو به‌طور مستقل توسعه یافته و از طریق مدل‌سازی مشترک چندین حالت به بهینه‌سازی هم‌زمان دست می‌یابد و توانایی درک چندرسانه‌ای فوق‌العاده‌ای دارد؛ دارای توانایی‌های زبانی پیشرفته‌تر، درک، تولید، منطق و حافظه به‌طور کلی بهبود یافته و توانایی‌های حذف توهم، استدلال منطقی و کد به‌طور قابل توجهی افزایش یافته است."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 جدیدترین نسخه مدل است که برای وظایف بسیار پیچیده و متنوع طراحی شده و عملکردی عالی دارد."
  },
  "glm-4-9b-chat": {
-    "description": "GLM-4-9B-Chat در زمینه‌های معنایی، ریاضی، استدلال، کد و دانش عملکرد بالایی از خود نشان می‌دهد. همچنین دارای قابلیت مرور وب، اجرای کد، تماس با ابزارهای سفارشی و استدلال متن‌های طولانی است. از 26 زبان از جمله ژاپنی، کره‌ای و آلمانی پشتیبانی می‌کند."
+    "description": "GLM-4-9B-Chat عملکرد بالایی در زمینه‌های معناشناسی، ریاضیات، استدلال، کدنویسی و دانش دارد. همچنین از مرور وب، اجرای کد، فراخوانی ابزارهای سفارشی و استدلال متون بلند پشتیبانی می‌کند. این مدل از ۲۶ زبان از جمله ژاپنی، کره‌ای و آلمانی پشتیبانی می‌نماید."
  },
  "glm-4-air": {
    "description": "GLM-4-Air نسخه‌ای با صرفه اقتصادی است که عملکردی نزدیک به GLM-4 دارد و سرعت بالا و قیمت مناسبی را ارائه می‌دهد."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "GLM-Zero-Preview دارای توانایی‌های پیچیده استدلال است و در زمینه‌های استدلال منطقی، ریاضیات، برنامه‌نویسی و غیره عملکرد عالی دارد."
  },
+  "glm4.6:355b": {
+    "description": "جدیدترین مدل پرچم‌دار Zhipu، GLM-4.6 (355B)، در زمینه‌های برنامه‌نویسی پیشرفته، پردازش متون طولانی، استدلال و توانایی‌های عامل هوشمند، به‌طور کامل از نسل قبلی پیشی گرفته است. به‌ویژه در توانایی کدنویسی با Claude Sonnet 4 هم‌تراز شده و به یکی از برترین مدل‌های کدنویسی در داخل کشور تبدیل شده است."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash ویژگی‌ها و قابلیت‌های نسل بعدی را ارائه می‌دهد، از جمله سرعت عالی، استفاده داخلی از ابزارها، تولید چندرسانه‌ای و پنجره زمینه 1 میلیون توکن."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "سریع‌ترین و اقتصادی‌ترین نسخه GPT-5. بسیار مناسب برای کاربردهایی که نیاز به پاسخ سریع و حساسیت به هزینه دارند."
  },
+  "gpt-5-pro": {
+    "description": "GPT-5 pro با استفاده از محاسبات بیشتر، عمیق‌تر می‌اندیشد و به طور مداوم پاسخ‌های بهتری ارائه می‌دهد."
+  },
  "gpt-audio": {
    "description": "GPT Audio مدلی عمومی برای چت با ورودی و خروجی صوتی است که از استفاده از ورودی/خروجی صوتی در API تکمیل چت پشتیبانی می‌کند."
  },
  "gpt-image-1": {
    "description": "مدل تولید تصویر چندرسانه‌ای بومی ChatGPT"
  },
+  "gpt-image-1-mini": {
+    "description": "نسخه‌ای مقرون‌به‌صرفه‌تر از GPT Image 1 که به‌صورت بومی از ورودی‌های متنی و تصویری پشتیبانی می‌کند و خروجی تصویری تولید می‌نماید."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4: ساختار ترنسفورمر کوانتیزه شده که حتی در منابع محدود عملکرد قوی خود را حفظ می‌کند."
+    "description": "برای استفاده از این مدل نیاز به درخواست دسترسی می‌باشد. GPT-OSS-120B یک مدل زبان بزرگ متن‌باز از OpenAI است که توانایی بالایی در تولید متن دارد."
+  },
+  "gpt-oss-20b": {
+    "description": "برای استفاده از این مدل نیاز به درخواست دسترسی می‌باشد. GPT-OSS-20B یک مدل زبان میان‌رده متن‌باز از OpenAI است که توانایی تولید متن به‌صورت کارآمد را دارد."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B یک مدل زبان بزرگ متن‌باز منتشر شده توسط OpenAI است که از فناوری کوانتیزاسیون MXFP4 بهره می‌برد و به عنوان مدل پرچمدار شناخته می‌شود. این مدل نیازمند محیطی با چند GPU یا ایستگاه کاری با عملکرد بالا برای اجرا است و در استدلال پیچیده، تولید کد و پردازش چندزبانه عملکردی برجسته دارد و از فراخوانی توابع پیشرفته و یکپارچه‌سازی ابزارها پشتیبانی می‌کند."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "مدل عمومی زمان واقعی که از ورودی و خروجی متنی و صوتی به صورت زنده پشتیبانی می‌کند و همچنین ورودی تصویر را نیز قبول می‌کند."
  },
-  "grok-2-1212": {
-    "description": "این مدل در دقت، پیروی از دستورات و توانایی چند زبانه بهبود یافته است."
-  },
  "grok-2-image-1212": {
    "description": "جدیدترین مدل تولید تصویر ما قادر است تصاویر زنده و واقعی را بر اساس متن توصیفی تولید کند. این مدل در زمینه تولید تصویر برای بازاریابی، رسانه‌های اجتماعی و سرگرمی عملکرد برجسته‌ای دارد."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "مدل پرچمدار که در استخراج داده، برنامه‌نویسی و خلاصه‌سازی متن برای کاربردهای سازمانی مهارت دارد و دانش عمیقی در حوزه‌های مالی، پزشکی، حقوقی و علمی دارد."
  },
-  "grok-3-fast": {
-    "description": "مدل پرچمدار که در استخراج داده، برنامه‌نویسی و خلاصه‌سازی متن برای کاربردهای سازمانی مهارت دارد و دانش عمیقی در حوزه‌های مالی، پزشکی، حقوقی و علمی دارد."
-  },
  "grok-3-mini": {
    "description": "مدل سبک‌وزن که قبل از پاسخگویی تفکر می‌کند. سریع و هوشمند اجرا می‌شود، مناسب برای وظایف منطقی که نیاز به دانش عمیق حوزه ندارند و می‌تواند مسیر تفکر اصلی را ارائه دهد."
  },
-  "grok-3-mini-fast": {
-    "description": "مدل سبک‌وزن که قبل از پاسخگویی تفکر می‌کند. سریع و هوشمند اجرا می‌شود، مناسب برای وظایف منطقی که نیاز به دانش عمیق حوزه ندارند و می‌تواند مسیر تفکر اصلی را ارائه دهد."
-  },
  "grok-4": {
    "description": "جدیدترین و قدرتمندترین مدل پرچمدار ما که در پردازش زبان طبیعی، محاسبات ریاضی و استدلال عملکردی برجسته دارد — یک انتخاب همه‌کاره بی‌نظیر است."
  },
@@ -1851,7 +1866,7 @@
    "description": "جدیدترین مدل تفکر عمیق چندرسانه‌ای t1-vision از Hunyuan که از زنجیره تفکر بلند چندرسانه‌ای بومی پشتیبانی می‌کند و نسبت به نسخه پیش‌فرض نسل قبلی به طور کامل بهبود یافته است."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "مدل تفکر عمیق چندرسانه‌ای Hunyuan که از زنجیره‌های فکری بومی چندرسانه‌ای پشتیبانی می‌کند و در پردازش انواع سناریوهای استنتاج تصویری مهارت دارد، در حل مسائل علمی نسبت به مدل تفکر سریع بهبود قابل توجهی یافته است."
+    "description": "جدیدترین نسخه مدل بینایی عمیق Hunyuan t1-vision، نسبت به نسخه قبلی در وظایفی مانند پرسش و پاسخ تصویری عمومی، مکان‌یابی بصری، OCR، نمودارها، حل مسائل از روی عکس و خلق آثار از روی تصویر، به‌طور جامع بهبود یافته و توانایی آن در زبان انگلیسی و زبان‌های کم‌کاربرد به‌طور چشمگیری ارتقا یافته است."
  },
  "hunyuan-turbo": {
    "description": "نسخه پیش‌نمایش مدل زبان بزرگ نسل جدید HunYuan که از ساختار مدل متخصص ترکیبی (MoE) جدید استفاده می‌کند. در مقایسه با hunyuan-pro، کارایی استنتاج سریع‌تر و عملکرد بهتری دارد."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "مدل پیش‌نمایش kimi-k2-0905 دارای طول متن ۲۵۶ هزار توکنی است و توانایی‌های قوی‌تری در برنامه‌نویسی عامل‌محور، زیبایی و کاربردی بودن کدهای فرانت‌اند و درک بهتر متن دارد."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct یک مدل زبان بزرگ است که توسط Moonshot AI توسعه یافته و دارای توانایی پردازش زمینه‌های بسیار طولانی می‌باشد."
+  },
  "kimi-k2-turbo-preview": {
    "description": "kimi-k2 یک مدل پایه با معماری MoE است که دارای توانمندی‌های بسیار قوی در حوزهٔ برنامه‌نویسی و عامل‌ها (Agent) می‌باشد. مجموع پارامترها 1T و پارامترهای فعال‌شده 32B است. در آزمون‌های بنچمارک در دسته‌های اصلی مانند استدلال دانش عمومی، برنامه‌نویسی، ریاضیات و Agent، عملکرد مدل K2 از سایر مدل‌های متن‌باز مرسوم پیشی گرفته است."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "Spark Lite یک مدل زبان بزرگ سبک است که دارای تأخیر بسیار کم و توانایی پردازش کارآمد می‌باشد. به‌طور کامل رایگان و باز است و از قابلیت جستجوی آنلاین در زمان واقعی پشتیبانی می‌کند. ویژگی پاسخ‌دهی سریع آن باعث می‌شود که در کاربردهای استنتاجی و تنظیم مدل در دستگاه‌های با توان محاسباتی پایین عملکرد برجسته‌ای داشته باشد و تجربه‌ای هوشمند و مقرون‌به‌صرفه برای کاربران فراهم کند. به‌ویژه در زمینه‌های پرسش و پاسخ دانش، تولید محتوا و جستجو عملکرد خوبی دارد."
  },
-  "llama-2-7b-chat": {
-    "description": "سری مدل‌های زبانی بزرگ (LLM) Llama2 توسط Meta توسعه یافته و به صورت متن‌باز منتشر شده است. این مجموعه شامل مدل‌های متنی تولیدی با مقیاس‌های مختلف از 7 میلیارد تا 70 میلیارد پارامتر است که پیش‌آموزش و ری‌آموزش داده شده‌اند. از نظر معماری، Llama2 یک مدل زبانی خودرگرسیو با استفاده از معماری تبدیل‌کننده بهینه‌شده است. نسخه‌های تنظیم‌شده از این مدل با استفاده از ری‌آموزش نظارت‌شده (SFT) و یادگیری تقویتی با بازخورد انسانی (RLHF) برای همگرایی با ترجیحات انسانی در مورد مفیدیت و ایمنی تنظیم شده‌اند. Llama2 نسبت به سری Llama در مجموعه‌های داده علمی مختلف عملکرد بهتری دارد و الهام بخش طراحی و توسعه مدل‌های دیگر بسیاری بوده است."
-  },
  "llama-3.1-70b-versatile": {
    "description": "لاما 3.1 70B توانایی استدلال هوش مصنوعی قوی‌تری را ارائه می‌دهد، مناسب برای برنامه‌های پیچیده، پشتیبانی از پردازش‌های محاسباتی فراوان و تضمین کارایی و دقت بالا."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "مدل میکروآموزش Llama 3.2-Vision برای شناسایی بصری، استدلال تصویری، توصیف تصویر و پاسخ به سوالات مربوط به تصویر بهینه‌سازی شده است."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 پیشرفته‌ترین مدل زبان چندزبانه و متن‌باز در سری Llama است که تجربه‌ای با هزینه بسیار پایین مشابه عملکرد مدل 405B را ارائه می‌دهد. این مدل بر اساس ساختار Transformer طراحی شده و از طریق تنظیم دقیق نظارتی (SFT) و یادگیری تقویتی با بازخورد انسانی (RLHF) بهبود کارایی و ایمنی یافته است. نسخه بهینه‌سازی شده آن برای مکالمات چندزبانه طراحی شده و در چندین معیار صنعتی از بسیاری از مدل‌های چت متن‌باز و بسته بهتر عمل می‌کند. تاریخ قطع دانش آن دسامبر 2023 است."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B: مدلی با اندازه متوسط تا بزرگ از سری Llama که تعادلی میان توانایی استدلال و بازدهی فراهم می‌کند."
  },
  "llama-3.3-70b-versatile": {
    "description": "مدل زبان بزرگ چند زبانه Meta Llama 3.3 (LLM) یک مدل تولیدی پیش‌آموزش دیده و تنظیم‌شده در 70B (ورودی متن/خروجی متن) است. مدل متن خالص Llama 3.3 برای کاربردهای گفتگوی چند زبانه بهینه‌سازی شده و در معیارهای صنعتی معمول در مقایسه با بسیاری از مدل‌های چت متن‌باز و بسته عملکرد بهتری دارد."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "مدل آموزشی لاما ۳.۳ برای صحنه‌های گفت‌وگو بهینه‌سازی شده است و در معیارهای صنعتی معمول، بسیاری از مدل‌های چت منبع باز موجود را در برمی‌آید."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick: مدلی قدرتمند از سری Llama، مناسب برای استدلال پیشرفته، حل مسائل پیچیده و پیروی از دستورات."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout: مدلی قدرتمند از سری Llama، مناسب برای سناریوهایی با نیاز به بازدهی بالا و تأخیر پایین."
+  },
  "llama3-70b-8192": {
    "description": "متا لاما ۳ ۷۰B توانایی پردازش پیچیدگی بی‌نظیری را ارائه می‌دهد و برای پروژه‌های با نیازهای بالا طراحی شده است."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 مدل پیشرو ارائه شده توسط Meta است که از حداکثر 405 میلیارد پارامتر پشتیبانی می‌کند و می‌تواند در زمینه‌های مکالمات پیچیده، ترجمه چندزبانه و تحلیل داده‌ها به کار گرفته شود."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B: نسخه‌ای سبک و با تأخیر پایین از Llama، مناسب برای استنتاج آنلاین سبک و تعاملات بلادرنگ."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 مدل پیشرو ارائه شده توسط Meta است که از 405 میلیارد پارامتر پشتیبانی می‌کند و می‌تواند در زمینه‌های مکالمات پیچیده، ترجمه چندزبانه و تحلیل داده‌ها به کار گرفته شود."
  },
@@ -2067,7 +2091,7 @@
    "description": "Spark Max 32K با قابلیت پردازش متن با زمینه بزرگ‌تر، توانایی درک و استدلال منطقی قوی‌تری دارد و از ورودی متنی تا 32K توکن پشتیبانی می‌کند. مناسب برای خواندن اسناد طولانی، پرسش و پاسخ با دانش خصوصی و موارد مشابه."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct یک مدل زبانی بزرگ است که به طور کامل توسط شرکت ووونگ شیونگ آموزش داده شده است. هدف از Megrez-3B-Instruct ایجاد یک راه‌حل هوشمند از طریق هماهنگی سخت‌افزار و نرم‌افزار است که دارای استنتاج سریع، حجم کوچک و آسانی در استفاده باشد."
+    "description": "Megrez 3B Instruct یک مدل کم‌پارامتر و کارآمد است که توسط Wuwen Xinqiong ارائه شده است."
  },
  "meta-llama-3-70b-instruct": {
    "description": "یک مدل قدرتمند با ۷۰ میلیارد پارامتر که در استدلال، کدنویسی و کاربردهای گسترده زبانی عملکرد برجسته‌ای دارد."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "Spark Pro 128K با قابلیت پردازش متن بسیار بزرگ، قادر به پردازش تا 128K اطلاعات متنی است. این ویژگی به‌ویژه برای تحلیل کامل و پردازش ارتباطات منطقی طولانی‌مدت در محتوای متنی طولانی مناسب است و می‌تواند در ارتباطات متنی پیچیده، پشتیبانی از منطق روان و یکپارچه و ارجاعات متنوع را فراهم کند."
  },
+  "pro-deepseek-r1": {
+    "description": "مدل اختصاصی برای خدمات سازمانی، شامل پشتیبانی از سرویس‌های هم‌زمان."
+  },
+  "pro-deepseek-v3": {
+    "description": "مدل اختصاصی برای خدمات سازمانی، شامل پشتیبانی از سرویس‌های هم‌زمان."
+  },
  "qvq-72b-preview": {
    "description": "مدل QVQ یک مدل تحقیقاتی تجربی است که توسط تیم Qwen توسعه یافته و بر بهبود توانایی استدلال بصری، به‌ویژه در زمینه استدلال ریاضی تمرکز دارد."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "مدل استدلال بصری. پشتیبانی از ورودی‌های بصری و خروجی زنجیره تفکر، نسخه پلاس پس از مدل qvq-max، که نسبت به مدل qvq-max سرعت استدلال بالاتر و تعادل بهتری بین عملکرد و هزینه دارد."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B: مدل سری Qwen با عملکرد عالی در وظایف چندزبانه و برنامه‌نویسی، مناسب برای استفاده در مقیاس متوسط تولیدی."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B: مدلی با زمینه طولانی برای تولید کد و انجام وظایف پیچیده برنامه‌نویسی."
+  },
  "qwen-coder-plus": {
    "description": "مدل کد نویسی Tongyi Qianwen."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 مدل زبان بزرگ نسل جدید علی‌بابا است که با عملکرد عالی از نیازهای متنوع کاربردی پشتیبانی می‌کند."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2، سری جدیدی از مدل‌های زبانی بزرگ توسط تیم Qwen ارائه شده است. این مدل بر اساس معماری Transformer ساخته شده و از توابع فعال‌سازی SwiGLU، بایاس QKV توجه (attention QKV bias)، توجه سؤال گروهی (group query attention)، ترکیب توجه پنجره‌ای لغزشی و توجه کامل (mixture of sliding window attention and full attention) استفاده می‌کند. علاوه بر این، تیم Qwen بهبودی در تجزیه‌کننده‌هایی که برای تجزیه متن‌های طبیعی و کد مناسب هستند ایجاد کرده‌اند."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2، سری جدیدی از مدل‌های زبانی بزرگ توسط تیم Qwen ارائه شده است. این مدل بر اساس معماری Transformer ساخته شده و از توابع فعال‌سازی SwiGLU، بایاس QKV توجه (attention QKV bias)، توجه سرویس‌گروهی (group query attention)، ترکیب توجه پنجره‌ای لغزشی و توجه کامل (mixture of sliding window attention and full attention) استفاده می‌کند. علاوه بر این، تیم Qwen بهبودی در تجزیه‌کننده‌هایی ارائه کرده‌اند که برای تجزیه متن‌های طبیعی و کد مناسب هستند."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 نسل جدید مدل زبانی مقیاس بزرگ Alibaba است که با عملکرد عالی از نیازهای متنوع کاربردی پشتیبانی می‌کند."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "مدل متن‌باز نسل جدید با حالت تفکر مبتنی بر Qwen3، که نسبت به نسخه قبلی (Tongyi Qianwen 3-235B-A22B-Thinking-2507) در پیروی از دستورات پیشرفت داشته و پاسخ‌های مدل خلاصه‌تر شده‌اند."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct یک مدل چندرسانه‌ای است که توسط Tongyi Qianwen توسعه یافته و از درک و استدلال بصری پشتیبانی می‌کند."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking یک مدل چندرسانه‌ای استدلالی است که توسط Tongyi Qianwen توسعه یافته و از درک و استدلال بصری پشتیبانی می‌کند."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL مدلی برای تولید متن با قابلیت درک بصری (تصویر) است که نه تنها می‌تواند OCR (تشخیص متن در تصویر) انجام دهد، بلکه قادر به خلاصه‌سازی و استنتاج بیشتر نیز هست، مانند استخراج ویژگی‌ها از عکس محصولات یا حل مسائل بر اساس تصاویر تمرین."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "این مدل یک مدل استدلال بزرگ با توانایی‌های قوی در درک تصویر است که می‌تواند اطلاعات تصویری و متنی را پردازش کند و پس از تفکر عمیق، متن تولید کند. این مدل در زمینه استدلال بصری عملکرد برجسته‌ای دارد و همچنین دارای توانایی‌های ریاضی، کدنویسی و استدلال متنی در سطح اول است. طول متن زمینه‌ای 100k است."
  },
+  "step3": {
+    "description": "Step3 یک مدل چندرسانه‌ای است که توسط StepStar توسعه یافته و دارای توانایی قوی در درک بصری می‌باشد."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 یک مدل استنتاج چندمودالی پیشرفته است که توسط شرکت StepFun منتشر شده است. این مدل بر پایهٔ معماری مخلوط متخصصان (MoE) با مجموع 321 میلیارد پارامتر و 38 میلیارد پارامتر فعال ساخته شده است. طراحی آن انتها‌به‌انتها است و هدفش کمینه‌سازی هزینهٔ رمزگشایی در حالی‌ست که در استدلال بینایی-زبانی عملکردی در سطح برتر ارائه می‌دهد. از طریق طراحی هم‌افزا مبتنی بر توجه چند-ماتریسی تجزیه‌شده (MFA) و جداسازی توجه و FFN (AFD)، Step3 قادر است کارایی برجسته‌ای را هم روی شتاب‌دهنده‌های رده‌پرچم‌دار و هم روی شتاب‌دهنده‌های سطح پایین حفظ کند. در مرحلهٔ پیش‌آموزش، Step3 بیش از 20T توکن متنی و 4T توکن ترکیبی تصویر-متن را پردازش کرده و بیش از ده زبان را پوشش داده است. این مدل در بنچ‌مارک‌های متعددی از جمله ریاضیات، کدنویسی و چندمودال در میان مدل‌های متن‌باز در جایگاه پیشرو قرار گرفته است."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "جدیدترین و بزرگ‌ترین مدل پرچمدار xAI که عملکرد بی‌نظیری در زبان طبیعی، ریاضیات و استدلال ارائه می‌دهد — انتخابی کامل و همه‌کاره."
  },
-  "yi-1.5-34b-chat": {
-    "description": "Yi-1.5 نسخه‌ی به‌روزرسانی شده‌ی Yi است. این مدل با استفاده از یک مجموعه داده با کیفیت بالا شامل 500 میلیارد توکن برای پیش‌آموزی و 3 میلیون نمونه متنوع برای آموزش ریزی مجدداً آموزش داده شده است."
-  },
  "yi-large": {
    "description": "مدل جدید با میلیاردها پارامتر، ارائه‌دهنده توانایی‌های فوق‌العاده در پاسخ‌گویی و تولید متن."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "آزمایشگاهی پیشرو در پژوهش‌های پیشرفتهٔ هوش مصنوعی که زیرساخت‌های بصریِ فردا را می‌سازد."
  },
+  "cerebras": {
+    "description": "Cerebras یک پلتفرم استنتاج هوش مصنوعی مبتنی بر سیستم اختصاصی CS-3 خود است که با هدف ارائه سریع‌ترین خدمات مدل‌های زبانی بزرگ (LLM) در جهان با پاسخ‌دهی آنی و توان عملیاتی بالا طراحی شده است. این پلتفرم به‌طور ویژه برای حذف تأخیر و تسریع جریان‌های کاری پیچیده هوش مصنوعی مانند تولید کد در زمان واقعی و انجام وظایف نمایندگی طراحی شده است."
+  },
  "cloudflare": {
    "description": "مدل‌های یادگیری ماشین مبتنی بر GPU بدون سرور را در شبکه جهانی Cloudflare اجرا کنید."
  },
@@ -294,6 +294,13 @@
    },
    "title": "تنظیمات عمومی"
  },
+  "settingImage": {
+    "defaultCount": {
+      "desc": "تعداد پیش‌فرض تصاویر تولیدشده هنگام ایجاد یک وظیفه جدید را در پنل تولید تصویر تنظیم کنید.",
+      "label": "تعداد پیش‌فرض تصاویر",
+      "title": "تنظیمات نقاشی هوش مصنوعی"
+    }
+  },
  "settingModel": {
    "enableMaxTokens": {
      "title": "فعال‌سازی محدودیت پاسخ"
@@ -549,6 +556,7 @@
    "common": "تنظیمات عمومی",
    "experiment": "آزمایش",
    "hotkey": "کلیدهای میانبر",
+    "image": "نقاشی هوش مصنوعی",
    "llm": "مدل زبان",
    "provider": "ارائه دهنده خدمات هوش مصنوعی",
    "proxy": "پروکسی شبکه",
@@ -236,6 +236,7 @@
  },
  "information": "Communauté et Informations",
  "installPWA": "Installer l'application du navigateur",
+  "labs": "Laboratoires",
  "lang": {
    "ar": "arabe",
    "bg-BG": "Bulgare",
@@ -7,6 +7,14 @@
    "desc": "Effacer les messages de la session actuelle et les fichiers téléchargés",
    "title": "Effacer les messages de la session"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "Supprimer le dernier message et régénérer",
+    "title": "Supprimer et régénérer"
+  },
+  "deleteLastMessage": {
+    "desc": "Supprimer le dernier message",
+    "title": "Supprimer le dernier message"
+  },
  "desktop": {
    "openSettings": {
      "desc": "Ouvrir la page des paramètres de l'application",
@@ -0,0 +1,14 @@
+{
+  "desc": "Nous mettons à jour régulièrement cette section avec de nouvelles fonctionnalités en cours d'exploration. N'hésitez pas à les essayer !",
+  "features": {
+    "groupChat": {
+      "desc": "Activez la coordination de discussions de groupe avec plusieurs agents intelligents.",
+      "title": "Discussion de groupe (multi-agents)"
+    },
+    "inputMarkdown": {
+      "desc": "Rendu en temps réel du Markdown dans la zone de saisie (gras, blocs de code, tableaux, etc.).",
+      "title": "Rendu Markdown dans le champ de saisie"
+    }
+  },
+  "title": "Laboratoire"
+}
@@ -284,11 +284,19 @@
          "placeholder": "Veuillez entrer l'identifiant du modèle, par exemple gpt-4o ou claude-3.5-sonnet",
          "title": "ID du modèle"
        },
+        "imageOutput": {
+          "extra": "Cette configuration activera uniquement la capacité du modèle à générer des images. L'efficacité dépend entièrement du modèle lui-même. Veuillez tester vous-même si le modèle est capable de générer des images utilisables.",
+          "title": "Prise en charge de la génération d'images"
+        },
        "modalTitle": "Configuration du modèle personnalisé",
        "reasoning": {
          "extra": "Cette configuration activera uniquement la capacité de réflexion approfondie du modèle. Les résultats dépendent entièrement du modèle lui-même, veuillez tester si ce modèle possède une capacité de réflexion approfondie utilisable.",
          "title": "Support de la réflexion approfondie"
        },
+        "search": {
+          "extra": "Cette configuration activera uniquement la capacité de recherche en ligne via le moteur de recherche intégré du modèle. La prise en charge dépend du modèle lui-même. Veuillez tester vous-même si le moteur de recherche intégré est fonctionnel.",
+          "title": "Prise en charge de la recherche en ligne"
+        },
        "tokens": {
          "extra": "Définir le nombre maximal de tokens pris en charge par le modèle",
          "title": "Fenêtre de contexte maximale",
@@ -309,6 +317,10 @@
          "placeholder": "Veuillez sélectionner un type de modèle",
          "title": "Type de modèle"
        },
+        "video": {
+          "extra": "Cette configuration activera uniquement la reconnaissance vidéo dans l'application. La capacité de reconnaissance dépend entièrement du modèle lui-même. Veuillez tester vous-même si le modèle est capable de reconnaître les vidéos.",
+          "title": "Prise en charge de la reconnaissance vidéo"
+        },
        "vision": {
          "extra": "Cette configuration n'activera que la configuration de téléchargement d'images dans l'application, la prise en charge de la reconnaissance dépend entièrement du modèle lui-même, veuillez tester la disponibilité des capacités de reconnaissance visuelle de ce modèle.",
          "title": "Reconnaissance visuelle prise en charge"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "Déployé et fourni par Microsoft ; le modèle DeepSeek R1 a bénéficié d'une mise à jour mineure, la version actuelle étant DeepSeek-R1-0528. Dans la dernière mise à jour, DeepSeek R1 a considérablement amélioré sa profondeur d'inférence et ses capacités de raisonnement grâce à l'augmentation des ressources de calcul et à l'introduction d'un mécanisme d'optimisation algorithmique en phase post-entraînement. Ce modèle excelle dans plusieurs benchmarks, notamment en mathématiques, programmation et logique générale, avec des performances globales proches des modèles de pointe tels que O3 et Gemini 2.5 Pro."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B est un modèle à experts mixtes développé par Baichuan Intelligence, doté de puissantes capacités de raisonnement."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B est un modèle de langage open source et commercialisable développé par Baichuan Intelligence, contenant 13 milliards de paramètres, qui a obtenu les meilleurs résultats dans des benchmarks chinois et anglais de référence."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 est conçu pour le jeu de rôle et l'accompagnement émotionnel, prenant en charge une mémoire multi-tours ultra-longue et des dialogues personnalisés, avec une large gamme d'applications."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 est un modèle fermé développé par l'IA Zhipu et le laboratoire KEG de Tsinghua. Il a été pré-entraîné sur une grande quantité d'identifiants chinois et anglais et a été aligné sur les préférences humaines. Par rapport au modèle de première génération, il a amélioré ses performances de 16%, 36% et 280% sur MMLU, C-Eval et GSM8K respectivement, et est devenu le meilleur modèle sur le classement C-Eval pour les tâches en chinois. Il est adapté aux scénarios nécessitant une grande quantité de connaissances, des capacités de raisonnement et de créativité, tels que la rédaction de publicités, l'écriture de romans, la rédaction de contenu informatif et la génération de code."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base est le modèle de base open source de la dernière génération de la série ChatGLM, développé par Zhipu, avec une taille de 6 milliards de paramètres."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o est un modèle dynamique, mis à jour en temps réel pour rester à jour avec la dernière version. Il combine une compréhension et une génération de langage puissantes, adapté à des scénarios d'application à grande échelle, y compris le service client, l'éducation et le support technique."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus est une version mise à jour du modèle V3.1 publiée par DeepSeek, positionnée comme un grand modèle de langage hybride pour agents intelligents. Cette mise à jour conserve les capacités originales du modèle tout en se concentrant sur la correction des problèmes signalés par les utilisateurs et l'amélioration de la stabilité. Elle améliore significativement la cohérence linguistique, réduisant le mélange de chinois et d'anglais ainsi que l'apparition de caractères anormaux. Le modèle intègre un « mode réflexion » (Thinking Mode) et un « mode non-réflexion » (Non-thinking Mode), permettant aux utilisateurs de basculer facilement entre ces modes via des modèles de conversation adaptés à différentes tâches. En tant qu'optimisation majeure, V3.1-Terminus renforce les performances des agents de code (Code Agent) et de recherche (Search Agent), rendant leur appel d'outils et l'exécution de tâches complexes en plusieurs étapes plus fiables."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "Le modèle DeepSeek V3.2 Exp adopte une architecture de raisonnement hybride, prenant en charge à la fois les modes de réflexion et non réflexifs."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek 67B est un modèle avancé formé pour des dialogues de haute complexité."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1 : modèle de raisonnement de nouvelle génération, améliorant les capacités de raisonnement complexe et de réflexion en chaîne, adapté aux tâches nécessitant une analyse approfondie."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1 : un modèle de raisonnement de nouvelle génération, améliorant les capacités de raisonnement complexe et de pensée en chaîne, idéal pour les tâches nécessitant une analyse approfondie."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 est un modèle de langage visuel à experts mixtes (MoE) développé sur la base de DeepSeekMoE-27B, utilisant une architecture MoE à activation sparse, réalisant des performances exceptionnelles tout en n'activant que 4,5 milliards de paramètres. Ce modèle excelle dans plusieurs tâches telles que la question-réponse visuelle, la reconnaissance optique de caractères, la compréhension de documents/tableaux/graphes et le positionnement visuel."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 est un nouveau modèle d'inférence hybride lancé par DeepSeek, prenant en charge deux modes d'inférence : réfléchi et non réfléchi, avec une efficacité de réflexion supérieure à celle de DeepSeek-R1-0528. Optimisé par post-entraînement, l'utilisation des outils Agent et les performances des tâches des agents ont été grandement améliorées. Supporte une fenêtre contextuelle de 128k et une longueur de sortie maximale de 64k tokens."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus est une version optimisée pour terminaux du grand modèle linguistique lancé par DeepSeek, spécialement conçu pour les appareils terminaux."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1 : modèle de raisonnement de nouvelle génération, améliorant les capacités de raisonnement complexe et de réflexion en chaîne, adapté aux tâches nécessitant une analyse approfondie."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "Le modèle de langage de très grande taille phare développé par Baidu, avec d'excellentes performances globales, largement applicable à des scénarios de tâches complexes dans divers domaines ; supporte l'intégration automatique avec le plugin de recherche Baidu, garantissant la pertinence des informations de réponse. Par rapport à ERNIE 4.0, il offre de meilleures performances."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B est un modèle à experts mixtes développé par Wenxin de Baidu, offrant de puissantes capacités de raisonnement et de traitement multilingue."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B est un modèle à très grande échelle à experts mixtes lancé par Wenxin de Baidu, doté d'excellentes capacités de raisonnement."
+  },
  "ernie-4.5-8k-preview": {
    "description": "Le modèle ERNIE 4.5 est un nouveau modèle de base multimodal natif développé par Baidu, réalisant une optimisation collaborative grâce à la modélisation conjointe de plusieurs modalités, avec d'excellentes capacités de compréhension multimodale ; il possède des capacités linguistiques améliorées, avec des améliorations significatives dans la compréhension, la génération, la logique et la mémoire, ainsi qu'une réduction des hallucinations et une amélioration des capacités de raisonnement logique et de codage."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 est la dernière version du modèle, conçue pour des tâches hautement complexes et diversifiées, avec des performances exceptionnelles."
  },
  "glm-4-9b-chat": {
-    "description": "GLM-4-9B-Chat affiche de bonnes performances dans divers domaines tels que la sémantique, les mathématiques, le raisonnement, le code et les connaissances. Il dispose également de fonctionnalités de navigation sur le web, d'exécution de code, d'appels d'outils personnalisés et de raisonnement sur de longs textes. Il prend en charge 26 langues, y compris le japonais, le coréen et l'allemand."
+    "description": "GLM-4-9B-Chat offre des performances élevées dans les domaines de la sémantique, des mathématiques, du raisonnement, du code et des connaissances. Il prend également en charge la navigation web, l'exécution de code, l'appel d'outils personnalisés et le raisonnement sur de longs textes. Prise en charge de 26 langues, dont le japonais, le coréen et l’allemand."
  },
  "glm-4-air": {
    "description": "GLM-4-Air est une version économique, offrant des performances proches de GLM-4, avec une rapidité et un prix abordable."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "GLM-Zero-Preview possède de puissantes capacités de raisonnement complexe, se distinguant dans les domaines du raisonnement logique, des mathématiques et de la programmation."
  },
+  "glm4.6:355b": {
+    "description": "Le tout dernier modèle phare de Zhipu, le GLM-4.6 (355B), surpasse largement ses prédécesseurs en matière de codage avancé, de traitement de longs textes, de raisonnement et de capacités d'agents intelligents. Il est particulièrement performant en programmation, atteignant le niveau de Claude Sonnet 4, ce qui en fait l’un des meilleurs modèles de codage en Chine."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash offre des fonctionnalités de nouvelle génération et des améliorations, incluant une vitesse exceptionnelle, l'utilisation d'outils intégrés, la génération multimodale et une fenêtre de contexte de 1 million de tokens."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "Version la plus rapide et la plus économique de GPT-5. Parfaitement adaptée aux scénarios nécessitant une réponse rapide et sensibles aux coûts."
  },
+  "gpt-5-pro": {
+    "description": "GPT-5 Pro utilise davantage de puissance de calcul pour approfondir sa réflexion et fournir des réponses toujours plus pertinentes."
+  },
  "gpt-audio": {
    "description": "GPT Audio est un modèle de chat universel orienté vers l'entrée et la sortie audio, supportant l'utilisation d'entrées/sorties audio dans l'API Chat Completions."
  },
  "gpt-image-1": {
    "description": "Modèle natif multimodal de génération d'images de ChatGPT."
  },
+  "gpt-image-1-mini": {
+    "description": "Une version plus économique de GPT Image 1, prenant en charge nativement les entrées texte et image, et générant des sorties visuelles."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4 : architecture Transformer quantifiée, offrant des performances solides même en ressources limitées."
+    "description": "Ce modèle nécessite une demande d'accès. GPT-OSS-120B est un modèle de langage open source à grande échelle lancé par OpenAI, doté de puissantes capacités de génération de texte."
+  },
+  "gpt-oss-20b": {
+    "description": "Ce modèle nécessite une demande d'accès. GPT-OSS-20B est un modèle de langage open source de taille moyenne lancé par OpenAI, offrant une génération de texte efficace."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B est un grand modèle de langage open source publié par OpenAI, utilisant la technologie de quantification MXFP4, conçu comme un modèle phare. Il nécessite un environnement multi-GPU ou une station de travail haute performance, offrant des performances exceptionnelles en raisonnement complexe, génération de code et traitement multilingue, avec prise en charge avancée des appels de fonctions et de l'intégration d'outils."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "Modèle universel en temps réel, supportant les entrées et sorties textuelles et audio en temps réel, ainsi que les entrées d'images."
  },
-  "grok-2-1212": {
-    "description": "Ce modèle a été amélioré en termes de précision, de respect des instructions et de capacités multilingues."
-  },
  "grok-2-image-1212": {
    "description": "Notre dernier modèle de génération d'images peut créer des images vivantes et réalistes à partir d'invites textuelles. Il excelle dans la génération d'images pour le marketing, les réseaux sociaux et le divertissement."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "Modèle phare, expert en extraction de données, programmation et résumé de texte pour des applications d'entreprise, avec une connaissance approfondie des domaines financier, médical, juridique et scientifique."
  },
-  "grok-3-fast": {
-    "description": "Modèle phare, expert en extraction de données, programmation et résumé de texte pour des applications d'entreprise, avec une connaissance approfondie des domaines financier, médical, juridique et scientifique."
-  },
  "grok-3-mini": {
    "description": "Modèle léger, réfléchit avant de répondre. Rapide et intelligent, adapté aux tâches logiques ne nécessitant pas de connaissances approfondies, avec accès à la trace de pensée originale."
  },
-  "grok-3-mini-fast": {
-    "description": "Modèle léger, réfléchit avant de répondre. Rapide et intelligent, adapté aux tâches logiques ne nécessitant pas de connaissances approfondies, avec accès à la trace de pensée originale."
-  },
  "grok-4": {
    "description": "Notre tout dernier modèle phare, le plus puissant, excelle dans le traitement du langage naturel, le calcul mathématique et le raisonnement — un véritable champion polyvalent."
  },
@@ -1851,7 +1866,7 @@
    "description": "La dernière version du modèle de réflexion profonde multimodale t1-vision de Hunyuan, supportant une chaîne de pensée native multimodale, avec des améliorations globales par rapport à la version par défaut précédente."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "Modèle de réflexion profonde multimodal Hunyuan, supportant des chaînes de pensée natives multimodales longues, excellent pour divers scénarios de raisonnement d’images, avec une amélioration globale par rapport au modèle Quick Think pour les problèmes scientifiques."
+    "description": "La dernière version du modèle de raisonnement visuel profond Hunyuan t1-vision offre des améliorations significatives par rapport à la version précédente dans des tâches telles que les questions-réponses image-texte, la localisation visuelle, l'OCR, l'interprétation de graphiques, la résolution de problèmes à partir de photos et la création d’images. Les performances en anglais et en langues rares ont également été nettement optimisées."
  },
  "hunyuan-turbo": {
    "description": "Version préliminaire du nouveau modèle de langage de génération Hunyuan, utilisant une nouvelle structure de modèle d'experts mixtes (MoE), offrant une efficacité d'inférence plus rapide et de meilleures performances par rapport à Hunyuan-Pro."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "Le modèle kimi-k2-0905-preview dispose d'une longueur de contexte de 256k, offrant une capacité renforcée de codage agentique, une meilleure esthétique et utilité du code front-end, ainsi qu'une compréhension contextuelle améliorée."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct est un grand modèle linguistique développé par Moonshot AI, capable de traiter des contextes ultra-longs."
+  },
  "kimi-k2-turbo-preview": {
    "description": "kimi-k2 est un modèle de base à architecture MoE doté de capacités remarquables en programmation et en agents autonomes, avec 1T de paramètres au total et 32B de paramètres activés. Dans les principaux tests de référence couvrant le raisonnement général, la programmation, les mathématiques et les agents, le modèle K2 surpasse les autres modèles open source majeurs."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "Spark Lite est un modèle de langage léger, offrant une latence extrêmement faible et une capacité de traitement efficace, entièrement gratuit et ouvert, prenant en charge la recherche en temps réel. Sa capacité de réponse rapide le rend exceptionnel pour les applications d'inférence sur des appareils à faible puissance de calcul et pour le réglage des modèles, offrant aux utilisateurs un excellent rapport coût-efficacité et une expérience intelligente, en particulier dans les scénarios de questions-réponses, de génération de contenu et de recherche."
  },
-  "llama-2-7b-chat": {
-    "description": "Llama2 est une série de grands modèles de langage (LLM) développés et open-source par Meta. Elle comprend des modèles de génération de texte pré-entraînés et affinés, dont la taille varie de 7 milliards à 70 milliards de paramètres. Sur le plan architectural, Llama2 est un modèle de langage auto-régressif utilisant une architecture de transformateur optimisée. Les versions ajustées utilisent un affinage supervisé (SFT) et un apprentissage par renforcement avec feedback humain (RLHF) pour aligner les préférences d'utilité et de sécurité humaines. Llama2 offre de meilleures performances que la série Llama sur de nombreux jeux de données académiques, fournissant des idées pour la conception et le développement de nombreux autres modèles."
-  },
  "llama-3.1-70b-versatile": {
    "description": "Llama 3.1 70B offre une capacité de raisonnement AI plus puissante, adaptée aux applications complexes, prenant en charge un traitement de calcul intensif tout en garantissant efficacité et précision."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "Le modèle Llama 3.2-Vision optimisé pour les instructions est spécialisé dans la reconnaissance visuelle, le raisonnement sur images, la description d'images et la réponse aux questions générales liées aux images."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 est le modèle de langage open source multilingue le plus avancé de la série Llama, offrant des performances comparables à celles du modèle 405B à un coût très bas. Basé sur une architecture Transformer, il améliore son utilité et sa sécurité grâce à un ajustement supervisé (SFT) et un apprentissage par renforcement avec retour humain (RLHF). Sa version optimisée pour les instructions est spécialement conçue pour les dialogues multilingues et surpasse de nombreux modèles de chat open source et fermés sur plusieurs benchmarks industriels. La date limite des connaissances est décembre 2023."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B : un modèle Llama de taille moyenne à grande, équilibrant capacités de raisonnement et débit élevé."
  },
  "llama-3.3-70b-versatile": {
    "description": "Le modèle de langage multilingue Llama 3.3 de Meta (LLM) est un modèle génératif pré-entraîné et affiné par instructions avec 70B (entrée/sortie de texte). Le modèle Llama 3.3 affiné par instructions est optimisé pour les cas d'utilisation de dialogue multilingue et surpasse de nombreux modèles de chat open-source et fermés disponibles sur des benchmarks industriels courants."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "Le modèle d'instructions affiné Llama 3.3 est optimisé pour les scénarios de dialogue, surpassant de nombreux modèles de chat open source existants dans les tests de référence courants de l'industrie."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick : un modèle haute performance de la série Llama, idéal pour le raisonnement avancé, la résolution de problèmes complexes et les tâches de suivi d'instructions."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout : un modèle haute performance de la série Llama, conçu pour les scénarios nécessitant un haut débit et une faible latence."
+  },
  "llama3-70b-8192": {
    "description": "Meta Llama 3 70B offre une capacité de traitement de complexité inégalée, sur mesure pour des projets exigeants."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 est le modèle de pointe lancé par Meta, prenant en charge jusqu'à 405B de paramètres, applicable dans les domaines des dialogues complexes, de la traduction multilingue et de l'analyse de données."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B : une variante légère et à faible latence de Llama, adaptée aux scénarios d'inférence en ligne et d'interaction légers."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 est le modèle de pointe lancé par Meta, prenant en charge jusqu'à 405B de paramètres, applicable dans les domaines des dialogues complexes, de la traduction multilingue et de l'analyse de données."
  },
@@ -2067,7 +2091,7 @@
    "description": "Spark Max 32K est équipé d'une grande capacité de traitement de contexte, avec une compréhension contextuelle et des capacités de raisonnement logique renforcées, prenant en charge des entrées textuelles de 32K tokens, adapté à la lecture de documents longs, aux questions-réponses privées et à d'autres scénarios."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct est un grand modèle de langage entièrement formé par Wúwèn Xīnqióng. Megrez-3B-Instruct vise à créer une solution d'intelligence embarquée rapide, compacte et facile à utiliser, en adoptant une approche intégrée logiciel-hardware."
+    "description": "Megrez 3B Instruct est un modèle efficace à faible nombre de paramètres lancé par Wuwen Xinqiong."
  },
  "meta-llama-3-70b-instruct": {
    "description": "Un puissant modèle de 70 milliards de paramètres excelling dans le raisonnement, le codage et les applications linguistiques larges."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "Spark Pro 128K est doté d'une capacité de traitement de contexte très étendue, capable de gérer jusqu'à 128K d'informations contextuelles, particulièrement adapté pour l'analyse complète et le traitement des relations logiques à long terme dans des contenus longs, offrant une logique fluide et cohérente ainsi qu'un soutien varié pour les références dans des communications textuelles complexes."
  },
+  "pro-deepseek-r1": {
+    "description": "Modèle dédié aux services d'entreprise, incluant les services en parallèle."
+  },
+  "pro-deepseek-v3": {
+    "description": "Modèle dédié aux services d'entreprise, incluant les services en parallèle."
+  },
  "qvq-72b-preview": {
    "description": "Le modèle QVQ est un modèle de recherche expérimental développé par l'équipe Qwen, axé sur l'amélioration des capacités de raisonnement visuel, en particulier dans le domaine du raisonnement mathématique."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "Modèle de raisonnement visuel. Prend en charge les entrées visuelles et les sorties en chaîne de pensée. Version plus avancée du modèle qvq-max, offrant une vitesse de raisonnement plus rapide et un meilleur équilibre entre performance et coût."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B : un modèle de la série Qwen performant dans les tâches multilingues et de codage, adapté à une utilisation en production à échelle moyenne."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B : un modèle à long contexte conçu pour la génération de code et les tâches de programmation complexes."
+  },
  "qwen-coder-plus": {
    "description": "Modèle de code Tongyi Qianwen."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 est le nouveau modèle de langage à grande échelle d'Alibaba, offrant d'excellentes performances pour des besoins d'application diversifiés."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2 est la nouvelle série de modèles de langage grand format développée par l'équipe Qwen. Elle repose sur l'architecture Transformer et intègre des fonctions d'activation SwiGLU, un biais d'attention QKV (attention QKV bias), une attention de requête de groupe (group query attention), un mélange d'attention à fenêtre glissante (mixture of sliding window attention) et une attention complète. De plus, l'équipe Qwen a amélioré le segmenteur pour mieux s'adapter à diverses langues naturelles et au code."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2 est la nouvelle génération de modèles de langage grand format développée par l'équipe Qwen. Il repose sur l'architecture Transformer et utilise des fonctions d'activation SwiGLU, des biais QKV d'attention, de l'attention de requête de groupe, un mélange d'attention à fenêtre glissante et d'attention complète. De plus, l'équipe Qwen a amélioré le segmenteur pour s'adapter à de nombreuses langues naturelles et à des codes."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 est le nouveau modèle de langage à grande échelle de Alibaba, offrant d'excellentes performances pour répondre à des besoins d'application diversifiés."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "Modèle open source de nouvelle génération en mode réflexif basé sur Qwen3, avec une meilleure conformité aux instructions et des réponses plus concises dans les résumés par rapport à la version précédente (Tongyi Qianwen 3-235B-A22B-Thinking-2507)."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct est un modèle multimodal lancé par Tongyi Qianwen, prenant en charge la compréhension et le raisonnement visuels."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking est un modèle de raisonnement multimodal lancé par Tongyi Qianwen, prenant en charge la compréhension et le raisonnement visuels."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL est un modèle de génération de texte doté de capacités de compréhension visuelle (images). Il peut non seulement effectuer de l'OCR (reconnaissance de texte sur images), mais aussi résumer et raisonner davantage, par exemple extraire des attributs à partir de photos de produits ou résoudre des exercices à partir d'images."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "Ce modèle est un grand modèle de raisonnement avec de puissantes capacités de compréhension d'image, capable de traiter des informations visuelles et textuelles, produisant du texte après une réflexion approfondie. Ce modèle se distingue dans le domaine du raisonnement visuel, tout en possédant des capacités de raisonnement mathématique, de code et de texte de premier plan. La longueur du contexte est de 100k."
  },
+  "step3": {
+    "description": "Step3 est un modèle multimodal développé par StepStar, doté de puissantes capacités de compréhension visuelle."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 est un modèle de raisonnement multimodal de pointe publié par StepFun (阶跃星辰). Il est construit sur une architecture Mixture-of-Experts (MoE) comportant 321 milliards de paramètres au total et 38 milliards de paramètres d'activation. Le modèle adopte une conception bout en bout visant à minimiser le coût de décodage tout en offrant des performances de premier plan en raisonnement visuel et linguistique. Grâce à la conception synergique de l'attention par décomposition multi-matrice (MFA) et du découplage attention‑FFN (AFD), Step3 conserve une grande efficacité aussi bien sur des accélérateurs haut de gamme que sur des accélérateurs d'entrée de gamme. Lors de la pré‑entraînement, Step3 a traité plus de 20 000 milliards de tokens textuels et 4 000 milliards de tokens mixtes image‑texte, couvrant une dizaine de langues. Le modèle atteint des niveaux de référence parmi les meilleurs des modèles open source sur plusieurs benchmarks, notamment en mathématiques, en code et en multimodalité."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "Le dernier et meilleur modèle phare de xAI, offrant des performances inégalées en langage naturel, mathématiques et raisonnement — un véritable modèle polyvalent."
  },
-  "yi-1.5-34b-chat": {
-    "description": "Yi-1.5 est une version améliorée de Yi. Il utilise un corpus de haute qualité de 500 milliards de tokens pour poursuivre l'entraînement préalable de Yi, et est affiné sur 3 millions d'exemples de fine-tuning variés."
-  },
  "yi-large": {
    "description": "Un modèle de nouvelle génération avec des milliards de paramètres, offrant des capacités de question-réponse et de génération de texte exceptionnelles."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "Un laboratoire de recherche en intelligence artificielle à la pointe, construisant l'infrastructure visuelle de demain."
  },
+  "cerebras": {
+    "description": "Cerebras est une plateforme d'inférence IA basée sur son système dédié CS-3, conçue pour offrir les services de LLM les plus rapides au monde, avec une réponse en temps réel et un débit élevé. Elle est spécialement conçue pour éliminer la latence et accélérer les flux de travail IA complexes, tels que la génération de code en temps réel et les tâches d'agents."
+  },
  "cloudflare": {
    "description": "Exécutez des modèles d'apprentissage automatique alimentés par GPU sans serveur sur le réseau mondial de Cloudflare."
  },
@@ -294,6 +294,13 @@
    },
    "title": "Paramètres généraux"
  },
+  "settingImage": {
+    "defaultCount": {
+      "desc": "Définir le nombre d'images générées par défaut lors de la création d'une nouvelle tâche dans le panneau de génération d'images.",
+      "label": "Nombre d'images par défaut",
+      "title": "Paramètres de dessin IA"
+    }
+  },
  "settingModel": {
    "enableMaxTokens": {
      "title": "Activer la limite de tokens par réponse"
@@ -549,6 +556,7 @@
    "common": "Paramètres généraux",
    "experiment": "Expérience",
    "hotkey": "Raccourcis clavier",
+    "image": "Dessin IA",
    "llm": "Modèle de langue",
    "provider": "Fournisseur de services d'IA",
    "proxy": "Proxy réseau",
@@ -236,6 +236,7 @@
  },
  "information": "Comunità e informazioni",
  "installPWA": "Installa l'applicazione del browser",
+  "labs": "Laboratorio",
  "lang": {
    "ar": "Arabo",
    "bg-BG": "bulgaro",
@@ -7,6 +7,14 @@
    "desc": "Cancella i messaggi e i file caricati della conversazione attuale",
    "title": "Cancella messaggi della conversazione"
  },
+  "deleteAndRegenerateMessage": {
+    "desc": "Elimina l'ultimo messaggio e rigeneralo",
+    "title": "Elimina e rigenera"
+  },
+  "deleteLastMessage": {
+    "desc": "Elimina l'ultimo messaggio",
+    "title": "Elimina l'ultimo messaggio"
+  },
  "desktop": {
    "openSettings": {
      "desc": "Apri la pagina delle impostazioni dell'app",
@@ -0,0 +1,14 @@
+{
+  "desc": "Qui troverai aggiornamenti periodici sulle nuove funzionalità che stiamo esplorando. Sentiti libero di provarle!",
+  "features": {
+    "groupChat": {
+      "desc": "Abilita la capacità di coordinamento in chat di gruppo con più agenti intelligenti.",
+      "title": "Chat di gruppo (multi-agente)"
+    },
+    "inputMarkdown": {
+      "desc": "Rendering in tempo reale del Markdown nell'area di input (grassetto, blocchi di codice, tabelle, ecc.).",
+      "title": "Rendering Markdown nell'input"
+    }
+  },
+  "title": "Laboratorio"
+}
@@ -284,11 +284,19 @@
          "placeholder": "Inserisci l'ID del modello, ad esempio gpt-4o o claude-3.5-sonnet",
          "title": "ID del modello"
        },
+        "imageOutput": {
+          "extra": "Questa configurazione abilita solo la capacità del modello di generare immagini. L'efficacia dipende interamente dal modello stesso. Si consiglia di testare autonomamente se il modello supporta la generazione di immagini.",
+          "title": "Supporta la generazione di immagini"
+        },
        "modalTitle": "Configurazione modello personalizzato",
        "reasoning": {
          "extra": "Questa configurazione attiverà solo la capacità di pensiero profondo del modello; l'effetto specifico dipende interamente dal modello stesso. Si prega di testare autonomamente se il modello possiede una capacità di pensiero profondo utilizzabile.",
          "title": "Supporto per il pensiero profondo"
        },
+        "search": {
+          "extra": "Questa configurazione abilita solo la capacità del motore di ricerca integrato del modello di connettersi a Internet. La disponibilità di questa funzione dipende dal modello stesso. Si consiglia di testare autonomamente se il motore di ricerca integrato è utilizzabile.",
+          "title": "Supporta la ricerca online"
+        },
        "tokens": {
          "extra": "Imposta il numero massimo di token supportati dal modello",
          "title": "Finestra di contesto massima",
@@ -309,6 +317,10 @@
          "placeholder": "Seleziona il tipo di modello",
          "title": "Tipo di modello"
        },
+        "video": {
+          "extra": "Questa configurazione abilita solo la funzione di riconoscimento video nell'applicazione. La disponibilità del riconoscimento dipende interamente dal modello stesso. Si consiglia di testare autonomamente se il modello supporta il riconoscimento video.",
+          "title": "Supporta il riconoscimento video"
+        },
        "vision": {
          "extra": "Questa configurazione abiliterà solo la configurazione di caricamento immagini nell'app, la disponibilità di riconoscimento dipende interamente dal modello stesso, testare autonomamente la disponibilità di riconoscimento visivo di questo modello.",
          "title": "Supporto per riconoscimento visivo"
@@ -704,6 +704,9 @@
  "azure-DeepSeek-R1-0528": {
    "description": "Distribuito e fornito da Microsoft; il modello DeepSeek R1 ha subito un aggiornamento minore, la versione attuale è DeepSeek-R1-0528. Nell'ultimo aggiornamento, DeepSeek R1 ha migliorato significativamente la profondità di inferenza e la capacità di deduzione aumentando le risorse computazionali e introducendo meccanismi di ottimizzazione algoritmica nella fase post-allenamento. Questo modello eccelle in vari benchmark come matematica, programmazione e logica generale, con prestazioni complessive vicine a modelli leader come O3 e Gemini 2.5 Pro."
  },
+  "baichuan-m2-32b": {
+    "description": "Baichuan M2 32B è un modello esperto ibrido sviluppato da Baichuan Intelligence, dotato di potenti capacità di ragionamento."
+  },
  "baichuan/baichuan2-13b-chat": {
    "description": "Baichuan-13B è un modello di linguaggio open source sviluppato da Baichuan Intelligence, con 13 miliardi di parametri, che ha ottenuto i migliori risultati nella sua categoria in benchmark autorevoli sia in cinese che in inglese."
  },
@@ -728,12 +731,6 @@
  "charglm-4": {
    "description": "CharGLM-4 è progettato per il gioco di ruolo e la compagnia emotiva, supportando una memoria multi-turno ultra-lunga e dialoghi personalizzati, con ampie applicazioni."
  },
-  "chatglm3": {
-    "description": "ChatGLM3 è un modello a sorgente chiusa sviluppato da Zhipu AI e dal laboratorio KEG di Tsinghua. Dopo un pre-addestramento su una vasta quantità di identificatori cinesi e inglesi e un addestramento allineato alle preferenze umane, rispetto alla prima generazione di modelli, ha ottenuto miglioramenti del 16%, 36% e 280% rispettivamente in MMLU, C-Eval e GSM8K, e ha raggiunto il vertice della classifica C-Eval per compiti in cinese. È adatto a scenari che richiedono un alto livello di conoscenza, capacità di ragionamento e creatività, come la stesura di testi pubblicitari, la scrittura di romanzi, la composizione di testi informativi e la generazione di codice."
-  },
-  "chatglm3-6b-base": {
-    "description": "ChatGLM3-6b-base è il modello di base open source più recente della serie ChatGLM, sviluppato da Zhipu con una dimensione di 6 miliardi di parametri."
-  },
  "chatgpt-4o-latest": {
    "description": "ChatGPT-4o è un modello dinamico, aggiornato in tempo reale per mantenere la versione più recente. Combina una potente comprensione e generazione del linguaggio, adatta a scenari di applicazione su larga scala, inclusi servizi clienti, educazione e supporto tecnico."
  },
@@ -938,6 +935,9 @@
  "deepseek-ai/DeepSeek-V3.1-Terminus": {
    "description": "DeepSeek-V3.1-Terminus è una versione aggiornata del modello V3.1 rilasciata da DeepSeek, concepita come un modello linguistico di grandi dimensioni con agenti ibridi. Questo aggiornamento mantiene le capacità originali del modello, concentrandosi sulla risoluzione dei problemi segnalati dagli utenti e sul miglioramento della stabilità. Migliora significativamente la coerenza linguistica, riducendo l'uso misto di cinese e inglese e la presenza di caratteri anomali. Il modello integra la “Modalità di pensiero” (Thinking Mode) e la “Modalità non di pensiero” (Non-thinking Mode), permettendo agli utenti di passare agevolmente tra le modalità tramite template di chat per adattarsi a diversi compiti. Come ottimizzazione importante, V3.1-Terminus potenzia le prestazioni degli agenti di codice (Code Agent) e di ricerca (Search Agent), rendendoli più affidabili nell'invocazione di strumenti e nell'esecuzione di compiti complessi multi-step."
  },
+  "deepseek-ai/DeepSeek-V3.2-Exp": {
+    "description": "Il modello DeepSeek V3.2 Exp adotta un'architettura di ragionamento ibrida, supportando sia la modalità riflessiva che quella non riflessiva."
+  },
  "deepseek-ai/deepseek-llm-67b-chat": {
    "description": "DeepSeek 67B è un modello avanzato addestrato per dialoghi ad alta complessità."
  },
@@ -947,6 +947,9 @@
  "deepseek-ai/deepseek-v3.1": {
    "description": "DeepSeek V3.1: modello di inferenza di nuova generazione che migliora le capacità di ragionamento complesso e di pensiero a catena, adatto a compiti che richiedono analisi approfondite."
  },
+  "deepseek-ai/deepseek-v3.1-terminus": {
+    "description": "DeepSeek V3.1: un modello di ragionamento di nuova generazione, con capacità avanzate di analisi complessa e pensiero concatenato, ideale per compiti che richiedono un'analisi approfondita."
+  },
  "deepseek-ai/deepseek-vl2": {
    "description": "DeepSeek-VL2 è un modello linguistico visivo a esperti misti (MoE) sviluppato sulla base di DeepSeekMoE-27B, che utilizza un'architettura MoE con attivazione sparsa, raggiungendo prestazioni eccezionali attivando solo 4,5 miliardi di parametri. Questo modello eccelle in vari compiti, tra cui domande visive, riconoscimento ottico dei caratteri, comprensione di documenti/tabelle/grafici e localizzazione visiva."
  },
@@ -1028,6 +1031,9 @@
  "deepseek-v3.1": {
    "description": "DeepSeek-V3.1 è il nuovo modello di ragionamento ibrido lanciato da DeepSeek, che supporta due modalità di ragionamento: con pensiero e senza pensiero, con un'efficienza di pensiero superiore rispetto a DeepSeek-R1-0528. Ottimizzato tramite post-addestramento, l'uso degli strumenti Agent e le prestazioni nelle attività degli agenti sono notevolmente migliorate. Supporta una finestra contestuale di 128k e una lunghezza massima di output di 64k token."
  },
+  "deepseek-v3.1-terminus": {
+    "description": "DeepSeek-V3.1-Terminus è una versione ottimizzata per dispositivi terminali del modello linguistico di grandi dimensioni sviluppato da DeepSeek."
+  },
  "deepseek-v3.1:671b": {
    "description": "DeepSeek V3.1: modello di inferenza di nuova generazione che migliora le capacità di ragionamento complesso e di pensiero a catena, adatto a compiti che richiedono analisi approfondite."
  },
@@ -1190,6 +1196,12 @@
  "ernie-4.0-turbo-8k-preview": {
    "description": "Il modello di linguaggio di grandi dimensioni di punta sviluppato internamente da Baidu, con prestazioni complessive eccezionali, ampiamente applicabile a scenari di compiti complessi in vari campi; supporta l'integrazione automatica con il plugin di ricerca di Baidu, garantendo l'aggiornamento delle informazioni nelle risposte. Rispetto a ERNIE 4.0, offre prestazioni migliori."
  },
+  "ernie-4.5-21b-a3b": {
+    "description": "ERNIE 4.5 21B A3B è un modello esperto ibrido sviluppato da Wenxin di Baidu, con eccellenti capacità di ragionamento e supporto multilingue."
+  },
+  "ernie-4.5-300b-a47b": {
+    "description": "ERNIE 4.5 300B A47B è un modello esperto ibrido su larga scala sviluppato da Wenxin di Baidu, con prestazioni eccezionali nel ragionamento."
+  },
  "ernie-4.5-8k-preview": {
    "description": "Il modello di grandi dimensioni Wenxin 4.5 è una nuova generazione di modello di base multimodale sviluppato autonomamente da Baidu, realizzato attraverso la modellazione congiunta di più modalità per ottenere un'ottimizzazione collaborativa, con eccellenti capacità di comprensione multimodale; presenta capacità linguistiche più avanzate, con miglioramenti significativi nella comprensione, generazione, logica e memoria, riducendo le illusioni e migliorando il ragionamento logico e le capacità di codifica."
  },
@@ -1446,7 +1458,7 @@
    "description": "GLM-4-0520 è l'ultima versione del modello, progettata per compiti altamente complessi e diversificati, con prestazioni eccezionali."
  },
  "glm-4-9b-chat": {
-    "description": "GLM-4-9B-Chat mostra elevate prestazioni in vari aspetti come semantica, matematica, ragionamento, codice e conoscenza. Ha anche la capacità di navigare in rete, eseguire codice, chiamare strumenti personalizzati e inferire testi lunghi. Supporta 26 lingue, tra cui giapponese, coreano e tedesco."
+    "description": "GLM-4-9B-Chat offre prestazioni elevate in semantica, matematica, ragionamento, programmazione e conoscenza. Supporta anche la navigazione web, l'esecuzione di codice, l'invocazione di strumenti personalizzati e il ragionamento su testi lunghi. Supporta 26 lingue, tra cui giapponese, coreano e tedesco."
  },
  "glm-4-air": {
    "description": "GLM-4-Air è una versione economica, con prestazioni simili a GLM-4, che offre velocità elevate a un prezzo accessibile."
@@ -1529,6 +1541,9 @@
  "glm-zero-preview": {
    "description": "GLM-Zero-Preview possiede potenti capacità di ragionamento complesso, eccellendo nei campi del ragionamento logico, della matematica e della programmazione."
  },
+  "glm4.6:355b": {
+    "description": "Il nuovo modello di punta di Zhipu, GLM-4.6 (355B), supera ampiamente la generazione precedente in codifica avanzata, elaborazione di testi lunghi, capacità di ragionamento e agenti intelligenti. In particolare, le sue prestazioni nella programmazione sono allineate a Claude Sonnet 4, rendendolo uno dei migliori modelli di coding in Cina."
+  },
  "google/gemini-2.0-flash": {
    "description": "Gemini 2.0 Flash offre funzionalità di nuova generazione e miglioramenti, inclusa velocità eccezionale, uso integrato di strumenti, generazione multimodale e una finestra di contesto di 1 milione di token."
  },
@@ -1730,14 +1745,23 @@
  "gpt-5-nano": {
    "description": "Versione più veloce ed economica di GPT-5. Perfetta per scenari applicativi che richiedono risposte rapide e sono sensibili ai costi."
  },
+  "gpt-5-pro": {
+    "description": "GPT-5 Pro utilizza maggiori risorse computazionali per un'elaborazione più profonda, offrendo risposte sempre più accurate e pertinenti."
+  },
  "gpt-audio": {
    "description": "GPT Audio è un modello di chat universale per input e output audio, supporta l'uso di I/O audio nell'API Chat Completions."
  },
  "gpt-image-1": {
    "description": "Modello nativo multimodale di generazione immagini di ChatGPT"
  },
+  "gpt-image-1-mini": {
+    "description": "Una versione più economica di GPT Image 1, con supporto nativo per input testuali e visivi e generazione di output in formato immagine."
+  },
  "gpt-oss-120b": {
-    "description": "GPT-OSS-120B MXFP4: struttura Transformer quantizzata, mantiene prestazioni elevate anche con risorse limitate."
+    "description": "È necessaria una richiesta per accedere a questo modello. GPT-OSS-120B è un modello linguistico open source su larga scala sviluppato da OpenAI, con potenti capacità di generazione testuale."
+  },
+  "gpt-oss-20b": {
+    "description": "È necessaria una richiesta per accedere a questo modello. GPT-OSS-20B è un modello linguistico open source di medie dimensioni sviluppato da OpenAI, con capacità di generazione testuale efficienti."
  },
  "gpt-oss:120b": {
    "description": "GPT-OSS 120B è un modello linguistico open source di grandi dimensioni rilasciato da OpenAI, che utilizza la tecnologia di quantizzazione MXFP4, rappresentando un modello di punta. Richiede un ambiente con più GPU o una workstation ad alte prestazioni per l'esecuzione, offrendo prestazioni eccellenti in ragionamenti complessi, generazione di codice e gestione multilingue, supportando chiamate di funzione avanzate e integrazione di strumenti."
@@ -1748,9 +1772,6 @@
  "gpt-realtime": {
    "description": "Modello universale in tempo reale, supporta input e output testuali e audio in tempo reale, oltre a input di immagini."
  },
-  "grok-2-1212": {
-    "description": "Questo modello ha migliorato l'accuratezza, il rispetto delle istruzioni e le capacità multilingue."
-  },
  "grok-2-image-1212": {
    "description": "Il nostro ultimo modello di generazione immagini può creare immagini vivide e realistiche basate su prompt testuali. Eccelle nella generazione di immagini per marketing, social media e intrattenimento."
  },
@@ -1760,15 +1781,9 @@
  "grok-3": {
    "description": "Modello di punta, eccelle in estrazione dati, programmazione e sintesi testuale per applicazioni aziendali, con profonda conoscenza nei settori finanziario, medico, legale e scientifico."
  },
-  "grok-3-fast": {
-    "description": "Modello di punta, eccelle in estrazione dati, programmazione e sintesi testuale per applicazioni aziendali, con profonda conoscenza nei settori finanziario, medico, legale e scientifico."
-  },
  "grok-3-mini": {
    "description": "Modello leggero che riflette prima di rispondere. Veloce e intelligente, adatto a compiti logici che non richiedono conoscenze di dominio profonde, con tracciamento del processo di pensiero originale."
  },
-  "grok-3-mini-fast": {
-    "description": "Modello leggero che riflette prima di rispondere. Veloce e intelligente, adatto a compiti logici che non richiedono conoscenze di dominio profonde, con tracciamento del processo di pensiero originale."
-  },
  "grok-4": {
    "description": "Il nostro modello di punta più recente e potente, eccellente nell'elaborazione del linguaggio naturale, nel calcolo matematico e nel ragionamento — un vero campione versatile e completo."
  },
@@ -1851,7 +1866,7 @@
    "description": "L'ultima versione del modello di pensiero profondo multimodale t1-vision di Hunyuan, supporta catene di pensiero native multimodali e presenta miglioramenti completi rispetto alla versione predefinita della generazione precedente."
  },
  "hunyuan-t1-vision-20250916": {
-    "description": "Modello di comprensione multimodale Hunyuan con pensiero profondo, supporta catene di pensiero native multimodali estese, eccelle nel gestire vari scenari di ragionamento visivo e migliora significativamente rispetto al modello Quick Think nei problemi scientifici."
+    "description": "L'ultima versione del modello visivo Hunyuan t1-vision offre un pensiero visivo profondo. Rispetto alla versione precedente, presenta miglioramenti completi in compiti come domande e risposte generali su immagini e testi, localizzazione visiva, OCR, grafici, risoluzione di problemi da foto e creazione visiva, con un'ottimizzazione significativa per l'inglese e le lingue meno diffuse."
  },
  "hunyuan-turbo": {
    "description": "Anteprima della nuova generazione di modelli di linguaggio di Hunyuan, utilizza una nuova struttura di modello ibrido di esperti (MoE), con una maggiore efficienza di inferenza e prestazioni superiori rispetto a hunyuan-pro."
@@ -1964,6 +1979,9 @@
  "kimi-k2-0905-preview": {
    "description": "Il modello kimi-k2-0905-preview ha una lunghezza di contesto di 256k, con capacità di Agentic Coding più forti, una migliore estetica e praticità del codice frontend, e una migliore comprensione del contesto."
  },
+  "kimi-k2-instruct": {
+    "description": "Kimi K2 Instruct è un modello linguistico di grandi dimensioni sviluppato da Moonshot AI, con capacità avanzate di gestione di contesti molto estesi."
+  },
  "kimi-k2-turbo-preview": {
    "description": "kimi-k2 è un modello di base con architettura MoE che offre potenti capacità di programmazione e di agent, con 1T di parametri totali e 32B di parametri attivi. Nei benchmark delle principali categorie — ragionamento su conoscenze generali, programmazione, matematica e agent — il modello K2 supera gli altri modelli open source più diffusi."
  },
@@ -1985,9 +2003,6 @@
  "lite": {
    "description": "Spark Lite è un modello di linguaggio di grandi dimensioni leggero, con latenza estremamente bassa e capacità di elaborazione efficiente, completamente gratuito e aperto, supporta funzionalità di ricerca online in tempo reale. La sua caratteristica di risposta rapida lo rende eccellente per applicazioni di inferenza su dispositivi a bassa potenza e per il fine-tuning dei modelli, offrendo agli utenti un'ottima efficienza dei costi e un'esperienza intelligente, soprattutto nei contesti di domande e risposte, generazione di contenuti e ricerca."
  },
-  "llama-2-7b-chat": {
-    "description": "Llama2 è una serie di modelli linguistici di grandi dimensioni (LLM) sviluppati e resi open source da Meta. Questa serie comprende modelli generativi di testo pre-addestrati e finetunati, con dimensioni che variano da 7 miliardi a 70 miliardi di parametri. Sul piano architettonico, Llama2 è un modello linguistico autoregressivo che utilizza un'architettura di trasformatore ottimizzata. Le versioni aggiornate utilizzano il fine-tuning supervisionato (SFT) e l'apprendimento per rinforzo con feedback umano (RLHF) per allineare le preferenze di utilità e sicurezza umane. Llama2 supera Llama in diverse basi di dati accademiche, fornendo ispirazione per la progettazione e lo sviluppo di molti altri modelli."
-  },
  "llama-3.1-70b-versatile": {
    "description": "Llama 3.1 70B offre capacità di ragionamento AI più potenti, adatto per applicazioni complesse, supporta un'elaborazione computazionale elevata garantendo efficienza e precisione."
  },
@@ -2012,8 +2027,8 @@
  "llama-3.2-vision-instruct": {
    "description": "Il modello Llama 3.2-Vision istruito è ottimizzato per il riconoscimento visivo, l' inferenza di immagini, la descrizione di immagini e la risposta a domande comuni relative a immagini."
  },
-  "llama-3.3-70b-instruct": {
-    "description": "Llama 3.3 è il modello di linguaggio open source multilingue più avanzato della serie Llama, che offre prestazioni paragonabili a un modello da 405B a un costo estremamente ridotto. Basato su una struttura Transformer, migliora l'utilità e la sicurezza attraverso il fine-tuning supervisionato (SFT) e l'apprendimento per rinforzo con feedback umano (RLHF). La sua versione ottimizzata per le istruzioni è progettata per dialoghi multilingue e supera molti modelli di chat open source e chiusi in vari benchmark di settore. La data di scadenza delle conoscenze è dicembre 2023."
+  "llama-3.3-70b": {
+    "description": "Llama 3.3 70B: un modello Llama di medie-grandi dimensioni che bilancia capacità di ragionamento e throughput."
  },
  "llama-3.3-70b-versatile": {
    "description": "Meta Llama 3.3 è un modello linguistico di grandi dimensioni multilingue (LLM) da 70B (input/output testuale) con pre-addestramento e aggiustamento delle istruzioni. Il modello di testo puro di Llama 3.3 è ottimizzato per casi d'uso di dialogo multilingue e supera molti modelli di chat open-source e chiusi nei benchmark di settore comuni."
@@ -2021,6 +2036,12 @@
  "llama-3.3-instruct": {
    "description": "Il modello Llama 3.3 per l'addestramento di istruzioni è stato ottimizzato per scenari di conversazione, superando molti modelli di chat open source esistenti nelle comuni benchmark settoriali."
  },
+  "llama-4-maverick-17b-128e-instruct": {
+    "description": "Llama 4 Maverick: un modello ad alte prestazioni della serie Llama, ideale per ragionamento avanzato, risoluzione di problemi complessi e compiti di esecuzione di istruzioni."
+  },
+  "llama-4-scout-17b-16e-instruct": {
+    "description": "Llama 4 Scout: un modello ad alte prestazioni della serie Llama, adatto a scenari che richiedono elevato throughput e bassa latenza."
+  },
  "llama3-70b-8192": {
    "description": "Meta Llama 3 70B offre capacità di elaborazione della complessità senza pari, progettato su misura per progetti ad alta richiesta."
  },
@@ -2036,6 +2057,9 @@
  "llama3.1": {
    "description": "Llama 3.1 è il modello leader lanciato da Meta, supporta fino a 405B parametri, applicabile a dialoghi complessi, traduzione multilingue e analisi dei dati."
  },
+  "llama3.1-8b": {
+    "description": "Llama 3.1 8B: una variante Llama leggera e a bassa latenza, ideale per inferenza online leggera e scenari interattivi."
+  },
  "llama3.1:405b": {
    "description": "Llama 3.1 è il modello leader lanciato da Meta, supporta fino a 405B parametri, applicabile a dialoghi complessi, traduzione multilingue e analisi dei dati."
  },
@@ -2067,7 +2091,7 @@
    "description": "Spark Max 32K è dotato di una grande capacità di elaborazione del contesto, con una comprensione del contesto e capacità di ragionamento logico superiori, supporta input testuali fino a 32K token, adatto per la lettura di documenti lunghi, domande e risposte su conoscenze private e altri scenari."
  },
  "megrez-3b-instruct": {
-    "description": "Megrez-3B-Instruct è un modello di linguaggio grande completamente addestrato da Wuwen Xin Qiong. Megrez-3B-Instruct mira a creare una soluzione di intelligenza per dispositivi finali, rapida, compatta e facile da usare, attraverso il concetto di collaborazione hardware-software."
+    "description": "Megrez 3B Instruct è un modello efficiente a basso numero di parametri sviluppato da Wuwen Xinqiong."
  },
  "meta-llama-3-70b-instruct": {
    "description": "Un potente modello con 70 miliardi di parametri che eccelle nel ragionamento, nella codifica e nelle ampie applicazioni linguistiche."
@@ -2624,6 +2648,12 @@
  "pro-128k": {
    "description": "Spark Pro 128K è dotato di una capacità di elaborazione del contesto eccezionale, in grado di gestire fino a 128K di informazioni contestuali, particolarmente adatto per l'analisi completa e la gestione di associazioni logiche a lungo termine in contenuti lunghi, fornendo una logica fluida e coerente e un supporto variegato per le citazioni in comunicazioni testuali complesse."
  },
+  "pro-deepseek-r1": {
+    "description": "Modello dedicato ai servizi aziendali, con supporto per servizi concorrenti."
+  },
+  "pro-deepseek-v3": {
+    "description": "Modello dedicato ai servizi aziendali, con supporto per servizi concorrenti."
+  },
  "qvq-72b-preview": {
    "description": "Il modello QVQ è un modello di ricerca sperimentale sviluppato dal team Qwen, focalizzato sul miglioramento delle capacità di ragionamento visivo, in particolare nel campo del ragionamento matematico."
  },
@@ -2633,6 +2663,12 @@
  "qvq-plus": {
    "description": "Modello di ragionamento visivo. Supporta input visivi e output a catena di pensiero. Versione plus lanciata dopo il modello qvq-max, con velocità di ragionamento più elevata e un equilibrio migliore tra prestazioni ed efficienza rispetto a qvq-max."
  },
+  "qwen-3-32b": {
+    "description": "Qwen 3 32B: eccellente nelle attività multilingue e di codifica, adatto per applicazioni produttive di media scala."
+  },
+  "qwen-3-coder-480b": {
+    "description": "Qwen 3 Coder 480B: un modello con contesto esteso progettato per la generazione di codice e compiti di programmazione complessi."
+  },
  "qwen-coder-plus": {
    "description": "Modello di codice Tongyi Qianwen."
  },
@@ -2753,12 +2789,6 @@
  "qwen2": {
    "description": "Qwen2 è la nuova generazione di modelli di linguaggio su larga scala di Alibaba, supporta prestazioni eccellenti per esigenze applicative diversificate."
  },
-  "qwen2-72b-instruct": {
-    "description": "Qwen2 è la nuova serie di modelli linguistici di grande dimensione sviluppata dal team Qwen. Si basa sull'architettura Transformer e utilizza funzioni di attivazione SwiGLU, bias QKV dell'attenzione, attenzione a query di gruppo, una combinazione di attenzione a finestra scorrevole e attenzione completa. Inoltre, il team Qwen ha migliorato il tokenizzatore per adattarlo a diverse lingue naturali e codici."
-  },
-  "qwen2-7b-instruct": {
-    "description": "Qwen2 è la nuova serie di modelli linguistici di grandi dimensioni presentata dal team Qwen. Si basa sull'architettura Transformer e utilizza funzioni di attivazione SwiGLU, bias QKV dell'attenzione (attention QKV bias), attenzione a query di gruppo (group query attention), una combinazione di attenzione a finestra scorrevole (sliding window attention) e attenzione completa. Inoltre, il team Qwen ha migliorato il tokenizzatore per adattarlo a diverse lingue naturali e codici."
-  },
  "qwen2.5": {
    "description": "Qwen2.5 è la nuova generazione di modelli linguistici su larga scala di Alibaba, che supporta esigenze applicative diversificate con prestazioni eccellenti."
  },
@@ -2897,6 +2927,12 @@
  "qwen3-next-80b-a3b-thinking": {
    "description": "Modello open source di nuova generazione basato su Qwen3 in modalità riflessiva, con migliorata capacità di seguire le istruzioni rispetto alla versione precedente (Tongyi Qianwen 3-235B-A22B-Thinking-2507) e risposte di sintesi più concise."
  },
+  "qwen3-vl-235b-a22b-instruct": {
+    "description": "Qwen3 VL 235B A22B Instruct è un modello multimodale sviluppato da Tongyi Qianwen, che supporta la comprensione visiva e il ragionamento."
+  },
+  "qwen3-vl-235b-a22b-thinking": {
+    "description": "Qwen3 VL 235B A22B Thinking è un modello di ragionamento multimodale sviluppato da Tongyi Qianwen, che supporta la comprensione visiva e il ragionamento."
+  },
  "qwen3-vl-plus": {
    "description": "Tongyi Qianwen VL è un modello di generazione testuale con capacità di comprensione visiva (immagini). Non solo può eseguire OCR (riconoscimento del testo nelle immagini), ma anche riassumere e ragionare ulteriormente, ad esempio estraendo attributi da foto di prodotti o risolvendo problemi basati su immagini di esercizi."
  },
@@ -3014,6 +3050,9 @@
  "step-r1-v-mini": {
    "description": "Questo modello è un grande modello di inferenza con potenti capacità di comprensione delle immagini, in grado di gestire informazioni visive e testuali, producendo contenuti testuali dopo un profondo ragionamento. Questo modello si distingue nel campo del ragionamento visivo, mostrando anche capacità di ragionamento matematico, codice e testo di primo livello. La lunghezza del contesto è di 100k."
  },
+  "step3": {
+    "description": "Step3 è un modello multimodale sviluppato da StepStar, con potenti capacità di comprensione visiva."
+  },
  "stepfun-ai/step3": {
    "description": "Step3 è un modello di inferenza multimodale all'avanguardia rilasciato da StepFun (阶跃星辰). È costruito su un'architettura Mixture of Experts (MoE) con 321 miliardi di parametri totali e 38 miliardi di parametri di attivazione. Il modello adotta un design end-to-end, pensato per minimizzare i costi di decodifica e al contempo offrire prestazioni di primo livello nel ragionamento visivo-linguistico. Grazie al design sinergico che combina Multi-Matrix Factorized Attention (MFA) e il disaccoppiamento attenzione-FFN (AFD), Step3 mantiene un'elevata efficienza sia sui più potenti acceleratori flagship sia su quelli di fascia bassa. Durante la fase di pre-addestramento, Step3 ha elaborato oltre 20T di token testuali e 4T di token misti immagine-testo, coprendo più di dieci lingue. Il modello ha raggiunto livelli leader tra i modelli open source in numerosi benchmark, inclusi matematica, codice e scenari multimodali."
  },
@@ -3137,9 +3176,6 @@
  "xai/grok-4": {
    "description": "Il modello di punta più recente e migliore di xAI, che offre prestazioni senza pari in linguaggio naturale, matematica e ragionamento — il perfetto tuttofare."
  },
-  "yi-1.5-34b-chat": {
-    "description": "Yi-1.5 è una versione aggiornata di Yi. Utilizza un corpus di alta qualità di 500B token per il pre-addestramento continuo di Yi e viene finetunato su 3M campioni di micro-tuning diversificati."
-  },
  "yi-large": {
    "description": "Un nuovo modello con centinaia di miliardi di parametri, offre capacità eccezionali di risposta e generazione di testi."
  },
@@ -32,6 +32,9 @@
  "bfl": {
    "description": "Laboratorio di ricerca all'avanguardia nell'intelligenza artificiale, che costruisce l'infrastruttura visiva del domani."
  },
+  "cerebras": {
+    "description": "Cerebras è una piattaforma di inferenza AI basata sul suo sistema dedicato CS-3, progettata per offrire il servizio LLM più veloce al mondo, con risposta in tempo reale e un'elevata capacità di elaborazione. È pensata per eliminare la latenza e accelerare i flussi di lavoro AI complessi, come la generazione di codice in tempo reale e le attività degli agenti."
+  },
  "cloudflare": {
    "description": "Esegui modelli di machine learning alimentati da GPU serverless sulla rete globale di Cloudflare."
  },
--- a/Show More
+++ b/Show More