Merge dev branch by oobabooga · Pull Request #7567 · oobabooga/textgen

oobabooga · 2026-05-17T00:24:18Z

No description provided.

…ion)

This reverts commit 3c4a140.

When using the continue function, add_generation_prompt was set to False, bypassing any template logic that depends on it. This caused missing tokens or headers that some models inject at the start of an assistant turn. Gemma 4 was particularly sensitive to this, producing garbled output due to the missing thought channel header (<|channel>thought\n<channel|>). Fix by popping the last incomplete assistant message and re-rendering the prompt with add_generation_prompt=True, then appending the partial content afterward. This ensures the prompt is structurally identical to normal generation regardless of the model's template. GPT-OSS and Seed-OSS thinking block handling is preserved via the existing fake-message approach, as those models manage thinking content differently.

Copilot

Pull request overview

This PR merges development-branch updates across the desktop/Electron portable experience, chat/tool UI behavior, model/mmproj discovery, token display accounting, and documentation.

Changes:

Adds Electron-specific settings such as model directory browsing, spellcheck toggling, update checking, and preload packaging.
Refines chat/tool rendering, web search snippets, thinking/tool visibility, and token count display.
Updates mmproj discovery/loading, dependencies, portable workflows, and OpenAI API docs.

Reviewed changes

Copilot reviewed 33 out of 33 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`user_data/tools/web_search.py`	Returns snippets in web search tool results.
`user_data/tools/fetch_webpage.py`	Fetches page content without link extraction.
`server.py`	Persists Electron model directory settings.
`requirements/full/requirements.txt`	Updates selected dependency and wheel versions.
`README.md`	Updates installation wording.
`modules/web_search.py`	Adds snippets to search results and simplifies content fetching.
`modules/utils.py`	Adds mmproj helpers and expands mmproj discovery.
`modules/ui.py`	Includes Electron-only settings in saved interface state.
`modules/ui_session.py`	Adds Electron model directory UI and portable update checker.
`modules/ui_model_menu.py`	Updates mmproj dropdown text and hides Jinja controls in chat mode.
`modules/ui_chat.py`	Reorganizes chat controls and adds token display refresh.
`modules/text_generation.py`	Tracks prompt/completion token counts for HF generation.
`modules/tensorrt_llm.py`	Tracks token counts for TensorRT-LLM streaming.
`modules/shared.py`	Adds Electron detection and spellcheck setting.
`modules/models_settings.py`	Auto-detects sibling mmproj files for llama.cpp models.
`modules/llama_cpp_server.py`	Tracks completion tokens and resolves mmproj paths from model folders.
`modules/html_generator.py`	Adds structured web search result rendering and tool-call spinner markup.
`modules/exllamav3.py`	Tracks ExLlamaV3 completion token counts.
`modules/chat.py`	Updates thinking continuation handling, token display, and active chat tracking.
`js/main.js`	Adjusts chat-tab character menu placement and Electron spellcheck toggle.
`js/global_scope_js.js`	Preserves open/closed thinking block state during morphdom updates.
`docs/12 - OpenAI API.md`	Reorders and expands API examples, including tool calling.
`docs/01 - Chat Tab.md`	Renames dummy message/reply documentation.
`desktop/textgen.bat`	Enables UTF-8 mode for Windows launcher.
`desktop/preload.js`	Exposes Electron directory picker bridge.
`desktop/main.js`	Adds preload, spellcheck context menu, external-link handling, and directory IPC.
`css/main.css`	Adds styles for settings buttons, spinner, and web search cards.
`.github/workflows/build-portable-release.yml`	Includes preload script in portable build packaging.
`.github/workflows/build-portable-release-vulkan.yml`	Includes preload script in Vulkan portable packaging.
`.github/workflows/build-portable-release-rocm.yml`	Includes preload script in ROCm portable packaging.
`.github/workflows/build-portable-release-ik.yml`	Includes preload script in IK portable packaging.
`.github/workflows/build-portable-release-ik-cuda.yml`	Includes preload script in IK CUDA portable packaging.
`.github/workflows/build-portable-release-cuda.yml`	Includes preload script in CUDA portable packaging.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+def apply_model_dir(value):
+    shared.args.model_dir = value
+    if Path(value).is_dir():
+        shared.user_config = shared.load_user_config()


    output = input_ids[0]
+    shared.model.last_prompt_token_count = input_ids.shape[-1]
+    shared.model.last_completion_token_count = 0
    if state['auto_max_new_tokens']:
        generate_params['max_new_tokens'] = state['truncation_length'] - input_ids.shape[-1]


+        title = html.escape(r['title'])
+        url = html.escape(r['url'])
+        snippet = html.escape(r.get('snippet', ''))
+        cards.append(
+            f'<div class="web-search-result">'
+            f'<a class="web-search-title" href="{url}" target="_blank" rel="noopener noreferrer">{title}</a>'


oobabooga and others added 30 commits May 7, 2026 14:15

Switch flash-attn wheels from kingbri1 to mjun0812

95c0a26

Update README

f29e220

Update exllamav3 to 0.0.34

9a098a9

Add snippet support to the web_search tool (closes #7548)

f4f556a

Downgrade xformers to make exllamav3 0.0.34 work

b863696

Remove backend="duckduckgo" from ddgs (#7548 (comment))

79630e9

Make web_search tool call results pretty

13f5d37

Electron: Add right-click context menu for copying text

66f01d6

docs: Small API example change

2c254cb

Electron: Add a folder picker for the models directory

47fdee9

Add missing file

6a1a959

Treat negative ctx-size as auto

66c3c49

Detect mmproj files in the models folder (simplifies LM Studio migrat…

a711849

…ion)

Fix streaming output leaking across chats (closes #7555)

6301419

Also fix streaming leak across all other chat actions

58774c8

Small simplifications

1e8136d

fix(win): set PYTHONUTF8 for non-ASCII locale Windows compatibility (#…

646f10d

…7560)

Rename "Send dummy message/reply" to "Insert user/assistant message"

0f88365

Auto-select sibling mmproj when loading a model (closes #7564)

bed909d

Keep web search blocks closed when user closes them mid-stream

82d931e

Reorder right sidebar: Mode/Character/Chat style on top

6033be5

Hide reasoning and tools controls in chat mode

2a27cea

Polish character dropdown in chat tab

626b089

Fix Show controls text style in hover menu

5270943

Soften slider and checkbox label colors in light theme

3c4a140

Show live context size while generating

8e34c7b

Revert "Soften slider and checkbox label colors in light theme"

dbf9220

This reverts commit 3c4a140.

docs: reorder API examples by importance

ceade2e

Electron: Add "Check for updates" button in the Session tab

6378c5c

Improve the looks of the Session tab

f327da7

oobabooga and others added 4 commits May 16, 2026 10:23

Tighten spacing between dropdowns and refresh buttons

bf6c8cd

Electron: Add spellcheck toggle in the Session tab (closes #7550)

be7f3a2

Fix continue-mode regressions across template families

d803e29

oobabooga requested a review from Copilot May 17, 2026 00:24

Copilot started reviewing on behalf of oobabooga May 17, 2026 00:24 View session

Change dependabot to target the main branch

59c67f9

Copilot AI reviewed May 17, 2026

View reviewed changes

oobabooga added 3 commits May 16, 2026 17:34

Electron: Validate model_dir path before applying it

aa7d6bf

UI: Fix token count not being set in non-streaming mode

4c16b94

UI: Improve web search security by rejecting non-HTTP links

dcbb323

oobabooga merged commit 58e4406 into main May 17, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge dev branch#7567

Merge dev branch#7567
oobabooga merged 38 commits into
mainfrom
dev

oobabooga commented May 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

oobabooga commented May 17, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants