isparkclaw-webui

Author	SHA1	Message	Date
nesquena-hermes	9c69b646ff	feat(commands): /background, /btw slash commands + undo button + reasoning chip Rebased onto master after #931 (aux title routing) to resolve streaming.py conflict. All changes from both PRs are cleanly integrated. 2088 tests passing (2065 master + 23 from #931). Co-authored-by: bergeouss <bergeouss@gmail.com>	2026-04-24 01:24:51 +00:00
nesquena-hermes	14a1924796	fix(streaming): respect auxiliary.title_generation config for session titles - _aux_title_configured(): returns True when provider/model/base_url is set - _aux_title_timeout(): reads configured timeout, falls back to 15.0s default - _generate_llm_session_title_via_aux: use_agent_model kwarg preserves old behavior - Missing llm_invalid_aux fallback now triggers agent-model retry - 23 new tests in tests/test_title_aux_routing.py — all pass Co-authored-by: starship-s <starship-s@users.noreply.github.com>	2026-04-24 01:07:02 +00:00
nesquena-hermes	a3647570fb	fix: persist onboarding_completed for CLI-configured users on first chat_ready (#922 ) * fix: persist onboarding_completed for CLI-configured users on first chat_ready (v0.50.179, #921) Co-authored-by: bsgdigital * fix(onboarding): don't 500 the status endpoint if save_settings fails The #921 persist call `save_settings({"onboarding_completed": True})` in get_onboarding_status() raises if the settings.json write fails (read-only filesystem, disk full, permission error). That turns every /api/onboarding/status call into a 500 until the disk is writable, which is much worse UX than losing the persistence-across-restart guard. Wrapped in try/except so persistence becomes best-effort. The function still sets settings["onboarding_completed"] = True in memory on success, and `completed` reflects `config_auto_completed` on this request either way, so the user sees the right state even when the write fails — only the next-restart protection degrades. Added regression test that patches save_settings to raise OSError and asserts the endpoint still returns completed=True without raising. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 15:46:02 -07:00
nesquena-hermes	1011918d50	feat: add PWA support (manifest, service worker, install prompt) (#920 ) * feat: add PWA support (manifest, service worker, install prompt) (v0.50.178, #911) Co-authored-by: bsgdigital Closes #685 * fix(sw): await caches.match() before `\|\| fallback` so offline HTML actually shows The offline-navigation fallback was dead code: return caches.match('./') \|\| new Response('<html>...</html>', ...); `caches.match()` returns a Promise, and Promise objects are always truthy in a `\|\|` check — so the `new Response(...)` branch was never taken. On actual offline, `caches.match('./')` resolves to undefined (no cache hit for the root), the SW returns undefined, and the browser falls back to its own default offline page. The custom "Hermes requires a server connection" HTML was unreachable. Fix by threading the match through `.then()` so the resolved value (not the Promise object) feeds the `\|\|`: return caches.match('./').then((cached) => cached \|\| new Response(...)); Added 13 regression tests in tests/test_pwa_manifest_sw.py covering: - manifest.json validity + required PWA fields + icon existence - sw.js cache-version placeholder + API/stream bypass + correct offline pattern (explicitly rejects the broken `\|\| new Response` shape so it can't regress) - /manifest.json + /sw.js routes serve correct Content-Type, Cache-Control, Service-Worker-Allowed headers and inject WEBUI_VERSION - index.html links manifest, registers SW, has iOS PWA meta tags Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 15:14:21 -07:00
nesquena-hermes	1175ee363f	fix(models): duplicate dropdown entries, stale default model, lowercase injected label (#907 #908 #909 ) (#918 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-23 14:41:06 -07:00
nesquena-hermes	5b923a9502	fix: harden session persistence and per-session lock handling during streaming (v0.50.175, #910 ) (#910 ) Co-authored-by: starship-s Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-23 14:25:43 -07:00
nesquena-hermes	9dd6e3f338	fix(cancel): preserve partial streamed response on Stop Generation (#893 ) (#902 ) * fix(cancel): preserve partial streamed response on Stop Generation (#893) * docs(cancel): fix misleading comment — partial message is NOT _error=True The outer comment block claimed `_error=True so _sanitize_messages_for_api() strips it from future conversation history`, but the actual append call sets only `_partial=True` (correctly matching the inner comment six lines below and the PR description). Updated the outer comment to match reality so a future reader doesn't try to "fix" the code to match the wrong comment. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 11:16:59 -07:00
nesquena-hermes	4089972b09	fix(models): preserve @nous: prefix in settings + fix cross-namespace 404 for Nous (#895 #894 ) (#901 ) * fix(models): preserve @nous: prefix in settings + fix cross-namespace 404 for Nous (#895 #894) * fix(review): persist bare form for CLI compatibility + picker smart-match The PR persisted `@nous:anthropic/claude-opus-4.6` verbatim to config.yaml to make the Settings picker match its dropdown options (which carry the `@nous:` prefix after #885). That fixes the WebUI picker but introduces a cross-tool regression: hermes-agent's CLI reads `config.yaml -> model.default` directly and passes it to the provider API verbatim. For aggregator providers (Nous is one — see hermes_cli/model_normalize.py `_AGGREGATOR_PROVIDERS`), `normalize_model_for_provider` is skipped entirely (run_agent.py:887), so the literal `@nous:anthropic/...` string flows to the Nous API, which rejects it — breaking every user who runs `hermes` in the terminal right after saving via WebUI. Fix the tension at the picker rather than the persistence: the existing `_findModelInDropdown()` smart matcher already normalises both sides (lowercase, strip namespace prefix, dashes→dots) so a saved bare `anthropic/claude-opus-4.6` resolves to the `@nous:anthropic/claude-opus-4.6` option automatically. Applied this in panels.js via `_applyModelToDropdown()`. Changes: api/config.py revert the @-prefix preservation; persist the resolved bare/slash form (CLI-compatible) static/panels.js Settings picker uses _applyModelToDropdown() instead of raw `.value =` so saved bare forms still select the matching @nous: option tests test renamed + asserts bare persisted form; new test locks the smart-matcher contract This also improves behaviour for a dormant case not flagged in #895: a user who set their default via `hermes model X` and opens Settings for the first time used to see a blank picker (bare form vs prefixed options). Now the smart matcher finds the right option, so the "open Settings → save → bare form in config.yaml" round-trip is stable for both CLI- and WebUI-origin saves. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore: update CHANGELOG v0.50.171 — bare-form persistence + picker smart-match --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 10:44:10 -07:00
bergeouss	cd01e4d5ba	feat(models): live-first model fetching for all OpenAI-compat providers (#892 ) * feat(models): live-first model fetching for all OpenAI-compat providers (#871) The WebUI model picker relied on hardcoded _PROVIDER_MODELS as primary source for providers like zai, minimax, mistralai, xai, openai-codex, deepseek, and gemini. These lists go stale — new models don't appear until someone manually updates the dict. Add an OpenAI-compat /v1/models fetch fallback in _handle_live_models() that fires when provider_model_ids() is unavailable or returns []. The resolution chain is now: 1. hermes_cli.provider_model_ids() (agent's live fetch) 2. Custom providers from config.yaml 3. Direct /v1/models fetch for known OpenAI-compat endpoints 4. Static _PROVIDER_MODELS as last-resort offline fallback Covers: zai, minimax, mistralai, xai, openai-codex, deepseek, gemini. Uses urllib (stdlib) — no new dependencies. Static lists remain as offline fallback so the UI always shows something. Closes #871 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * refactor(models): address review feedback on live fetch (#892) Five changes from nesquena-hermes review: 1. Move _OPENAI_COMPAT_ENDPOINTS to module level — avoid dict reconstruction per request 2. Document urllib blocking behavior — 8s timeout acceptable because server is threaded and frontend enriches in background 3. Add TODO comment for TTL-based caching follow-up 4. Remove openai-codex from endpoint map — same endpoint as base openai provider, already covered by provider_model_ids() 5. Restrict API key lookup to provider-scoped and model.api_key only — remove top-level api_key fallback to prevent cross-provider key leakage Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-23 09:45:46 -07:00
Pavol Biely	96c97c5e0e	fix: remove hardcoded chinese title heuristics (#887 ) * fix: remove hardcoded chinese title heuristics * fix: use english placeholder for non-latin fallback titles	2026-04-23 09:45:34 -07:00
nesquena-hermes	b82954ee70	feat(ui): session attention indicators — streaming spinner, unread dot, timestamps (#856 ) Closes #856. Co-authored-by: Frank Song <138988108+franksong2702@users.noreply.github.com> Reviewed-by: nesquena (709bd37 — test isolation fix also included)	2026-04-23 09:05:57 -07:00
nesquena-hermes	666d385c03	fix: Nous static models use @nous: prefix — v0.50.164 (#885 ) fix: Nous static models use @nous: prefix — v0.50.164 (#885) Follow-up to #854 / PR #870. The previous fix made Nous static IDs slash-prefixed and added a portal-guard branch to resolve_model_provider(). This tightens the static list to use the explicit @nous: prefix, matching the format of live-fetched models after ui.js's _fetchLiveModels() portal- prefix step. The @provider:model branch in resolve_model_provider() is more explicit and reliable than the portal-guard fallback. Both static and live-fetched paths now converge on the same resolver output — and as a side effect, the dedup check in _fetchLiveModels() now correctly identifies static entries as already present, eliminating duplicate entries in the dropdown for Nous users. Verified: all 29 Nous models in the browser dropdown carry @nous: prefix, routing confirmed correct via resolve_model_provider() for all 4 static IDs, 1941 tests passing. Closes #854.	2026-04-22 22:56:21 -07:00
nesquena-hermes	d39d30a213	fix: correct message ordering after task cancellation — v0.50.163 (#883 ) fix: correct message ordering after task cancellation — v0.50.163 (#883) Fixes the message-ordering glitch from #882: clicking Cancel while the agent is responding could cause a subsequent response to render above the "Task cancelled." marker. Root cause: the cancel handler pushed the marker only to local S.messages without persisting to the server. When the done event fired shortly after and replaced S.messages from server state, the marker disappeared from client state while the next response anchored to the server-authoritative position. Fix has three parts: - Server (cancel_stream): append Task cancelled. to session.messages with _error:True + timestamp, then save. _error ensures _sanitize_messages_for_api() strips it from conversation_history on the next agent turn, so the LLM never sees it as a prior assistant turn. Precedent: same flag used for the apperror marker at line 1343. - Client (SSE cancel handler): fetch /api/session instead of pushing locally (same pattern as the done handler). Falls back to local push if the fetch fails. - Tests: fix test window width for cancel handler (1200→dynamic); add two regression tests pinning _error flag and _sanitize invariant. 1941 tests passing. Co-authored-by: piliang <piliang1@jd.com>	2026-04-22 22:17:40 -07:00
Frank Song	62c56175b7	feat(workspaces): autocomplete trusted workspace paths — v0.50.162 (PR #880 by @franksong2702, closes #616 ) Adds GET /api/workspaces/suggest endpoint and autocomplete dropdown in the Spaces panel. Suggestions limited to trusted roots (home, saved workspaces, boot default). Keyboard nav, Tab completion, hidden dir support. Symlink-escape and dotdot-escape invariants locked by regression tests.	2026-04-23 02:35:58 +00:00
nesquena-hermes	0f1b232c12	fix(ci): eliminate test_set_key flakiness — v0.50.161 Root cause: test_profile_env_isolation.py and test_profile_path_security.py called sys.modules.pop() without restoring, poisoning subsequent tests. Fix: monkeypatch.delitem so pytest auto-restores. Also holds _ENV_LOCK for full I/O cycle in _write_env_file and creates .env at 0600 via os.open. Reviewed by Opus (no independent review needed — test/providers fix only).	2026-04-23 02:09:37 +00:00
nesquena-hermes	04b00065f9	feat: provider key management from Settings — v0.50.159 (PR #867 by @bergeouss, closes #586 ) New Providers tab in Settings lets users add/update/remove API keys without editing .env. Six review fixes applied. 18 tests.	2026-04-23 01:09:22 +00:00
nesquena-hermes	0a75b3f1d3	fix: Nous portal model IDs + portal provider routing guard — v0.50.157 (closes #854 ) Two bugs fixed: (1) _PROVIDER_MODELS["nous"] updated to slash-prefixed IDs that Nous API expects. (2) resolve_model_provider() now routes portal provider models through the portal (not OpenRouter) and preserves the full slash-prefixed model ID. 10 regression tests.	2026-04-22 23:05:27 +00:00
nesquena-hermes	3a63fe479e	fix(security): gate auto-install behind HERMES_WEBUI_AUTO_INSTALL=1 — v0.50.156 Breaking: auto_install_agent_deps() is now disabled by default. Set HERMES_WEBUI_AUTO_INSTALL=1 to re-enable. New _trusted_agent_dir() checks ownership and permission bits. Addresses #842 by @tomaioo.	2026-04-22 20:49:28 +00:00
nesquena-hermes	96cb880a12	fix: Honcho per-session uses stable session ID across WebUI turns — v0.50.155 (closes #855 ) Pass gateway_session_key=session_id to AIAgent from streaming.py so Honcho per-session strategy pins to stable WebUI session ID rather than creating a new Honcho session each turn.	2026-04-22 20:48:52 +00:00
nesquena-hermes	256b3fbbdf	fix: image_generate renders inline + auto-title strips thinking preamble — v0.50.152 (closes #853 , #857 ) MEDIA: restore renders all https:// URLs as img (closes #853). _strip_thinking_markup strips Qwen3 plain-text reasoning preambles (closes #857).	2026-04-22 20:20:01 +00:00
nesquena-hermes	5fa731ea4a	release: v0.50.151 — credential_pool provider detection + Ollama Cloud support (PR #820 by @starship-s) Surfaces providers added via credential_pool in the model dropdown. Ambient gh-cli tokens suppressed. _apply_provider_prefix helper extracted. Ollama Cloud display name + dynamic model list. looksLikeBareOllamaId heuristic tightened. Test isolation fixed. PR #820 by @starship-s.	2026-04-22 20:18:02 +00:00
Miguel Tavares	f42f1c69ca	fix: correct webui profile switching state — v0.50.150 (PR #849 by @migueltavares) Three related profile-switching fixes: - Always persist hermes_profile=default cookie when switching back to default (was being cleared with max-age=0, causing fallback to process-global profile) - Replace undefined updateWorkspaceChip() with syncTopbar() in the sessionInProgress branch of switchToProfile() - Make sidebar/dropdown active-profile rendering prefer S.activeProfile client state when available, with safe fallback Tests: 1854 passing.	2026-04-22 16:27:01 +00:00
Frank Song	418d77443c	fix: keep GET /api/session side-effect free for stale models — v0.50.149 (PR #848 by @franksong2702) Replace _normalize_session_model_in_place() on the GET /api/session read path with a read-only _resolve_effective_session_model_for_display() that returns the effective display model without writing it back to disk or the session index. Closes #845. Tests: 1856 passing.	2026-04-22 16:26:48 +00:00
Frank Song	13dbd818c9	fix: prune stale session index entries after session-id rotation — v0.50.148 (PR #847 by @franksong2702) Prune ghost _index.json rows whose backing session file no longer exists, on both incremental index writes and all_sessions() reads. Fixes duplicate session entries after session-id rotation (e.g. context compression). Also pre-snapshots in_memory_ids under a single LOCK acquisition in all_sessions() rather than one per row. Closes #846. Review additions: optimised lock pattern in all_sessions() (one LOCK acquisition instead of N). Tests: 1856 passing.	2026-04-22 16:26:38 +00:00
nesquena-hermes	1239129ae2	fix(models): stale cross-provider model no longer shows as unavailable in picker (closes #829 ) * fix(models): stale cross-provider model no longer shows as unavailable in picker Two bugs allowed an openai/gpt-5.4-mini stale session model to appear as '(unavailable)' under a custom provider group for users who never configured OpenAI (#829). Backend (api/routes.py): _resolve_compatible_session_model() had a blanket early-return for active_provider in {custom, openrouter} that skipped all normalization regardless of whether any catalog group could route the model's prefix. A custom_providers-only user with a stale openai/... session model was never corrected. Fixed: only skip normalization when the model prefix is actually routable (matches a catalog group provider_id, or an openrouter group is present that can route any provider/model). Frontend (static/ui.js): renderSession() injected a bare <option> (not in any <optgroup>) for models not found in the dropdown. renderModelDropdown() rendered bare options without emitting a group heading, so they visually inherited the last rendered provider heading — making the stale model appear to belong to the custom provider group. Fixed: silently reset to the first available model and fire a PATCH to persist the correction instead of injecting a misleading (unavailable) option. 5 new tests in test_provider_mismatch.py cover: - stale openai model cleared when custom_providers-only + no default_model - stale openai model cleared when custom_providers-only + default_model set - openrouter model preserved when openrouter group present - custom/ namespace always preserved - ui.js no longer injects model_unavailable option * fix(ui): declare modelSel locally in syncTopbar reset path; fix test assertion - Use const modelSel=$('modelSelect') instead of undeclared sel in the stale-model reset branch of syncTopbar() (caught in Opus review) - Fix test assertion: or → and for model_unavailable key absence check --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-21 22:20:08 -07:00
nesquena-hermes	d4a3adb7b1	fix(sessions): surface gateway SSE failures and add polling fallback (#828 ) * fix(sessions): surface gateway SSE failures and add polling fallback - add a JSON probe mode for the gateway SSE endpoint - detect watcher-unavailable 503s from the browser - fall back to periodic session refresh with a toast - add probe payload tests and endpoint coverage Fixes #635 * fix(sessions): surface gateway SSE failures and add polling fallback (#826) Absorbed from PR #826 by @cloudyun888 (fixes #635). When the gateway watcher thread is not running, the browser now shows a toast notification and falls back to 30-second periodic polling for session sync. Previously the SSE failure was completely silent with no user feedback. Changes from original PR: - Deleted misplaced test_gateway_sse_probe_unit.py (was at repo root, not discovered by `pytest tests/`); unit tests moved into tests/test_gateway_sync.py - _gateway_sse_probe_payload now checks watcher._thread.is_alive() rather than just watcher is not None — a watcher instance with a dead poll thread now correctly reports unavailable and activates the polling fallback - probeGatewaySSEStatus catch(e) now starts the polling fallback on network error rather than silently swallowing the failure - Added 5 unit tests covering all watcher-alive/dead/missing/disabled branches Co-authored-by: cloudyun888 <269269188+86cloudyun-afk@users.noreply.github.com> * cleanup(gateway): public is_alive() + dedup probe/live watcher-alive check + changelog Three small cleanups on top of @cloudyun888's PR #826 absorption: 1. Add GatewayWatcher.is_alive() public accessor so routes.py doesn't reach into the private _thread attribute. The existing private- attribute check stays as a defensive fallback for any older in- memory instance or test double that doesn't implement the full API. 2. Dedupe the watcher_alive computation in _handle_gateway_sse_stream: the live-SSE path now calls _gateway_sse_probe_payload(...) and reads its watcher_running field instead of re-deriving the same logic inline. Keeps probe and SSE in sync automatically. 3. CHANGELOG trailer was (#826, fixes #635, @cloudyun888) — this PR is #828, so updated to (#828, absorbs PR #826 by @cloudyun888, fixes #635) matching the repo convention for absorbed PRs (see #805). Added two regression tests: - test_gateway_watcher_is_alive_public_method — covers the three lifecycle states (before start, while running, after stop). - test_probe_payload_prefers_public_is_alive — asserts the probe uses watcher.is_alive() rather than poking _thread when the public method exists. Full suite: 1735 passed, 0 new failures. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: cloudyun888 <269269188+86cloudyun-afk@users.noreply.github.com> Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-21 21:18:55 -07:00
nesquena-hermes	8f1f582caf	fix: BYOK/custom provider models missing from WebUI model dropdown (#815 ) Closes #815. Three root causes fixed: 1. Provider aliases (z.ai/x.ai/google/grok/claude/aws-bedrock/dashscope/~25 more) not normalized before _PROVIDER_MODELS lookup — provider fell to empty else-branch while TUI worked (it normalizes at startup). Fixed via _resolve_provider_alias() + inlined _PROVIDER_ALIASES table in api/config.py. 2. Silent ImportError in original normalization: 'from hermes_cli.models import _PROVIDER_ALIASES' inside try/except silently failed without hermes-agent on sys.path (CI, minimal installs). The inlined table fixes this — normalization now works regardless of whether hermes-agent is installed. 3. /api/models/live?provider=custom now falls back to custom_providers entries from config.yaml when provider_model_ids() returns empty. Also: provider_id on every group in /api/models response for deterministic JS optgroup matching (no substring false positives). 17 targeted tests, 1725/1725 full suite.	2026-04-21 17:24:54 -07:00
nesquena-hermes	a4d59b9e6c	fix: update banner — conflict recovery path + server self-restart after update (#816 ) * fix: update banner conflict recovery + server self-restart after update (#813 #814) * fix(update): restart must wait for in-flight update + reset force button on retry Two defects in the update banner flow found during review of PR #816: 1. Two-target race (webui + agent sequential) The client posts targets sequentially: webui succeeds and schedules a restart timer (2 s delay); client then posts agent; server begins agent fetch+pull; at T=2 s the restart timer fires os.execv mid-pull, killing the agent update and closing the client connection. User sees "Update failed (agent): Failed to fetch" even though webui did update, and the agent repo is in an unknown partial state. Fix: _schedule_restart() now blocks on _apply_lock before calling os.execv. If a second update is in flight when the timer fires, the restart thread waits until it completes. If nothing is in flight the lock acquire is instant, so no-op updates still restart immediately. 2. Stale force-update button across retries _showUpdateError sets btnForceUpdate to display:inline-block when res.conflict / res.diverged. Nothing resets it on the next retry, so a subsequent non-conflict error (e.g. network) leaves the stale force button visible pointing at the previous target. Fix: applyUpdates() now hides the force button and clears its data-target at the start of each attempt. Tests: - test_schedule_restart_waits_for_apply_lock: holds _apply_lock from a helper thread, verifies execv is delayed until the lock is released. - test_schedule_restart_still_fires_when_no_update_in_flight: sanity check that the common path still works with no contention. - test_apply_updates_resets_force_button_at_start: regression guard that the reset appears before the update loop begins. Full suite: 1683 passed, 0 failures. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(update): hold _apply_lock through execv + fix banner error layout Two fixes from Opus review: 1. TOCTOU gap in _schedule_restart (api/updates.py): the original pattern acquired _apply_lock, released it, then called os.execv — leaving a brief window where a new update could start between release and execv. Fixed by moving os.execv inside the 'with _apply_lock:' block so the process is replaced while still holding the lock; no new update can acquire it. 2. Banner CSS layout (static/index.html): #updateError was a direct flex child of .update-banner (display:flex row), so long error messages sat inline between #updateMsg and the buttons instead of below the message. Wrapped #updateMsg + #updateError in a flex-column container so errors stack vertically under the status line. * docs: add v0.50.134 CHANGELOG entry --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-21 17:10:41 -07:00
nesquena-hermes	811424a87b	feat(reasoning): full /reasoning CLI parity — show\|hide + effort levels via config.yaml (#812 ) Closes #461 Adds full /reasoning CLI parity to the WebUI slash command system: - /reasoning show\|on → window._showThinking = true; writes display.show_reasoning to config.yaml (same key as CLI); mirrors to settings.json for boot.js - /reasoning hide\|off → same in reverse; re-renders immediately - /reasoning none\|minimal\|low\|medium\|high\|xhigh → POST /api/reasoning → writes agent.reasoning_effort to config.yaml; takes effect next turn (matching CLI semantics) - /reasoning (no args) → GET /api/reasoning → live status toast from config.yaml - Autocomplete shows all 8 options: show\|hide\|none\|minimal\|low\|medium\|high\|xhigh - Profile-isolated: _get_config_path() is thread-local so per-profile settings never bleed across - Boot hydration: window._showThinking initialised from settings.json show_thinking on page load - Inspect.signature guard in streaming.py so older hermes-agent builds don't TypeError 28 new tests, 1708/1708 total passing. Full browser QA on port 8789 with isolated state. CLI/config.yaml sync verified with hermes_constants.parse_reasoning_effort().	2026-04-21 15:26:52 -07:00
nesquena-hermes	f6e1612c7e	fix: periodic session checkpoint during streaming — v0.50.132 (#810 ) Closes #765. Supersedes #809 (@bergeouss). Co-authored-by: bergeouss <bergeouss@users.noreply.github.com>	2026-04-21 12:07:44 -07:00
nesquena-hermes	312a493a72	fix(sessions): new sessions appear immediately in sidebar (#806 ) Closes #789 Bug A. 60-second exemption in all_sessions() filter.	2026-04-21 17:08:52 +00:00
nesquena-hermes	3246b263d9	fix(profiles): complete profile isolation via cookie + thread-local (#805 ) Closes the gap left by #800. Full isolation via hermes_profile cookie + TLS. Co-authored-by: bergeouss <bergeouss@users.noreply.github.com>	2026-04-21 17:04:11 +00:00
nesquena-hermes	cbb4ba3f28	fix(profiles): profile isolation — new_session uses per-request profile, not process global (#800 ) Fixes the multi-client profile isolation bug (#798). - get_hermes_home_for_profile(): pure path resolver, validates name against _PROFILE_ID_RE (rejects path traversal), never mutates os.environ or globals - new_session() accepts explicit profile= param from POST body (S.activeProfile), short-circuits the process-level _active_profile global - streaming handler resolves HERMES_HOME from s.profile instead of the global - sessions.js sends profile: S.activeProfile in every new-session POST 10 tests in tests/test_issue798.py including concurrency and traversal coverage. Co-authored-by: nesquena <nesquena@users.noreply.github.com>	2026-04-21 16:16:51 +00:00
Dave Brown	77ab63361f	fix(onboarding): recognize credential_pool OAuth auth for openai-codex (#797 ) fix(onboarding): recognize credential_pool OAuth auth for openai-codex (#797) The onboarding readiness check in `api/onboarding.py` only looked at the legacy `providers[provider]` key in `auth.json`. Hermes runtime resolves OAuth tokens from `credential_pool[provider]` (device-code / OAuth flows), so WebUI could report "not ready" while the runtime chatted successfully. The check now covers both storage locations with a fail-closed helper. Adds three regression tests. Reported in #796, fixed by @davidsben. Co-authored-by: davidsben <davidsben@users.noreply.github.com>	2026-04-21 15:41:34 +00:00
nesquena-hermes	38e215e8f8	fix: dynamic version badge — read from git tag, never hardcoded (#790 ) * fix: dynamic version badge — read from git tag, never hardcoded The settings panel showed v0.50.87 and the HTTP Server: header said HermesWebUI/0.50.38 — both hardcoded strings that drift further behind with every release because there was no mechanism to keep them in sync. Changes: - api/updates.py: add _run_git() (moved before _detect_webui_version), _detect_webui_version(), and WEBUI_VERSION module constant resolved once at import time via 'git describe --tags --always --dirty'. Fallback chain: git → api/_version.py → 'unknown'. - api/routes.py: inject webui_version into GET /api/settings response so the frontend can read it without a separate API call. - static/panels.js: loadSettingsPanel() populates .settings-version-badge from settings.webui_version — one line after the existing api() call. - static/index.html: replace stale hardcoded 'v0.50.87' with '—' placeholder; JS overwrites it as soon as the settings panel opens. - server.py: replace hardcoded 'HermesWebUI/0.50.38' server_version with 'HermesWebUI/' + WEBUI_VERSION.lstrip('v') — stays in sync automatically. - Dockerfile: add ARG HERMES_VERSION=unknown and write api/_version.py so Docker images (where .git is excluded) still show the correct tag. - .github/workflows/release.yml: pass build-args: HERMES_VERSION=${{ github.ref_name }} to the Docker build step on tag pushes. - .gitignore: exclude api/_version.py (generated by Docker/CI, never committed). No manual 'update the version badge' step is required going forward. Tagging is sufficient — the badge and HTTP header update automatically. Tests: 18 new tests in tests/test_version_badge.py covering the full resolution chain, /api/settings injection, HTML placeholder, JS wiring, and server.py import. 1596 tests pass total. * fix: address review feedback on PR #790 - api/updates.py: replace exec() with regex parse for api/_version.py (no supply-chain risk from build artifact; exec unnecessary for one assignment) - api/updates.py: cap git describe timeout at 3s (was 10s — import-time stall on NFS/.git would block server startup unnecessarily) - server.py: lstrip('v') → removeprefix('v') (lstrip strips chars not prefix) - server.py: emit bare 'HermesWebUI' when version is 'unknown' rather than 'HermesWebUI/unknown' (log aggregators expect semver-ish suffix or none) - CHANGELOG.md: add v0.50.124 entry for this user-visible change - tests: rename exec-error test to reflect regex behaviour; add tests for removeprefix usage and unknown-version header guard (1598 tests total) --------- Co-authored-by: nesquena-hermes <hermes@nesquena.com>	2026-04-20 20:36:53 -07:00
Nathan Esquenazi	e91325db25	fix(config): invalidate model-list TTL cache on default-model change set_hermes_default_model() calls reload_config() which resyncs _cfg_mtime, so the mtime check inside get_available_models() never fires and the POST response returns the stale cached default. Explicitly drop the TTL cache after reload so the next read recomputes. Fixes the CI failure in test_default_model_updates_hermes_config which the prior teardown-only fix in this PR did not actually address. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 19:32:33 -07:00
nesquena-hermes	b6d335feaa	perf: TTL cache for model list + incremental session index (#780 ) Fixes AWS IMDS timeout on model dropdown. Incremental index writes. Co-authored-by: starship-s <starship-s@users.noreply.github.com>	2026-04-21 00:33:03 +00:00
nesquena-hermes	a7e8b1ab83	fix(streaming): eagerly release session lock in cancel_stream() (#778 ) cancel_stream() now pops STREAMS/CANCEL_FLAGS/AGENT_INSTANCES and clears session.active_stream_id immediately after signalling cancel. Fixes sessions permanently stuck at 409 when the agent thread is blocked in a bad tool call. Session cleanup runs outside STREAMS_LOCK to preserve lock ordering. Fixes #653 Co-authored-by: bergeouss <bergeouss@users.noreply.github.com>	2026-04-20 23:54:40 +00:00
nesquena-hermes	c34892be44	fix(streaming): guard newer AIAgent kwargs with inspect for hermes-agent compat (#775 ) Uses inspect.signature() to check which params AIAgent accepts. Fixes #772.	2026-04-20 23:23:19 +00:00
nesquena-hermes	98cd318413	fix(sessions): surface get_cli_sessions() failures via logger.warning (#769 ) Logs warnings instead of silently returning [] on DB errors. Fixes #634.	2026-04-20 23:13:54 +00:00
nesquena-hermes	765d8520d4	fix(streaming): quota error detection, error persistence, stream_end session_id fix (#767 ) - quota_exhausted error type: distinguishes credit exhaustion from rate limits - Streaming errors persisted to session file so they survive page reload - _error flag excludes persisted errors from subsequent LLM API calls - stream_end and title SSE events use original session_id (not s.session_id which rotates during context compaction) Fixes #739, #652, #653	2026-04-20 22:48:19 +00:00
nesquena-hermes	76e602af25	feat: remove bubble_layout setting end-to-end (#777 ) Removes the bubble_layout toggle from Settings, all persistence, CSS, i18n strings, and the UI docs demo. The CSS was already effectively dead. Users with a saved bubble_layout value in settings.json get a clean migration via _SETTINGS_LEGACY_DROP_KEYS. Credit: @aronprins (PR #760 / #777) Co-authored-by: aronprins <aronprins@users.noreply.github.com>	2026-04-20 22:34:45 +00:00
nesquena-hermes	63f9b719bb	fix(config): use Hermes config.yaml as single source of default model (#773 ) Removes split-brain where WebUI Settings persisted default_model separately from Hermes runtime config.yaml. New POST /api/default-model endpoint writes to config.yaml. Existing saved values migrated on first load. Fixes #761 Co-authored-by: aronprins <aronprins@users.noreply.github.com>	2026-04-20 22:12:01 +00:00
Frank Song	0dd5d6f21c	feat(ui): add sidebar density mode to session list (#764 ) Adds compact/detailed toggle for the session list sidebar. Compact is the default (no behavior change for existing users). Detailed mode shows message count and model; profile names only appear when mixing sessions across profiles. Fixes #673 Co-authored-by: franksong2702 <franksong2702@users.noreply.github.com>	2026-04-20 19:43:40 +00:00
nesquena-hermes	711d8bb6c0	fix(ui): hover-only footer chrome with timestamps for both user and assistant — v0.50.110 (fixes #680 ) (#758 ) Squash merge of PR #717 — rebased on behalf of @franksong2702. ## What it does Fixes #680. Footer chrome (timestamps, copy, edit, regenerate) is now hover-only for both user and assistant message rows, consistent throughout the conversation. The last assistant turn keeps cumulative usage visible at rest; timestamp and actions are revealed inline on hover in the same row. Key changes: - `static/ui.js`: new `_formatMessageFooterTimestamp()` (local timezone, cross-day fuller format); `timeHtml` no longer gated to user-only; last assistant usage moved from separate `.msg-usage` div to inline `.msg-usage-inline` span in the footer - `static/style.css`: `.msg-foot-with-usage` class + rules; assistant footer opacity changed from 0.45 to 0 (hover-only); `:focus-within` alongside `:hover` for keyboard users - `api/streaming.py`: `_restore_reasoning_metadata()` now preserves `_ts`/`timestamp` for unchanged historical messages - `tests/test_sprint49.py`: 8 new tests covering rendering contract, hover CSS, timestamp preservation Tests: 1518 passed. QA: 20/20. Browser verified. Reviewed and approved by @nesquena and @aronprins.	2026-04-20 00:53:19 -07:00
nesquena-hermes	69570ca77c	release: v0.50.102–v0.50.108 batch (code blocks, utf-8, image URLs, deletion warning, PermissionError, Docker docs, kimi-k2.5) (#755 ) ## Batch release: v0.50.102 – v0.50.108 Seven self-built PRs reviewed and approved by @nesquena, now consolidated into a single release branch. ### Included fixes \| Version \| PR \| What it fixes \| \|---\|---\|---\| \| v0.50.102 \| #746 \| Code blocks lose newlines when not preceded by blank line (fixes #745) \| \| v0.50.103 \| #743 \| `encoding='utf-8'` on `write_text()` in `api/profiles.py` — Windows `.env` detection (fixes #741) \| \| v0.50.104 \| #735 \| Agent `MEDIA:localhost:` image URLs rewritten to `document.baseURI` — remote users get working images (fixes #642) \| \| v0.50.105 \| #736 \| Profile deletion warning strengthened: "permanently deleted, cannot be undone" across all 6 locales (fixes #637) \| \| v0.50.106 \| #738 \| Catch `PermissionError` in `_signing_key()` — three-container Docker UID mismatch no longer crashes all HTTP requests \| \| v0.50.107 \| #737 \| Docs: three-container UID/GID alignment guide in README + `HERMES_UID`/`HERMES_GID` forwarded in compose (fixes #645) \| \| v0.50.108 \| #742 \| Add `kimi-k2.5` to Kimi/Moonshot provider model list (fixes #740) \| ### Testing - pytest: 1510 passed, 1 warning (1 pre-existing unrelated failure excluded) - QA harness: 20/20 passed (`~/WebUI/scripts/run-browser-tests.sh`) - Browser: layout, slash autocomplete width, edit button, image URL rewrite, profile deletion dialog all verified All PRs reviewed and approved by @nesquena. Ready to merge and tag v0.50.108*.	2026-04-20 00:26:55 -07:00
nesquena-hermes	78c4f1e425	fix: null/empty session model must not trigger index rebuild — v0.50.101 (#753 ) ## Summary Follow-up to #751/#752. Code review identified a case where `_normalize_session_model_in_place` could call `session.save()` (which triggers a full session index rebuild) for sessions with `model: null` or missing model field. Root cause: `_resolve_compatible_session_model(None)` returns `(default_model, True)` when a default exists — which was interpreted as "changed, needs save." But there's nothing to correct for a session with no model; the default is just a fallback for display purposes, not a cross-provider correction worth persisting. Fix: capture `original_model` before calling `_resolve_compatible_session_model`. Only call `session.save()` if `original_model` was non-empty and actually changed. Adds a test asserting `save_calls == []` when `session.model is None`. No behavior change for sessions with a real model (the primary use case of #751 is unaffected).	2026-04-19 23:44:46 -07:00
nesquena-hermes	81ba420716	fix: custom/unknown model prefixes must not be stripped on provider switch — v0.50.100 (#752 ) ## Summary Regression fix for #751. Models with custom or unrecognized prefixes (e.g. `custom-provider/my-model`, `test/import-model`) were being incorrectly replaced with the active provider default. Root cause: `_normalize_provider_id("custom-provider")` matched the `"custom"` prefix and returned `"custom"`, which ≠ `active_provider` → normalization fired. Two-part fix: 1. Add `"custom"` and `"openrouter"` to the `model_provider` exclusion set in `_resolve_compatible_session_model` (parallel to the existing `active_provider` guard) 2. Return `""` for unknown prefixes in `_normalize_provider_id` so the `if model_provider` truthiness check safely short-circuits Adds a regression test covering `custom-provider/`, `test/`, `my-local-llm/`, and `lmstudio-community/` prefixes. ## Tests 1499 passed, 0 failures (was 2 failures before this fix)	2026-04-19 23:27:24 -07:00
nesquena-hermes	7f16a41a31	fix: normalize stale session models after provider switch — v0.50.99 (#751 ) ## Summary Rebased-on-behalf of @likawa3b (originally PR #748 — stale base). Sessions can outlive provider changes. When an old session still points to a model from a previous provider (e.g. `gemini-3.1-pro-preview` after switching the agent to OpenAI Codex), starting a chat hits the wrong backend and fails silently. This PR adds a lightweight normalization pass: - `_normalize_provider_id()` maps common prefixes to canonical provider IDs - `_resolve_compatible_session_model()` checks the session model's provider against `active_provider` and returns the default model if they differ - `_normalize_session_model_in_place()` is called at GET `/api/session` — corrects and persists stale models once - Chat start also normalizes via `_resolve_compatible_session_model()` and returns `effective_model` in the response - `messages.js` applies `effective_model` back to the UI/localStorage/dropdown if set Closes #748 ## Tests 1498 passed (2 pre-existing ordering failures unrelated to this PR; 5 new tests added in `test_provider_mismatch.py`). Original author: @likawa3b	2026-04-19 23:22:26 -07:00
nesquena-hermes	067d96bb30	feat: add full Russian (ru-RU) localization — v0.50.95 (PR #713 ) Full Russian locale — 389/389 English keys, Slavic plural forms, native Cyrillic. Rebased from PR #605 with rebase artifacts fixed. Login page Russian added to api/routes.py. Credits: @DrMaks22 (translation), @renheqiang (PR #605 author). Co-authored-by: DrMaks22 <DrMaks22@users.noreply.github.com> Co-authored-by: renheqiang <renheqiang@users.noreply.github.com>	2026-04-19 06:47:24 +00:00

1 2 3 4 5

235 Commits