chore: add runtime and desktop regression coverage by lefarcen · Pull Request #941 · nexu-io/nexu

lefarcen · 2026-04-08T10:06:48Z

What

Add a focused batch of regression coverage for the highest-risk runtime and desktop flows that changed after v0.1.10, and optimize CI so local/default test runs stay fast while PR/CI still execute the right coverage for the files that changed.

Why

Recent post-release fixes touched restart prevention, BYOK model preservation, launchd disabled overrides, credit-guard error replacement, mac desktop layout, local dev stale-port recovery, and skill watcher reconciliation. Those fixes are valuable precisely because they protect against subtle regressions, so they need direct tests rather than relying on indirect coverage. At the same time, the test suite has grown enough that we should stop running the slowest desktop/e2e paths on every PR when the changed files do not justify them.

How

Add runtime stability regression tests for deterministic plugins.allow ordering and BYOK model preservation when a configured provider has an empty model allowlist.
Add controller runtime-plugin tests for nexu-credit-guard channel isolation and stale-cache eviction.
Add desktop launchd tests to verify disabled launchd overrides are cleared even when plist content is unchanged.
Add web desktop-platform tests for the mac sidebar/traffic-light layout split.
Add scripts/dev service tests covering stale web/controller port cleanup before startup.
Add desktop link tests covering host-bridge external-link/local-folder fallback behavior.
Harden SkillDirWatcher with a low-frequency reconciliation fallback so missed fs events do not leave the ledger stale.
Split Vitest into core and extended layers, add a PR-aware test-selection script, and gate Desktop E2E so expensive suites/coverage only run for relevant changes while push/main still keeps full coverage.

Affected areas

Checklist

pnpm typecheck passes
pnpm lint passes
pnpm test passes
pnpm generate-types run (if API routes/schemas changed)
No credentials or tokens in code or logs
No any types introduced (use unknown with narrowing)

Add regression tests for runtime stability fixes, launchd disabled-override recovery, credit-guard channel isolation, mac desktop sidebar platform layout, and scripts/dev stale port cleanup so these post-0.1.10 failures stay covered in one place.

sentry · 2026-04-08T10:09:40Z

Codecov Report

❌ Patch coverage is 93.75000% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...troller/src/services/skillhub/skill-dir-watcher.ts	93.75%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 9bf9d0a6c6

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

…ktop-gaps

Cover desktop-only external link behavior so browser and local-folder launches prefer the host bridge when available and safely fall back when the controller is unavailable.

Make the credit-guard test prove cross-channel isolation before cache consumption, and add a competing runtime model to the BYOK empty-allowlist fixture so the fallback-prevention path is actually exercised.

Keep the skill ledger in sync even when fs.watch misses new skill directory events by adding a low-frequency reconciliation fallback and covering it in the desktop watcher regression test.

Run core tests by default, keep extended suites available for full CI, and use a dynamic PR test selector plus desktop E2E gating so expensive jobs only run when the changed files justify them.

Make the PR test selector fetch the base branch ref on demand before computing merge-base so shallow GitHub Actions checkouts do not fail on missing origin/main in test and desktop CI jobs.

Use full checkout history in PR jobs that rely on merge-base so the dynamic test selector can compare against the base branch reliably in GitHub Actions instead of failing on shallow clones.

Fetch full git history in the CI test job so merge-base driven PR test selection can compare against the base branch reliably, matching the desktop dev workflow behavior.

Update the new regression tests to assert the stable behavior that mainline now produces: plugin ordering remains deterministic even with Feishu/Weixin defaults present, and the mac desktop layout assertion targets the expanded header offset instead of a collapsed-only toggle.

chatgpt-codex-connector Bot reviewed Apr 8, 2026

View reviewed changes

Comment thread tests/controller/nexu-credit-guard.test.ts Outdated

Comment thread tests/controller/runtime-stability-regressions.test.ts

lefarcen added 4 commits April 8, 2026 20:53

Merge remote-tracking branch 'origin/main' into test/fill-runtime-des…

6c6f168

…ktop-gaps

chore: add desktop external link bridge coverage

9a743a8

Cover desktop-only external link behavior so browser and local-folder launches prefer the host bridge when available and safely fall back when the controller is unavailable.

chore: tighten regression assertions in controller coverage

835c4ee

Make the credit-guard test prove cross-channel isolation before cache consumption, and add a competing runtime model to the BYOK empty-allowlist fixture so the fallback-prevention path is actually exercised.

fix(controller): harden skill dir watcher reconciliation

266590c

Keep the skill ledger in sync even when fs.watch misses new skill directory events by adding a low-frequency reconciliation fallback and covering it in the desktop watcher regression test.

mrcfps approved these changes Apr 8, 2026

View reviewed changes

lefarcen added 5 commits April 9, 2026 10:50

chore(ci): split core and extended tests

bf68d8a

Run core tests by default, keep extended suites available for full CI, and use a dynamic PR test selector plus desktop E2E gating so expensive jobs only run when the changed files justify them.

fix(ci): fetch PR base ref for test selection

6046e37

Make the PR test selector fetch the base branch ref on demand before computing merge-base so shallow GitHub Actions checkouts do not fail on missing origin/main in test and desktop CI jobs.

fix(ci): fetch full history for PR test planning

56be306

Use full checkout history in PR jobs that rely on merge-base so the dynamic test selector can compare against the base branch reliably in GitHub Actions instead of failing on shallow clones.

fix(ci): use full checkout in test job

5cfd22d

Fetch full git history in the CI test job so merge-base driven PR test selection can compare against the base branch reliably, matching the desktop dev workflow behavior.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: add runtime and desktop regression coverage#941

chore: add runtime and desktop regression coverage#941
lefarcen wants to merge 10 commits intomainfrom
test/fill-runtime-desktop-gaps

lefarcen commented Apr 8, 2026 •

edited

Loading

Uh oh!

sentry Bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lefarcen commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

How

Affected areas

Checklist

Uh oh!

sentry Bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lefarcen commented Apr 8, 2026 •

edited

Loading

sentry Bot commented Apr 8, 2026 •

edited

Loading