yunoadmin (yuno admin)

yunoadmin synced commits to github-repo-stats at yunoadmin/bambuddy from mirror

5022e1a0e4 ghrs: report 05-25-2329-25F9 for maziggy/bambuddy
3900ca0014 ghrs: stars and forks 05-25-2329-25F9 for maziggy/bambuddy
200a87e2fb ghrs: vc agg 05-25-2329-25F9 for maziggy/bambuddy
f5545bc959 ghrs: snap 05-25-2329-25F9 for maziggy/bambuddy
View comparison for these 4 commits »

4 hours ago

yunoadmin synced commits to v0.2.5b1-daily.20260525 at yunoadmin/bambuddy from mirror

12 hours ago

yunoadmin synced new reference v0.2.5b1-daily.20260525 to yunoadmin/bambuddy from mirror

12 hours ago

yunoadmin synced commits to bug-report-assets at yunoadmin/bambuddy from mirror

df4e8676b2 Bug report logs c9f84e2cff9647c1896da10ef4a8d6f9.log
c1e19b0397 Bug report screenshot adecb7ef71e244cdbc734a5da6c01c5b.png
View comparison for these 2 commits »

12 hours ago

yunoadmin synced and deleted reference v0.2.5b1-daily.20260524 at yunoadmin/bambuddy from mirror

12 hours ago

yunoadmin synced commits to dev at yunoadmin/bambuddy from mirror

e34958c3fa Post work PR #1501
052e928107 feat: add system theme detection (prefers-color-scheme) (#1501) feat: add system theme detection (prefers-color-scheme)
554a73070f fix(maintenance): paused prints no longer accumulate runtime hours (#1521) PAUSE counted toward runtime_seconds equally with RUNNING, inflating hours-based maintenance thresholds (rod lube, belt check, nozzle clean) by however long overnight or extended pauses lasted. Maintenance items track mechanical wear, which is zero while paused, so the predicate now excludes PAUSE. Field-comment and docstring trail across main.py / models/printer.py / maintenance.py updated to match. Existing runtime_seconds values cannot be retroactively split — only future accumulation is fixed. Adds 3 regression tests pinning PAUSE non-accumulation, RUNNING accumulation, and the FINISH state's last_runtime_update clear (prevents idle-time back-bill when the printer next goes RUNNING).
4cce575bcc fix(stats): cancelled bucket icon now uses semantic warning token (orange) (#1390 follow-up) Reporter flagged that the new Cancelled row's Ban icon rendered colourless while Successful and Failed used green/red icons in the same widget. Switch the Cancelled icon to text-status-warning (amber-500) so all three rows now use semantic status tokens consistently, and the colour matches the orange Archives + notification badges already use for cancelled status.
View comparison for these 4 commits »

20 hours ago

yunoadmin synced commits to bug-report-assets at yunoadmin/bambuddy from mirror

1e011b0c6e Bug report logs d2a5a501c1c24c0a9fb226653244183f.log
1f524ef4b5 Bug report logs 4e51e2251f8946539ad539aa0fcb6b8a.log
355e3f9a57 Bug report screenshot 4e4ac38b030843909b1583e10444d1ed.png
View comparison for these 3 commits »

20 hours ago

yunoadmin synced commits to github-repo-stats at yunoadmin/bambuddy from mirror

4cae6c2c1a ghrs: report 05-24-2326-F8AF for maziggy/bambuddy
cfdf59bd8d ghrs: stars and forks 05-24-2326-F8AF for maziggy/bambuddy
4d80a67e80 ghrs: vc agg 05-24-2326-F8AF for maziggy/bambuddy
869c14f540 ghrs: snap 05-24-2326-F8AF for maziggy/bambuddy
View comparison for these 4 commits »

1 day ago

yunoadmin synced commits to main at yunoadmin/bambuddy from mirror

5d248e16ba chore(triage): tighten bug-report template + add Area dropdown to cut invalid-issue load 170 issues have been closed with the `invalid` label (61 of them in the last 30 days alone — ~1 in 5 of all closed issues), almost always because the reporter hadn't run the in-app Connection Diagnostic or checked the documented troubleshooting page. The Connection Diagnostic shipped weeks ago but the bug-report form let people skip it: the "I ran it" checkbox was `required: false` and the Support Package field was optional. Tighten both. Form changes (.github/ISSUE_TEMPLATE/bug_report.yml): - Connection Diagnostic checkbox: required: false → true - Support Package field: required: false → true ("drag the .zip or explain why you cannot attach one") - New required textarea "Troubleshooting steps already taken" — forces the reporter to type WHAT they tried and WHICH wiki pages they checked before submitting. Empty answers can't submit. - Pre-form intro spells out the search → wiki → diagnostic → support package sequence and cites the 1-in-5 stat - Final-checks list grew from one to three required confirmations (searched issues + checked troubleshooting wiki + ran Connection Diagnostic for any connection/printing/camera issue) Bug categorization (the gap that motivated this): - Old `Component` dropdown was Bambuddy / SpoolBuddy / Both — no area triage signal - Replaced with two required dropdowns: - Product: Bambuddy / SpoolBuddy - Area: 15 options covering the actual feature surface + Other / not sure - Auto-label workflow (.github/workflows/auto-label-area.yml) reads the Area dropdown from the rendered issue body on open/edit and applies the matching area:* label. Tolerant of CRLF and the _No response_ placeholder, won't re-add on edit re-fires, warns on unknown Area values Maintainer hand-off — labels must exist BEFORE the workflow runs, since github-script's addLabels throws on missing labels. Create the 16 labels (15 area:* + 1 area:unsorted) once via the `gh label create` commands captured in CHANGELOG / commit context. OS dropdown left untouched (Docker stays — per Martin). Printer Model dropdown verified against backend/app/utils/printer_models.py PRINTER_MODEL_MAP: all 13 current models present (X1 Carbon, X1, X1E, X2D, P1S, P1P, P2S, A1, A1 Mini, H2D, H2D Pro, H2C, H2S).

1 day ago

yunoadmin synced commits to github-repo-stats at yunoadmin/bambuddy from mirror

7216c21989 ghrs: report 05-24-1515-2425 for maziggy/bambuddy
e9f58aadda ghrs: stars and forks 05-24-1515-2425 for maziggy/bambuddy
d853c9d88b ghrs: vc agg 05-24-1515-2425 for maziggy/bambuddy
346dd8674a ghrs: snap 05-24-1515-2425 for maziggy/bambuddy
View comparison for these 4 commits »

1 day ago

yunoadmin synced commits to dev at yunoadmin/bambuddy from mirror

e5ebab7ab8 chore(triage): tighten bug-report template + add Area dropdown to cut invalid-issue load 170 issues have been closed with the `invalid` label (61 of them in the last 30 days alone — ~1 in 5 of all closed issues), almost always because the reporter hadn't run the in-app Connection Diagnostic or checked the documented troubleshooting page. The Connection Diagnostic shipped weeks ago but the bug-report form let people skip it: the "I ran it" checkbox was `required: false` and the Support Package field was optional. Tighten both. Form changes (.github/ISSUE_TEMPLATE/bug_report.yml): - Connection Diagnostic checkbox: required: false → true - Support Package field: required: false → true ("drag the .zip or explain why you cannot attach one") - New required textarea "Troubleshooting steps already taken" — forces the reporter to type WHAT they tried and WHICH wiki pages they checked before submitting. Empty answers can't submit. - Pre-form intro spells out the search → wiki → diagnostic → support package sequence and cites the 1-in-5 stat - Final-checks list grew from one to three required confirmations (searched issues + checked troubleshooting wiki + ran Connection Diagnostic for any connection/printing/camera issue) Bug categorization (the gap that motivated this): - Old `Component` dropdown was Bambuddy / SpoolBuddy / Both — no area triage signal - Replaced with two required dropdowns: - Product: Bambuddy / SpoolBuddy - Area: 15 options covering the actual feature surface + Other / not sure - Auto-label workflow (.github/workflows/auto-label-area.yml) reads the Area dropdown from the rendered issue body on open/edit and applies the matching area:* label. Tolerant of CRLF and the _No response_ placeholder, won't re-add on edit re-fires, warns on unknown Area values Maintainer hand-off — labels must exist BEFORE the workflow runs, since github-script's addLabels throws on missing labels. Create the 16 labels (15 area:* + 1 area:unsorted) once via the `gh label create` commands captured in CHANGELOG / commit context. OS dropdown left untouched (Docker stays — per Martin). Printer Model dropdown verified against backend/app/utils/printer_models.py PRINTER_MODEL_MAP: all 13 current models present (X1 Carbon, X1, X1E, X2D, P1S, P1P, P2S, A1, A1 Mini, H2D, H2D Pro, H2C, H2S).

1 day ago

yunoadmin synced commits to bug-report-assets at yunoadmin/bambuddy from mirror

7319518b75 Bug report logs ab77b5273a014227ad28051767c6ca6f.log
078c01baec Bug report screenshot c6b1fc95cd174fc2a26879d358de35ab.png
dd10af2e7c Bug report logs dc2e311307544f288f9a4e79e88f996e.log
View comparison for these 3 commits »

1 day ago

yunoadmin synced commits to v0.2.5b1-daily.20260524 at yunoadmin/bambuddy from mirror

1 day ago

yunoadmin synced new reference v0.2.5b1-daily.20260524 to yunoadmin/bambuddy from mirror

1 day ago

yunoadmin synced commits to v0.2.4.3 at yunoadmin/bambuddy from mirror

1 day ago

yunoadmin synced new reference v0.2.4.3 to yunoadmin/bambuddy from mirror

1 day ago

yunoadmin synced commits to main at yunoadmin/bambuddy from mirror

c0129ef93f chore(docker): silence Trivy DS-0026 on Dockerfile.test via HEALTHCHECK NONE Trivy raised DS-0026 ("No HEALTHCHECK defined") against Dockerfile.test on every run of the security workflow. The test image is a one-shot pytest runner — there's no service to probe, so any HEALTHCHECK we invented would be cargo-cult noise that fires once and means nothing. HEALTHCHECK NONE is the documented Docker directive to explicitly opt out of any inherited HEALTHCHECK and is the way Trivy itself expects projects to signal "this image is intentionally not a long-running service." Adding it closes code-scanning alert #813 cleanly. Note: the perl-base CVE-2026-8376 alert (#811) is left open for now and dismissed in the GitHub UI as "Won't fix - no upstream patch" because Debian Trixie has not yet shipped a fixed perl-base; the patched build will land automatically on the next base-image refresh.
b3b37e8f08 ci(docker): full backend suite in Docker, 4-way matrix shard, GHA cache backend Earlier patch trimmed the duplicate unit-test re-run from docker-test to drop a 5-10 min job that wasn't adding coverage. But "wasn't adding coverage" only holds for pure-logic tests — system-touching tests (ffmpeg version probes, ftp clients, subprocess shell-outs, locale/ timezone-sensitive assertions, paths) genuinely can pass on the GHA host and fail in python:3.13-slim. Curation via a `docker_env` marker is fragile (new tests get forgotten); gating on `main` only defers the cost without removing it. Instead, run the full backend suite IN Docker on every PR but make it fast: - New docker-backend-tests job runs the same 4-way pytest-split matrix as the host backend-tests, just inside the test image. - docker/setup-buildx-action + docker/build-push-action@v5 with cache-from/cache-to: type=gha,scope=backend-test persist the BuildKit cache (pip-install layer included) across CI runs and across the 4 sibling shards. Cold build is ~150s/shard; warm build drops to ~10s/shard. - fail-fast: false so a single failing shard surfaces the rest's output too. Total CI wall-clock for a PR push is now gated by docker-test (the image-build + integration HTTP smoke + integration test suite job) at ~3 min, not by the unit-test re-run anymore. The earlier ci.yml step that ran `docker compose run --rm backend-test` synchronously in the docker-test job stays removed — the new docker-backend-tests matrix covers the same ground and is much faster.
ed905d8406 ci(docker): stop re-running unit tests inside the test image The "Docker Build" job in ci.yml was running the same 5287 backend tests + 2022 frontend tests inside the bambuddy-backend-test / bambuddy-frontend-test images that the host-side backend-tests and frontend-tests jobs had already run. Same test code, same Python version (env.PYTHON_VERSION), same requirements.txt the test image installs. On 2-vCPU GHA runners that re-run added 5-10 min of wall-clock for zero new coverage — and "frontend tests in Docker" added another 2-3 min for the same reason. Drop both steps from the CI job. Keep everything that validates the Docker IMAGE specifically: production image build, backend module import verification, static-files-copied check, integration container bring-up + health/API/static HTTP smoke checks, and the integration test suite (which IS genuinely Docker-specific — it runs against the live container via BAMBUDDY_TEST_URL). test_docker.sh keeps the unit-test reruns because devs running it locally don't have a separate host-side pytest job to compare against. Combined with the earlier 4-way pytest-split shard on the host backend-tests job, expected PR-push wall-clock drops from ~10-12 min to ~3 min, gated on max(backend-tests shard, frontend tests, docker-image-build+integration).
39a075918a ci(docker): drop -v, -n auto instead of -n 30, pip cache mount Three things were making the Docker test runs noisier and slower than they needed to be: 1. -v was hardcoded in Dockerfile.test:35 CMD and in docker-compose. test.yml's integration-test-runner command. The ci.yml change to drop -v from the bare pytest call missed both — Docker runs use the image's CMD, not the workflow's. 2. -n 30 was hardcoded as the xdist worker count. On a 2-vCPU CI box that's 30 Python processes fighting over 2 cores — mostly IPC and import-thrash overhead. -n auto adapts to the host: 2 on CI, 30 on a 30-core dev box. Same final-result throughput on the dev box, much better on small runners. 3. pip install had --no-cache-dir and no BuildKit cache mount, so every Docker build re-fetched ~50 packages from PyPI (~60-90s on a cold pip cache). Adding `RUN --mount=type=cache,target= /root/.cache/pip` (with the `# syntax=docker/dockerfile:1.7` directive that enables it) makes subsequent builds re-use the download cache so they only do install work, ~5s instead of ~90s. DOCKER_BUILDKIT=1 is already exported in test_docker.sh and is the GHA default since runner image 2023, so the cache mount is always honoured. Verified locally: Docker build is 19s warm (was ~90s cold each time), test run is 102s with 5287 passed / 1 skipped (the by-design spoolbuddy importorskip) — clean output, no [gwN] worker spam, no "created: 30/30 workers" startup line. GHA-side per-run cold-build slowness still happens because GHA runners are ephemeral; a follow-up using docker/build-push-action with type=gha cache backend would persist the BuildKit cache across CI runs but that's a bigger workflow change.
4fac9ff12c fix(test): stop sys.modules-deleting backend.app.main in test_code_quality + ci: shard backend tests 4-way + drop -v for ~3.5x wall-clock speedup Root cause of the 4 CI failures on PR #1514 (all in test_print_start_assigns_printer_id_to_vp_archive.py + test_timelapse_baseline_restart_recovery.py): test_all_modules_importable in test_code_quality.py was deleting backend.app.main from sys.modules and re-importing it via importlib.import_module. That created NEW module-level dicts (_timelapse_baselines, _expected_prints, _active_prints, …) and re-ran root_logger.addHandler — hence the duplicate log lines at the same microsecond in captured stderr. Any sibling test that bound those names via "from backend.app.main import _timelapse_baselines" before the reimport now held a reference to the OLD dict; production code (reached via "from backend.app.main import on_print_start") resolved the symbol through the NEW module instance. Production mutated the new dict, the test read the old one, the assertion saw None / un-mutated mock_archive. Locally with -n 30, xdist load-balanced test_code_quality.py to a different worker process so the collision never happened (which is why the suite was green for me). CI's -n auto = -n 2 on ubuntu-latest made the collision deterministic. Fix: drop the "del sys.modules[name]" step. importlib.import_module already returns the cached module if cached, or runs the import machinery if not — either way, any import-time error surfaces. The "fresh import" framing was theatre; in practice every module in the list is already imported by other tests/fixtures before this test runs, so we were never actually getting a fresh import anyway — just destruction. CI workflow tightening (separate concern, same PR since both touch the test infrastructure): - Dropped -v from the pytest invocation. 5300+ "PASSED foo::bar" lines per worker were eating ~30-60s of stdout I/O on 2-vCPU runners. --tb=short is sufficient for failure context. - Sharded backend-tests into a 4-way matrix via pytest-split (new dev dep). Each shard runs ~1326 tests in ~95s on a 2-vCPU runner; all 4 run in parallel so wall-clock drops from 362s -> ~100s. - fail-fast: false on the matrix so a single failing shard doesn't hide failures in the other three — PRs see the complete failure picture in one push.
View comparison for these 82 commits »

1 day ago

yunoadmin synced commits to dev at yunoadmin/bambuddy from mirror

1e734fb7c6 fix(stats): cancelled prints get their own bucket; gauge denominator excludes them (#1390 follow-up) Reporter (@IndividualGhost1905) saw Total: 20 / Success: 18 / Failed: 1 and asked where the 20th print went. The Quick Stats endpoint counted status == "completed" → Successful and status == "failed" → Failed, but used a raw count(*) for Total Prints, so the four other PrintLogEntry statuses (aborted, stopped, cancelled, skipped) silently inflated the total without showing up in any breakdown row. The earlier #1390 round had committed a test locking in this exact behaviour ("uses total_prints as denominator so cancelled/stopped events count"), which was wrong: it conflated user intent with print quality. Three-bucket classification, applied across the whole stats surface and matching how the rest of the codebase already groups statuses (main.py:430, 1729; failure_analysis status filter): successful = completed failed = failed + aborted (printer-detected quality failures) cancelled = stopped + cancelled + skipped (user/queue stopped) Quick Stats endpoint returns the new cancelled_prints field; ArchiveStats.cancelled_prints defaults to 0 so older fixtures still parse. SuccessRateWidget gauge now divides by successful + failed only — a cancelled roll no longer drags the gauge down — and a Cancelled row appears in the breakdown so the missing prints don't silently vanish from Total Prints. Failure Analysis service applies the same denominator fix to both the headline failure_rate and the per-week trend, so a week with several cancellations and zero failures reads as 0% rather than a misleading "failed / total". i18n: new stats.cancelled key in all 9 locales with real translations (no English fallback), parity script clean. Tests: the existing 'uses total_prints as denominator' assertion is inverted to assert the new behaviour (40 / 20 / 35 → 67% gauge, Cancelled: 35 visible). The unchanged-display path (140 / 10 / 0 → 93%) still holds since 140 / (140 + 10) = 93.33% rounds the same. 33 StatsPage tests + 6 backend stats/failure tests green.
b9b06a7351 chore(docker): silence Trivy DS-0026 on Dockerfile.test via HEALTHCHECK NONE Trivy raised DS-0026 ("No HEALTHCHECK defined") against Dockerfile.test on every run of the security workflow. The test image is a one-shot pytest runner — there's no service to probe, so any HEALTHCHECK we invented would be cargo-cult noise that fires once and means nothing. HEALTHCHECK NONE is the documented Docker directive to explicitly opt out of any inherited HEALTHCHECK and is the way Trivy itself expects projects to signal "this image is intentionally not a long-running service." Adding it closes code-scanning alert #813 cleanly. Note: the perl-base CVE-2026-8376 alert (#811) is left open for now and dismissed in the GitHub UI as "Won't fix - no upstream patch" because Debian Trixie has not yet shipped a fixed perl-base; the patched build will land automatically on the next base-image refresh.
12d344cbfc ci(docker): full backend suite in Docker, 4-way matrix shard, GHA cache backend Earlier patch trimmed the duplicate unit-test re-run from docker-test to drop a 5-10 min job that wasn't adding coverage. But "wasn't adding coverage" only holds for pure-logic tests — system-touching tests (ffmpeg version probes, ftp clients, subprocess shell-outs, locale/ timezone-sensitive assertions, paths) genuinely can pass on the GHA host and fail in python:3.13-slim. Curation via a `docker_env` marker is fragile (new tests get forgotten); gating on `main` only defers the cost without removing it. Instead, run the full backend suite IN Docker on every PR but make it fast: - New docker-backend-tests job runs the same 4-way pytest-split matrix as the host backend-tests, just inside the test image. - docker/setup-buildx-action + docker/build-push-action@v5 with cache-from/cache-to: type=gha,scope=backend-test persist the BuildKit cache (pip-install layer included) across CI runs and across the 4 sibling shards. Cold build is ~150s/shard; warm build drops to ~10s/shard. - fail-fast: false so a single failing shard surfaces the rest's output too. Total CI wall-clock for a PR push is now gated by docker-test (the image-build + integration HTTP smoke + integration test suite job) at ~3 min, not by the unit-test re-run anymore. The earlier ci.yml step that ran `docker compose run --rm backend-test` synchronously in the docker-test job stays removed — the new docker-backend-tests matrix covers the same ground and is much faster.
a4afc9c073 ci(docker): stop re-running unit tests inside the test image The "Docker Build" job in ci.yml was running the same 5287 backend tests + 2022 frontend tests inside the bambuddy-backend-test / bambuddy-frontend-test images that the host-side backend-tests and frontend-tests jobs had already run. Same test code, same Python version (env.PYTHON_VERSION), same requirements.txt the test image installs. On 2-vCPU GHA runners that re-run added 5-10 min of wall-clock for zero new coverage — and "frontend tests in Docker" added another 2-3 min for the same reason. Drop both steps from the CI job. Keep everything that validates the Docker IMAGE specifically: production image build, backend module import verification, static-files-copied check, integration container bring-up + health/API/static HTTP smoke checks, and the integration test suite (which IS genuinely Docker-specific — it runs against the live container via BAMBUDDY_TEST_URL). test_docker.sh keeps the unit-test reruns because devs running it locally don't have a separate host-side pytest job to compare against. Combined with the earlier 4-way pytest-split shard on the host backend-tests job, expected PR-push wall-clock drops from ~10-12 min to ~3 min, gated on max(backend-tests shard, frontend tests, docker-image-build+integration).
0aadce1a8a ci(docker): drop -v, -n auto instead of -n 30, pip cache mount Three things were making the Docker test runs noisier and slower than they needed to be: 1. -v was hardcoded in Dockerfile.test:35 CMD and in docker-compose. test.yml's integration-test-runner command. The ci.yml change to drop -v from the bare pytest call missed both — Docker runs use the image's CMD, not the workflow's. 2. -n 30 was hardcoded as the xdist worker count. On a 2-vCPU CI box that's 30 Python processes fighting over 2 cores — mostly IPC and import-thrash overhead. -n auto adapts to the host: 2 on CI, 30 on a 30-core dev box. Same final-result throughput on the dev box, much better on small runners. 3. pip install had --no-cache-dir and no BuildKit cache mount, so every Docker build re-fetched ~50 packages from PyPI (~60-90s on a cold pip cache). Adding `RUN --mount=type=cache,target= /root/.cache/pip` (with the `# syntax=docker/dockerfile:1.7` directive that enables it) makes subsequent builds re-use the download cache so they only do install work, ~5s instead of ~90s. DOCKER_BUILDKIT=1 is already exported in test_docker.sh and is the GHA default since runner image 2023, so the cache mount is always honoured. Verified locally: Docker build is 19s warm (was ~90s cold each time), test run is 102s with 5287 passed / 1 skipped (the by-design spoolbuddy importorskip) — clean output, no [gwN] worker spam, no "created: 30/30 workers" startup line. GHA-side per-run cold-build slowness still happens because GHA runners are ephemeral; a follow-up using docker/build-push-action with type=gha cache backend would persist the BuildKit cache across CI runs but that's a bigger workflow change.
View comparison for these 15 commits »

1 day ago

yunoadmin synced commits to bug-report-assets at yunoadmin/bambuddy from mirror

b8c8af2e24 Bug report logs 9d2b699570d5483eb17066011861281a.log
c32123cc0d Bug report logs 4d44cf55241248499056f46fd6db70f0.log
View comparison for these 2 commits »

1 day ago

yunoadmin synced commits to 0.2.4.3 at yunoadmin/bambuddy from mirror

66fea70564 ci(docker): full backend suite in Docker, 4-way matrix shard, GHA cache backend Earlier patch trimmed the duplicate unit-test re-run from docker-test to drop a 5-10 min job that wasn't adding coverage. But "wasn't adding coverage" only holds for pure-logic tests — system-touching tests (ffmpeg version probes, ftp clients, subprocess shell-outs, locale/ timezone-sensitive assertions, paths) genuinely can pass on the GHA host and fail in python:3.13-slim. Curation via a `docker_env` marker is fragile (new tests get forgotten); gating on `main` only defers the cost without removing it. Instead, run the full backend suite IN Docker on every PR but make it fast: - New docker-backend-tests job runs the same 4-way pytest-split matrix as the host backend-tests, just inside the test image. - docker/setup-buildx-action + docker/build-push-action@v5 with cache-from/cache-to: type=gha,scope=backend-test persist the BuildKit cache (pip-install layer included) across CI runs and across the 4 sibling shards. Cold build is ~150s/shard; warm build drops to ~10s/shard. - fail-fast: false so a single failing shard surfaces the rest's output too. Total CI wall-clock for a PR push is now gated by docker-test (the image-build + integration HTTP smoke + integration test suite job) at ~3 min, not by the unit-test re-run anymore. The earlier ci.yml step that ran `docker compose run --rm backend-test` synchronously in the docker-test job stays removed — the new docker-backend-tests matrix covers the same ground and is much faster.
1b271a8ead ci(docker): stop re-running unit tests inside the test image The "Docker Build" job in ci.yml was running the same 5287 backend tests + 2022 frontend tests inside the bambuddy-backend-test / bambuddy-frontend-test images that the host-side backend-tests and frontend-tests jobs had already run. Same test code, same Python version (env.PYTHON_VERSION), same requirements.txt the test image installs. On 2-vCPU GHA runners that re-run added 5-10 min of wall-clock for zero new coverage — and "frontend tests in Docker" added another 2-3 min for the same reason. Drop both steps from the CI job. Keep everything that validates the Docker IMAGE specifically: production image build, backend module import verification, static-files-copied check, integration container bring-up + health/API/static HTTP smoke checks, and the integration test suite (which IS genuinely Docker-specific — it runs against the live container via BAMBUDDY_TEST_URL). test_docker.sh keeps the unit-test reruns because devs running it locally don't have a separate host-side pytest job to compare against. Combined with the earlier 4-way pytest-split shard on the host backend-tests job, expected PR-push wall-clock drops from ~10-12 min to ~3 min, gated on max(backend-tests shard, frontend tests, docker-image-build+integration).
e377b4aaa9 ci(docker): drop -v, -n auto instead of -n 30, pip cache mount Three things were making the Docker test runs noisier and slower than they needed to be: 1. -v was hardcoded in Dockerfile.test:35 CMD and in docker-compose. test.yml's integration-test-runner command. The ci.yml change to drop -v from the bare pytest call missed both — Docker runs use the image's CMD, not the workflow's. 2. -n 30 was hardcoded as the xdist worker count. On a 2-vCPU CI box that's 30 Python processes fighting over 2 cores — mostly IPC and import-thrash overhead. -n auto adapts to the host: 2 on CI, 30 on a 30-core dev box. Same final-result throughput on the dev box, much better on small runners. 3. pip install had --no-cache-dir and no BuildKit cache mount, so every Docker build re-fetched ~50 packages from PyPI (~60-90s on a cold pip cache). Adding `RUN --mount=type=cache,target= /root/.cache/pip` (with the `# syntax=docker/dockerfile:1.7` directive that enables it) makes subsequent builds re-use the download cache so they only do install work, ~5s instead of ~90s. DOCKER_BUILDKIT=1 is already exported in test_docker.sh and is the GHA default since runner image 2023, so the cache mount is always honoured. Verified locally: Docker build is 19s warm (was ~90s cold each time), test run is 102s with 5287 passed / 1 skipped (the by-design spoolbuddy importorskip) — clean output, no [gwN] worker spam, no "created: 30/30 workers" startup line. GHA-side per-run cold-build slowness still happens because GHA runners are ephemeral; a follow-up using docker/build-push-action with type=gha cache backend would persist the BuildKit cache across CI runs but that's a bigger workflow change.
fdf818e54c fix(test): stop sys.modules-deleting backend.app.main in test_code_quality + ci: shard backend tests 4-way + drop -v for ~3.5x wall-clock speedup Root cause of the 4 CI failures on PR #1514 (all in test_print_start_assigns_printer_id_to_vp_archive.py + test_timelapse_baseline_restart_recovery.py): test_all_modules_importable in test_code_quality.py was deleting backend.app.main from sys.modules and re-importing it via importlib.import_module. That created NEW module-level dicts (_timelapse_baselines, _expected_prints, _active_prints, …) and re-ran root_logger.addHandler — hence the duplicate log lines at the same microsecond in captured stderr. Any sibling test that bound those names via "from backend.app.main import _timelapse_baselines" before the reimport now held a reference to the OLD dict; production code (reached via "from backend.app.main import on_print_start") resolved the symbol through the NEW module instance. Production mutated the new dict, the test read the old one, the assertion saw None / un-mutated mock_archive. Locally with -n 30, xdist load-balanced test_code_quality.py to a different worker process so the collision never happened (which is why the suite was green for me). CI's -n auto = -n 2 on ubuntu-latest made the collision deterministic. Fix: drop the "del sys.modules[name]" step. importlib.import_module already returns the cached module if cached, or runs the import machinery if not — either way, any import-time error surfaces. The "fresh import" framing was theatre; in practice every module in the list is already imported by other tests/fixtures before this test runs, so we were never actually getting a fresh import anyway — just destruction. CI workflow tightening (separate concern, same PR since both touch the test infrastructure): - Dropped -v from the pytest invocation. 5300+ "PASSED foo::bar" lines per worker were eating ~30-60s of stdout I/O on 2-vCPU runners. --tb=short is sufficient for failure context. - Sharded backend-tests into a 4-way matrix via pytest-split (new dev dep). Each shard runs ~1326 tests in ~95s on a 2-vCPU runner; all 4 run in parallel so wall-clock drops from 362s -> ~100s. - fail-fast: false on the matrix so a single failing shard doesn't hide failures in the other three — PRs see the complete failure picture in one push.
b92bdb7d09 fix(test): snapshot _timelapse_baselines inside the patch context to dodge CI race test_running_observed_captures_baseline_on_restart_recovery was reading _timelapse_baselines.get(1) after the patch() with-block exited. Locally and under low parallelism this works fine — the dict still holds what _capture_timelapse_baseline_at_start wrote. CI under xdist's default load-balancing scheduling intermittently saw the dict empty by the time the top-level assert ran, even though the production code logged "Baseline at print start: 3 video files for printer 1" right before returning. The duplicate log line at the same microsecond in the captured stderr is the tell — module state is being re-touched between the handler completing and the test asserting, almost certainly via the session-scoped event_loop fixture in conftest.py interacting badly with the per-file autouse _clear_baselines teardown of a sibling test on the same worker. The test is verifying the handler captured the baseline at the moment it returned, so capture the relevant value at exactly that point — inside the with-block, immediately after the await. That's immune to whatever happens to the module-level dict afterward.
View comparison for these 10 commits »

1 day ago