meshcore-analyzer

mirror of https://github.com/Kpa-clawbot/meshcore-analyzer.git synced 2026-05-10 20:26:56 +00:00

Author	SHA1	Message	Date
Kpa-clawbot	57ebd76070	fix: config.json lives in data dir, not bind-mounted as file Removes the separate config.json file bind mount from both compose files. The data directory mount already covers it, and the Go server searches /app/data/config.json via LoadConfig. - Entrypoint symlinks /app/data/config.json for ingestor compatibility - manage.sh setup creates config in data dir, prompts admin if missing - manage.sh start checks config exists before starting, offers to create - deploy.yml simplified — no more sudo rm or directory cleanup - Backup/restore updated to use data dir path Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-30 09:58:22 -07:00
Kyle Gabriel	86b5d4e175	Fix incorrect internal port binding (#270 ) Fixes issue #268	2026-03-30 16:48:50 +00:00
VE7KOD	faca80e626	feat: add multi-byte hash usage matrix with stats and improved tooltips (#269 ) - Add 1/2/3-byte selector to Hash Issues analytics page - 1-byte and 2-byte modes show 16×16 matrix with stat cards (nodes tracked, using N-byte ID, prefix space used, prefix collisions) - 3-byte mode shows summary stat cards instead of unrenderable grid - Fix "Nodes tracked" to always show total node count across all modes - Use CSS variable colours for matrix cells (light/dark mode compatible) - Replace native title tooltips with custom styled popovers - Hide collision risk card when 3-byte mode is selected - Fix double-tooltip bug on mode switch via _matrixTipInit guard - Fix tooltip persisting outside matrix grid on mouseleave https://dev.ve7kod.ca/#/analytics Hash Issues --------- Co-authored-by: Jesse <your@email.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-30 16:31:35 +00:00
efiten	8f833f64ae	fix: parse TRACE packet path hops from payload instead of header (#277 ) Fixes #276 ## Root cause TRACE packets store hop IDs in the payload (bytes 9+) rather than in the header path field. The header path field is overloaded in TRACE packets to carry RSSI values instead of repeater IDs (as noted in the issue comments). This meant `Path.Hops` was always empty for TRACE packets — the raw bytes ended up as an opaque `PathData` hex string with no structure. The hashSize encoded in the header path byte (bits 6–7) is still valid for TRACE and is used to split the payload path bytes into individual hop prefixes. ## Fix After decoding a TRACE payload, if `PathData` is non-empty, parse it into individual hops using `path.HashSize`: ```go if header.PayloadType == PayloadTRACE && payload.PathData != "" { pathBytes, err := hex.DecodeString(payload.PathData) if err == nil && path.HashSize > 0 { for i := 0; i+path.HashSize <= len(pathBytes); i += path.HashSize { path.Hops = append(path.Hops, ...) } } } ``` Applied to both `cmd/ingestor/decoder.go` and `cmd/server/decoder.go`. ## Verification Packet from the issue: `260001807dca00000000007d547d` \| \| Before \| After \| \|---\|---\|---\| \| `Path.Hops` \| `[]` \| `["7D", "54", "7D"]` \| \| `Path.HashCount` \| `0` \| `3` \| New test `TestDecodeTracePathParsing` covers this exact packet. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-30 16:27:50 +00:00
Kpa-clawbot	726b041740	fix: staging config.json directory mount and wrong source file Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-30 09:16:07 -07:00
you	1193351fc5	fix: use git pull --ff-only in update to suppress divergent branch hints	2026-03-30 16:12:28 +00:00
you	3531d51fc8	fix: support docker-compose v1 and handle Windows line endings in .env - Auto-detect 'docker compose' (v2 plugin) vs 'docker-compose' (v1 standalone) - Strip \r from .env before sourcing (fixes $'\r': command not found) - Remove redundant compose check from setup wizard (caught at script init)	2026-03-30 16:05:04 +00:00
Kpa-clawbot	77d8f35a04	feat: implement packet store eviction/aging to prevent OOM (#273 ) ## Summary The in-memory `PacketStore` had no eviction or aging — it grew unbounded until OOM killed the process. At ~3K packets/hour and ~5KB per packet (not the 450 bytes previously estimated), an 8GB VM would OOM in a few days. ## Changes ### Time-based eviction - Configurable via `config.json`: `"packetStore": { "retentionHours": 24 }` - Packets older than the retention window are evicted from the head of the sorted slice ### Memory-based cap - Configurable via `"packetStore": { "maxMemoryMB": 1024 }` - Hard ceiling — evicts oldest packets when estimated memory exceeds the cap ### Index cleanup When a `StoreTx` is evicted, ALL associated data is removed from: - `byHash`, `byTxID`, `byObsID`, `byObserver`, `byNode`, `byPayloadType` - `nodeHashes`, `distHops`, `distPaths`, `spIndex` ### Periodic execution - Background ticker runs eviction every 60 seconds - Analytics caches and hash size cache are invalidated after eviction ### Stats fixes - `estimatedMB` now uses ~5KB/packet + ~500B/observation (was 430B + 200B) - `evicted` counter reflects actual evictions (was hardcoded to 0) - Removed fake `maxPackets: 2386092` and `maxMB: 1024` from stats ### Config example ```json { "packetStore": { "retentionHours": 24, "maxMemoryMB": 1024 } } ``` Both values default to 0 (unlimited) for backward compatibility. ## Tests - 7 new tests in `eviction_test.go` covering time-based, memory-based, index cleanup, thread safety, config parsing, and no-op when disabled - All existing tests pass unchanged Co-authored-by: Kpa-clawbot <kpabap+clawdbot@gmail.com>	2026-03-30 03:42:11 +00:00
Kpa-clawbot	a555b68915	feat: expand frontend coverage collector for 60%+ target (#275 ) ## Summary Expands the parallel frontend coverage collector to boost coverage from ~43% toward 60%+. This covers Phases 1 and 2 of the coverage improvement plan. ### Phase 1 — Visit unvisited pages - Compare page (`#/compare`): Navigates with query params selecting two real observers from fixture DB, also exercises UI controls - Node analytics (`#/nodes/{pubkey}/analytics`): Visits analytics for two real nodes from fixture DB, clicks day buttons - Traces search (`#/traces`): Searches for two valid packet hashes from fixture DB - Personalized home: Sets `localStorage.myNodes` with real pubkeys before visiting `#/home` - Observer detail pages: Direct navigation to `#/observers/test-obs-1` and `#/observers/test-status-obs` - Real packet detail: Navigates to `#/packets/b6b839cb61eead4a` (real hash) - Rapid route transitions: Exercises destroy/init cycles across all pages - Compare in route list: Added to the full route transition exercise ### Phase 2 — page.evaluate() for interactive code paths \| File \| Functions exercised \| \|------\|-------------------\| \| live.js \| `vcrPause`, `vcrSpeedCycle`, `vcrReplayFromTs`, `drawLcdText`, `vcrResumeLive`, `vcrUnpause`, `vcrRewind`, `updateVCRClock`, `updateVCRLcd`, `updateVCRUI`, `bufferPacket` (synthetic WS packets), `dbPacketToLive`, `renderPacketTree` \| \| packets.js \| `renderDecodedPacket` (ADVERT + GRP_TXT), `obsName`, `renderPath`, `renderHop` \| \| packet-filter.js \| 30+ filter expressions now evaluated against 4 synthetic packets (previously only compiled, not run). Covers `resolveField` for all field types including `payload.` dot notation \| \| nodes.js* \| `getStatusInfo`, `renderNodeBadges`, `renderStatusExplanation`, `renderHashInconsistencyWarning` with varied node types/roles \| \| roles.js \| `getHealthThresholds` (all roles), `getNodeStatus` (all roles × active/stale), `getTileUrl`, `syncBadgeColors`, `miniMarkdown` (bold, italic, code, links, lists), `copyToClipboard` \| \| channels.js \| `hashCode`, `getChannelColor`, `getSenderColor`, `highlightMentions`, `formatSecondsAgo` \| \| app.js \| `escapeHtml`, `debouncedOnWS`, extended `timeAgo`/`truncate` edge cases, extended `routeTypeName`/`payloadTypeName`/`payloadTypeColor` ranges \| ### What changed - `scripts/collect-frontend-coverage.js` — +336 lines across existing groups (no new groups added) ### Testing - `npm test` passes (all 13 tests) - No other files modified Co-authored-by: you <you@example.com>	2026-03-29 20:41:02 -07:00
Kpa-clawbot	a6364c92f4	fix: packets-per-hour counts unique transmissions, not observations (#274 ) ## Problem The RF analytics `packetsPerHour` chart was counting observations instead of unique transmissions per hour. With ~34 observations per transmission on average, the chart showed ~5,645 packets/hr instead of the correct ~163/hr. Evidence from prod API: - `packetsPerHour` total: 1,580,620 (sum of all hourly counts) - `totalPackets`: 45,764 - That's a ~34× inflation — exactly the observations-per-transmission ratio ## Root Cause In `store.go`, the `hourBuckets[hr]++` counter was inside the observations loop (both regional and non-regional paths). Other counters like `packetSizes` and `typeBuckets` already deduplicate by hash — `hourBuckets` was the only one that didn't. ## Fix Added a `seenHourHash` map (keyed by `hash\|hour`) to deduplicate. Each unique transmission is counted once per hour bucket, matching how packet sizes and payload types already work. Both the regional observer path and the non-regional path are fixed. The legacy path (transmissions without observations) was already correct since it iterates per-transmission. Co-authored-by: Kpa-clawbot <kpabap+clawdbot@gmail.com>	2026-03-29 20:16:25 -07:00
Kpa-clawbot	4cbb66d8e9	ci: fix badge publish — use admin PAT via Contents API to bypass branch protection [skip ci]	2026-03-29 20:03:59 -07:00
Kpa-clawbot	5c6bebc135	ci: update frontend-coverage badge [skip ci]	2026-03-29 20:03:41 -07:00
Kpa-clawbot	72bc90069f	ci: update e2e-tests badge [skip ci]	2026-03-29 20:03:40 -07:00
Kpa-clawbot	329b5cf516	ci: update go-ingestor-coverage badge [skip ci]	2026-03-29 20:03:38 -07:00
Kpa-clawbot	8afff22b4c	ci: update go-server-coverage badge [skip ci]	2026-03-29 20:03:05 -07:00
Kpa-clawbot	5777780fc8	refactor: parallel coverage collector (~30-60s vs 8min) (#272 ) ## Summary Redesigned frontend coverage collector with 7 parallel browser contexts. Coverage collector runs on master pushes only (skipped on PRs). ### Architecture 7 groups run simultaneously via `Promise.allSettled()`: - G1: Home + Customizer - G2: Nodes + Node Detail - G3: Packets + Packet Detail - G4: Map - G5: Analytics + Channels + Observers - G6: Live + Perf + Traces + Globals - G7: Utility functions (page.evaluate) ### Speed gains - `safeClick` 500ms → 100ms - `navHash` 150ms → 50ms - Removed redundant page visits and E2E-duplicate interactions - Wall time = slowest group (~30-60s estimated) ### 821 lines → ~450 lines Each group writes its own coverage JSON, nyc merges automatically. ### CI behavior - PRs: Coverage collector skipped (fast CI) - Master: Coverage collector runs (full synthetic user validation) Co-authored-by: you <you@example.com>	2026-03-29 19:46:01 -07:00
Kpa-clawbot	ada53ff899	ci: fix badge artifacts not uploading (include-hidden-files for .badges/)	2026-03-30 01:38:31 +00:00
Kpa-clawbot	54e39c241d	chore: add squad agent, workflows, and gitattributes Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 18:32:22 -07:00
you	3dd68d4418	fix: staging deploy failures — OOM + config.json directory mount Root causes from CI logs: 1. 'read /app/config.json: is a directory' — Docker creates a directory when bind-mounting a non-existent file. The entrypoint now detects and removes directory config.json before falling back to example. 2. 'unable to open database file: out of memory (14)' — old container (3GB) not fully exited when new one starts. Deploy now uses 'docker compose down' with timeout and waits for memory reclaim. 3. Supervisor gave up after 3 fast retries (FATAL in ~6s). Increased startretries to 10 and startsecs to 2 for server and ingestor. Additional: - Deploy step ensures staging config.json exists before starting - Healthcheck: added start_period=60s, increased timeout and retries - No longer uses manage.sh (CI working dir != repo checkout dir)	2026-03-29 23:16:46 +00:00
you	5bf2cdd812	fix: prevent staging OOM during deploy — wait for old container exit + add 3GB memory limit Root cause: on the 8GB VM, both prod (~2.5GB) and staging (~2GB) containers run simultaneously. During deploy, manage.sh would rm the old staging container and immediately start a new one. The old container's memory wasn't reclaimed yet, so the new one got 'unable to open database file: out of memory (14)' from SQLite and both corescope-server and corescope-ingestor entered FATAL. Fix: - manage.sh restart staging: wait up to 15s for old container to fully exit, plus 3s for OS memory reclamation before starting new container - manage.sh restart staging: verify config.json exists before starting - docker-compose.staging.yml: add deploy.resources.limits.memory=3g to prevent staging from consuming unbounded memory	2026-03-29 22:59:05 +00:00
Kpa-clawbot	f438411a27	chore: remove deprecated Node.js backend (-11,291 lines) (#265 ) ## Summary Removes all deprecated Node.js backend server code. The Go server (`cmd/server/`) has been the production backend — the Node.js server was kept "just in case" but is no longer needed. ### Removed (19 files, -11,291 lines) Backend server (6 files): `server.js`, `db.js`, `decoder.js`, `server-helpers.js`, `packet-store.js`, `iata-coords.js` Backend tests (9 files): `test-decoder.js`, `test-decoder-spec.js`, `test-server-helpers.js`, `test-server-routes.js`, `test-packet-store.js`, `test-db.js`, `test-db-migration.js`, `test-regional-filter.js`, `test-regional-integration.js` Backend tooling (4 files): `tools/e2e-test.js`, `tools/frontend-test.js`, `benchmark.js`, `benchmark-ab.sh` ### Updated - `AGENTS.md` — Rewritten architecture section for Go, explicit deprecation warnings - `test-all.sh` — Only runs frontend tests - `package.json` — Updated test:unit - `scripts/validate.sh` — Removed Node.js server syntax check - `docker/supervisord.conf` — Points to Go binary ### NOT touched - `public/` (active frontend) ✅ - `test-e2e-playwright.js` (frontend E2E tests) ✅ - Frontend test files (`test-packet-filter.js`, `test-aging.js`, `test-frontend-helpers.js`) ✅ - `package.json` / Playwright deps ✅ ### Follow-up - Server-only npm deps (express, better-sqlite3, mqtt, ws, supertest) can be cleaned from package.json separately - `Dockerfile.node` can be removed separately --------- Co-authored-by: you <you@example.com>	2026-03-29 15:53:51 -07:00
Kpa-clawbot	8c63200679	feat: hash size distribution by repeaters (Go server) (#264 ) ## Summary Adds `distributionByRepeaters` to the `/api/analytics/hash-sizes` endpoint in the Go server. ### Problem PR #263 implemented this feature in the deprecated Node.js server (server.js). All backend changes should go in the Go server at `cmd/server/`. ### Solution - For each hash size (1, 2, 3), count how many unique repeaters (nodes) advertise packets with that hash size - Uses the existing `byNode` map already computed in `computeAnalyticsHashSizes()` - Added to both the live response and the empty/fallback response in routes.go - Frontend changes from PR #263 (`public/analytics.js`) already render this field — no frontend changes needed ### Response shape ```json { "distributionByRepeaters": { "1": 42, "2": 7, "3": 2 }, ...existing fields... } ``` ### Testing - All Go server tests pass - Replaces PR #263 (which modified the wrong server) Closes #263 --------- Co-authored-by: you <you@example.com>	2026-03-29 15:18:40 -07:00
Kpa-clawbot	21fc478e83	Merge remote-tracking branch 'origin/fix/compose-split-deploy-manage' # Conflicts: # .github/workflows/deploy.yml	2026-03-29 14:07:02 -07:00
Kpa-clawbot	900cbf6392	fix: deploy uses manage.sh restart staging instead of raw compose Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 14:06:37 -07:00
Kpa-clawbot	efc2d875c5	Merge remote-tracking branch 'origin/fix/compose-split-deploy-manage' # Conflicts: # .github/workflows/deploy.yml	2026-03-29 14:02:04 -07:00
Kpa-clawbot	067b101e14	fix: split prod/staging compose and harden deploy/manage staging control Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 14:01:29 -07:00
Kpa-clawbot	8e5eedaebd	fix: split prod/staging compose and harden deploy/manage staging control Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 13:59:07 -07:00
Kpa-clawbot	fba941af1b	fix: use compose rm -sf (not down) to stop only staging, not prod down tears down the entire compose project including prod. rm -sf stops and removes just the named service. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 13:20:41 -07:00
Kpa-clawbot	c271093795	fix: use docker compose down (not stop) to properly tear down staging stop leaves the container/network in place, blocking port rebind. down removes everything cleanly. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 12:53:18 -07:00
Kpa-clawbot	424e4675ae	ci: restrict staging deploy container cleanup Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 12:42:31 -07:00
Kpa-clawbot	c81744fed7	fix: manage.sh exports build metadata + compose build args for all services Version/Commit/BuildTime now populated from package.json, git, and date. Exported as env vars so docker compose build picks them up. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 12:36:25 -07:00
Kpa-clawbot	fd162a9354	fix: CI kills legacy meshcore-* containers before deploy (#261 ) Old meshcore-analyzer container still running from pre-rename era. Freed 2.2GB by killing it. CI now cleans up both old and new container names. Co-authored-by: Kpa-clawbot <259247574+Kpa-clawbot@users.noreply.github.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 12:30:13 -07:00
Kpa-clawbot	e41aba705e	fix: exclude vendor files from frontend coverage (#260 ) Coverage was 31% including vendor libs. Adds .nycrc.json scoping to first-party code. Co-authored-by: Kpa-clawbot <259247574+Kpa-clawbot@users.noreply.github.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 12:14:20 -07:00
Kpa-clawbot	075dcaed4d	fix: CI staging OOM — wait for old container before starting new (#259 ) Old staging container wasn't fully stopped before new one started. Both loaded 300MB stores simultaneously → OOM. Now properly waits and verifies. Ref: https://github.com/Kpa-clawbot/CoreScope/actions/runs/23716535123/job/69084603590 Co-authored-by: Kpa-clawbot <259247574+Kpa-clawbot@users.noreply.github.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-29 12:08:56 -07:00
you	2817877380	ci: pass BUILD_TIME to Docker build	2026-03-29 18:55:37 +00:00
you	ab140ab851	ci: add e2e-tests badge placeholder	2026-03-29 18:51:54 +00:00
you	b51d8c9701	fix: correct badge URLs to use CoreScope (case-sensitive)	2026-03-29 18:50:38 +00:00
you	251b7fa5c2	ci: rename frontend-tests badge to e2e-tests in README, remove copy hack	2026-03-29 18:49:01 +00:00
you	f31e0b42a0	ci: clean up stale badges, add Go coverage placeholders, fix frontend-tests.json name	2026-03-29 18:48:04 +00:00
you	78e0347055	ci: fix staging deploy — only stop staging container, don't nuke prod	2026-03-29 18:46:33 +00:00
you	8ab195b45f	ci: fix Go cache warnings on E2E step + fix staging deploy OOM (proper container cleanup)	2026-03-29 18:45:50 +00:00
you	6c7a3c1614	ci: clean Go module cache before setup to prevent tar extraction warnings	2026-03-29 18:37:59 +00:00
you	a5a3a85fc0	ci: disable coverage collector — E2E extracts window.__coverage__ directly	2026-03-29 18:33:46 +00:00
Kpa-clawbot	ec7ae19bb5	ci: restructure pipeline — sequential fail-fast, Go server E2E, remove deprecated JS tests (#256 ) ## Summary Complete CI pipeline restructure. Sequential fail-fast chain, E2E tests against Go server with real staging data, all deprecated Node.js server tests removed. ### Pipeline (PR): 1. Go unit tests — fail-fast, coverage + badges 2. Playwright E2E — against Go server with fixture DB, frontend coverage, fail-fast on first failure 3. Docker build — verify containers build ### Pipeline (master merge): Same chain + deploy to staging + badge publishing ### Removed: - All Node.js server-side unit tests (deprecated JS server) - `npm ci` / `npm run test` steps - JS server coverage collection (`COVERAGE=1 node server.js`) - Changed-files detection logic - Docs-only CI skip logic - Cancel-workflow API hacks ### Added: - `test-fixtures/e2e-fixture.db` — real data from staging (200 nodes, 31 observers, 500 packets) - `scripts/capture-fixture.sh` — refresh fixture from staging API - Go server launches with `-port 13581 -db test-fixtures/e2e-fixture.db -public public-instrumented` --------- Co-authored-by: Kpa-clawbot <kpabap+clawdbot@gmail.com> Co-authored-by: you <you@example.com>	2026-03-29 11:24:22 -07:00
you	75637afcc8	ci: upgrade upload/download-artifact to v6 (Node.js 24)	2026-03-29 18:05:03 +00:00
you	78c5b911e3	test: skip flaky packet detail pane E2E tests (fixes #257 )	2026-03-29 17:54:03 +00:00
you	13cab9bede	perf: optimize frontend coverage collector (~2x faster) Three optimizations to reduce wall-clock time: 1. Reduce safeClick timeout from 3000ms to 500ms - Elements either exist immediately after navigation or don't exist at all - ~75 safeClick calls; if ~30 miss, saves ~75s of dead wait time 2. Replace 18 page.goto() calls with SPA hash navigation - After initial page load, the SPA shell is already in the DOM - page.goto() reloads the entire page (network round-trip + parse) - Hash navigation via location.hash triggers the SPA router instantly - Only 3 page.goto() remain: initial load + 2 home page loads after localStorage.clear() 3. Remove redundant final route sweep - All 10 routes were already visited during the page-specific sections - The sweep just re-navigated to pages that had already been exercised - Saves ~2s of redundant navigation Also: - Reduce inter-route wait from 200ms to 50ms (SPA router is synchronous) - Merge utility function + packet filter exercises into single evaluate() call - Use navHash() helper for consistent hash navigation with 150ms settle time	2026-03-29 10:32:42 -07:00
you	97486cfa21	ci: temporarily disable node-test job (CI restructure in progress)	2026-03-29 17:32:07 +00:00
you	d8ba887514	test: remove Node-specific perf test that fails against Go server The test 'Node perf page should NOT show Go Runtime section' asserts Node.js-specific behavior, but E2E tests now run against the Go server (per this PR), so Go Runtime info is correctly present. Remove the now-irrelevant assertion.	2026-03-29 10:22:26 -07:00
you	bb43b5696c	ci: use Go server instead of Node.js for E2E tests The Playwright E2E tests were starting `node server.js` (the deprecated JS server) instead of the Go server, meaning E2E tests weren't testing the production backend at all. Changes: - Add Go 1.22 setup and build steps to the node-test job - Build the Go server binary before E2E tests run - Replace `node server.js` with `./corescope-server` in both the instrumented (coverage) and quick (no-coverage) E2E server starts - Use `-port 13581` and `-public` flags to configure the Go server - For coverage runs, serve from `public-instrumented/` directory The Go server serves the same static files and exposes compatible /api/* routes (stats, packets, health, perf) that the E2E tests hit.	2026-03-29 10:22:26 -07:00

1 2 3 4 5 ...

1084 Commits