meshcore-analyzer

mirror of https://github.com/Kpa-clawbot/meshcore-analyzer.git synced 2026-07-23 10:00:58 +00:00

Author	SHA1	Message	Date
Kpa-clawbot	dae08a0ebb	ci: update go-ingestor-coverage.json [skip ci]	2026-05-19 15:28:15 +00:00
Kpa-clawbot	bb9b4ba6cb	ci: update frontend-tests.json [skip ci]	2026-05-19 15:28:14 +00:00
Kpa-clawbot	1017f422c2	ci: update frontend-coverage.json [skip ci]	2026-05-19 15:28:12 +00:00
Kpa-clawbot	3f8698e8e4	ci: update e2e-tests.json [skip ci]	2026-05-19 15:28:11 +00:00
Kpa-clawbot GitHub bot	749fdc114f	feat(decoder+ui): close remaining P2 items from #1279 — payloadTypeNames, legend, TransportCodes, Feat1/2, RAW_CUSTOM, sensor docs (#1291 ) RED commit: `dc4c0800` — CI: https://github.com/Kpa-clawbot/CoreScope/actions?query=branch%3Afix%2Fissue-1279-p2 Closes the remaining six 🟢 P2 items in umbrella #1279 (PR #1280 shipped P0+P1, PR #1276 shipped ACK/RESPONSE/PATH legend rows). ### Item-by-item \| # \| Item \| Where \| Test \| \|---\|---\|---\|---\| \| 1 \| `payloadTypeNames` parity \| `cmd/server/store.go` \| `cmd/server/issue1279_p2_test.go::TestPayloadTypeNamesAll13` \| \| 2 \| Legend rows: Anon Req / Grp Data / Multipart / Control / Raw Custom \| `public/live.js` \| `test-issue-1279-legend-p2-e2e.js` (Playwright) \| \| 3 \| TransportCodes detail-row + `code1=` / `code2=` filter grammar \| `public/packets.js`, `public/packet-filter.js` \| `test-issue-1279-p2-code-filter.js` (6 cases) \| \| 4 \| Multibyte capability badge on node detail/list rows \| `public/nodes.js::renderNodeBadges` \| `n.hash_size >= 2` (observable Feat1/Feat2 proxy; firmware `AdvertDataHelpers.h:14-16`) \| \| 5 \| RAW_CUSTOM (0x0F) `{rawLength, firstByteTag}` decode + detail-row \| `cmd/server/decoder.go`, `cmd/ingestor/decoder.go`, `public/packets.js` \| `TestDecodeRawCustomExposesLengthAndTag` × 2 + updated `TestDecodePayloadRAWCustom` \| \| 6 \| Sensor advert telemetry firmware-derivation comments \| `cmd/ingestor/decoder.go:363-380` \| pure comments — exempt per AGENTS \| ### Firmware refs cited inline - `firmware/src/Packet.h:19-32` — PAYLOAD_TYPE_* constants - `firmware/src/Packet.h:46` — TransportCodes wire layout - `firmware/src/Mesh.cpp:577` — `createRawData` - `firmware/src/helpers/SensorMesh.{h,cpp}` — sensor advert telemetry derivation - `firmware/src/helpers/AdvertDataHelpers.h:14-16` — Feat1/Feat2 ### TDD Red `dc4c0800` proves the assertions gate behavior: - `payloadTypeNames` had only 12 entries (no 0x0F). - RAW_CUSTOM decoded as `UNKNOWN` with no envelope fields. Green `<HEAD>` makes both green; per-item tests included. ### Cross-stack note Cross-stack: justified — items 1/5 add decoder output fields; items 2/3/4/5 surface those fields in the UI in the same PR per #1279 acceptance. ### Out of scope Item 4 surfaces the observable multibyte capability via the persisted `hash_size` (Feat1/Feat2 wire bits are only on transient adverts and not stored per-node today); persisting raw Feat1/Feat2 per-node is left for a follow-up. Fixes #1279 --------- Co-authored-by: bot <bot@corescope>	2026-05-19 08:08:28 -07:00
Kpa-clawbot	9ff6732e04	ci: update go-server-coverage.json [skip ci]	2026-05-19 08:38:50 +00:00
Kpa-clawbot	43d10ac2ef	ci: update go-ingestor-coverage.json [skip ci]	2026-05-19 08:38:48 +00:00
Kpa-clawbot	ea531e0e65	ci: update frontend-tests.json [skip ci]	2026-05-19 08:38:47 +00:00
Kpa-clawbot	7fd1722986	ci: update frontend-coverage.json [skip ci]	2026-05-19 08:38:46 +00:00
Kpa-clawbot	9fe4db7cc6	ci: update e2e-tests.json [skip ci]	2026-05-19 08:38:45 +00:00
Kpa-clawbot GitHub corescope-bot	467b01a1b3	fix(#1285 ): exclude RTC-reset outliers from clock-skew hash median + recent bad count (#1288 ) Red commit: `97c9a22a55` (CI: https://github.com/Kpa-clawbot/CoreScope/commit/97c9a22a55b07d1576c579aa9d23b290dad33eb6/checks) Fixes #1285. ## What was broken Bug A — outlier-dominated hash-evidence median. On the per-hash evidence panel a single observer reporting an RTC-reset advert (firmware emitting factory timestamp, ~700d off) dragged the displayed median to "median corrected: -704d 18h" even when every other observer of that hash saw a normal value. Bug B — false "N of last K had nonsense timestamps" warning. `recentBadSampleCount` lumped RTC-reset adverts in with "bimodal-bad" samples. On the repro node every recent skew was -16…-22s (healthy), but the lone RTC-reset advert that landed inside the recent window was counted as bad → "3 of last 5 adverts had nonsense timestamps" fired and the node was misclassified `bimodal_clock`. Root cause of B: the recent-window split (`cmd/server/clock_skew.go` ~L575) classified anything `\|corrected skew\| > 1h` as "bad". That conflates true bimodal RTC oscillation (1h…24h) with factory-timestamp resets (>24h, already surfaced via the RTC-reset badge). ## Fix - New `rtcResetOutlierThresholdSec = 24h`. Rationale: real µC drift is sub-second/advert; real bimodal RTC misbehaves in the hours range; anything >1d is not a drift signal. - Recent-window split puts `\|skew\| > 24h` in a third bucket excluded from both `recentSampleCount` and `recentBadCount`. - New `hashEvidenceMedian()` filters outliers before computing the per-hash median. UI labels the hash "insufficient data (N RTC-reset outliers excluded)" when every observer saw a reset-shaped advert. - Three pre-existing #845 tests used -50M-sec "bad" samples (RTC-reset range) — re-pointed to -7200s (true bimodal range), what `bimodal_clock` actually models. ## Preflight overrides - check-branch-clean: cross-stack: justified — backend computes counts/median; frontend renders the new label. ## Browser verification Confirmed staging node `c0dedad…` repro matches the test fixture. No new CSS vars. ## E2E assertion added `cmd/server/clock_skew_issue1285_test.go:81` and `:103`. --------- Co-authored-by: corescope-bot <bot@corescope.local>	2026-05-19 01:17:12 -07:00
Kpa-clawbot	2fff0e6d7b	ci: update go-server-coverage.json [skip ci]	2026-05-19 07:10:13 +00:00
Kpa-clawbot	487300460c	ci: update go-ingestor-coverage.json [skip ci]	2026-05-19 07:10:12 +00:00
Kpa-clawbot	cbcc33f636	ci: update frontend-tests.json [skip ci]	2026-05-19 07:10:11 +00:00
Kpa-clawbot	d5f8b9ba22	ci: update frontend-coverage.json [skip ci]	2026-05-19 07:10:09 +00:00
Kpa-clawbot	4ad2068ff2	ci: update e2e-tests.json [skip ci]	2026-05-19 07:10:08 +00:00
Kpa-clawbot GitHub MeshCore Bot	1da2034341	refactor(db): move all writes from server to ingestor; server truly read-only (fixes #1283 ) (#1286 ) Red commit: `f6290b63` — CI run will appear at https://github.com/Kpa-clawbot/CoreScope/actions Fixes #1283. ## What Moves all four DB write operations out of `cmd/server/` into `cmd/ingestor/`, making the server truly read-only and eliminating the SQLITE_BUSY VACUUM bug at its root: the server can no longer race the ingestor for the write lock because the server has no write path. ## The four operations \| # \| Was in \| Now in \| \|---\|--------\|--------\| \| 1 \| `cmd/server/vacuum.go` (`checkAutoVacuum`, full VACUUM + `auto_vacuum=INCREMENTAL` migration) \| `cmd/ingestor/db.go` `Store.CheckAutoVacuum` (already existed; ingestor runs it at startup before the MQTT subscriber starts → no contention) \| \| 2 \| `cmd/server/db.go` `PruneOldPackets` (`DELETE FROM transmissions`) \| `cmd/ingestor/maintenance.go` `Store.PruneOldPackets` (new) + 24h ticker in `cmd/ingestor/main.go` \| \| 3 \| `cmd/server/db.go` `PruneOldMetrics` (`DELETE FROM observer_metrics`) \| `cmd/ingestor/db.go` `Store.PruneOldMetrics` (already existed) \| \| 4 \| `cmd/server/db.go` `RemoveStaleObservers` (`UPDATE observers SET inactive=1`) \| `cmd/ingestor/db.go` `Store.RemoveStaleObservers` (already existed) \| ## HTTP surface - Removed: `POST /api/admin/prune` (`handleAdminPrune`, route, openapi entry). Operators trigger an ad-hoc prune by restarting the ingestor. - Kept: `GET /api/backup` — uses `VACUUM INTO` which writes to a separate file, not the live DB; read-only-safe. ## Tests - `cmd/server/readonly_invariant_test.go` (RED gate) — reflect-asserts `PruneOldPackets`/`PruneOldMetrics`/`RemoveStaleObservers` are NOT methods on the server's `DB`. Fails on master, passes after this PR. - `cmd/ingestor/issue1283_test.go` — exercises `Store.PruneOldPackets` and the auto_vacuum=NONE → INCREMENTAL migration through `Store.CheckAutoVacuum` with `vacuumOnStartup=true`. ## Why the bug is gone The SQLITE_BUSY VACUUM failure happened because supervisord launched both ingestor + server in one container; the ingestor took the write lock for INSERTs and the server's `checkAutoVacuum` then failed to acquire it within `busy_timeout=5000`. After this PR, only the ingestor ever opens a writable connection, and it runs `CheckAutoVacuum` before* spawning the MQTT subscriber → no contention possible. ## Scope notes - `cachedRW()` still has three pre-existing callers in `cmd/server/` (`neighbor_persist.go`, `ensure_indexes.go`, `from_pubkey_migration.go`). These pre-date #1283 and are not in the issue's four-operation list. Leaving them for follow-up keeps this PR honest about scope; AGENTS.md documents the invariant so new write paths can't sneak in. - PII preflight reports false positives on the Go method name `requireAPIKey` in `routes.go` diff context — no real PII. - Server-side neighbor-edge prune (`PruneNeighborEdges`) intentionally left in place — out of scope of #1283. --------- Co-authored-by: MeshCore Bot <bot@meshcore.local>	2026-05-18 23:52:27 -07:00
Kpa-clawbot GitHub bot	e2d320449b	fix(#1281 ): hide empty Location row + theme map link via --accent (#1284 ) ## Summary Minimal fix for #1281 — two surgical changes to the packet detail pane: 1. Hide the `Location` row when transmitter GPS is unavailable. Only ADVERT packets carry unencrypted GPS in their payload, so ~90% of packet types (TXT_MSG, GRP_TXT, ACK, REQ, MULTIPART, …) were rendering `<dt>Location</dt><dd>—</dd>` for nothing. We now skip the `<dt>/<dd>` pair entirely when `locationHtml` is empty. ADVERT rendering is unchanged. 2. Fix the `📍map` link contrast in dark mode. The trailing link had only `style="font-size:0.85em"` and inherited the UA-default `<a>` blue (`rgb(0,0,238)`) → unreadable against `--card-bg` in dark theme. Replaced inline style with `class="loc-map-link"` and added a small CSS rule that pulls color from `var(--accent)`. ### Out of scope (per operator direction) The original issue also proposed adding an `Rx:` observer-GPS line and distance-from-observer. Not in this PR — operator decided the existing observer IATA pill already conveys that, so adding more rows here is unnecessary. Bullets 1–2 of the issue's "Acceptance" list are covered; the multi-line `Tx:`/`Rx:` reformat is intentionally not done. ## TDD - Red `d465cf84` — `test-issue-1281-location-row-e2e.js` asserting: - Non-ADVERT detail must NOT contain `<dt>Location</dt>` - ADVERT detail STILL contains `<dt>Location</dt>` with GPS coords - `.loc-map-link` computed `color` equals `var(--accent)` (not UA blue) Verified to fail on master (`1 passed, 2 failed`) — see commit body. - Green `8c9bd8cb` — implementation. All three assertions pass. - CI wiring `9571b4f4` — added the test to `deploy.yml`'s E2E block. ## Files changed - `public/packets.js` — empty-string default for `locationHtml`, conditional `<dt>/<dd>` render, three sites swap inline style → class. - `public/style.css` — new `.loc-map-link { color: var(--accent); … }` rule next to `.detail-meta dd`. - `test-issue-1281-location-row-e2e.js` — new Playwright E2E. - `.github/workflows/deploy.yml` — one-line CI hook. ## Acceptance verification (against fixture DB) ``` === #1281 Location row + map link contrast E2E against http://localhost:13581 === ✓ Non-ADVERT packet detail does NOT render <dt>Location</dt> ✓ ADVERT packet detail STILL renders <dt>Location</dt> with GPS coords link.color=rgb(74, 158, 255) --accent→rgb(74, 158, 255) ✓ 📍map link uses class="loc-map-link" with color = var(--accent) 3 passed, 0 failed ``` Fixes #1281 --------- Co-authored-by: bot <bot@local>	2026-05-18 23:37:04 -07:00
Kpa-clawbot GitHub corescope-bot	d667dc0a74	fix(#1278 ): /api/nodes/{pk}/paths uses canonical persisted resolved_path (drop anchor-bias inconsistency) (#1282 ) First failing (RED) commit: `c994c5a7` — CI: https://github.com/Kpa-clawbot/CoreScope/actions Fixes #1278. ## Root cause `handleNodePaths` (`cmd/server/routes.go`) anchored the disambiguator with the queried node as `hopContext` (`hopContext := []string{lowerPK}`). For ambiguous short-prefix hops (e.g. two nodes sharing the 1-byte prefix `C0`), tier-1/2 hop-context resolution then biased the resolver to pick the queried node — even though the CANONICAL persisted `resolved_path` (what `/api/packets/{hash}` shows via `fetchResolvedPathForTxBest`) had picked the OTHER colliding node at ingest time. The `containsTarget` gate accepted those packets and rendered the queried node into the displayed hop, while the packets page (reading the canonical resolved_path) showed a different node. The two pages disagreed. Confirmed on staging: `/api/nodes/c0dedad…/paths` returned `sampleHash 6c4af39ee4b7e202`; `/api/packets/6c4af39ee4b7e202.resolved_path[3]` = `c0ffeec7…`, not `c0dedad…`. ## Option chosen — A For each candidate tx, read the canonical persisted `resolved_path` via `fetchResolvedPathForTxBest`. When present, use it for BOTH: - the `containsTarget` membership decision (queried pubkey must appear in the canonical resolved hops), and - the displayed hop names (zipped parallel to `tx.PathJSON`). When absent (older data / async backfill not yet complete) the legacy biased re-resolve is kept as a fallback — there's no canonical answer to be consistent with, and dropping the bias unconditionally would regress #1197. ## Why not B / C - B (drop bias only for membership): still re-resolves display with bias → display vs packets page can still diverge for hop names. Option A fixes both. - C (drop `hopContext` entirely): regresses #1197 / breaks the `resolve_context_callsites_test.go` gate. ## Performance Same O(N) walk over candidates; one extra `fetchResolvedPathForTxBest` per candidate, LRU-cached, worst case a single SQL row. ## Tests - RED: `cmd/server/paths_anchor_bias_test.go` — seeds two `c0…` nodes + a tx whose best-obs resolved_path picks the GPS node; asserts the no-GPS node's `/paths` excludes the tx and the GPS node's includes it. Mutation-verified (fails on master). - All existing tests green (including #1197 callsite gate and #929 prefix-collision exclusion). --------- Co-authored-by: corescope-bot <bot@corescope>	2026-05-18 23:19:30 -07:00
Kpa-clawbot GitHub OpenClaw Bot	e6c30e1a7e	feat(decoder): GRP_DATA + MULTIPART + advertRole fix + CONTROL flags (#1279 P0+P1) (#1280 ) Addresses the four P0+P1 firmware reconciliation gaps from the umbrella audit (issue #1279). RED commit: `0a4c084e` (asserts on stub returns; all 13 assertions fail). GREEN commit: `13867681`. ## What's in this PR ### P0 — silently dropped data - #1 GRP_DATA (0x06) decoder. Outer envelope is the same shape as GRP_TXT (`channel_hash(1)+MAC(2)+ciphertext`) per `firmware/src/helpers/BaseChatMesh.cpp:476,500`. Factored `decryptChannelBlock(...)` helper used by both 5 and 6. When a channel key matches, the inner is parsed per `firmware/src/helpers/BaseChatMesh.cpp:382-385` as `data_type(uint16 LE) + data_len(1) + blob(data_len)`. Surfaces `{channelHash, MAC, dataType, dataLen, decryptedBlob}` on decrypt or `{channelHash, MAC, encryptedData}` otherwise. Server-side decoder surfaces envelope only (no key store). - #2 MULTIPART (0x0A) decoder. Per `firmware/src/Mesh.cpp:289`, byte0 = `(remaining<<4) \| inner_type`. When `inner_type == PAYLOAD_TYPE_ACK (0x03)`, next 4 bytes are the LE ack_crc per `firmware/src/Mesh.cpp:292-307`. Surfaces `{remaining, innerType, innerTypeName, innerAckCrc \| innerPayload}`. ### P1 — mis-classified / opaque - #3 `advertRole()` raw-type fix. Per `firmware/src/helpers/AdvertDataHelpers.h:7-12`, ADV_TYPE_NONE = 0 and 5-15 are FUTURE. The previous boolean fallback collapsed both into `"companion"`, silently relabelling unknown/reserved types. New behaviour: type 0 → `none`, 1 → `companion`, 2-4 → `repeater`/`room`/`sensor`, 5-15 → `type-N`. `ValidateAdvert` accepts the new labels. - #4 CONTROL (0x0B) byte0 flags + length. Per `firmware/src/Mesh.cpp:69` + `createControlData` at `Mesh.cpp:609`, byte0 high-bit marks the zero-hop direct subset. Surfaces `{ctrlFlags, ctrlZeroHop, ctrlLength}`. ### Drift fix - `cmd/server/store.go` `payloadTypeNames` now includes `6: GRP_DATA` and `10: MULTIPART` (previously omitted; canonical decoder map already had them). ## Lockstep & TDD Both `cmd/ingestor/decoder.go` and `cmd/server/decoder.go` updated in the same commits — same wire-vector tests live in both packages (`cmd/{ingestor,server}/issue1279_test.go`). Per-item RED→GREEN visible in `git log`. \| Item \| Tests \| RED proof \| \|---\|---\|---\| \| #1 GRP_DATA \| ingestor: NoKey + DecryptedInner; server: Envelope \| 6 assertions failed pre-impl \| \| #2 MULTIPART \| ingestor + server: Ack + NonAck \| 8 assertions failed pre-impl \| \| #3 advertRole \| ingestor + server: 7-row table \| 3 assertions failed pre-impl \| \| #4 CONTROL \| ingestor + server: ZeroHop + MultiHop \| 6 assertions failed pre-impl \| ## What's NOT in this PR The umbrella issue lists P2 items that ship in follow-up PRs: - Live + compare legend entries for the long tail of newly-named types (#1274 + others). - TransportCodes UI surface + filter grammar. - feat1/feat2 capability badges. - `payloadTypeNames` consolidation across server/ingestor (drift-prevention). Leave the umbrella open after this merges. Refs #1279 --------- Co-authored-by: OpenClaw Bot <bot@openclaw.local>	2026-05-18 23:19:27 -07:00
Kpa-clawbot	6b2bc62fc3	ci: update go-server-coverage.json [skip ci]	2026-05-19 06:08:58 +00:00
Kpa-clawbot	3d0b3ea551	ci: update go-ingestor-coverage.json [skip ci]	2026-05-19 06:08:57 +00:00
Kpa-clawbot	0ad6dd2c6d	ci: update frontend-tests.json [skip ci]	2026-05-19 06:08:56 +00:00
Kpa-clawbot	b98f59475f	ci: update frontend-coverage.json [skip ci]	2026-05-19 06:08:55 +00:00
Kpa-clawbot	9d76a91718	ci: update e2e-tests.json [skip ci]	2026-05-19 06:08:54 +00:00
Kpa-clawbot GitHub openclaw-bot	c1d94f7db5	fix(#1273 ): collapse QR overlay wrap to content height (#1277 ) ## Summary Fixes #1273 — `.node-top-row .node-qr-wrap` was 2-3× taller than the QR canvas inside it, leaving empty translucent space below the QR. ## Root cause Three compounding issues: 1. SVG intrinsic height not constrained. `qrcode-generator` emits an SVG with fixed `width`/`height` attributes (e.g. 147×147). The CSS rule `.node-qr svg { max-width: 100px }` (and 72px mobile) constrains width only, so the svg's intrinsic height (147px) is preserved and the wrap is sized to that. 2. Flex stretch. `.node-top-row` is `display:flex` with default `align-items:stretch`, so the QR column was forced to match the map column's height (~280px) on desktop. 3. Excess padding/margin added another ~24px above and below the visible QR. ## Fix Three small CSS changes in `public/style.css`: \| change \| effect \| \|---\|---\| \| `.node-qr svg { height: auto; }` \| svg height scales with constrained width \| \| `.node-top-row .node-qr-wrap { align-self: flex-start; }` \| wrap sizes to content, not column \| \| `.node-top-row .node-qr-wrap { padding: 8px; }` + zero inner `.node-qr` margin-top \| tight hug \| ## Measurements (real-data fixture, full node detail page) \| viewport \| wrap.height before \| wrap.height after \| QR canvas \| \|---\|---\|---\|---\| \| 375×800 (mobile overlay) \| 165px \| 82px \| 72×72 \| \| 1280×800 (desktop side-by-side) \| 217px \| 154px \| 100×100 (+ 28px caption) \| Overlay remains `position:absolute` top-right on mobile; the original #1243 behavior is preserved. ## TDD - RED: `test-issue-1273-qr-overlay-height-e2e.js` asserts wrap height ≤ visible QR + caption + 32px at 375×800 and 1280×800. Failed on master with deltas of 93px (mobile) and 89px (desktop). - GREEN: both viewports pass after the CSS fix. Wired into the deploy workflow alongside the other `test-issue-*-e2e.js` runs. ## Acceptance checklist - [x] Container height ≈ QR canvas height + 16-24px padding total - [x] No empty translucent space below the QR - [x] E2E asserts at 375×800 and 1280×800 - [x] Desktop layout unchanged (overlay position preserved; column no longer stretches but the QR card is the same width) - [x] All colors via CSS variables - [x] #1243 overlay behavior preserved (still top-right on mobile, still rendered) ## Commits - `e9d75c92` test(#1273): RED - `13899270` fix(#1273): collapse QR overlay wrap --------- Co-authored-by: openclaw-bot <bot@openclaw.local>	2026-05-18 22:51:29 -07:00
Kpa-clawbot GitHub corescope-bot	21b6eb0d63	fix(live legend): document ACK/RESPONSE/PATH + white-ring repeater convention (#1274 ) (#1276 ) RED commit `ac1fb4c3` (Playwright E2E asserts legend rows for ACK / RESPONSE / PATH text + "ring" + "repeater" — fails on master). CI: https://github.com/Kpa-clawbot/CoreScope/actions?query=branch%3Afix%2Fissue-1274 ## What The Live legend rendered five packet-type rows but the codebase defines eight `TYPE_COLORS`. The three gray-area types (ACK, RESPONSE, PATH) had no swatch in the legend, leaving operators guessing what gray dots meant — they're either ACKs or unknown payload types. Separately, the L.circleMarker styling block uses a brighter white ring to mark repeaters vs. all other roles; that convention was nowhere on screen. ## Changes - `public/live.js` legend HTML — adds rows for RESPONSE, PATH and a combined Ack / Other row (covering both ACK and the unknown-type fallback that share `#6b7280`). Adds a new MARKER STYLES subsection below NODE ROLES with two entries: bright white ring = repeater, faded ring = other. - `public/live.css` — adds `.live-ring` / `.live-ring--repeater` / `.live-ring--other` swatches. Background uses `var(--text-muted)`; only the white border + opacity differ between the two, matching the actual circleMarker weights (1.5 / 0.5) and opacities (0.6 / 0.3). - `test-issue-1274-legend-coverage-e2e.js` — Playwright E2E (desktop + mobile attached-DOM) asserting all four new pieces. ## Notes - All colors via `TYPE_COLORS` — no hardcoded hex in HTML. - Legend is `display:none` at ≤640px (existing #279 behavior), so no mobile CSS tweak required for the longer list. - Does not touch the legend toggle (#1219), mobile single-row header (#1234), or VCR visibility (#1269). Fixes #1274. --------- Co-authored-by: corescope-bot <bot@meshcore.local>	2026-05-18 22:51:26 -07:00
Kpa-clawbot GitHub clawbot	8bf7709970	feat(repeater): usefulness score — bridge axis (#672 axis 2 of 4) (#1275 ) RED test commit: `fd661569` — CI will fail on this (stub returns empty map; assertions fail by design). GREEN: `bf4b8592`. ## What Implements axis 2 of 4 for the repeater usefulness score per #672 ([status comment](https://github.com/Kpa-clawbot/CoreScope/issues/672#issuecomment-4484635378)). The Bridge axis measures structural importance: how many shortest paths between other nodes route through this one. A high-traffic redundant node and a low-traffic critical bridge will no longer look identical. ## Algorithm Brandes' weighted betweenness centrality with Dijkstra for shortest paths (`cmd/server/bridge_score.go`). - Nodes: pubkeys in the `neighbor_edges` graph - Edge weight: `Score(now) * Confidence()` — per the convention from #1235 (count + recency decay scaled by observer-diversity confidence). Geo-rejected edges already excluded at graph build time (#1230) so we don't re-filter here. - Dijkstra distance: `1 / max(epsilon, weight)` — high affinity = cheap cost. - Normalize: divide by max observed centrality so output is in `[0, 1]`. Cost: `O(V · (E + V log V))`. Staging-scale (~600 nodes / ~2 000 edges) ≈ ~4.8M ops, completes in milliseconds. ## Where it lives - `cmd/server/bridge_score.go` — pure algorithm, no locks - `cmd/server/bridge_recomputer.go` — background recomputer (mirrors #1240/#1262 pattern), 5-min default interval, initial sync prewarm, snapshot stored in `s.bridgeScoreMap atomic.Pointer[map[string]float64]` - `cmd/server/routes.go` — `handleNodes` adds `node["bridge_score"]` on repeater/room rows; node-detail handler adds it on the single-node path - `public/nodes.js` — separate Bridge row in the node detail panel, alongside the existing Usefulness (Traffic) row. Distinct colour-coded bar. ## What's NOT in this PR (still pending for #672) - Coverage axis (axis 3) — unique observer-pair connectivity - Redundancy axis (axis 4) — simulated node-removal impact - Composite — once all 4 axes ship, swap the `usefulness_score` formula from "traffic-only" to the weighted composite `Refs #672` (not `Fixes` — issue stays open until all 4 axes + composite ship). ## Tests - `TestComputeBridgeScores_LineGraph` — 4-node line: middles non-zero, leaves zero, max normalized to 1.0 - `TestComputeBridgeScores_TriangleNoBridge` — clique has zero bridges - `TestComputeBridgeScores_Empty` — defensive nil-safety - `TestComputeBridgeScores_WeightSensitive` — mutation guard: revert the `1/w` inversion and this test fails - `TestBridgeScore_HandleNodesSurface` — integration: `/api/nodes` returns `bridge_score` on repeater rows; middle nodes > 0, ends == 0 --------- Co-authored-by: clawbot <bot@meshcore.local>	2026-05-18 22:51:23 -07:00
Kpa-clawbot	c09fec56ff	ci: update go-server-coverage.json [skip ci]	2026-05-19 01:42:02 +00:00
Kpa-clawbot	6dbfd331a6	ci: update go-ingestor-coverage.json [skip ci]	2026-05-19 01:42:01 +00:00
Kpa-clawbot	a00e1c0e18	ci: update frontend-tests.json [skip ci]	2026-05-19 01:42:00 +00:00
Kpa-clawbot	763d4f707c	ci: update frontend-coverage.json [skip ci]	2026-05-19 01:41:59 +00:00
Kpa-clawbot	ad467daeeb	ci: update e2e-tests.json [skip ci]	2026-05-19 01:41:57 +00:00
Kpa-clawbotandGitHub	46ce9590f1	fix(#1270 ): Prefix Tool Network Overview shows configured-hash-size counts, not math-only slices (#1271 ) Red commit: `6b68080c24106301b6bfc25f8a05484f07d0612d` (test added that fails on master). CI: see Checks tab on this PR. Fixes #1270. ## Problem Two analytics surfaces told contradictory stories about prefix usage: - Prefix Tool → Network Overview showed e.g. `168 / 65,536` for the 2-byte tier — a pure math fact: every repeater pubkey sliced to 2 bytes yields N distinct values. Because collisions are rare, this number always equals (or nearly equals) the repeater count, making it look like the whole network uses 2-byte hashing. - Hash Stats → By Repeaters showed configured-hash-size counts straight from `/api/analytics/hash-sizes` `distributionByRepeaters` — usually a minority on 2-byte and near-zero on 3-byte. The Prefix Tool was presenting a math fact as if it were operational truth. ## Fix `renderPrefixTool` now also fetches `/api/analytics/hash-sizes` and restructures each tier card into three labeled stats with explicit hierarchy: 1. Primary — `X of Y repeaters configured` (from `distributionByRepeaters`). Same source the Hash Stats tab uses, so the two pages agree exactly. 2. Operational collisions — colliding slices among repeaters configured for this hash size only (matches Hash Issues semantics). 3. Theoretical (secondary, smaller, dashed-rule footnote) — `X unique N-byte slices across all repeater pubkeys (of Y possible)`. The math fact is preserved as educational info, no longer impersonating operational truth. The "Total repeaters" card now also notes how many have a known configured hash size. The "About these numbers" footer was rewritten to explain the three numbers and link to both Hash Stats and Hash Issues. The prefix collision detector (Check / Generate panels) is unchanged — it still scans every repeater pubkey because that is its job. ## Test Added `#1270 Prefix Tool primary counts match Hash Stats By Repeaters` to `test-e2e-playwright.js`. It fetches `/api/analytics/hash-sizes` for the ground-truth `distributionByRepeaters`, then visits `#/analytics?tab=prefix-tool`, opens Network Overview, and scrapes the primary count via a new `data-pt-configured="<bytes>"` `data-value="<count>"` marker on each tier card, asserting exact equality for 1/2/3-byte. - Red commit `6b68080c` (test only): fails on master with `NO data-pt-configured marker`. - Green commit `12ed2789` (fix): test passes; full E2E suite `123/126 passed, 3 skipped`. ## Acceptance - [x] Prefix Tool Network Overview shows configured-hash-size repeater counts as the primary number - [x] "Unique slices" math is shown as secondary/educational - [x] Two pages tell the same story (E2E asserts byte-equal match) - [x] E2E asserts the configured-count matches what Hash-Sizes tab shows at the same point in time	2026-05-18 18:20:29 -07:00
Kpa-clawbot	0022c8fd1f	ci: update go-server-coverage.json [skip ci]	2026-05-18 22:43:47 +00:00
Kpa-clawbot	7827c8e778	ci: update go-ingestor-coverage.json [skip ci]	2026-05-18 22:43:46 +00:00
Kpa-clawbot	cb0218fc4d	ci: update frontend-tests.json [skip ci]	2026-05-18 22:43:46 +00:00
Kpa-clawbot	385f49b3d8	ci: update frontend-coverage.json [skip ci]	2026-05-18 22:43:45 +00:00
Kpa-clawbot	f4ecc96ccc	ci: update e2e-tests.json [skip ci]	2026-05-18 22:43:44 +00:00
Kpa-clawbot GitHub openclaw-bot	78b666c248	fix(#1267 ): mobile VCR bar invisible — JS height clobbered bottom-nav reserve (#1269 ) ## Summary Mobile-only regression: on the Live page at ≤768px viewports the VCR bar was rendered behind the fixed bottom-nav and never visible to the user. iOS Safari screenshot at 375x812 showed: top header strip, full-height map, bottom-nav — no VCR row at all. Fixes #1267. ## Root cause `public/live.js` `initResizeHandler` (the existing JS height override) was setting `page.style.height = window.innerHeight + 'px'`, which clobbered the CSS rule that already subtracts `--bottom-nav-reserve` from the live-page height. Because `.live-page` then spanned the full viewport, the VCR bar (`position:absolute; bottom:0; z-index:1000`) was painted underneath `.bottom-nav` (`position:fixed; z-index:1200`). The VCR bar element WAS in the DOM, WAS `display: flex`, and HAD `height: 53px` — it just sat at y=758..812 underneath the bottom-nav at y=754..812. CSS-only checks for `display:none` would never catch this; the test asserts the bar's bottom edge is at or above the bottom-nav's top edge. ## Fix One-liner in spirit: subtract the bottom-nav height before applying `page.style.height`. The implementation measures the rendered `.bottom-nav` (with a fallback to a hidden probe that resolves the `--bottom-nav-reserve` token), so it survives safe-area inset and the bottom-nav's 1px border. ```js const reserve = /* measure .bottom-nav, fall back to --bottom-nav-reserve token /; const h = Math.max(0, window.innerHeight - reserve); ``` Desktop is unchanged: `.bottom-nav` is `display: none`, the probe resolves to 0, and `h === window.innerHeight` exactly as before. ## TDD - RED* (commit 1): `test-e2e-1267-mobile-vcr.js` — Playwright at iPhone 375x812 asserts `.vcr-bar` has `display !== 'none'`, `visibility !== 'hidden'`, `height > 0`, `top < viewport.height`, and (the key check) `bottom <= bottom-nav.top`. Fails on `master` with: "VCR bar bottom 812 overlaps bottom-nav top 754". - GREEN (commit 2): the fix above. Test passes: "VCR bar bottom 754 ≤ bottom-nav top 754". ## Verification - ✅ Mobile (375x812) repro reproduced against `master` (bar at y=758..812, behind bottom-nav) - ✅ Mobile (375x812) E2E green after fix (bar at y=700..754, flush above bottom-nav) - ✅ Desktop (1440x900) unaffected — bottom-nav hidden, page height = viewport height as before, VCR bar at viewport bottom - ✅ #1234 (top-nav hidden on /live), #1246 (single-row VCR), #1206/#1213 (VCR/feed clearance) unchanged — none touched ## Files - `public/live.js` — single function (`initResizeHandler`) modified - `test-e2e-1267-mobile-vcr.js` — new mobile-viewport Playwright regression test Run: `BASE_URL=http://localhost:13581 node test-e2e-1267-mobile-vcr.js` --------- Co-authored-by: openclaw-bot <bot@openclaw.local>	2026-05-18 15:27:05 -07:00
Kpa-clawbot	d4d569278d	ci: update go-server-coverage.json [skip ci]	2026-05-18 19:46:11 +00:00
Kpa-clawbot	ee21eafa66	ci: update go-ingestor-coverage.json [skip ci]	2026-05-18 19:46:10 +00:00
Kpa-clawbot	c6a90e9896	ci: update frontend-tests.json [skip ci]	2026-05-18 19:46:09 +00:00
Kpa-clawbot	92518ab234	ci: update frontend-coverage.json [skip ci]	2026-05-18 19:46:08 +00:00
Kpa-clawbot	cd84f51f8a	ci: update e2e-tests.json [skip ci]	2026-05-18 19:46:08 +00:00
Kpa-clawbot GitHub CoreScope Bot	4cd8445233	perf(#1265 ): wire /api/observers/clock-skew + /api/nodes/clock-skew into analytics recomputer (#1266 ) RED: `97f49a0c` · CI: https://github.com/Kpa-clawbot/CoreScope/actions/runs/26046530920 Fixes #1265. ## Problem On staging two clock-skew endpoints serve compute-on-request: - `/api/observers/clock-skew` — 3.3s - `/api/nodes/clock-skew` — 8.9s Both drive a full `clockSkew.Recompute` over 100k+ adverts while holding `s.mu.RLock`, blocking under concurrent reader load. ## Fix Wire both endpoints into the established `analytics_recomputer.go` pattern (PRs #1248 / #1259 / #1263). Two new slots: - `recompObserversClockSkew` — wraps `computeObserverCalibrations()` - `recompNodesClockSkew` — wraps `computeFleetClockSkew()` Accessors `GetObserverCalibrations` / `GetFleetClockSkew` now prefer the atomic-pointer snapshot; on-request compute is fallback-only for the brief window before initial sync compute lands (and for tests that skip the recomputer). Default interval 300s, overridable via: ```json "analytics": { "recomputeIntervalSeconds": { "observersClockSkew": 300, "nodesClockSkew": 300 } } ``` `config.example.json` + the `_comment_analytics` doc updated. ## TDD - RED `97f49a0c` — `TestClockSkewRecomputersRegistered` + `TestClockSkewHandlersSteadyStateLatency` (8 concurrent readers × 25 reqs per endpoint, p99 < 100ms gate). Fails on master: recomputer slots nil. - GREEN `19599375` — wire + accessor switch. p99 well under 5ms on the test fixture. ## Verification ``` cd cmd/server && go test ./... -count=1 # ok 42s bash ~/.openclaw/skills/pr-preflight/scripts/run-all.sh origin/master # all gates pass ``` --------- Co-authored-by: CoreScope Bot <bot@corescope.local>	2026-05-18 12:27:44 -07:00
Kpa-clawbot GitHub corescope-bot	ae17a2be12	perf(#1262 ): /api/nodes?limit=2000 cold-miss 15.7s → <100ms — prewarm repeater enrichment cache (#1263 ) RED commit: `22ce5736066142583017cad7303fa48d9e00ccf0` — CI on red: https://github.com/Kpa-clawbot/CoreScope/actions?query=branch%3Afix%2Fissue-1262 ## Problem After #1260 added a 15s-TTL bulk cache for repeater enrichment in `handleNodes`, `/api/nodes` (default limit) dropped to ~500ms. But `/api/nodes?limit=2000` — called by `public/live.js` at SPA startup for hop resolution — still took 15.7s cold on staging (75k tx, 600 nodes). Warm hits were ~40ms. Root cause: the bulk cache was lazily populated on the first request after TTL expiry. The rebuild ran on the request-serving goroutine. Every cold SPA load triggered the rebuild and ate 15s. ## Fix Add `StartRepeaterEnrichmentRecomputer` — a steady-state background recomputer that mirrors the `analytics_recomputer.go` pattern from #1240: - Prewarm: initial synchronous compute on Start so the first request hits a populated cache. - Steady-state: ticker refreshes the snapshot every 5min (configurable via the existing analytics recompute interval knob). - Panic-safe + idempotent Start. Wired into `main.go` right after `StartAnalyticsRecomputers`, using `cfg.GetHealthThresholds().RelayActiveHours` as the window. ## Test `TestHandleNodesLimit2000ColdMiss` — seeds 600 nodes + 150k non-advert tx with repeaters indexed under a shared 1-byte hop prefix (matches production hop-prefix collisions), starts the recomputer, then issues `/api/nodes?limit=2000` with no HTTP warmup. \| State \| Latency \| \|---\|---\| \| Before (master, on-thread rebuild) \| 3.37s \| \| After (prewarm + steady-state) \| 56ms \| \| Budget \| 2s \| Staging end-to-end: 15.7s → expected sub-100ms on the same call path. Red commit (`22ce5736066142583017cad7303fa48d9e00ccf0`) compiles with a no-op stub of the new method so the test fails on the latency assertion, not a missing symbol. Fixes #1262 --------- Co-authored-by: corescope-bot <bot@corescope.local>	2026-05-18 09:22:27 -07:00
Kpa-clawbot GitHub corescope-bot	094a96bd6c	perf(#1258 ): /#/perf — parallel health fetch, sort endpoints, pause refresh while hidden (#1261 ) Fixes #1258 — Perf dashboard (/#/perf) was slow because of three frontend issues; backend APIs were never the problem. ## Findings 1. `/api/health` fetched sequentially after `Promise.all` in `refresh()` — added a full RTT (~50-200ms) on every 5s tick on top of the parallel batch. 2. Endpoints table not actually sorted despite the heading "sorted by total time". JSON shape is `map[string]EndpointStatsResp` (no defined order); frontend rendered map iteration order. Visible correctness bug surfaced during investigation. 3. `setInterval(refresh, 5000)` kept firing while tab was hidden, rebuilding the entire ~10-section `innerHTML` (cards + 3 tables) in the background. On tab return the user saw a backlog thrash + felt the page was "slow to render". ## Fix (`public/perf.js`) - Move `/api/health` into the same `Promise.all` as the other 4 endpoints — saves one RTT per refresh. - Sort `Object.entries(server.endpoints)` by `count * avgMs` DESC client-side. - Add `document.hidden` guard in the interval tick + `visibilitychange` listener that refreshes once on return; `destroy()` removes the listener. ## Tests `test-perf-render-1258.js` (new): - All 5 initial fetches issued in parallel (including `/api/health`) - Refresh suppressed while `document.hidden` - Endpoints table sorted by total time DESC, regardless of input map order RED commit first (`6b54f9e8`, 0/3 pass) → GREEN commit (`be81303b`, 3/3 pass). Existing `test-perf-go-runtime.js` (13/13) and `test-perf-disk-io-1120.js` (15/15) still green. ## Investigation exemption No Playwright timing test — sandbox can't run a real browser. Static analysis + render-shape unit tests cover the three identified bottlenecks. Documented per AGENTS "investigation surfaces" exemption. ## Measurement Before: refresh = parallel batch (~max(server-side)) + sequential `/api/health` (~50ms) + full innerHTML rebuild every 5s including hidden tabs. After: refresh = single parallel batch, runs only while visible. Expected improvement on tab-return ≈ -1 RTT per refresh + zero background work. --------- Co-authored-by: corescope-bot <bot@corescope.local>	2026-05-18 08:02:27 -07:00
Kpa-clawbot GitHub corescope-bot	1efe93d7f6	perf(#1257 ): bulk-cache repeater enrichment in /api/nodes — 32s → <500ms (#1260 ) RED commit `a2879e12` — perf regression test; CI run: see Actions tab. Fixes #1257. ## Root cause `handleNodes` looped over the response page and called `store.GetRepeaterRelayInfo(pk, win)` + `store.GetRepeaterUsefulnessScore(pk)` for every repeater/room. Each call: - grabbed its own `s.mu.RLock`, - walked `byPathHop[pk]` (+ the matching 1-byte raw-prefix bucket, which on busy networks fans out to nearly the entire non-advert tx set), - and re-parsed every `tx.FirstSeen` with `parseRelayTS`. Default page is the 50 most-recently-seen nodes — almost all hot repeaters — so the request did O(50) lock acquisitions and hundreds of thousands of timestamp parses on the same set of txs. That's the classic load-then-paginate / per-row N+1 shape called out in the issue (same family as #1226). The `?limit=2000` variant looks faster relatively only because per-node enrichment dwarfs serialization; on staging both still bottleneck on the same loop. ## Fix Two new bulk methods on `PacketStore`: - `GetRepeaterRelayInfoMap(windowHours)` → `pubkey → RepeaterRelayInfo` - `GetRepeaterUsefulnessScoreMap()` → `pubkey → 0..1` Both snapshot `byPathHop` under a single `RLock`, pre-parse each `FirstSeen` exactly once (a tx that appears in N hop buckets used to be parsed N times), and emit one entry per hop key. Cached 15s — same TTL as `GetNodeHashSizeInfo` / `GetMultiByteCapMap`, same status-column freshness budget. `handleNodes` is one map-lookup per node; behavior, output schema, and `RelayActive` / `RelayCount{1h,24h}` / `LastRelayed` / `usefulness_score` semantics are preserved. ## Why no `limit` default change The issue mentioned a default-limit knob. Investigated: `queryInt(r, "limit", 50)` already defaults to 50 — frontends calling `/api/nodes` (no limit) get a 50-row page today. Capping further would change behavior (live.js already passes `?limit=2000` when it wants more); the cost was per-repeater enrichment, not page size. Fixing the N+1 is the correct lever and preserves backward compat. ## Perf Regression test `TestHandleNodesPerfLargeFleet` (600 nodes, 150k non-advert tx, repeaters indexed under `byPathHop`): \| \| elapsed \| vs 2s budget \| \|---\|---\|---\| \| before (master) \| 4.72s \| ✗ \| \| after \| ~4ms \| ✓ (~1000×) \| ## TDD - RED: `a2879e12` — test fails at 4.72s on master. - GREEN: `c529d29a` — fix; full `cmd/server` + `cmd/ingestor` suites green. --------- Co-authored-by: corescope-bot <bot@corescope>	2026-05-18 07:36:33 -07:00
Kpa-clawbot GitHub openclaw-bot	f81ed5b3cf	perf(#1256 ): wire /api/analytics/roles into steady-state recomputer (#1259 ) RED commit: `0190466d` — failing CI: https://github.com/Kpa-clawbot/CoreScope/actions (will populate after PR creation) ## Problem On staging (commit `d69d9fb`, 78k tx, 2.3M obs), `curl http://localhost/api/analytics/roles` times out at 60s with 0 bytes — the Roles tab is unusable. Issue #1256. PR #1248's steady-state recomputer fan-out (topology / rf / distance / channels / hash-collisions / hash-sizes) didn't include roles. The legacy handler: 1. Holds `s.mu.RLock` for the entire compute. 2. Calls `GetFleetClockSkew()`, which drives `clockSkew.Recompute(s)` over all ADVERT transmissions — O(78k) per request. 3. Concurrent ingest writers compound the latency through writer-starvation. Result: every request hits the cold path; the response never comes back inside the 60 s HTTP budget. ## Fix Add `roles` as the 7th endpoint in the recomputer fan-out — same pattern as #1248: - `PacketStore.recompRoles` slot, registered in `StartAnalyticsRecomputers` with default 5-min interval. - `PacketStore.GetAnalyticsRoles()` → atomic-pointer load from the snapshot (sub-ms), with a `computeAnalyticsRoles()` fallback only for the brief startup window before the initial sync compute completes. - Handler is now a thin wrapper — no lock-held work on the request path. - New optional `roles` key under `analytics.recomputeIntervalSeconds` in config; `config.example.json` and `_comment_analytics` updated. ## Latency (unit-scope benchmark) - Worst-of-50 handler latency: <100 ms (test budget; well under the 2 s p99 acceptance). - Compute itself is bounded by the existing 5-min recompute window — it runs once in the background, never on the request path. ## Tests - RED `0190466d`: asserts `recompRoles` is registered and the handler returns under the latency budget. Fails on master with `recompRoles not registered`. - GREEN `d7784f76`: registers the recomputer + snapshot accessor — both tests pass. Fixes #1256 --------- Co-authored-by: openclaw-bot <bot@openclaw.local>	2026-05-18 07:36:28 -07:00

1 2 3 4 5 ...