meshcore-analyzer

dandri/meshcore-analyzer

Fork 0

mirror of https://github.com/Kpa-clawbot/meshcore-analyzer.git synced 2026-07-29 17:39:27 +00:00

Files

T

History

fb744d895f fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

Fixes #1143.

## Summary

Replaces the structurally unsound `decoded_json LIKE '%pubkey%'` (and
`OR LIKE '%name%'`) attribution path with an exact-match lookup on a
dedicated, indexed `transmissions.from_pubkey` column.

This closes both holes documented in #1143:
- **Hole 1** — same-name false positives via `OR LIKE '%name%'`
- **Hole 2a** — adversarial spoofing: a malicious node names itself with
another node's pubkey and gets attributed to the victim
- **Hole 2b** — accidental false positive when any free-text field (path
elements, channel names, message bodies) contains a 64-char hex
substring matching a real pubkey
- **Perf** — query now uses an index instead of a full-table scan
against `LIKE '%substring%'`

## TDD

Two-commit history shows red-then-green:

| Commit | Status | Purpose |
|---|---|---|
| `7f0f08e` | RED — tests assertion-fail on master behaviour |
Adversarial fixtures + spec |
| `59327db` | GREEN — schema + ingestor + server + migration |
Implementation |

The red commit's test schema includes the new column so the file
compiles, but the production code still uses LIKE — the assertions fail
because the malicious / same-name / free-text rows are returned. The
green commit changes the query plus adds the migration/ingest path.

## Changes

### Schema
- new column `transmissions.from_pubkey TEXT`
- new index `idx_transmissions_from_pubkey`

### Ingestor (`cmd/ingestor/`)
- `PacketData.FromPubkey` populated from decoded ADVERT `pubKey` at
write time. Cheap — already parsing `decoded_json`. Non-ADVERTs stay
NULL.
- `stmtInsertTransmission` writes the column.
- Migration `from_pubkey_v1` ALTERs legacy DBs to add the column +
index.
- Bonus: rewrote the recipe in the gated one-shot
`advert_count_unique_v1` migration to use `from_pubkey` (already marked
done on existing DBs; kept correct for fresh installs).

### Server (`cmd/server/`)
- `ensureFromPubkeyColumn` mirrors the ingestor migration so the server
can boot against a DB the ingestor has never touched (e2e fixture, fresh
installs).
- `backfillFromPubkeyAsync` runs **after** HTTP starts. Scans `WHERE
from_pubkey IS NULL AND payload_type = 4` in 5000-row chunks with a
100ms yield between chunks. Cannot block boot even on prod-sized DBs
(100K+ transmissions). Queries handle NULL gracefully (return empty for
that pubkey, same as today's unknown-pubkey path).
- All in-scope LIKE call sites switched to exact match:

| Site | Before | After |
|---|---|---|
| `buildPacketWhere` (was db.go:582) | `decoded_json LIKE '%pubkey%'` |
`from_pubkey = ?` |
| `buildTransmissionWhere` (was db.go:626) | `t.decoded_json LIKE
'%pubkey%'` | `t.from_pubkey = ?` |
| `GetRecentTransmissionsForNode` (was db.go:910) | `LIKE '%pubkey%' OR
LIKE '%name%'` | `t.from_pubkey = ?` |
| `QueryMultiNodePackets` (was db.go:1785) | `decoded_json LIKE
'%pubkey%' OR ...` | `t.from_pubkey IN (?, ?, ...)` |
| `advert_count_unique_v1` (was ingestor/db.go:257) | `decoded_json LIKE
'%' \|\| nodes.public_key \|\| '%'` | `t.from_pubkey = nodes.public_key`
|

`GetRecentTransmissionsForNode` signature simplifies: the `name`
parameter is gone (it was only ever used for the legacy `OR LIKE
'%name%'` fallback). Sole caller in `routes.go:1243` updated.

### Tests
- `cmd/server/from_pubkey_attribution_test.go` — adversarial fixtures +
Hole 1/2a/2b/QueryMultiNodePackets exact-match assertions, EXPLAIN QUERY
PLAN index check, migration backfill correctness.
- `cmd/ingestor/from_pubkey_test.go` — write-time correctness
(BuildPacketData populates FromPubkey for ADVERT only;
InsertTransmission persists it; non-ADVERTs stay NULL).
- Existing test schemas (server v2, server v3, coverage) get the new
column **plus a SQLite trigger** that auto-populates `from_pubkey` from
`decoded_json` on ADVERT inserts. This means existing fixtures (which
only seed `decoded_json`) keep attributing correctly without per-test
edits.
- `seedTestData`'s ADVERTs explicitly set `from_pubkey`.

## Performance — index is used

```
$ EXPLAIN QUERY PLAN SELECT id FROM transmissions WHERE from_pubkey = ?
SEARCH transmissions USING INDEX idx_transmissions_from_pubkey (from_pubkey=?)
```

Asserted in `TestFromPubkeyIndexUsed`.

## Migration approach

- **Sync at boot**: `ALTER TABLE transmissions ADD COLUMN from_pubkey
TEXT` is a metadata-only operation in SQLite — microseconds regardless
of table size. `CREATE INDEX IF NOT EXISTS
idx_transmissions_from_pubkey` is **not** metadata-only: it scans the
table once. Empirically a few hundred ms on a 100K-row table; expect a
few seconds on a 10M-row table (one-time cost, blocking boot during that
window). Subsequent boots no-op via `IF NOT EXISTS`. If this boot delay
becomes an operational concern at prod scale we can defer the `CREATE
INDEX` to a goroutine — for now a few-second one-time delay is
acceptable.
- **Async**: row-level backfill of legacy NULL ADVERTs (chunked 5000 /
100ms yield). On a 100K-ADVERT prod DB, this completes in seconds in the
background; HTTP is fully available throughout.
- **Safety**: queries handle NULL gracefully — a node whose ADVERTs
haven't backfilled yet returns empty, identical to today's behaviour for
unknown pubkeys. No half-state regression.

## Out of scope (intentionally)

The free-text `LIKE` paths the issue explicitly leaves alone (e.g.
user-typed packet search) are untouched. Only the pubkey-attribution
sites get the column treatment.



## Cycle-3 review fixes

| Finding | Status | Commit |
|---|---|---|
| **M1c** — async-contract test was tautological (test's own `go`, not
production's) | Fixed | `23ace71` (red) → `a05b50c` (green) |
| **m1c** — package-global atomic resets unsafe under `t.Parallel()` |
Fixed (`// DO NOT t.Parallel` comment + `Reset()` helper) | rolled into
`23ace71` / `241ec69` |
| **m2c** — `/api/healthz` read 3 atomics non-atomically (torn snapshot)
| Fixed (single RWMutex-guarded snapshot + race test) | `241ec69` |
| **n3c.m1** — vestigial OR-scaffolding in `QueryMultiNodePackets` |
Fixed (cleanup) | `5a53ceb` |
| **n3c.m2** — verify PR body language about `ALTER` vs `CREATE INDEX` |
Verified accurate (already corrected in cycle 2) | (no change) |
| **n3c.m3** — `json.Unmarshal` per row in backfill → could use SQL
`json_extract` | **Deferred as known followup** — pure perf optimization
(current per-row Unmarshal is correct, just slower); SQL rewrite would
unwind the chunked-yield architecture and is non-trivial. Acceptable for
one-time backfill at boot on legacy DBs. |

### M1c implementation detail

`startFromPubkeyBackfill(dbPath, chunkSize, yieldDuration)` is now the
single production entry point used by `main.go`. It internally does `go
backfillFromPubkeyAsync(...)`. The test calls `startFromPubkeyBackfill`
(no `go` prefix) and asserts the dispatch returns within 50ms — so if
anyone removes the `go` keyword inside the wrapper, the test fails.
**Manually verified**: removing the `go` keyword causes
`TestBackfillFromPubkey_DoesNotBlockBoot` to fail with "backfill
dispatch took ~1s (>50ms): not async — would block boot."

### m2c implementation detail

`fromPubkeyBackfillTotal/Processed/Done` are now plain `int64`/`bool`
package globals guarded by a single `sync.RWMutex`.
`fromPubkeyBackfillSnapshot()` returns all three under one RLock.
`TestHealthzFromPubkeyBackfillConsistentSnapshot` races a writer
(lock-step total/processed updates with periodic done flips) against 8
readers hammering `/api/healthz`, asserting `processed<=total` and
`(done => processed==total)` on every response. Verified the test
catches torn reads (manually injected a 3-RLock implementation; test
failed within milliseconds with "processed>total" and "done=true but
processed!=total" errors).

---------

Co-authored-by: openclaw-bot <bot@openclaw.local>
Co-authored-by: openclaw-bot <bot@openclaw.dev>

2026-05-06 23:50:44 -07:00

testdata/golden

feat(#847 ): dedupe Top Longest Hops by pair + add obs count and SNR cues (#848 )

2026-04-21 09:09:39 -07:00

advert_pubkey_test.go

perf: track advert pubkeys incrementally, eliminate per-request JSON parsing (#360 ) (#544 )

2026-04-03 13:51:13 -07:00

apikey_security_test.go

fix: reject weak/default API keys + startup warning (#532 ) (#628 )

2026-04-05 14:50:40 -07:00

backfill_async_test.go

perf: async chunked backfill — HTTP serves within 2 minutes (#612 ) (#614 )

2026-04-05 09:49:39 -07:00

backup_test.go

feat: /api/backup — one-click SQLite database export (#474 ) (#1022 )

2026-05-03 17:56:42 -07:00

backup.go

feat: /api/backup — one-click SQLite database export (#474 ) (#1022 )

2026-05-03 17:56:42 -07:00

bounded_load_test.go

fix(store): apply retentionHours cutoff in Load() to prevent OOM on cold start (#917 )

2026-05-01 06:47:55 +00:00

cache_invalidation_test.go

fix: cache invalidation tuning — 7% → 50-80% hit rate (#721 )

2026-04-12 18:09:23 -07:00

channel_analytics_test.go

feat(analytics): selectable timeframes via ?window/?from/?to (#842 ) (#1018 )

2026-05-03 17:41:22 -07:00

channel_filter_test.go

Fix channel filter on Packets page (UI + API) — #812 (#816 )

2026-04-20 21:46:34 -07:00

clock_skew_test.go

feat(#690 ): expose observer skew + per-hash evidence in clock UI (#906 )

2026-05-02 10:30:54 -07:00

clock_skew.go

feat(#690 ): expose observer skew + per-hash evidence in clock UI (#906 )

2026-05-02 10:30:54 -07:00

collision_details_test.go

feat: show collision details in Hash Usage Matrix for all hash sizes (#758 )

2026-04-16 00:18:25 -07:00

config_knobs_test.go

perf: async chunked backfill — HTTP serves within 2 minutes (#612 ) (#614 )

2026-04-05 09:49:39 -07:00

config_test.go

feat: add observer retention — remove stale observers after configurable days (#764 )

2026-04-17 09:24:40 -07:00

config.go

feat(node-battery): voltage trend chart + /api/nodes/{pubkey}/battery (#663 ) (#1082 )

2026-05-05 01:41:00 -07:00

cors_test.go

feat(server): explicit CORS policy with configurable origin allowlist (#883 ) (#971 )

2026-05-02 12:04:37 -07:00

cors.go

feat(server): explicit CORS policy with configurable origin allowlist (#883 ) (#971 )

2026-05-02 12:04:37 -07:00

coverage_test.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

db_test.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

db_vacuum_test.go

fix: enable SQLite incremental auto-vacuum so DB shrinks after retention (#919 ) (#920 )

2026-04-30 23:45:00 -07:00

db.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

decoder_test.go

feat: parse and display per-hop SNR values for TRACE packets (#1007 )

2026-05-03 11:17:25 -07:00

decoder.go

feat: parse and display per-hop SNR values for TRACE packets (#1007 )

2026-05-03 11:17:25 -07:00

discovered_channels_test.go

fix(#688 ): auto-discover hashtag channels from message text (#1071 )

2026-05-05 01:16:57 -07:00

discovered_channels.go

fix(#688 ): auto-discover hashtag channels from message text (#1071 )

2026-05-05 01:16:57 -07:00

encrypted_channels_test.go

fix: channel query performance — add channel_hash column, SQL-level filtering (#762 ) (#763 )

2026-04-16 00:09:36 -07:00

eviction_test.go

perf(#800 ): remove per-StoreTx ResolvedPath, replace with membership index + on-demand decode (#806 )

2026-04-20 19:55:00 -07:00

foreign_advert_test.go

feat(#730 ): foreign-advert detection — flag instead of silent drop (#1084 )

2026-05-05 01:58:52 -07:00

from_pubkey_attribution_test.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

from_pubkey_migration.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

geo_filter.go

feat: geo_filter enforcement, DB pruning, geofilter-builder tool, HB column (#215 )

2026-03-31 01:10:56 -07:00

go.mod

fix: cache RW SQLite connection + dedup DBConfig (closes #921 ) (#982 )

2026-05-02 20:15:30 -07:00

go.sum

feat: add Go web server (cmd/server/) — full API + WebSocket + static files

2026-03-27 01:16:59 -07:00

hash_migrate_test.go

fix: use payload type bits only in content hash (not full header byte) (#787 )

2026-04-18 11:52:22 -07:00

hash_migrate.go

fix: use payload type bits only in content hash (not full header byte) (#787 )

2026-04-18 11:52:22 -07:00

healthz_test.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

healthz.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

helpers_test.go

perf: replace O(n²) selection sort with sort.Slice (#354 ) (#542 )

2026-04-03 13:11:59 -07:00

issue673_test.go

fix(#673 ): replace raw JSON text search with byNode index for node packet queries (#803 )

2026-04-20 22:15:02 -07:00

issue804_repeater_region_test.go

fix(#804 ): attribute analytics by repeater home region, not observer (#1025 )

2026-05-03 20:10:02 -07:00

issue810_repro_test.go

fix(#810 ): /health.recentPackets resolved_path falls back to longest sibling obs (#821 )

2026-04-21 04:51:24 +00:00

issue871_test.go

fix: drop/filter packets with null hash or timestamp (closes #871 ) (#993 )

2026-05-02 20:35:15 -07:00

main.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

memlimit_test.go

feat(memlimit): GOMEMLIMIT support, derive from packetStore.maxMemoryMB (#836 ) (#1077 )

2026-05-05 01:33:23 -07:00

memlimit.go

feat(memlimit): GOMEMLIMIT support, derive from packetStore.maxMemoryMB (#836 ) (#1077 )

2026-05-05 01:33:23 -07:00

memory.go

obs: surface real RSS alongside tracked store bytes in /api/stats (#832 ) (#835 )

2026-04-20 23:10:33 -07:00

multibyte_capability_test.go

feat: validate advert signatures on ingest, reject corrupt packets (#794 )

2026-04-18 11:39:13 -07:00

multibyte_enrich_test.go

feat: show multi-byte hash support indicator on map markers (#1002 )

2026-05-03 08:56:09 -07:00

multibyte_region_filter_test.go

fix(analytics): multiByteCapability missing under region filter → all rows 'unknown' (#1049 )

2026-05-05 06:42:58 +00:00

neighbor_api_test.go

feat: observer graph representation (M1+M2) (#774 )

2026-04-16 21:35:14 -07:00

neighbor_api.go

feat: add nodeBlacklist config to hide abusive/troll nodes (#742 )

2026-04-17 23:43:05 +00:00

neighbor_debug_test.go

feat: affinity debugging tools (#482 ) — milestone 6 (#521 )

2026-04-02 23:45:03 -07:00

neighbor_debug.go

feat: affinity debugging tools (#482 ) — milestone 6 (#521 )

2026-04-02 23:45:03 -07:00

neighbor_dedup_test.go

fix: exclude non-repeater nodes from path-hop resolution (#935 ) (#936 )

2026-04-30 09:25:51 -07:00

neighbor_graph_test.go

fix: exclude non-repeater nodes from path-hop resolution (#935 ) (#936 )

2026-04-30 09:25:51 -07:00

neighbor_graph.go

feat: Clock skew detection — backend computation (M1) (#746 )

2026-04-14 23:22:35 -07:00

neighbor_persist_test.go

feat: separate "Last Status Update" from "Last Packet Observation" for observers (v3 rebase) (#969 )

2026-05-02 12:03:42 -07:00

neighbor_persist.go

feat(#730 ): foreign-advert detection — flag instead of silent drop (#1084 )

2026-05-05 01:58:52 -07:00

node_battery_test.go

feat(node-battery): voltage trend chart + /api/nodes/{pubkey}/battery (#663 ) (#1082 )

2026-05-05 01:41:00 -07:00

node_battery.go

feat(node-battery): voltage trend chart + /api/nodes/{pubkey}/battery (#663 ) (#1082 )

2026-05-05 01:41:00 -07:00

node_blacklist_test.go

feat: add nodeBlacklist config to hide abusive/troll nodes (#742 )

2026-04-17 23:43:05 +00:00

obs_dedup_test.go

perf: replace O(n²) observation dedup with map-based O(n) (#355 ) (#543 )

2026-04-03 13:33:26 -07:00

observer_blacklist_test.go

feat(ingestor + server): observerBlacklist config (#962 ) (#963 )

2026-05-01 23:11:27 -07:00

openapi_test.go

feat: auto-generated OpenAPI 3.0 spec endpoint + Swagger UI (#530 ) (#632 )

2026-04-05 15:05:20 -07:00

openapi.go

feat: /api/backup — one-click SQLite database export (#474 ) (#1022 )

2026-05-03 17:56:42 -07:00

parity_test.go

feat: implement packet store eviction/aging to prevent OOM (#273 )

2026-03-30 03:42:11 +00:00

path_inspect_test.go

feat: path-prefix candidate inspector with map view (#944 ) (#945 )

2026-04-30 23:28:16 -07:00

path_inspect.go

feat: path-prefix candidate inspector with map view (#944 ) (#945 )

2026-04-30 23:28:16 -07:00

paths_through_test.go

fix(paths): exclude false-positive paths from short-prefix collisions (#930 )

2026-05-02 11:15:25 -07:00

perf_io_test.go

feat(perf): per-component disk I/O + write source metrics on Perf page (#1120 ) (#1123 )

2026-05-05 17:56:56 -07:00

perf_io.go

feat(perf): per-component disk I/O + write source metrics on Perf page (#1120 ) (#1123 )

2026-05-05 17:56:56 -07:00

perfstats_race_test.go

fix: add mutex synchronization to PerfStats to eliminate data races (#469 )

2026-04-01 19:26:11 -07:00

prefix_map_role_test.go

fix: exclude non-repeater nodes from path-hop resolution (#935 ) (#936 )

2026-04-30 09:25:51 -07:00

region_filter_test.go

fix(#770 ): treat region 'All' as no-filter + document region behavior (#1026 )

2026-05-03 19:50:01 -07:00

repeater_liveness_test.go

fix(#662 ): GetRepeaterRelayInfo also looks up byPathHop by 1-byte prefix (#1086 )

2026-05-05 02:33:27 -07:00

repeater_liveness.go

fix(#662 ): GetRepeaterRelayInfo also looks up byPathHop by 1-byte prefix (#1086 )

2026-05-05 02:33:27 -07:00

repeater_usefulness_test.go

feat(repeater): usefulness score — traffic axis (#672 ) (#1079 )

2026-05-05 01:34:08 -07:00

repeater_usefulness.go

feat(repeater): usefulness score — traffic axis (#672 ) (#1079 )

2026-05-05 01:34:08 -07:00

resolve_context_test.go

fix: exclude non-repeater nodes from path-hop resolution (#935 ) (#936 )

2026-04-30 09:25:51 -07:00

resolved_index_test.go

perf(#800 ): remove per-StoreTx ResolvedPath, replace with membership index + on-demand decode (#806 )

2026-04-20 19:55:00 -07:00

resolved_index.go

fix(#810 ): /health.recentPackets resolved_path falls back to longest sibling obs (#821 )

2026-04-21 04:51:24 +00:00

role_analytics_test.go

feat(roles): /#/roles page + /api/analytics/roles endpoint (Fixes #818 ) (#1023 )

2026-05-03 17:56:12 -07:00

role_analytics.go

feat(roles): /#/roles page + /api/analytics/roles endpoint (Fixes #818 ) (#1023 )

2026-05-03 17:56:12 -07:00

routes_test.go

fix(#827 ): /api/packets/{hash} falls back to DB when in-memory store misses (#831 )

2026-04-20 22:50:01 -07:00

routes.go

fix(#1143 ): structural pubkey attribution via from_pubkey column (#1152 )

2026-05-06 23:50:44 -07:00

rw_cache_test.go

fix: cache RW SQLite connection + dedup DBConfig (closes #921 ) (#982 )

2026-05-02 20:15:30 -07:00

rw_cache.go

fix: cache RW SQLite connection + dedup DBConfig (closes #921 ) (#982 )

2026-05-02 20:15:30 -07:00

short_url_test.go

feat(#772 ): short pubkey-prefix URLs for mesh sharing (#1016 )

2026-05-03 17:40:54 -07:00

stats_memory_test.go

obs: surface real RSS alongside tracked store bytes in /api/stats (#832 ) (#835 )

2026-04-20 23:10:33 -07:00

store_tophops_test.go

feat(#847 ): dedupe Top Longest Hops by pair + add obs count and SNR cues (#848 )

2026-04-21 09:09:39 -07:00

store.go

fix(#688 ): auto-discover hashtag channels from message text (#1071 )

2026-05-05 01:16:57 -07:00

time_window_test.go

feat(analytics): selectable timeframes via ?window/?from/?to (#842 ) (#1018 )

2026-05-03 17:41:22 -07:00

time_window.go

feat(analytics): selectable timeframes via ?window/?from/?to (#842 ) (#1018 )

2026-05-03 17:41:22 -07:00

topology_dedup_test.go

feat(analytics): selectable timeframes via ?window/?from/?to (#842 ) (#1018 )

2026-05-03 17:41:22 -07:00

touch_last_seen_test.go

perf(#800 ): remove per-StoreTx ResolvedPath, replace with membership index + on-demand decode (#806 )

2026-04-20 19:55:00 -07:00

tracked_bytes_test.go

perf(#800 ): remove per-StoreTx ResolvedPath, replace with membership index + on-demand decode (#806 )

2026-04-20 19:55:00 -07:00

types.go

feat: separate "Last Status Update" from "Last Packet Observation" for observers (v3 rebase) (#969 )

2026-05-02 12:03:42 -07:00

vacuum.go

fix: cache RW SQLite connection + dedup DBConfig (closes #921 ) (#982 )

2026-05-02 20:15:30 -07:00

websocket_test.go

feat: implement packet store eviction/aging to prevent OOM (#273 )

2026-03-30 03:42:11 +00:00

websocket.go

fix: graceful container shutdown for reliable deployments (#453 )

2026-04-01 12:19:20 -07:00