Phase 75-6: define SSOT policy to avoid baseline drift

2025-12-18 10:22:24 +09:00
parent 9123a8f12b
commit d5c1113b4c
2 changed files with 67 additions and 0 deletions
--- a/CURRENT_TASK.md
+++ b/CURRENT_TASK.md
@ -41,6 +41,7 @@
 - 変更は必ず **箱 + 境界1箇所 + ENVで戻せる** で積む（Fail-fast、最小可視化）。
 - A/B は **同一バイナリでENVトグル**が原則（別バイナリ比較は layout が混ざる）。
 - SSOT運用（ころころ防止）: `docs/analysis/PHASE75_6_SSOT_POLICY_FAST_PGO_VS_STANDARD.md`
 - “削除して速い” は封印（link-out/大削除は layout tax で符号反転しやすい）→ **compile-out** を優先。
  - 診断: `scripts/box/layout_tax_forensics_box.sh` / `docs/analysis/PHASE67A_LAYOUT_TAX_FORENSICS_SSOT.md`
--- a/docs/analysis/PHASE75_6_SSOT_POLICY_FAST_PGO_VS_STANDARD.md
+++ b/docs/analysis/PHASE75_6_SSOT_POLICY_FAST_PGO_VS_STANDARD.md
@ -0,0 +1,66 @@
 # Phase 75-6: SSOT Policy — FAST PGO vs Standard (stop “ころころ” drift)
 ## Problem statement
 After Phase 75, we observed:
 - Phase 75 win is **real** (C5/C6 inline slots improve D vs A in both Standard and FAST PGO).
 - Absolute “baseline” numbers **move** across commits/builds (especially with PGO), causing SSOT confusion (“ころころ変わる”).
 This document defines a stable SSOT policy that keeps Box Theory iteration reliable.
 ## Definitions
 ### Standard binary
 - `./bench_random_mixed_hakmem`
 - Used for: correctness, production-like behavior, “stable across code refactors”
 ### FAST PGO binary
 - `./bench_random_mixed_hakmem_minimal_pgo`
 - Used for: competitive speed tracking vs mimalloc (best-case tuned build)
 - Caveat: more sensitive to build/layout drift than Standard
 ### SSOT harness
 - `scripts/run_mixed_10_cleanenv.sh`
 - Must pin the binary explicitly via `BENCH_BIN=...` when comparing Standard vs FAST.
 ## SSOT policy (two-track)
 ### Track A (Decision SSOT): same-binary A/B
 For accepting a feature (GO/NEUTRAL/NO-GO), the primary truth is:
 - **same binary**, **ENV toggle only**
 - Example: Phase 75 4-point matrix within the same binary.
 This avoids layout tax from “different binaries” and is aligned with prior learnings:
 - link-out / large pruning can flip signs due to layout.
 ### Track B (Competitive SSOT): FAST PGO ratio vs mimalloc
 For “how close to mimalloc”, use FAST PGO:
 - `BENCH_BIN=./bench_random_mixed_hakmem_minimal_pgo`
 - mimalloc is still a separate binary reference (layout differs), so treat ratio as “headline”, not proof of a micro-change.
 ## Practical rules to prevent SSOT drift
 1. **Never mix Standard numbers into FAST ratio tables**
   - Standard A/B results are valid, but not directly comparable to FAST baseline.
 2. **When reporting a result, always include:**
   - binary (`bench_random_mixed_hakmem` vs `bench_random_mixed_hakmem_minimal_pgo`)
   - workload (`ITERS`, `WS`, `RUNS`)
   - key ENV knobs (`WARM_POOL_SIZE`, `C5/C6 inline`, etc.)
 3. **If FAST PGO baseline changes across commits**
   - treat it as “baseline rebase event”, not automatically “regression”
   - confirm using `scripts/box/layout_tax_forensics_box.sh` + perf stat deltas (IPC/branch/cache)
 4. **Do not demote FAST PGO SSOT solely from one episode**
   - use Track A (same-binary A/B) to validate the optimization first
   - then decide whether FAST PGO is “worth maintaining” based on ongoing ROI
 ## Recommended next action after Phase 75-5
 - Keep Phase 75 (C5/C6) promoted for Standard and for FAST builds.
 - Treat Phase 69’s 62.63M as historical reference, not guaranteed to reproduce on later commits.
 - Proceed with Phase 76 using Track A for GO decisions, and Track B for periodic headline updates.