Major Features: - Debug counter infrastructure for Refill Stage tracking - Free Pipeline counters (ss_local, ss_remote, tls_sll) - Diagnostic counters for early return analysis - Unified larson.sh benchmark runner with profiles - Phase 6-3 regression analysis documentation Bug Fixes: - Fix SuperSlab disabled by default (HAKMEM_TINY_USE_SUPERSLAB) - Fix profile variable naming consistency - Add .gitignore patterns for large files Performance: - Phase 6-3: 4.79 M ops/s (has OOM risk) - With SuperSlab: 3.13 M ops/s (+19% improvement) This is a clean repository without large log files. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
698 B
698 B
2025-10-22 hakmem(best) Mid 2–32KB (2s)
ENV:
HAKMEM_WRAP_L2=1 HAKMEM_WRAP_L25=1 \
HAKMEM_LEARN=1 HAKMEM_DYN1_AUTO=1 HAKMEM_DYN2_AUTO=1 HAKMEM_HIST_SAMPLE=7 \
HAKMEM_WMAX_LEARN=1 HAKMEM_WMAX_DWELL_SEC=2 \
HAKMEM_PROF=1 HAKMEM_PROF_SAMPLE=7
結果:
- 1T: 1,264,425 ops/s
- 4T: 917,424 ops/s
注: 本設定はラッパー内L1を許可し学習を同時に回すため、短時間ではウォームアップが不足し、既定(学習OFF/WRAP OFF)より低い数値。 当面は既定構成での比較(docs/benchmarks/2025-10-22_COMPARE_MID_2-32KB.md)を採用し、 "best"系はウォームアップ・CAP初期値・最小バンドル等の整地後に再計測する。