|
|
859027e06c
|
Perf Analysis: Registry 線形スキャンがボトルネック (28.51% CPU)
- perf record で superslab_refill が 28.51% CPU を消費していることを特定
- Root cause: 262,144 エントリの Registry を線形スキャン
- Hot instructions: ループ比較 (32.36%), カウンタ++ (16.78%), ポインタ進める (16.29%)
- 解決策: per-class registry (8 classes × 4096 entries) に変更
- 期待効果: +200-300% (2.59M → 7.8-10.4M ops/s)
詳細: PERF_ANALYSIS_2025_11_05.md
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>
|
2025-11-05 16:44:43 +09:00 |
|
|
|
52386401b3
|
Debug Counters Implementation - Clean History
Major Features:
- Debug counter infrastructure for Refill Stage tracking
- Free Pipeline counters (ss_local, ss_remote, tls_sll)
- Diagnostic counters for early return analysis
- Unified larson.sh benchmark runner with profiles
- Phase 6-3 regression analysis documentation
Bug Fixes:
- Fix SuperSlab disabled by default (HAKMEM_TINY_USE_SUPERSLAB)
- Fix profile variable naming consistency
- Add .gitignore patterns for large files
Performance:
- Phase 6-3: 4.79 M ops/s (has OOM risk)
- With SuperSlab: 3.13 M ops/s (+19% improvement)
This is a clean repository without large log files.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>
|
2025-11-05 12:31:14 +09:00 |
|