Major Features: - Debug counter infrastructure for Refill Stage tracking - Free Pipeline counters (ss_local, ss_remote, tls_sll) - Diagnostic counters for early return analysis - Unified larson.sh benchmark runner with profiles - Phase 6-3 regression analysis documentation Bug Fixes: - Fix SuperSlab disabled by default (HAKMEM_TINY_USE_SUPERSLAB) - Fix profile variable naming consistency - Add .gitignore patterns for large files Performance: - Phase 6-3: 4.79 M ops/s (has OOM risk) - With SuperSlab: 3.13 M ops/s (+19% improvement) This is a clean repository without large log files. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
1.2 KiB
1.2 KiB
2025-10-22 Comparison (larson, 2–32KB, 2s)
環境:
- Runner: mimalloc-bench/bench/larson/larson
- Args:
2 2048 32768 10000 1 12345 <threads> - Threads: 1, 4
- Host libs: system malloc (glibc), libmimalloc.so.2, hakmem (LD_PRELOAD)
- hakmem env: default(学習OFF/WRAP L1 OFF、しきい値=2MiB)
結果(ops/s)
| Allocator | 1T | 4T |
|---|---|---|
| system | 4,779,287 | 3,659,717 |
| mimalloc | 13,893,235 | 18,756,738 |
| hakmem | 3,947,671 | 10,884,943 |
注:
- hakmem(default) の 4T は system より大きくスケールする一方、1T は system/mimalloc に劣後。
- WRAP L1 ON + 整地(最小バンドル/学習ON)構成は別途 docs/benchmarks/2025-10-22_SWEEP_NOTES.md を参照(安定化中)。
再現
# system
larson 2 2048 32768 10000 1 12345 1
larson 2 2048 32768 10000 1 12345 4
# mimalloc
LD_PRELOAD=/lib/x86_64-linux-gnu/libmimalloc.so.2 \
larson 2 2048 32768 10000 1 12345 1
LD_PRELOAD=/lib/x86_64-linux-gnu/libmimalloc.so.2 \
larson 2 2048 32768 10000 1 12345 4
# hakmem (default)
LD_PRELOAD=$(readlink -f ./libhakmem.so) \
larson 2 2048 32768 10000 1 12345 1
LD_PRELOAD=$(readlink -f ./libhakmem.so) \
larson 2 2048 32768 10000 1 12345 4