Major Features: - Debug counter infrastructure for Refill Stage tracking - Free Pipeline counters (ss_local, ss_remote, tls_sll) - Diagnostic counters for early return analysis - Unified larson.sh benchmark runner with profiles - Phase 6-3 regression analysis documentation Bug Fixes: - Fix SuperSlab disabled by default (HAKMEM_TINY_USE_SUPERSLAB) - Fix profile variable naming consistency - Add .gitignore patterns for large files Performance: - Phase 6-3: 4.79 M ops/s (has OOM risk) - With SuperSlab: 3.13 M ops/s (+19% improvement) This is a clean repository without large log files. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
41 lines
1.2 KiB
Markdown
41 lines
1.2 KiB
Markdown
# 2025-10-22 Comparison (larson, 2–32KB, 2s)
|
||
|
||
環境:
|
||
- Runner: mimalloc-bench/bench/larson/larson
|
||
- Args: `2 2048 32768 10000 1 12345 <threads>`
|
||
- Threads: 1, 4
|
||
- Host libs: system malloc (glibc), libmimalloc.so.2, hakmem (LD_PRELOAD)
|
||
- hakmem env: default(学習OFF/WRAP L1 OFF、しきい値=2MiB)
|
||
|
||
## 結果(ops/s)
|
||
|
||
| Allocator | 1T | 4T |
|
||
|------------|-----------|------------|
|
||
| system | 4,779,287 | 3,659,717 |
|
||
| mimalloc | 13,893,235| 18,756,738 |
|
||
| hakmem | 3,947,671 | 10,884,943 |
|
||
|
||
注:
|
||
- hakmem(default) の 4T は system より大きくスケールする一方、1T は system/mimalloc に劣後。
|
||
- WRAP L1 ON + 整地(最小バンドル/学習ON)構成は別途 docs/benchmarks/2025-10-22_SWEEP_NOTES.md を参照(安定化中)。
|
||
|
||
## 再現
|
||
```
|
||
# system
|
||
larson 2 2048 32768 10000 1 12345 1
|
||
larson 2 2048 32768 10000 1 12345 4
|
||
|
||
# mimalloc
|
||
LD_PRELOAD=/lib/x86_64-linux-gnu/libmimalloc.so.2 \
|
||
larson 2 2048 32768 10000 1 12345 1
|
||
LD_PRELOAD=/lib/x86_64-linux-gnu/libmimalloc.so.2 \
|
||
larson 2 2048 32768 10000 1 12345 4
|
||
|
||
# hakmem (default)
|
||
LD_PRELOAD=$(readlink -f ./libhakmem.so) \
|
||
larson 2 2048 32768 10000 1 12345 1
|
||
LD_PRELOAD=$(readlink -f ./libhakmem.so) \
|
||
larson 2 2048 32768 10000 1 12345 4
|
||
```
|
||
|