Files
hakmem/core
Moe Charm (CI) 437df708ed Phase 3c: L1D Prefetch Optimization (+10.4% throughput)
Added software prefetch directives to reduce L1D cache miss penalty.

Changes:
- Refill path: Prefetch SuperSlab hot fields (slab_bitmap, total_active_blocks)
- Refill path: Prefetch SlabMeta freelist and next freelist entry
- Alloc path: Early prefetch of TLS cache head/count
- Alloc path: Prefetch next pointer after SLL pop

Results (Random Mixed 256B, 1M ops):
- Throughput: 22.7M → 25.05M ops/s (+10.4%)
- Cycles: 189.7M → 182.6M (-3.7%)
- Instructions: 285.0M → 280.4M (-1.6%)
- IPC: 1.50 → 1.54 (+2.7%)
- L1-dcache loads: 116.0M → 109.9M (-5.3%)

Files:
- core/hakmem_tiny_refill_p0.inc.h: 3 prefetch sites
- core/tiny_alloc_fast.inc.h: 3 prefetch sites

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-19 23:11:27 +09:00
..
2025-11-11 21:49:05 +09:00