Files
hakmem/scripts/run_larson_dev.sh
Moe Charm (CI) 1da8754d45 CRITICAL FIX: TLS 未初期化による 4T SEGV を完全解消
**問題:**
- Larson 4T で 100% SEGV (1T は 2.09M ops/s で完走)
- System/mimalloc は 4T で 33.52M ops/s 正常動作
- SS OFF + Remote OFF でも 4T で SEGV

**根本原因: (Task agent ultrathink 調査結果)**
```
CRASH: mov (%r15),%r13
R15 = 0x6261  ← ASCII "ba" (ゴミ値、未初期化TLS)
```

Worker スレッドの TLS 変数が未初期化:
- `__thread void* g_tls_sll_head[TINY_NUM_CLASSES];`  ← 初期化なし
- pthread_create() で生成されたスレッドでゼロ初期化されない
- NULL チェックが通過 (0x6261 != NULL) → dereference → SEGV

**修正内容:**
全 TLS 配列に明示的初期化子 `= {0}` を追加:

1. **core/hakmem_tiny.c:**
   - `g_tls_sll_head[TINY_NUM_CLASSES] = {0}`
   - `g_tls_sll_count[TINY_NUM_CLASSES] = {0}`
   - `g_tls_live_ss[TINY_NUM_CLASSES] = {0}`
   - `g_tls_bcur[TINY_NUM_CLASSES] = {0}`
   - `g_tls_bend[TINY_NUM_CLASSES] = {0}`

2. **core/tiny_fastcache.c:**
   - `g_tiny_fast_cache[TINY_FAST_CLASS_COUNT] = {0}`
   - `g_tiny_fast_count[TINY_FAST_CLASS_COUNT] = {0}`
   - `g_tiny_fast_free_head[TINY_FAST_CLASS_COUNT] = {0}`
   - `g_tiny_fast_free_count[TINY_FAST_CLASS_COUNT] = {0}`

3. **core/hakmem_tiny_magazine.c:**
   - `g_tls_mags[TINY_NUM_CLASSES] = {0}`

4. **core/tiny_sticky.c:**
   - `g_tls_sticky_ss[TINY_NUM_CLASSES][TINY_STICKY_RING] = {0}`
   - `g_tls_sticky_idx[TINY_NUM_CLASSES][TINY_STICKY_RING] = {0}`
   - `g_tls_sticky_pos[TINY_NUM_CLASSES] = {0}`

**効果:**
```
Before: 1T: 2.09M   |  4T: SEGV 💀
After:  1T: 2.41M   |  4T: 4.19M   (+15% 1T, SEGV解消)
```

**テスト:**
```bash
# 1 thread: 完走
./larson_hakmem 2 8 128 1024 1 12345 1
→ Throughput = 2,407,597 ops/s 

# 4 threads: 完走(以前は SEGV)
./larson_hakmem 2 8 128 1024 1 12345 4
→ Throughput = 4,192,155 ops/s 
```

**調査協力:** Task agent (ultrathink mode) による完璧な根本原因特定

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-07 01:27:04 +09:00

50 lines
1.5 KiB
Bash
Executable File

#!/usr/bin/env bash
set -euo pipefail
# run_larson_dev.sh — deterministic run wrapper (avoids perf warm-up issues)
#
# Usage:
# scripts/run_larson_dev.sh tput 10 4
# scripts/run_larson_dev.sh pf 10 4
#
# Notes:
# - Runs ./larson_hakmem directly and prints the Throughput line.
# - Keeps logging quiet and avoids perf warm-ups that sometimes SEGV under A/B.
MODE=${1:-tput}
DUR=${2:-10}
THR=${3:-4}
ROOT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")"/.. && pwd)"
cd "$ROOT_DIR"
[[ -x ./larson_hakmem ]] || ./scripts/build_larson_dev.sh
export HAKMEM_QUIET=1
export HAKMEM_TINY_SUKESUKE=0
export HAKMEM_TINY_TRACE_RING=0
export HAKMEM_DISABLE_BATCH=1
export HAKMEM_WRAP_TINY=1
export HAKMEM_LARSON_TINY_ONLY=1
export HAKMEM_TINY_META_ALLOC=1
export HAKMEM_TINY_META_FREE=1
export HAKMEM_TINY_USE_SUPERSLAB=1
if [[ "$MODE" == "tput" ]]; then
export HAKMEM_TINY_FREE_TO_SS=0
export HAKMEM_TINY_MUST_ADOPT=0
export HAKMEM_TINY_REG_SCAN_MAX=${HAKMEM_TINY_REG_SCAN_MAX:-64}
export HAKMEM_SFC_ENABLE=${HAKMEM_SFC_ENABLE:-1}
export HAKMEM_TINY_TLS_LIST=${HAKMEM_TINY_TLS_LIST:-1}
export HAKMEM_TINY_TLS_SLL=${HAKMEM_TINY_TLS_SLL:-1}
else
export HAKMEM_TINY_FREE_TO_SS=1
export HAKMEM_TINY_MUST_ADOPT=1
export HAKMEM_TINY_SS_ADOPT_COOLDOWN=${HAKMEM_TINY_SS_ADOPT_COOLDOWN:-64}
export HAKMEM_TINY_REG_SCAN_MAX=${HAKMEM_TINY_REG_SCAN_MAX:-32}
fi
echo "[run_dev] mode=$MODE dur=$DUR thr=$THR"
./larson_hakmem "$DUR" 8 128 1024 1 12345 "$THR" | rg "Throughput" -n || true