CRITICAL FIX: TLS 未初期化による 4T SEGV を完全解消

**問題:**
- Larson 4T で 100% SEGV (1T は 2.09M ops/s で完走)
- System/mimalloc は 4T で 33.52M ops/s 正常動作
- SS OFF + Remote OFF でも 4T で SEGV

**根本原因: (Task agent ultrathink 調査結果)**
```
CRASH: mov (%r15),%r13
R15 = 0x6261  ← ASCII "ba" (ゴミ値、未初期化TLS)
```

Worker スレッドの TLS 変数が未初期化:
- `__thread void* g_tls_sll_head[TINY_NUM_CLASSES];`  ← 初期化なし
- pthread_create() で生成されたスレッドでゼロ初期化されない
- NULL チェックが通過 (0x6261 != NULL) → dereference → SEGV

**修正内容:**
全 TLS 配列に明示的初期化子 `= {0}` を追加:

1. **core/hakmem_tiny.c:**
   - `g_tls_sll_head[TINY_NUM_CLASSES] = {0}`
   - `g_tls_sll_count[TINY_NUM_CLASSES] = {0}`
   - `g_tls_live_ss[TINY_NUM_CLASSES] = {0}`
   - `g_tls_bcur[TINY_NUM_CLASSES] = {0}`
   - `g_tls_bend[TINY_NUM_CLASSES] = {0}`

2. **core/tiny_fastcache.c:**
   - `g_tiny_fast_cache[TINY_FAST_CLASS_COUNT] = {0}`
   - `g_tiny_fast_count[TINY_FAST_CLASS_COUNT] = {0}`
   - `g_tiny_fast_free_head[TINY_FAST_CLASS_COUNT] = {0}`
   - `g_tiny_fast_free_count[TINY_FAST_CLASS_COUNT] = {0}`

3. **core/hakmem_tiny_magazine.c:**
   - `g_tls_mags[TINY_NUM_CLASSES] = {0}`

4. **core/tiny_sticky.c:**
   - `g_tls_sticky_ss[TINY_NUM_CLASSES][TINY_STICKY_RING] = {0}`
   - `g_tls_sticky_idx[TINY_NUM_CLASSES][TINY_STICKY_RING] = {0}`
   - `g_tls_sticky_pos[TINY_NUM_CLASSES] = {0}`

**効果:**
```
Before: 1T: 2.09M   |  4T: SEGV 💀
After:  1T: 2.41M   |  4T: 4.19M   (+15% 1T, SEGV解消)
```

**テスト:**
```bash
# 1 thread: 完走
./larson_hakmem 2 8 128 1024 1 12345 1
→ Throughput = 2,407,597 ops/s 

# 4 threads: 完走(以前は SEGV)
./larson_hakmem 2 8 128 1024 1 12345 4
→ Throughput = 4,192,155 ops/s 
```

**調査協力:** Task agent (ultrathink mode) による完璧な根本原因特定

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
Moe Charm (CI)
2025-11-07 01:27:04 +09:00
parent f454d35ea4
commit 1da8754d45
110 changed files with 17703 additions and 1693 deletions

View File

@ -0,0 +1,68 @@
#!/usr/bin/env bash
set -euo pipefail
# Larson triad (system/mimalloc/HAKMEM), CSV保存
# Usage: benchmarks/scripts/run_larson_matrix.sh [dur_csv] [threads_csv] [reps]
# dur_csv default: 2,10 threads_csv default: 1,4 reps default: 5
ROOT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")"/../.. && pwd)"
cd "$ROOT_DIR"
dur_csv=${1:-"2,10"}
thr_csv=${2:-"1,4"}
reps=${3:-5}
MIN=8; MAX=128; CHUNKS=1024; ROUNDS=1; SEED=12345
MI_LIB_DEFAULT="mimalloc-bench/extern/mi/out/release/libmimalloc.so"
MI_LIB="${MIMALLOC_SO:-$MI_LIB_DEFAULT}"
[[ -x ./larson_system ]] || make -s larson_system >/dev/null
if [[ -f "$MI_LIB" ]]; then
[[ -x ./larson_mi ]] || make -s larson_mi >/dev/null
HAVE_MI=1
else
HAVE_MI=0
fi
[[ -x ./larson_hakmem ]] || make -s larson_hakmem >/dev/null
TS=$(date +%Y%m%d_%H%M%S)
OUTDIR="bench_results/auto/larson_${TS}"
mkdir -p "$OUTDIR"
CSV="$OUTDIR/results.csv"
echo "ts,scenario,dur_s,threads,allocator,env,rep,throughput_ops_s" >"$CSV"
IFS=',' read -ra DLIST <<<"$dur_csv"
IFS=',' read -ra TLIST <<<"$thr_csv"
extract_ops_s() {
awk '/Throughput =/{print $3}' | tail -n1
}
run_case() {
local dur="$1"; shift
local thr="$1"; shift
local alloc="$1"; shift
local envstr="$1"; shift
local rep="$2"; shift
local ts=$(date +%H%M%S)
local out
out=$($envstr ./larson_${alloc} "$dur" "$MIN" "$MAX" "$CHUNKS" "$ROUNDS" "$SEED" "$thr" 2>/dev/null || true)
local tput=$(echo "$out" | extract_ops_s)
if [[ -n "${tput:-}" ]]; then
echo "$ts,larson,$dur,$thr,$alloc,$(echo "$envstr" | sed 's/,/;/g'),$rep,$tput" >>"$CSV"
fi
}
echo "[info] writing CSV to $CSV"
for d in "${DLIST[@]}"; do
for t in "${TLIST[@]}"; do
for ((i=1;i<=reps;i++)); do run_case "$d" "$t" system "env -i" "$i"; done
if (( HAVE_MI == 1 )); then
for ((i=1;i<=reps;i++)); do run_case "$d" "$t" mi "env -i LD_LIBRARY_PATH=$(dirname "$MI_LIB")" "$i"; done
fi
for ((i=1;i<=reps;i++)); do run_case "$d" "$t" hakmem "env -i HAKMEM_WRAP_TINY=1" "$i"; done
done
done
echo "[done] $CSV"

View File

@ -0,0 +1,46 @@
#!/usr/bin/env bash
set -euo pipefail
# Mid/Large MT (832KiB) matrix runner
# Usage: benchmarks/scripts/run_mid_large_mt_matrix.sh [threads_csv] [cycles] [ws] [reps]
ROOT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")"/../.. && pwd)"
cd "$ROOT_DIR"
threads_csv=${1:-"1,4"}
cycles=${2:-40000}
ws=${3:-2048}
reps=${4:-5}
outdir="bench_results/auto/mid_large_mt_$(date +%Y%m%d_%H%M%S)"
mkdir -p "$outdir"
csv="$outdir/results.csv"
echo "ts,scenario,threads,allocator,env,cycles,ws,rep,throughput_ops_s" >"$csv"
IFS=',' read -ra TLIST <<<"$threads_csv"
run_case() {
local thr="$1"; shift
local alloc="$1"; shift
local envstr="$1"; shift
local bin="$1"; shift
for ((i=1;i<=reps;i++)); do
local ts=$(date +%H%M%S)
local out
out=$($envstr "$bin" "$thr" "$cycles" "$ws" 42 2>/dev/null || true)
local tput=$(echo "$out" | awk '/Throughput =/{print $3; exit}')
if [[ -n "${tput:-}" ]]; then
echo "$ts,mid_large_mt,$thr,$alloc,$(echo "$envstr" | sed 's/,/;/g'),$cycles,$ws,$i,$tput" >>"$csv"
fi
done
}
[[ -x ./bench_mid_large_mt_system ]] || make -s bench_mid_large_mt_system >/dev/null
[[ -x ./bench_mid_large_mt_hakmem ]] || make -s bench_mid_large_mt_hakmem >/dev/null
echo "[info] writing CSV to $csv"
for t in "${TLIST[@]}"; do
run_case "$t" "system" "env -i" ./bench_mid_large_mt_system
run_case "$t" "hakmem" "env -i HAKMEM_WRAP_TINY=1" ./bench_mid_large_mt_hakmem
done
echo "[done] $csv"

View File

@ -0,0 +1,40 @@
#!/usr/bin/env bash
set -euo pipefail
# Random mixed (161024B) matrix runner
# Usage: benchmarks/scripts/run_random_mixed_matrix.sh [cycles] [ws] [reps]
ROOT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")"/../.. && pwd)"
cd "$ROOT_DIR"
cycles=${1:-1000000}
ws=${2:-8192}
reps=${3:-5}
outdir="bench_results/auto/random_mixed_$(date +%Y%m%d_%H%M%S)"
mkdir -p "$outdir"
csv="$outdir/results.csv"
echo "ts,scenario,allocator,env,cycles,ws,rep,throughput_ops_s" >"$csv"
run_case() {
local alloc="$1"; shift
local envstr="$1"; shift
local bin="$1"; shift
for ((i=1;i<=reps;i++)); do
local ts=$(date +%H%M%S)
local out
out=$($envstr "$bin" "$cycles" "$ws" 123 2>/dev/null || true)
local tput=$(echo "$out" | awk '/Throughput =/{print $3; exit}')
if [[ -n "${tput:-}" ]]; then
echo "$ts,random_mixed,$alloc,$(echo "$envstr" | sed 's/,/;/g'),$cycles,$ws,$i,$tput" >>"$csv"
fi
done
}
[[ -x ./bench_random_mixed_system ]] || make -s bench_random_mixed_system >/dev/null
[[ -x ./bench_random_mixed_hakmem ]] || make -s bench_random_mixed_hakmem >/dev/null
echo "[info] writing CSV to $csv"
run_case "system" "env -i" ./bench_random_mixed_system
run_case "hakmem" "env -i HAKMEM_WRAP_TINY=1" ./bench_random_mixed_hakmem
echo "[done] $csv"

View File

@ -0,0 +1,71 @@
#!/usr/bin/env bash
set -euo pipefail
# Redis-style allocator benchmark triad (System/mimalloc/HAKMEM via LD_PRELOAD)
# Usage: benchmarks/scripts/run_redis_matrix.sh [threads] [cycles] [ops] [reps]
ROOT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")"/../.. && pwd)"
cd "$ROOT_DIR"
THREADS=${1:-1}
CYCLES=${2:-100}
OPS=${3:-1000}
REPS=${4:-5}
BENCH="./benchmarks/redis/workload_bench_system"
MI_LIB_DEFAULT="mimalloc-bench/extern/mi/out/release/libmimalloc.so"
MI_LIB="${MIMALLOC_SO:-$MI_LIB_DEFAULT}"
if [[ ! -x "$BENCH" ]]; then
echo "[error] $BENCH not found or not executable" >&2
exit 1
fi
if [[ ! -f "$MI_LIB" ]]; then
echo "[warn] mimalloc .so not found at $MI_LIB (set MIMALLOC_SO) — skipping mi runs" >&2
HAVE_MI=0
else
HAVE_MI=1
fi
# Ensure shared lib exists for HAKMEM LD_PRELOAD
[[ -f ./libhakmem.so ]] || make -s shared >/dev/null || true
TS=$(date +%Y%m%d_%H%M%S)
OUTDIR="bench_results/auto/redis_${TS}"
mkdir -p "$OUTDIR"
CSV="$OUTDIR/results.csv"
echo "ts,scenario,allocator,env,threads,cycles,ops,rep,throughput_ops_s" >"$CSV"
extract_ops_s() {
# workload_bench prints: "Throughput: 28.97 M ops/sec"
# return ops/s as integer
awk '/Throughput:/ {print int($2*1000000)}' | tail -n1
}
run_case() {
local alloc="$1"; shift
local envstr="$1"; shift
local rep="$2"; shift
local ts=$(date +%H%M%S)
local out
out=$($envstr "$BENCH" -t "$THREADS" -c "$CYCLES" -o "$OPS" 2>/dev/null || true)
local tput=$(echo "$out" | extract_ops_s)
if [[ -n "${tput:-}" ]]; then
echo "$ts,redis,$alloc,$(echo "$envstr" | sed 's/,/;/g'),$THREADS,$CYCLES,$OPS,$rep,$tput" >>"$CSV"
fi
}
echo "[info] writing CSV to $CSV"
# System
for ((i=1;i<=REPS;i++)); do run_case system "env -i" "$i"; done
# mimalloc
if (( HAVE_MI == 1 )); then
for ((i=1;i<=REPS;i++)); do run_case mimalloc "env -i LD_PRELOAD=$MI_LIB" "$i"; done
fi
# HAKMEM (safer LD flags for tiny-only)
for ((i=1;i<=REPS;i++)); do \
run_case hakmem "env -i LD_PRELOAD=./libhakmem.so HAKMEM_WRAP_TINY=1 HAKMEM_LD_SAFE=1 HAKMEM_TINY_SUPERSLAB=0 HAKMEM_TINY_TRACE_RING=0 HAKMEM_SAFE_FREE=0" "$i"; done
echo "[done] $CSV"

View File

@ -0,0 +1,48 @@
#!/usr/bin/env bash
set -euo pipefail
# Run VM-mixed (512KB<2MB) bench across allocators and L25 A/B, save CSV.
# Usage: benchmarks/scripts/run_vm_mixed_matrix.sh [cycles] [ws] [reps]
ROOT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")"/../.. && pwd)"
cd "$ROOT_DIR"
cycles=${1:-20000}
ws=${2:-256}
reps=${3:-5}
outdir="bench_results/auto/vm_mixed_$(date +%Y%m%d_%H%M%S)"
mkdir -p "$outdir"
csv="$outdir/results.csv"
echo "ts,scenario,allocator,env,cycles,ws,rep,throughput_ops_s" >"$csv"
run_case() {
local scenario="$1"; shift
local alloc="$1"; shift
local envstr="$1"; shift
local bin="$1"; shift
for ((i=1;i<=reps;i++)); do
local ts=$(date +%H%M%S)
local out
out=$($envstr "$bin" "$cycles" "$ws" 4242 2>/dev/null || true)
local tput=$(echo "$out" | awk '/Throughput =/{print $3; exit}')
if [[ -n "${tput:-}" ]]; then
echo "$ts,$scenario,$alloc,$(echo "$envstr" | sed 's/,/;/g'),$cycles,$ws,$i,$tput" >>"$csv"
fi
done
}
# Build benches if needed
[[ -x ./bench_vm_mixed_system ]] || make -s bench_vm_mixed_system >/dev/null
[[ -x ./bench_vm_mixed_hakmem ]] || make -s bench_vm_mixed_hakmem >/dev/null
echo "[info] writing CSV to $csv"
# system
run_case "vm_mixed" "system" "env -i" ./bench_vm_mixed_system
# HAKMEM L25 OFF/ON
run_case "vm_mixed" "hakmem(l25=0)" "env -i HAKMEM_BIGCACHE_L25=0 HAKMEM_WRAP_TINY=1" ./bench_vm_mixed_hakmem
run_case "vm_mixed" "hakmem(l25=1)" "env -i HAKMEM_BIGCACHE_L25=1 HAKMEM_WRAP_TINY=1" ./bench_vm_mixed_hakmem
echo "[done] $csv"