2025-12-02 16:16:51 +09:00
|
|
|
// wrapper_env_box.h - Environment variable cache for malloc/free wrappers
|
|
|
|
|
// Eliminates getenv() calls from malloc/free hot paths
|
|
|
|
|
#pragma once
|
|
|
|
|
|
|
|
|
|
#include <stdatomic.h>
|
|
|
|
|
|
|
|
|
|
typedef struct {
|
|
|
|
|
int inited;
|
|
|
|
|
int step_trace; // HAKMEM_STEP_TRACE (default: 0)
|
|
|
|
|
int ld_safe_mode; // HAKMEM_LD_SAFE (default: 1)
|
|
|
|
|
int free_wrap_trace; // HAKMEM_FREE_WRAP_TRACE (default: 0)
|
feat(Phase 2-1): Lane Classification + Fallback Reduction
## Phase 2-1: Lane Classification Box (Single Source of Truth)
### New Module: hak_lane_classify.inc.h
- Centralized size-to-lane mapping with unified boundary definitions
- Lane architecture:
- LANE_TINY: [0, 1024B] SuperSlab (unchanged)
- LANE_POOL: [1025, 52KB] Pool per-thread (extended!)
- LANE_ACE: [52KB, 2MB] ACE learning
- LANE_HUGE: [2MB+] mmap direct
- Key invariant: POOL_MIN = TINY_MAX + 1 (no gaps)
### Fixed: Tiny/Pool Boundary Mismatch
- Before: TINY_MAX_SIZE=1024 vs tiny_get_max_size()=2047 (inconsistent!)
- After: Both reference LANE_TINY_MAX=1024 (authoritative)
- Impact: Eliminates 1025-2047B "unmanaged zone" causing libc fragmentation
### Updated Files
- core/hakmem_tiny.h: Use LANE_TINY_MAX, fix sizes[7]=1024 (was 2047)
- core/hakmem_pool.h: Use POOL_MIN_REQUEST_SIZE=1025 (was 2048)
- core/box/hak_alloc_api.inc.h: Lane-based routing (HAK_LANE_IS_*)
## jemalloc Block Bug Fix
### Root Cause
- g_jemalloc_loaded initialized to -1 (unknown)
- Condition `if (block && g_jemalloc_loaded)` treated -1 as true
- Result: ALL allocations fallback to libc (even when jemalloc not loaded!)
### Fix
- Change condition to `g_jemalloc_loaded > 0`
- Only fallback when jemalloc is ACTUALLY loaded
- Applied to: malloc/free/calloc/realloc
### Impact
- Before: 100% libc fallback (jemalloc block false positive)
- After: Only genuine cases fallback (init_wait, lockdepth, etc.)
## Fallback Diagnostics (ChatGPT contribution)
### New Feature: HAKMEM_WRAP_DIAG
- ENV flag to enable fallback logging
- Reason-specific counters (init_wait, jemalloc_block, lockdepth, etc.)
- First 4 occurrences logged per reason
- Helps identify unwanted fallback paths
### Implementation
- core/box/wrapper_env_box.{c,h}: ENV cache + DIAG flag
- core/box/hak_wrappers.inc.h: wrapper_record_fallback() calls
## Verification
### Fallback Reduction
- Before fix: [wrap] libc malloc: jemalloc block (100% fallback)
- After fix: Only init_wait + lockdepth (expected, minimal)
### Known Issue
- Tiny allocator OOM (size=8) still crashes
- This is a pre-existing bug, unrelated to Phase 2-1
- Was hidden by jemalloc block false positive
- Will be investigated separately
## Performance Impact
### sh8bench 8 threads
- Phase 1-1: 15秒
- Phase 2-1: 14秒 (~7% improvement)
### Note
- True hakmem performance now measurable (no more 100% fallback)
- Tiny OOM prevents full benchmark completion
- Next: Fix Tiny allocator for complete evaluation
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>
Co-Authored-By: ChatGPT <chatgpt@openai.com>
2025-12-02 19:13:28 +09:00
|
|
|
int wrap_diag; // HAKMEM_WRAP_DIAG (default: 0) - log first few libc fallbacks
|
2025-12-13 16:46:18 +09:00
|
|
|
int wrap_shape; // HAKMEM_WRAP_SHAPE (default: 0) - Phase 2 B4: malloc/free hot/cold split
|
2025-12-02 16:16:51 +09:00
|
|
|
} wrapper_env_cfg_t;
|
|
|
|
|
|
|
|
|
|
extern wrapper_env_cfg_t g_wrapper_env;
|
|
|
|
|
|
|
|
|
|
void wrapper_env_init_once(void);
|
|
|
|
|
|
|
|
|
|
static inline const wrapper_env_cfg_t* wrapper_env_cfg(void) {
|
|
|
|
|
// Constructor ensures init at library load time
|
|
|
|
|
// This check prevents repeated initialization in multi-threaded context
|
|
|
|
|
if (__builtin_expect(!g_wrapper_env.inited, 0)) {
|
|
|
|
|
wrapper_env_init_once();
|
|
|
|
|
}
|
|
|
|
|
return &g_wrapper_env;
|
|
|
|
|
}
|