hakmem

Author	SHA1	Message	Date
Moe Charm (CI)	3b4449d773	Phase v4-mid-1: C6-only v4 route + page_meta_of() Fail-Fast validation Implementation: - SMALL_SEGMENT_V4_* constants (SIZE=2MiB, PAGE_SIZE=64KiB, MAGIC=0xDEADBEEF) - smallsegment_v4_page_meta_of(): O(1) mask+shift lookup with magic validation - Computes segment base: addr & ~(2MiB - 1) - Verifies SmallSegment magic number - Calculates page_idx: (addr - seg_base) >> PAGE_SHIFT (16) - Returns non-NULL sentinel for now (full page_meta[] in Phase v4-mid-2) Stubs for C6-only phase: - small_heap_alloc_fast_v4(): C6 returns NULL → pool v1 fallback - small_heap_free_fast_v4(): C6 calls page_meta_of() for Fail-Fast, then pool v1 fallback Documentation: - ENV_PROFILE_PRESETS.md: Add "C6_ONLY_SMALLOBJECT_V4" research profile - HAKMEM_SMALL_HEAP_V4_ENABLED=1, HAKMEM_SMALL_HEAP_V4_CLASSES=0x40 - Expected: Throughput ≈ 28–29M ops/s (same as v1) Build: - ビルド成功（警告のみ） - Backward compatible, alloc/free stubs fall back to pool v1 Sanity: - C6-heavy with v4 opt-in: segv/assert なし - page_meta_of() lookup working correctly - Performance unchanged (expected for stub phase) Status: - C6-only v4 route now available via ENV opt-in - Phase v4-mid-2: SmallHeapCtx v4 full implementation with A/B 🤖 Generated with Claude Code Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com>	2025-12-10 23:37:45 +09:00
Moe Charm (CI)	2a13478dc7	Optimize C6 heavy and C7 ultra performance analysis with refined design refinements - Update environment profile presets and visibility analysis - Enhance small object and tiny segment v4 box implementations - Refine C7 ultra and C6 heavy allocation strategies - Add comprehensive performance metrics and design documentation 🤖 Generated with Claude Code Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com>	2025-12-10 22:57:26 +09:00
Moe Charm (CI)	d576116484	Document current Mixed baseline throughput and ENV profile	2025-12-10 14:12:13 +09:00
Moe Charm (CI)	406a2f4d26	Incremental improvements: mid_desc cache, pool hotpath optimization, and doc updates Changes: - core/box/pool_api.inc.h: Code organization and micro-optimizations - CURRENT_TASK.md: Updated Phase MD1 (mid_desc TLS cache: +3.2% for C6-heavy) - docs/analysis files: Various analysis and documentation updates - AGENTS.md: Agent role clarifications - TINY_FRONT_V3_FLATTENING_GUIDE.md: Flattening strategy documentation Verification: - random_mixed_hakmem: 44.8M ops/s (1M iterations, 400 working set) - No segfaults or assertions across all benchmark variants - Stable performance across multiple runs 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com>	2025-12-10 14:00:57 +09:00
Moe Charm (CI)	acc64f2438	Phase ML1: Pool v1 memset 89.73% overhead 軽量化 (+15.34% improvement) ## Summary - ChatGPT により bench_profile.h の setenv segfault を修正（RTLD_NEXT 経由に切り替え） - core/box/pool_zero_mode_box.h 新設：ENV キャッシュ経由で ZERO_MODE を統一管理 - core/hakmem_pool.c で zero mode に応じた memset 制御（FULL/header/off） - A/B テスト結果：ZERO_MODE=header で +15.34% improvement（1M iterations, C6-heavy） ## Files Modified - core/box/pool_api.inc.h: pool_zero_mode_box.h include - core/bench_profile.h: glibc setenv → malloc+putenv（segfault 回避） - core/hakmem_pool.c: zero mode 参照・制御ロジック - core/box/pool_zero_mode_box.h (新設): enum/getter - CURRENT_TASK.md: Phase ML1 結果記載 ## Test Results \| Iterations \| ZERO_MODE=full \| ZERO_MODE=header \| Improvement \| \|-----------\|----------------\|-----------------\|------------\| \| 10K \| 3.06 M ops/s \| 3.17 M ops/s \| +3.65% \| \| 1M \| 23.71 M ops/s \| 27.34 M ops/s \| +15.34% \| 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com>	2025-12-10 09:08:18 +09:00

5 Commits