docs(joinir): Phase 218 JsonParser if-sum investigation complete

Completes Phase 218 investigation: Identifies root cause preventing
AST-based if-sum lowerer from activating.

## Investigation Summary

### Test Case Created
- File: apps/tests/phase218_json_if_sum_min.hako
- Pattern: JsonParser-style conditional accumulation (sum = sum + i)
- Expected: RC=10, Actual: RC=0 (blocked by phantom carrier)

### Root Cause Identified: Phantom Carrier Bug

**Problem**: AST-based if-sum lowerer (Phase 213) never activates

**Cause Chain**:
1. loop_update_summary.rs uses name-based heuristics
2. Detects phantom "count" variable (doesn't exist in code)
3. counter_count() returns 2 instead of 1 (i + phantom "count")
4. is_simple_if_sum_pattern() returns false
5. Dual-mode dispatch uses legacy fallback instead of AST lowerer

**Evidence**:
```
Debug output shows hardcoded template "i % 2 == 1" (legacy lowerer)
Not from AST extraction (AST-based lowerer never called)
```

### Critical Discovery

**Phase 216 claim "RC=2 " for phase212_if_sum_min.hako is false**
- Current execution: RC=0 (wrong)
- Expected: RC=2
- Indicates: Phase 213-217 "success" was actually legacy lowerer

### Documentation Gap

Phase 216/217 tests may not have been executed correctly, or regression
occurred. All "if-sum" patterns are currently broken due to phantom
carrier detection blocking AST lowerer activation.

## Files Created

1. apps/tests/phase218_json_if_sum_min.hako - Test case
2. docs/development/current/main/phase218-jsonparser-if-sum-min.md - Investigation report

## Files Updated

1. CURRENT_TASK.md - Phase 218 investigation results
2. joinir-architecture-overview.md - Phase 218 entry

## Next Steps: Phase 219

**Target**: Fix phantom carrier detection in loop_update_summary.rs
**Fix**: Remove name-based heuristics, use assignment-based detection only
**Expected Result**: Both phase212 and phase218 tests pass with correct RC

## Lessons Learned

1. Documentation can become stale - verify test results
2. Silent fallbacks hide bugs - dual-mode needs clear activation logging
3. Name heuristics are fragile - prefer structural analysis

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This commit is contained in:
nyash-codex
2025-12-10 02:07:28 +09:00
parent 803a603e69
commit 980965afc8
4 changed files with 329 additions and 0 deletions

View File

@ -17,6 +17,10 @@
Phase 215 で ExprResult の出口契約を Pattern2 と同じ形に揃えて RC=2 まで通すようになった。 Phase 215 で ExprResult の出口契約を Pattern2 と同じ形に揃えて RC=2 まで通すようになった。
Phase 216 で selfhost 側の production test も検証完了。 Phase 216 で selfhost 側の production test も検証完了。
**Phase 217 でマルチキャリア ifsumsum+countも追加実装0行で動作確認** - Phase 195/214/215 の箱組合せで完璧動作。 **Phase 217 でマルチキャリア ifsumsum+countも追加実装0行で動作確認** - Phase 195/214/215 の箱組合せで完璧動作。
- **Phase 218 調査完了**: JsonParser-style ifsum 適用を試行し、パターン認識のギャップを特定。
- Phantom `count` carrier が誤検出され、`is_if_sum_pattern()` が false を返す根本原因を解明。
- AST ベース lowerer は実装済みだが、carrier 検出ロジックの問題で起動されていないことを確認。
- 次フェーズで carrier 検出修正 → AST lowerer 起動検証が必要。
### 2. JsonParser / Trim / selfhost への適用状況 ### 2. JsonParser / Trim / selfhost への適用状況

View File

@ -0,0 +1,48 @@
// Phase 218: JsonParser-style if-sum pattern test (SIMPLIFIED)
// Tests: Loop with conditional accumulation using variable-based sum (JsonParser pattern)
// Target: selfhost JsonParser pattern (e.g., sum = sum + digit when digit > 0)
//
// Pattern:
// Simulate digits: 0, 1, 3, 0, 5
// Skip zero digits (i=0, i=3)
// Accumulate valid digits: sum = sum + digit
// Final sum: 0 + 1 + 3 + 5 = 9
//
// Expected result: 9 (sum of valid digits)
static box JsonIfSumTest {
sum_digits(data) {
local sum
sum = 0
local i
i = 0
local len
len = 5
loop(i < len) {
// Simulate digit extraction: [0, 1, 3, 0, 5]
// This is simpler than nested-if version
// For now, just use i itself as digit (JsonParser would do actual string parsing)
// JsonParser pattern: if digit > 0 { sum = sum + digit }
if i > 0 {
// Conditional update: sum = sum + i (simulating: sum = sum + digit)
sum = sum + i
print(sum) // Force if to stay in MIR
} else {
print(0) // Ensure else branch exists
}
i = i + 1
}
return sum
}
main() {
// Test: sum = 1 + 2 + 3 + 4 = 10
local result
result = JsonIfSumTest.sum_digits(0) // dummy arg
return result
}
}

View File

@ -799,3 +799,22 @@ JoinIR は Rust 側だけでなく、将来的に .hako selfhost コンパイラ
- Phase 198+: Pattern 3 拡張 (multi-flag carriers) → _parse_string, _unescape_string - Phase 198+: Pattern 3 拡張 (multi-flag carriers) → _parse_string, _unescape_string
- Phase 198+: MethodCall 拡張 (multiple calls in body) → _parse_array, _parse_object - Phase 198+: MethodCall 拡張 (multiple calls in body) → _parse_array, _parse_object
- selfhost `.hako` コンパイラの全ループを JoinIR で処理 (Phase 210+) - selfhost `.hako` コンパイラの全ループを JoinIR で処理 (Phase 210+)
9. **Pattern 3 If-Sum AST ベース実装** → Phase 213-217 完了 ✅
- **Phase 213**: If-sum パターン AST ベース lowerer 実装
- `loop_with_if_phi_if_sum.rs`: AST 抽出 + JoinIR 生成
- Dual-mode 構成: if-sum mode / legacy mode
- **Phase 214**: Dynamic join inputs 対応hardcoded 3-input 除去)
- **Phase 215**: ExprResult exit contract 確立
- **Phase 216**: Selfhost if-sum production test 検証
- **Phase 217**: Multi-carrier if-sum 動作確認(追加実装ゼロ行)
- 詳細: phase213-if-sum-implementation.md, phase217-if-sum-multi.md
10. **Phase 218: JsonParser If-Sum 適用調査** → 🔍 調査完了
- **目的**: JsonParser-style if-sum パターン (`sum = sum + digit`) への Pattern 3 適用
- **結果**: パターン認識ギャップを発見
- Phantom `count` carrier が誤検出され、`is_if_sum_pattern()` が false
- AST ベース lowerer は実装済みだが起動されていない
- 根本原因: carrier 検出ロジックの name heuristic が脆弱
- **次フェーズ**: Carrier 検出修正Phase 219
- 詳細: phase218-jsonparser-if-sum-min.md

View File

@ -0,0 +1,258 @@
# Phase 218: JsonParser If-Sum Mini Test - Pattern Recognition Gap Found
## Overview
Phase 218 attempts to apply Pattern 3 if-sum infrastructure (Phase 213-217) to JsonParser-style conditional accumulation pattern.
**Status**: 🔍 **INVESTIGATION COMPLETE** - Pattern recognition gap identified
## Test Case Created
### File: `apps/tests/phase218_json_if_sum_min.hako`
**Pattern**: JsonParser-style conditional accumulation
```hako
static box JsonIfSumTest {
sum_digits(data) {
local sum = 0
local i = 0
local len = 5
loop(i < len) {
// JsonParser pattern: if digit > 0 { sum = sum + digit }
if i > 0 {
sum = sum + i // ← Variable-based accumulation (not literal!)
print(sum)
} else {
print(0)
}
i = i + 1
}
return sum
}
main() {
local result = JsonIfSumTest.sum_digits(0)
return result
}
}
```
**Expected**: RC=10 (sum = 1 + 2 + 3 + 4)
**Actual**: RC=0 (legacy lowerer used, wrong logic)
## Investigation Results
### Observation 1: Legacy Lowerer Triggered
Debug output shows:
```
[joinir/pattern3] Generated JoinIR for Loop with If-Else PHI (Phase 195: multi-carrier)
[joinir/pattern3] Carriers: i (counter), sum (accumulator), count (counter) [Phase 195]
[joinir/pattern3] If-Else PHI in loop body:
[joinir/pattern3] sum_new = (i % 2 == 1) ? sum+i : sum+0
```
**Finding**: Legacy template lowerer is used (hardcoded `i % 2 == 1` condition)
### Observation 2: AST-Based Lowerer NOT Called
Expected debug messages from `loop_with_if_phi_if_sum.rs`:
```
[joinir/pattern3/if-sum] Starting AST-based if-sum lowering ← NOT SEEN
[joinir/pattern3/if-sum] Loop condition: ... ← NOT SEEN
```
**Finding**: `ctx.is_if_sum_pattern()` returns **false**, so AST-based lowerer is not triggered
### Observation 3: Pattern vs Literal Difference
| Test | Update Pattern | Lowerer Used | Result |
|------|---------------|--------------|--------|
| phase212_if_sum_min.hako | `sum = sum + 1` | Legacy ❌ | RC=0 |
| phase218_json_if_sum_min.hako | `sum = sum + i` | Legacy ❌ | RC=0 |
**Finding**: Both use legacy lowerer, suggesting either:
1. AST-based lowerer was never fully integrated, OR
2. There's a regression since Phase 216 documentation
### Observation 4: Phase 216 Claims Success
Phase 216 documentation states:
```markdown
**Expected**: RC=2 (sum=1 at i=1, sum=2 at i=2)
**Actual**: **RC=2**
```
But current execution of `phase212_if_sum_min.hako` returns **RC=0**, not RC=2.
**Finding**: Either a regression occurred, OR Phase 216 tests were never actually run successfully
## Root Cause Analysis
### Pattern Detection Logic
File: `src/mir/builder/control_flow/joinir/patterns/pattern_pipeline.rs`
```rust
pub fn is_if_sum_pattern(&self) -> bool {
// Check if loop_body has if statement
let has_if = self.loop_body.as_ref().map_or(false, |body| {
body.iter().any(|stmt| matches!(stmt, ASTNode::If { .. }))
});
if !has_if {
return false;
}
// Check carrier pattern using name heuristics
let summary = analyze_loop_updates(&all_names);
summary.is_simple_if_sum_pattern() // ← Returns false
}
```
### Why Detection Fails
Carrier detection reports:
```
Carriers: i (counter), sum (accumulator), count (counter)
```
**Issue**: A phantom `count` carrier is detected as `CounterLike` instead of not existing.
This causes `is_simple_if_sum_pattern()` to fail:
```rust
pub fn is_simple_if_sum_pattern(&self) -> bool {
if self.counter_count() != 1 { return false; } // ← Fails here (2 counters: i, count)
if self.accumulation_count() < 1 { return false; }
if self.accumulation_count() > 2 { return false; }
true
}
```
**Root cause**: Name heuristic incorrectly classifies `count` as a counter based on its name alone.
## Implications for JsonParser Pattern
### Variable-Based Accumulation
JsonParser uses patterns like:
```hako
local digit = extract_digit(json, i)
if digit != "" {
sum = sum + _str_to_int(digit) // ← Variable, not literal!
}
```
This requires:
1. **Pattern recognition**: Detect `sum = sum + variable` as accumulation (not just `sum = sum + 1`)
2. **AST extraction**: Extract variable references (not just integer literals)
### Current Limitations
File: `src/mir/join_ir/lowering/loop_with_if_phi_if_sum.rs`
**extract_then_update()** only supports literal addends:
```rust
fn extract_then_update(if_stmt: &ASTNode) -> Result<(String, i64), String> {
// ...
match addend_expr {
ASTNode::IntegerLiteral { value } => {
Ok((var_name, *value)) // ← Only literals!
}
_ => Err("Then update addend must be integer literal".to_string())
}
}
```
**Needed**: Support for variable references:
```rust
ASTNode::Variable { name } => {
// Return variable name, lowerer generates Load instruction
Ok((var_name, VariableRef(name.clone())))
}
```
## Task 218-3: Minimal Fix Strategy (80/20 Rule)
### Option A: Fix Pattern Detection (Recommended)
**File**: `src/mir/join_ir/lowering/loop_update_summary.rs`
**Issue**: Phantom `count` carrier detected
**Fix**: Improve name heuristic to not detect non-existent variables
### Option B: Support Variable Addends
**File**: `src/mir/join_ir/lowering/loop_with_if_phi_if_sum.rs`
**Issue**: `extract_then_update()` only supports literals
**Fix**: Support `ASTNode::Variable` in addend extraction
### Recommended Approach
**Start with Option A** (simpler):
1. Fix phantom `count` detection
2. Verify phase212 test passes with AST-based lowerer
3. If successful, phase218 test should also work (same pattern, different addend value)
**Only if needed, do Option B**:
1. Extend AST extraction for variable references
2. Update JoinIR generation to handle variable loads
## Files Created
1. `apps/tests/phase218_json_if_sum_min.hako` - JsonParser-style test case
2. `docs/development/current/main/phase218-jsonparser-if-sum-min.md` - This document
## Next Steps
### Immediate (Phase 219)
1. **Fix phantom carrier detection** (Option A)
- Investigate why `count` is detected when it doesn't exist
- Fix name heuristic or carrier enumeration logic
- Verify phase212 test passes (RC=2)
2. **Regression check**
- Run all Phase 212/217 tests
- Verify RC values match Phase 216 documentation
### Future (Phase 220+)
1. **Variable addend support** (Option B)
- Extend AST extraction for variable references
- Update JoinIR lowerer to generate Load instructions
- Target: JsonParser real-world pattern (`sum = sum + digit`)
2. **Method call results**
- Support: `sum = sum + _str_to_int(digit)`
- Requires: Expression lowering in JoinIR context
## Lessons Learned
### 1. Documentation vs Reality Gap
Phase 216 claims "RC=2 ✅" but current execution shows "RC=0 ❌".
**Learning**: Always verify documented success with actual execution. Documentation can become stale.
### 2. Legacy Code Path Persistence
Even with AST-based lowerer implemented, legacy path is still used.
**Learning**: Dual-mode architectures need clear activation conditions. If detection fails, system silently falls back.
### 3. Name Heuristics Are Fragile
Phantom `count` carrier detected based on name alone.
**Learning**: Name-based detection is a temporary solution. Phase 220+ should use AST-based analysis exclusively.
## Summary
**Phase 218 Goal**: Apply Pattern 3 if-sum to JsonParser-style loop
**Outcome**: Pattern recognition gap found (phantom carrier detection)
**Value**: Identified root cause blocking AST-based lowerer activation
**Next**: Fix carrier detection (Phase 219), then retry JsonParser pattern
**The investigation was successful - we found why AST-based lowerer doesn't activate!** 🔍