feat(joinir): Phase 176 Pattern2 multi-carrier lowering complete

Task 176-1: Pattern2 limitation investigation
- Identified 10 limitation points where only position carrier was handled
- Added TODO markers for Phase 176-2/3 implementation
- Created phase176-pattern2-limitations.md documentation

Task 176-2: CarrierUpdateLowerer helper implementation
- Implemented emit_carrier_update() helper function
- Supports CounterLike and AccumulationLike UpdateExpr patterns
- Added 6 unit tests (all passing)
- Fail-Fast error handling for carrier/variable not found

Task 176-3: Pattern2 lowerer multi-carrier extension
- Extended header PHI generation for all carriers
- Implemented loop update for all carriers using emit_carrier_update()
- Extended ExitLine/ExitMeta construction for all carriers
- Updated function call/jump args to include all carriers
- 9/10 tests passing (1 pre-existing test issue)

Task 176-4: E2E testing and bug fixes
- Fixed Trim pattern loop_var_name overwrite bug (pattern2_with_break.rs)
- Fixed InstructionRewriter latch_incoming mapping bug
- All E2E tests passing (RC=0): pos + result dual-carrier loops work
- test_jsonparser_parse_string_min2.hako verified

Task 176-5: Documentation updates
- Created phase176-completion-report.md
- Updated phase175-multicarrier-design.md with completion status
- Updated joinir-architecture-overview.md roadmap
- Updated CURRENT_TASK.md with Phase 176 completion + Phase 177 TODO
- Updated loop_pattern_space.md F-axis (multi-carrier support complete)

Technical achievements:
- Pattern2 now handles single/multiple carriers uniformly
- CarrierInfo architecture proven to work end-to-end
- Two critical bugs fixed (loop_var overwrite, latch_incoming mapping)
- No regressions in existing tests

Next: Phase 177 - Apply to JsonParser _parse_string full implementation
This commit is contained in:
nyash-codex
2025-12-08 15:17:53 +09:00
parent 24aa8ced75
commit 99d329096f
13 changed files with 1345 additions and 55 deletions

View File

@ -77,6 +77,21 @@
**成果**: P5 アーキテクチャが複数キャリアに対応済みであることを確認。
CarrierInfo は 3 キャリア検出済みpos, result, is_ch_match
Pattern2 実装が Trim 最適化により `pos` のみ emit する制約を発見Phase 176 改善対象)。
- [x] Phase 176: Pattern2 multi-carrier lowering ✅ (2025-12-08)
→ Task 176-1: 制限ポイント特定10箇所の TODO マーク)
→ Task 176-2: CarrierUpdateLowerer ヘルパ実装6 unit tests
→ Task 176-3: ヘッダ PHI / ループ更新 / ExitLine を全キャリア対応に拡張
→ Task 176-4: E2E テスト成功2キャリア: pos + result
→ Task 176-5: ドキュメント更新
**成果**: Pattern2 が複数キャリアに完全対応。CarrierInfo に載っている全キャリアの UpdateExpr を MIR に emit。
**バグ修正**: Trim pattern の loop_var_name 上書き問題、InstructionRewriter の latch_incoming 設定問題。
- 今後 1〜2 フェーズの TODOJoinIR 周りのサマリ)
- [ ] **Phase 177: JsonParser `_parse_string` 本体の JoinIR 適用**
- Phase 176 の multi-carrier 対応を前提に、実際の `_parse_string` ループを P2+P5 で通す。
- 期待: pos + result の 2 キャリアが正しく更新される。
- エスケープ処理(`\"`)は Phase 178+ で対応。
- [ ] Phase 178+: JsonParser `_parse_array` / `_parse_object` など、残りの複雑ループを順次 P1P5 の組み合わせで吸収していく。
---
@ -145,4 +160,3 @@
に移し、このファイルからはリンクまたは簡単な要約だけに留める。
- もし CURRENT_TASK がまた肥大化してきたら、
古いセクションを phase ドキュメント側に逃して、このファイルを再度「現在地サマリ」に戻す。

View File

@ -258,3 +258,53 @@ JoinIR は Rust 側だけでなく、将来的に .hako selfhost コンパイラ
- JoinIR ループパターン空間の整理メモ。
どの軸(継続条件 / break / continue / PHI / 条件変数スコープ / 更新パターン)でパターンを分けるか、
そして P1P4 / Trim(P5) の位置づけと、今後追加候補のパターン一覧がまとまっている。
---
## 6. RoadmapJoinIR の今後のゴール)
ここから先の JoinIR の「目指す形」を、箱レベルでざっくり書いておくよ。フェーズ詳細は各 phase ドキュメントに分散させて、このセクションは常に最新の方向性だけを保つ。
### 6.1 直近Phase 176-177 まわり)
- **P5Trim/JsonParser 系)ループの複数キャリア対応** ✅ Phase 176 完了 (2025-12-08)
- 完了内容:
- Pattern2 lowerer を全キャリア対応に拡張(ヘッダ PHI / ループ更新 / ExitLine
- CarrierUpdateLowerer ヘルパで UpdateExpr → JoinIR 変換を統一。
- 2キャリアpos + resultE2E テスト完全成功。
- 技術的成果:
- CarrierInfo / ExitMeta / ExitLine / LoopHeaderPhiBuilder の multi-carrier 対応を Pattern2 lowerer で完全活用。
- Trim pattern の「キャリア = ループ変数」という誤解を解消loop_var は特殊キャリア)。
- 次のステップ (Phase 177):
- JsonParser `_parse_string` 本体を P2+P5 で通すpos + result の 2 キャリアで実ループ動作確認)。
### 6.2 中期selfhost depth2 / JsonParser 本体)
- **JsonParserBox / Trim 系ループの本線化**
- 目標:
- `_trim` / `_skip_whitespace` / `_parse_string` / `_parse_array` などの主要ループが、すべて JoinIR Pattern14 + P5 で通ること。
- LoopConditionScopeBox + LoopBodyCarrierPromoter + TrimLoopHelper の上で安全に正規化できるループを広げていく。
- 方針:
- 「ループの形」は P1P4 から増やさず、複雑さは BoolExprLowerer / ContinueBranchNormalizer / P5 系の補助箱で吸収する。
- LoopPatternSpace の P6/P7/P12 候補break+continue 同時 / 複数キャリア条件更新 / early returnは、実アプリで必要になった順に小さく足す。
- **selfhost depth2.hako JoinIR/MIR Frontend**
- 目標:
- `.hako → JsonParserBox → Program/MIR JSON → MirAnalyzerBox/JoinIrAnalyzerBox → VM/LLVM` の深度 2 ループを、日常的に回せるようにする。
- Rust 側の JoinIR は「JSON を受け取って実行・検証するランナー層」、.hako 側が「JoinIR/MIR を構築・解析する言語側 SSOT」という役割分担に近づける。
- 前提:
- 本ドキュメントjoinir-architecture-overview.mdを .hako 側の JoinIR 実装の参照設計として維持し、仕様変更は必ずここを更新してから .hako にも反映する。
### 6.3 当面やらないことNonGoals
- ループパターンを闇雲に増やすこと
- P1P4構造 P5bodylocal 条件を昇格する補助パス)を「骨格」とみなし、
新しいパターンが必要になったときは LoopPatternSpace に追記してから、小さな箱で補う方針。
- LoopBuilder の復活や、JoinIR 以外の別ラインによるループ lowering
- LoopBuilder 系は Phase 186187 で完全に削除済み。
ループに関する新しい要件はすべて JoinIR 側のパターン/箱の拡張で扱う。
- JoinIR の中に言語固有のハードコード(特定 Box 名や変数名)を戻すこと
- Trim/JsonParser 系は、構造パターンと補助箱Promoter/Helperで扱い、
「sum」「ch」など名前ベースの判定は LoopUpdateSummary / TrimLoopHelper の内部に閉じ込める。
この Roadmap は、JoinIR 層の変更や selfhost 深度を進めるたびに更新していくよ。

View File

@ -16,7 +16,7 @@
| C. スキップ | ① なし ② continue ③ 条件付き cont | 次のイテレーションに「飛ぶ」 |
| D. PHI 分岐 | ① なし ② ifPHI ③ matchPHI | 条件に応じて値が変わるパターン |
| E. 条件変数のスコープ | ① OuterLocal ② LoopBodyLocal | 条件で参照される変数の定義位置 |
| F. キャリア更新 | ① 単一 ② 複数 ③ 条件付き | ループ内での状態更新パターン |
| F. キャリア更新 | ① 単一 ② 複数 ③ 条件付き | ループ内での状態更新パターン(✅ Phase 176 で複数対応完了) |
この 6 軸は、それぞれ 2〜3 通りしかないので、理論上は最大 3×3×3×3×2×3=486 通りの組み合わせになるが、
実際に意味のあるパターンはずっと少ない10〜20 程度)ことが分かった。

View File

@ -251,9 +251,30 @@ bb12: // Exit block
---
## 5. References
## 5. Phase 176 で解決済み ✅ (2025-12-08)
**実装内容**:
- Pattern2 lowerer を全キャリア対応に拡張
- ヘッダ PHI / ループ更新 / ExitLine で複数キャリアを正しく処理
- CarrierUpdateLowerer ヘルパで UpdateExpr → MIR 変換を統一
**修正されたバグ**:
1. Trim pattern で loop_var_name が上書きされていた問題pattern2_with_break.rs
2. InstructionRewriter が loop_var を exit_bindings から除外していなかった問題
**テスト結果**:
- ✅ 2キャリア E2E テスト全てパスpos + result
- ✅ 回帰テストなし
- ✅ Trim pattern も正常動作
**次のステップ**: Phase 177 で JsonParser の複雑ループへ拡張
---
## 6. References
- **Phase 170**: LoopUpdateSummary design
- **Phase 171**: LoopBodyCarrierPromoter implementation
- **Phase 174**: P5 minimal PoC (quote detection only)
- **Phase 176**: Pattern2 multi-carrier implementation ([phase176-completion-report.md](phase176-completion-report.md))
- **Pattern Space**: [docs/development/current/main/loop_pattern_space.md](loop_pattern_space.md)

View File

@ -0,0 +1,138 @@
# Phase 176-1: Pattern2 Limitation Investigation - Completion Report
**Date**: 2025-12-08
**Status**: ✅ COMPLETE
**Task**: Mark all single-carrier limitations in Pattern2 lowerer
---
## What Was Done
Investigated `src/mir/join_ir/lowering/loop_with_break_minimal.rs` and identified **10 critical points** where the lowerer currently only handles the position carrier (`i`) and ignores `CarrierInfo.carriers`.
### Files Modified
1. **`src/mir/join_ir/lowering/loop_with_break_minimal.rs`**
- Added 10 TODO comments marking limitation points
- No code changes (read + memo level only)
2. **`docs/development/current/main/phase176-pattern2-limitations.md`**
- Created comprehensive limitation report
- Detailed explanation of each limitation point
- Impact analysis and next steps
---
## Limitation Points Identified
### Easy Fixes (9 points) - Iteration-based
1. **ValueId Allocation** (Line 172) - Only allocates for position carrier
2. **Main Function Params** (Line 208) - Only takes `i_init`
3. **Loop Step Call Args** (Line 214) - Only passes `i_init`
4. **Loop Step Params** (Line 234) - Only takes `i_param`
5. **Natural Exit Jump** (Line 257) - Only passes `i_param` to k_exit
6. **Break Exit Jump** (Line 272) - Only passes `i_param` to k_exit
7. **Tail Call Args** (Line 304) - Only passes `i_next`
8. **K_Exit Params** (Line 319) - Only takes `i_exit`
9. **ExitMeta Construction** (Line 344) - Only includes position carrier
### Hard Fix (1 point) - Requires AST Body Analysis
10. **Loop Body Updates** (Line 284) - Only computes `i_next = i + 1`
- **Challenge**: Need to analyze AST body to determine carrier updates
- **Example**: How do we know `sum = sum + x` updates the `sum` carrier?
---
## Key Findings
### Architecture Issue
The Pattern2 lowerer completely ignores `CarrierInfo.carriers`:
```rust
pub struct CarrierInfo {
pub loop_var_name: String, // Used ✅
pub loop_var_id: ValueId, // Used ✅
pub carriers: Vec<CarrierVar>, // IGNORED ❌
pub trim_helper: Option<TrimLoopHelper>,
}
```
The function signature only takes `loop_var_name` as a separate string parameter, losing access to the full CarrierInfo structure.
### Infrastructure Ready
-**CarrierInfo**: Already multi-carrier ready (Phase 175)
-**ExitMeta**: Supports `ExitMeta::multiple(vec![...])` for multi-carrier
-**LoopHeaderPhiBuilder**: Multi-carrier ready (Phase 175)
-**ExitPhiBuilder**: Multi-carrier ready (Phase 175)
**Problem**: Pattern2 lowerer doesn't use these capabilities!
---
## Next Phase Roadmap
### Phase 176-2: Iteration-Based Fixes
**Difficulty**: Easy
**Estimate**: 1-2 hours
Fix points 1-6, 8-10 by iterating over `CarrierInfo.carriers`:
- Allocate ValueIds for all carriers
- Extend function params/call args/jump args
- Build multi-carrier ExitMeta
### Phase 176-3: Loop Body Analysis
**Difficulty**: Hard
**Estimate**: 3-4 hours
Fix point 7 by analyzing AST body:
- Track carrier assignments in loop body
- Emit update instructions for each carrier
- Handle complex cases (conditional updates, etc.)
### Integration Test
Pattern 3 (trim) with Pattern 2 shape:
```nyash
loop(pos < len) {
if ch == ' ' { break }
pos = pos + 1
}
```
Verify sum/count carriers survive through break exits.
---
## Deliverables
1.**TODO Comments**: 10 markers added to `loop_with_break_minimal.rs`
2.**Limitation Report**: `phase176-pattern2-limitations.md`
3.**Completion Report**: This document
---
## How to Use This Report
For Task 176-2/3 implementers:
1. **Read the limitation report first**: `phase176-pattern2-limitations.md`
2. **Start with easy fixes**: Points 1-6, 8-10 (iteration-based)
3. **Tackle hard fix last**: Point 7 (loop body analysis)
4. **Use TODO markers as guide**: Search for `TODO(Phase 176)` in code
5. **Test with Pattern 3**: Use trim pattern as integration test
---
## Related Files
- **Main file**: `src/mir/join_ir/lowering/loop_with_break_minimal.rs`
- **CarrierInfo**: `src/mir/join_ir/lowering/carrier_info.rs`
- **Limitation report**: `docs/development/current/main/phase176-pattern2-limitations.md`
- **This report**: `docs/development/current/main/phase176-1-completion-report.md`
---
## Conclusion
Task 176-1 successfully identified and documented all 10 single-carrier limitations in the Pattern2 lowerer. The code is now well-marked with TODO comments, and a comprehensive analysis report is available for the next implementation phases.
**Ready for Phase 176-2/3 implementation!** 🚀

View File

@ -0,0 +1,63 @@
# Phase 176: Pattern2 Multi-Carrier Lowering - 完了レポート
**日付**: 2025-12-08
**ステータス**: ✅ 完全成功
---
## 概要
Phase 175 で「アーキテクチャは multi-carrier ready だが、Pattern2 lowerer の実装が pos のみ」という問題を発見。
Phase 176 でこの実装ギャップを埋め、Pattern2 が複数キャリアに完全対応した。
---
## 実装内容
### Task 176-1: 制限ポイント特定
- 10箇所の「pos だけ」制約を TODO コメントでマーク
- ヘッダ PHI / ループ更新 / ExitLine の 3 カテゴリに分類
### Task 176-2: CarrierUpdateLowerer ヘルパ
- `emit_carrier_update()` 関数実装UpdateExpr → JoinIR 変換)
- CounterLike / AccumulationLike 両対応
- 6 unit tests 全てパス
### Task 176-3: Pattern2 Lowerer 拡張
- ヘッダ PHI: 全キャリア分の PHI パラメータを生成
- ループ更新: CarrierInfo.carriers をループして emit_carrier_update() 呼び出し
- ExitLine: 全キャリアの ExitMeta を構築
### Task 176-4: E2E テスト
- 2キャリアpos + resultテストが完全動作
- **バグ修正 1**: Trim pattern で loop_var_name が上書きされていたpattern2_with_break.rs:271-272
- **バグ修正 2**: InstructionRewriter が loop_var を exit_bindings から除外していなかった
### Task 176-5: ドキュメント更新
- phase175-multicarrier-design.md に完了マーク
- joinir-architecture-overview.md の F軸更新
- CURRENT_TASK.md に Phase 177 メモ追加
---
## テスト結果
✅ E2E テスト: 3 件全てパスRC=0
✅ Unit テスト: 6 件全てパス
✅ 回帰テストなし
---
## 技術的成果
- **コード削減**: Phase 176-1 の調査で、将来的に数百行の単純化が可能と判明
- **汎用性**: Pattern2 が単一/複数キャリアの両方に対応Trim / JsonParser で共通利用可能)
- **設計修正**: Trim pattern の「キャリア = ループ変数」という誤解を解消
---
## 次のステップ (Phase 177)
- JsonParser `_parse_string` 本体を P2+P5 で通す
- pos + result の 2 キャリアが正しく動作することを確認
- エスケープ処理は Phase 178+ で対応

View File

@ -0,0 +1,236 @@
# Phase 176: Pattern2 Lowerer Multi-Carrier Limitations Report
## Overview
This document identifies all locations in `loop_with_break_minimal.rs` where the Pattern2 lowerer currently only handles the position carrier (`i`) and ignores additional carriers from `CarrierInfo.carriers`.
## Limitation Points
### 1. ValueId Allocation (Line 172-173)
**Location**: ValueId allocation section
**Current Behavior**: Only allocates ValueIds for the position carrier (`i_init`, `i_param`, `i_next`, `i_exit`)
**Missing**: No allocation for additional carriers (e.g., `sum_init`, `sum_param`, `sum_next`, `sum_exit`)
```rust
// TODO(Phase 176): Multi-carrier support - currently only allocates position carrier
// Future: iterate over CarrierInfo.carriers and allocate for each carrier
```
**Impact**: Cannot represent multi-carrier loops in JoinIR local ValueId space.
---
### 2. Main Function Parameters (Line 208-209)
**Location**: `main()` function creation
**Current Behavior**: Only takes `i_init` as parameter
**Missing**: Should take all carrier init values as parameters
```rust
// TODO(Phase 176): Multi-carrier support - main() params should include all carriers
// Future: params = vec![i_init, sum_init, count_init, ...] from CarrierInfo
let mut main_func = JoinFunction::new(main_id, "main".to_string(), vec![i_init]);
```
**Impact**: Additional carriers cannot be passed into the JoinIR fragment.
---
### 3. Loop Step Call Arguments (Line 214-215)
**Location**: `main()``loop_step()` call
**Current Behavior**: Only passes `i_init` to `loop_step()`
**Missing**: Should pass all carrier init values
```rust
// TODO(Phase 176): Multi-carrier support - Call args should include all carrier inits
// Future: args = vec![i_init, sum_init, count_init, ...] from CarrierInfo
main_func.body.push(JoinInst::Call {
func: loop_step_id,
args: vec![i_init], // Only position carrier
k_next: None,
dst: Some(loop_result),
});
```
**Impact**: Additional carriers lost at loop entry.
---
### 4. Loop Step Function Parameters (Line 234-235)
**Location**: `loop_step()` function creation
**Current Behavior**: Only takes `i_param` as parameter
**Missing**: Should take all carrier params
```rust
// TODO(Phase 176): Multi-carrier support - loop_step params should include all carriers
// Future: params = vec![i_param, sum_param, count_param, ...] from CarrierInfo
let mut loop_step_func = JoinFunction::new(
loop_step_id,
"loop_step".to_string(),
vec![i_param], // Only position carrier
);
```
**Impact**: Cannot access additional carriers inside loop body.
---
### 5. Natural Exit Jump Arguments (Line 257-258)
**Location**: Natural exit condition → k_exit jump
**Current Behavior**: Only passes `i_param` to k_exit
**Missing**: Should pass all carrier values
```rust
// TODO(Phase 176): Multi-carrier support - Jump args should include all carrier values
// Future: args = vec![i_param, sum_param, count_param, ...] from CarrierInfo
loop_step_func.body.push(JoinInst::Jump {
cont: k_exit_id.as_cont(),
args: vec![i_param], // Only position carrier
cond: Some(exit_cond),
});
```
**Impact**: Additional carrier values lost at natural exit.
---
### 6. Break Exit Jump Arguments (Line 272-273)
**Location**: Break condition → k_exit jump
**Current Behavior**: Only passes `i_param` to k_exit
**Missing**: Should pass all carrier values
```rust
// TODO(Phase 176): Multi-carrier support - Jump args should include all carrier values
// Future: args = vec![i_param, sum_param, count_param, ...] from CarrierInfo
loop_step_func.body.push(JoinInst::Jump {
cont: k_exit_id.as_cont(),
args: vec![i_param], // Only position carrier
cond: Some(break_cond_value),
});
```
**Impact**: Additional carrier values lost at break exit.
---
### 7. Loop Body Updates (Line 284-285)
**Location**: Loop body computation
**Current Behavior**: Only computes `i_next = i + 1`
**Missing**: Should compute updates for all carriers
```rust
// TODO(Phase 176): Multi-carrier support - need to compute updates for all carriers
// Future: for each carrier in CarrierInfo.carriers, emit carrier_next = carrier_update
loop_step_func.body.push(JoinInst::Compute(MirLikeInst::BinOp {
dst: i_next,
op: BinOpKind::Add,
lhs: i_param,
rhs: const_1,
}));
```
**Impact**: Additional carriers cannot be updated in loop body.
**Note**: This is the HARDEST part - we need actual AST body analysis to determine carrier updates!
---
### 8. Tail Call Arguments (Line 304-305)
**Location**: Tail recursion call to `loop_step()`
**Current Behavior**: Only passes `i_next`
**Missing**: Should pass all updated carrier values
```rust
// TODO(Phase 176): Multi-carrier support - tail call args should include all updated carriers
// Future: args = vec![i_next, sum_next, count_next, ...] from CarrierInfo
loop_step_func.body.push(JoinInst::Call {
func: loop_step_id,
args: vec![i_next], // Only position carrier
k_next: None,
dst: None,
});
```
**Impact**: Additional carrier updates lost in iteration.
---
### 9. K_Exit Function Parameters (Line 319-320)
**Location**: `k_exit()` function creation (Exit PHI)
**Current Behavior**: Only takes `i_exit` as parameter
**Missing**: Should take all carrier exit values as parameters
```rust
// TODO(Phase 176): Multi-carrier support - k_exit params should include all carrier exits
// Future: params = vec![i_exit, sum_exit, count_exit, ...] from CarrierInfo
let mut k_exit_func = JoinFunction::new(
k_exit_id,
"k_exit".to_string(),
vec![i_exit], // Only position carrier
);
```
**Impact**: Additional carrier exit values cannot be received by Exit PHI.
---
### 10. ExitMeta Construction (Line 344-345)
**Location**: Final ExitMeta return value
**Current Behavior**: Only includes position carrier in exit bindings
**Missing**: Should include all carrier bindings
```rust
// TODO(Phase 176): Multi-carrier support - ExitMeta should include all carrier bindings
// Future: ExitMeta::multiple(vec![(loop_var_name, i_exit), ("sum", sum_exit), ...])
let exit_meta = ExitMeta::single(loop_var_name.to_string(), i_exit);
```
**Impact**: Additional carriers not visible to MIR merge layer - no carrier PHIs generated!
---
## Summary Statistics
- **Total Limitation Points**: 10
- **Easy Fixes** (iteration over CarrierInfo): 9 points
- **Hard Fix** (requires AST body analysis): 1 point (Loop Body Updates)
## Architecture Note
The CarrierInfo structure is already multi-carrier ready:
```rust
pub struct CarrierInfo {
pub loop_var_name: String, // Position carrier
pub loop_var_id: ValueId, // Host ValueId
pub carriers: Vec<CarrierVar>, // Additional carriers (THIS IS IGNORED!)
pub trim_helper: Option<TrimLoopHelper>,
}
```
The problem is that `lower_loop_with_break_minimal()` completely ignores `CarrierInfo.carriers` and only uses `loop_var_name` (passed as a separate string parameter).
## Next Steps (Phase 176-2/3)
1. **Phase 176-2**: Fix iteration-based points (points 1-6, 8-10)
- Add carrier iteration logic
- Extend function params, call args, jump args
- Build multi-carrier ExitMeta
2. **Phase 176-3**: Fix loop body updates (point 7)
- Requires AST body analysis
- Need to track which carriers are updated by which statements
- Most complex part of multi-carrier support
3. **Integration Test**: Pattern 3 (trim) with Pattern 2 shape
- Test case: `loop(pos < len) { if ch == ' ' { break } pos = pos + 1 }`
- Verify sum/count carriers survive through break exits

View File

@ -0,0 +1,116 @@
# Task 176-4: Trim Pattern Loop Variable Overwrite Bug Fix
**Status**: ✅ COMPLETED
**Date**: 2025-12-08
## Problem
In `src/mir/builder/control_flow/joinir/patterns/pattern2_with_break.rs`, lines 271-272 were incorrectly overwriting the loop variable information during Trim pattern processing:
```rust
// ❌ Incorrect code (lines 271-272)
carrier_info.loop_var_id = is_ch_match0;
carrier_info.loop_var_name = carrier_name.clone(); // Overwrites "pos" with "is_ch_match"
```
### Root Cause
The Trim pattern implementation (Phase 171-172) was confusing two distinct concepts:
1. **Loop variable**: The actual loop counter (e.g., `pos`) that increments through the string
2. **Promoted carrier**: The derived variable (e.g., `is_ch_match`) that tracks whether the current character matches whitespace
The code was incorrectly treating the promoted carrier as a replacement for the loop variable, when it should only be an additional carrier alongside the loop variable.
### Impact
- **ExitMeta generation**: Would use wrong variable name ("is_ch_match" instead of "pos")
- **PHI node construction**: Would create PHI nodes for the wrong variable
- **Variable mapping**: Would lose track of the actual loop counter
## Solution
Removed lines 271-272 and replaced with explanatory comment:
```rust
// Note: DO NOT overwrite carrier_info.loop_var_id/loop_var_name here!
// The loop variable is 'pos' (counter), not 'is_ch_match' (carrier).
// carrier_info.loop_var_name should remain as the original loop variable.
eprintln!("[pattern2/trim] Carrier registered. Loop var='{}' remains unchanged",
carrier_info.loop_var_name);
```
### Design Principle
In Trim patterns:
- `loop_var_name` = "pos" (the counter variable)
- `carriers` = ["result", "is_ch_match"] (accumulated values)
- `is_ch_match` is a **promoted carrier**, not a loop variable replacement
## Verification
### 1. Build Status
✅ Build succeeded with no errors
### 2. Log Output Verification
Both E2E tests show correct behavior:
```
[pattern2/promoter] Phase 171-C-4 DEBUG: BEFORE merge - carrier_info.loop_var_name='pos'
[pattern2/promoter] Phase 171-C-4 DEBUG: promoted_carrier.loop_var_name='is_ch_match'
[pattern2/promoter] Phase 171-C-4 DEBUG: AFTER merge - carrier_info.loop_var_name='pos' ← CORRECT!
[pattern2/trim] Carrier registered. Loop var='pos' remains unchanged ← NEW LOG MESSAGE
[pattern2/before_lowerer] About to call lower_loop_with_break_minimal with carrier_info.loop_var_name='pos'
```
**Key observation**: The loop_var_name correctly remains as 'pos' throughout the entire process.
### 3. Regression Tests
✅ All carrier update tests pass (6/6):
- `test_emit_binop_update_with_const`
- `test_emit_binop_update_with_variable`
- `test_emit_const_update`
- `test_emit_update_lhs_mismatch`
- `test_emit_update_carrier_not_in_env`
- `test_emit_update_rhs_variable_not_found`
❌ One pre-existing test failure (unrelated to this fix):
- `test_pattern2_accepts_loop_param_only` - Failed on both HEAD and with our changes
### 4. E2E Test Results
**test_jsonparser_parse_string_min2.hako**:
- ✅ Loop variable name correctly preserved as 'pos'
- ⚠️ Execution fails with: "Phase 33-16: Carrier 'result' has no latch incoming set"
- **Note**: This is a separate issue unrelated to our fix (carrier PHI initialization)
**test_trim_simple.hako** (new test):
- ✅ Loop variable name correctly preserved as 'pos'
- ⚠️ Same carrier PHI error as above
## Remaining Issues (Out of Scope)
The following error is NOT caused by this fix and exists independently:
```
[ERROR] ❌ MIR compilation error: Phase 33-16: Carrier 'result' has no latch incoming set
```
This appears to be related to PHI node initialization for carriers in Pattern 2, and will need to be addressed separately.
## Files Changed
1. `src/mir/builder/control_flow/joinir/patterns/pattern2_with_break.rs`
- Lines 271-272: Removed incorrect loop_var overwrite
- Lines 270-275: Added explanatory comment and new debug log
## Conclusion
**Bug successfully fixed**
- Loop variable name no longer gets overwritten by carrier name
- Design clarification: promoted carriers are additions, not replacements
- No regressions introduced in carrier update logic
- Remaining errors are pre-existing and unrelated to this fix
The fix correctly implements the design intention: in Trim patterns, `is_ch_match` should be an additional carrier alongside the original loop variable `pos`, not a replacement for it.

View File

@ -82,6 +82,11 @@ impl ExitMetaCollector {
// Iterate over ExitMeta entries and build bindings
for (carrier_name, join_exit_value) in &exit_meta.exit_values {
eprintln!(
"[cf_loop/exit_line] ExitMetaCollector DEBUG: Checking carrier '{}' in variable_map",
carrier_name
);
// Look up host slot from variable_map
if let Some(&host_slot) = builder.variable_map.get(carrier_name) {
let binding = LoopExitBinding {
@ -90,15 +95,13 @@ impl ExitMetaCollector {
host_slot,
};
if debug {
eprintln!(
"[cf_loop/exit_line] ExitMetaCollector: Collected '{}' JoinIR {:?} → HOST {:?}",
carrier_name, join_exit_value, host_slot
);
}
eprintln!(
"[cf_loop/exit_line] ExitMetaCollector: Collected '{}' JoinIR {:?} → HOST {:?}",
carrier_name, join_exit_value, host_slot
);
bindings.push(binding);
} else if debug {
} else {
eprintln!(
"[cf_loop/exit_line] ExitMetaCollector DEBUG: Carrier '{}' not in variable_map (skip)",
carrier_name

View File

@ -394,11 +394,27 @@ pub(super) fn merge_and_rewrite(
}
// Phase 33-20: Also set latch incoming for other carriers from exit_bindings
// The exit_bindings are ordered to match args[1..] (after the loop variable)
for (idx, binding) in b.exit_bindings.iter().enumerate() {
let arg_idx = idx + 1; // +1 because args[0] is the loop variable
if arg_idx < args.len() {
let latch_value = args[arg_idx];
// Phase 176-4 FIX: exit_bindings may include the loop variable itself
// We need to skip it since it's already handled above via boundary.loop_var_name
// The remaining non-loop-variable carriers are ordered to match args[1..] (after the loop variable)
let mut carrier_arg_idx = 1; // Start from args[1] (args[0] is loop variable)
for binding in b.exit_bindings.iter() {
// Skip if this binding is for the loop variable (already handled)
if let Some(ref loop_var) = b.loop_var_name {
if &binding.carrier_name == loop_var {
if debug {
eprintln!(
"[cf_loop/joinir] Phase 176-4: Skipping loop variable '{}' in exit_bindings (handled separately)",
binding.carrier_name
);
}
continue; // Skip loop variable
}
}
// Process non-loop-variable carrier
if carrier_arg_idx < args.len() {
let latch_value = args[carrier_arg_idx];
loop_header_phi_info.set_latch_incoming(
&binding.carrier_name,
new_block_id,
@ -407,14 +423,15 @@ pub(super) fn merge_and_rewrite(
if debug {
eprintln!(
"[cf_loop/joinir] Phase 33-20: Set latch incoming for carrier '{}': block={:?}, value={:?} (arg[{}])",
binding.carrier_name, new_block_id, latch_value, arg_idx
"[cf_loop/joinir] Phase 176-4: Set latch incoming for carrier '{}': block={:?}, value={:?} (arg[{}])",
binding.carrier_name, new_block_id, latch_value, carrier_arg_idx
);
}
carrier_arg_idx += 1;
} else if debug {
eprintln!(
"[cf_loop/joinir] Phase 33-20 WARNING: No arg for carrier '{}' at index {}",
binding.carrier_name, arg_idx
binding.carrier_name, carrier_arg_idx
);
}
}

View File

@ -157,6 +157,18 @@ pub(in crate::mir::builder) fn merge_joinir_mir_blocks(
// Phase 33-20: Extract other carriers from exit_bindings
// Skip the loop variable (it's handled separately) and collect other carriers
eprintln!(
"[cf_loop/joinir] Phase 33-20 DEBUG: exit_bindings count={}, loop_var_name={:?}",
boundary.exit_bindings.len(),
loop_var_name
);
for b in boundary.exit_bindings.iter() {
eprintln!(
"[cf_loop/joinir] Phase 33-20 DEBUG: exit_binding: carrier_name={:?}, host_slot={:?}",
b.carrier_name, b.host_slot
);
}
let other_carriers: Vec<(String, ValueId)> = boundary.exit_bindings
.iter()
.filter(|b| b.carrier_name != *loop_var_name)

View File

@ -65,6 +65,11 @@ impl MirBuilder {
None, // Pattern 2 handles break-triggered vars via condition_bindings
)?;
eprintln!(
"[pattern2/init] CommonPatternInitializer returned: loop_var='{}', loop_var_id={:?}, carriers={}",
loop_var_name, loop_var_id, carrier_info.carriers.len()
);
// Phase 195: Use unified trace
trace::trace().varmap("pattern2_start", &self.variable_map);
@ -175,8 +180,22 @@ impl MirBuilder {
// Phase 171-C-4: Convert to CarrierInfo and merge
let promoted_carrier = trim_info.to_carrier_info();
eprintln!(
"[pattern2/promoter] Phase 171-C-4 DEBUG: BEFORE merge - carrier_info.loop_var_name='{}'",
carrier_info.loop_var_name
);
eprintln!(
"[pattern2/promoter] Phase 171-C-4 DEBUG: promoted_carrier.loop_var_name='{}'",
promoted_carrier.loop_var_name
);
carrier_info.merge_from(&promoted_carrier);
eprintln!(
"[pattern2/promoter] Phase 171-C-4 DEBUG: AFTER merge - carrier_info.loop_var_name='{}'",
carrier_info.loop_var_name
);
eprintln!(
"[pattern2/promoter] Phase 171-C-4: Merged carrier '{}' into CarrierInfo (total carriers: {})",
trim_info.carrier_name,
@ -248,12 +267,12 @@ impl MirBuilder {
eprintln!("[pattern2/trim] Registered carrier '{}' in variable_map", carrier_name);
// Update carrier_info with actual ValueId
carrier_info.loop_var_id = is_ch_match0;
carrier_info.loop_var_name = carrier_name.clone();
// Note: DO NOT overwrite carrier_info.loop_var_id/loop_var_name here!
// The loop variable is 'pos' (counter), not 'is_ch_match' (carrier).
// carrier_info.loop_var_name should remain as the original loop variable.
eprintln!("[pattern2/trim] Updated carrier_info: loop_var='{}', loop_var_id={:?}",
carrier_info.loop_var_name, carrier_info.loop_var_id);
eprintln!("[pattern2/trim] Carrier registered. Loop var='{}' remains unchanged",
carrier_info.loop_var_name);
// Phase 172-4: Break condition will be replaced below after JoinIR generation
eprintln!("[pattern2/trim] Trim pattern lowering enabled, proceeding to JoinIR generation");
@ -298,6 +317,15 @@ impl MirBuilder {
eprintln!("[pattern2/trim] Added carrier '{}' to ConditionEnv: HOST {:?} → JoinIR {:?}",
helper.carrier_name, condition_bindings.last().unwrap().host_value, condition_bindings.last().unwrap().join_value);
// Phase 176-6: Also map the original variable name to the same JoinIR ValueId
// This allows the loop body to reference the original variable (e.g., 'ch')
// even though it was promoted to a carrier (e.g., 'is_ch_match')
let join_value = condition_bindings.last().unwrap().join_value;
env.insert(helper.original_var.clone(), join_value);
eprintln!("[pattern2/trim] Phase 176-6: Also mapped original var '{}' → JoinIR {:?}",
helper.original_var, join_value);
// Generate negated carrier check: !is_ch_match
let negated_carrier = TrimPatternLowerer::generate_trim_break_condition(helper);
@ -310,9 +338,71 @@ impl MirBuilder {
break_condition_node.clone()
};
// Phase 176-3: Analyze carrier updates from loop body
use crate::mir::join_ir::lowering::loop_update_analyzer::LoopUpdateAnalyzer;
let carrier_updates = LoopUpdateAnalyzer::analyze_carrier_updates(_body, &carrier_info.carriers);
eprintln!(
"[cf_loop/pattern2] Phase 176-3: Analyzed {} carrier updates",
carrier_updates.len()
);
// Phase 176-4: Filter carriers to only include those with actual updates
// Issue: CommonPatternInitializer includes all variables in variable_map as carriers,
// but only variables with updates in the loop body are true carriers.
// Condition-only variables (like 'len', 's') should be excluded.
let original_carrier_count = carrier_info.carriers.len();
carrier_info.carriers.retain(|carrier| {
carrier_updates.contains_key(&carrier.name)
});
eprintln!(
"[cf_loop/pattern2] Phase 176-4: Filtered carriers: {}{} (kept only carriers with updates)",
original_carrier_count,
carrier_info.carriers.len()
);
// Phase 176-5: Add body-only carriers to ConditionEnv
// Issue: Carriers that are updated in the loop body but not used in the condition
// need to be added to ConditionEnv with their initial values.
for carrier in &carrier_info.carriers {
if env.get(&carrier.name).is_none() {
// Allocate a new JoinIR ValueId for this carrier
let join_value = alloc_join_value();
// Add to ConditionEnv
env.insert(carrier.name.clone(), join_value);
// Add to condition_bindings for later processing
condition_bindings.push(ConditionBinding {
name: carrier.name.clone(),
host_value: carrier.host_id,
join_value,
});
eprintln!(
"[cf_loop/pattern2] Phase 176-5: Added body-only carrier '{}' to ConditionEnv: HOST {:?} → JoinIR {:?}",
carrier.name, carrier.host_id, join_value
);
}
}
// Phase 169 / Phase 171-fix / Phase 172-3 / Phase 170-B: Call Pattern 2 lowerer with break_condition
// Phase 33-14: Now returns (JoinModule, JoinFragmentMeta) for expr_result + carrier separation
let (join_module, fragment_meta) = match lower_loop_with_break_minimal(scope, condition, &effective_break_condition, &env, &loop_var_name) {
// Phase 176-3: Multi-carrier support - pass carrier_info and carrier_updates
eprintln!(
"[pattern2/before_lowerer] About to call lower_loop_with_break_minimal with carrier_info.loop_var_name='{}'",
carrier_info.loop_var_name
);
let (join_module, fragment_meta) = match lower_loop_with_break_minimal(
scope,
condition,
&effective_break_condition,
&env,
&carrier_info,
&carrier_updates,
) {
Ok((module, meta)) => (module, meta),
Err(e) => {
// Phase 195: Use unified trace
@ -328,13 +418,30 @@ impl MirBuilder {
use crate::mir::builder::control_flow::joinir::merge::exit_line::ExitMetaCollector;
let exit_bindings = ExitMetaCollector::collect(self, &exit_meta, debug);
// Phase 176-3: Build input mappings for all carriers
// JoinIR main() params: [ValueId(0), ValueId(1), ValueId(2), ...] for (i, carrier1, carrier2, ...)
// Host values: [loop_var_id, carrier1_host_id, carrier2_host_id, ...]
let mut join_input_slots = vec![ValueId(0)]; // Loop variable
let mut host_input_values = vec![loop_var_id]; // Loop variable
for (idx, carrier) in carrier_info.carriers.iter().enumerate() {
join_input_slots.push(ValueId((idx + 1) as u32));
host_input_values.push(carrier.host_id);
}
eprintln!(
"[cf_loop/pattern2] Phase 176-3: Boundary inputs - {} JoinIR slots, {} host values",
join_input_slots.len(),
host_input_values.len()
);
// Phase 200-2: Use JoinInlineBoundaryBuilder for clean construction
// Canonical Builder pattern - see docs/development/current/main/joinir-boundary-builder-pattern.md
use crate::mir::join_ir::lowering::JoinInlineBoundaryBuilder;
let boundary = JoinInlineBoundaryBuilder::new()
.with_inputs(
vec![ValueId(0)], // JoinIR's main() parameter (loop variable init)
vec![loop_var_id], // Host's loop variable
join_input_slots, // JoinIR's main() parameters (loop variable + carriers)
host_input_values, // Host's loop variable + carrier values
)
.with_condition_bindings(condition_bindings)
.with_exit_bindings(exit_bindings.clone())

View File

@ -56,9 +56,10 @@
//! Following the "80/20 rule" from CLAUDE.md - get it working first, generalize later.
use crate::ast::ASTNode;
use crate::mir::join_ir::lowering::carrier_info::{ExitMeta, JoinFragmentMeta};
use crate::mir::join_ir::lowering::carrier_info::{CarrierInfo, CarrierVar, ExitMeta, JoinFragmentMeta};
use crate::mir::join_ir::lowering::condition_to_joinir::{lower_condition_to_joinir, ConditionEnv};
use crate::mir::join_ir::lowering::loop_scope_shape::LoopScopeShape;
use crate::mir::join_ir::lowering::loop_update_analyzer::{UpdateExpr, UpdateRhs};
use crate::mir::join_ir::{
BinOpKind, ConstValue, JoinFuncId, JoinFunction, JoinInst, JoinModule,
MirLikeInst, UnaryOp,
@ -68,6 +69,132 @@ use crate::mir::loop_pattern_detection::error_messages::{
format_unsupported_condition_error, extract_body_local_names,
};
use crate::mir::ValueId;
use std::collections::HashMap;
/// Phase 176-2: Emit JoinIR instructions for a single carrier update
///
/// Converts UpdateExpr (from LoopUpdateAnalyzer) into JoinIR instructions
/// that compute the updated carrier value.
///
/// # Arguments
///
/// * `carrier` - Carrier variable information (name, ValueId)
/// * `update` - Update expression (e.g., CounterLike, AccumulationLike)
/// * `alloc_value` - ValueId allocator closure
/// * `env` - ConditionEnv for variable resolution
/// * `instructions` - Output vector to append instructions to
///
/// # Returns
///
/// ValueId of the computed update result
///
/// # Example
///
/// ```ignore
/// // For "count = count + 1":
/// let count_next = emit_carrier_update(
/// &count_carrier,
/// &UpdateExpr::BinOp { lhs: "count", op: Add, rhs: Const(1) },
/// &mut alloc_value,
/// &env,
/// &mut instructions,
/// )?;
/// // Generates:
/// // const_1 = Const(1)
/// // count_next = BinOp(Add, count_param, const_1)
/// ```
fn emit_carrier_update(
carrier: &CarrierVar,
update: &UpdateExpr,
alloc_value: &mut dyn FnMut() -> ValueId,
env: &ConditionEnv,
instructions: &mut Vec<JoinInst>,
) -> Result<ValueId, String> {
match update {
UpdateExpr::Const(step) => {
// CounterLike: carrier = carrier + step
// Allocate const ValueId
let const_id = alloc_value();
instructions.push(JoinInst::Compute(MirLikeInst::Const {
dst: const_id,
value: ConstValue::Integer(*step),
}));
// Get carrier parameter ValueId from env
let carrier_param = env
.get(&carrier.name)
.ok_or_else(|| {
format!(
"Carrier '{}' not found in ConditionEnv",
carrier.name
)
})?;
// Allocate result ValueId
let result = alloc_value();
instructions.push(JoinInst::Compute(MirLikeInst::BinOp {
dst: result,
op: BinOpKind::Add,
lhs: carrier_param,
rhs: const_id,
}));
Ok(result)
}
UpdateExpr::BinOp { lhs, op, rhs } => {
// General binary operation: carrier = carrier op rhs
// Verify lhs matches carrier name
if lhs != &carrier.name {
return Err(format!(
"Update expression LHS '{}' doesn't match carrier '{}'",
lhs, carrier.name
));
}
// Get carrier parameter ValueId from env
let carrier_param = env
.get(&carrier.name)
.ok_or_else(|| {
format!(
"Carrier '{}' not found in ConditionEnv",
carrier.name
)
})?;
// Resolve RHS
let rhs_id = match rhs {
UpdateRhs::Const(n) => {
let const_id = alloc_value();
instructions.push(JoinInst::Compute(MirLikeInst::Const {
dst: const_id,
value: ConstValue::Integer(*n),
}));
const_id
}
UpdateRhs::Variable(var_name) => {
env.get(var_name).ok_or_else(|| {
format!(
"Update RHS variable '{}' not found in ConditionEnv",
var_name
)
})?
}
};
// Allocate result ValueId
let result = alloc_value();
instructions.push(JoinInst::Compute(MirLikeInst::BinOp {
dst: result,
op: *op,
lhs: carrier_param,
rhs: rhs_id,
}));
Ok(result)
}
}
}
/// Lower Pattern 2 (Loop with Conditional Break) to JoinIR
///
@ -122,16 +249,19 @@ use crate::mir::ValueId;
/// # Arguments
///
/// * `break_condition` - AST node for the break condition (e.g., `i >= 2`) - Phase 170-B
/// * `loop_var_name` - Name of the loop variable (for ExitMeta construction)
/// * `carrier_info` - Phase 176-3: Carrier metadata for dynamic multi-carrier support
/// * `carrier_updates` - Phase 176-3: Update expressions for each carrier variable
pub fn lower_loop_with_break_minimal(
_scope: LoopScopeShape,
condition: &ASTNode,
break_condition: &ASTNode,
env: &ConditionEnv,
loop_var_name: &str,
carrier_info: &CarrierInfo,
carrier_updates: &HashMap<String, UpdateExpr>,
) -> Result<(JoinModule, JoinFragmentMeta), String> {
// Phase 170-D-impl-3: Validate that conditions only use supported variable scopes
// LoopConditionScopeBox checks that loop conditions don't reference loop-body-local variables
let loop_var_name = &carrier_info.loop_var_name; // Phase 176-3: Extract from CarrierInfo
let loop_cond_scope = LoopConditionScopeBox::analyze(
loop_var_name,
&[condition, break_condition],
@ -169,12 +299,29 @@ pub fn lower_loop_with_break_minimal(
// ==================================================================
// ValueId allocation (Phase 188-Impl-2: Sequential local IDs)
// ==================================================================
// main() locals
let i_init = alloc_value(); // ValueId(0) - loop init value
let loop_result = alloc_value(); // ValueId(1) - result from loop_step
// Phase 176-3: Multi-carrier support - allocate parameters for all carriers
let carrier_count = carrier_info.carriers.len();
// loop_step locals
let i_param = alloc_value(); // ValueId(2) - parameter
eprintln!(
"[joinir/pattern2] Phase 176-3: Generating JoinIR for {} carriers: {:?}",
carrier_count,
carrier_info.carriers.iter().map(|c| &c.name).collect::<Vec<_>>()
);
// main() parameters: [i_init, carrier1_init, carrier2_init, ...]
let i_init = alloc_value(); // ValueId(0) - loop init value
let mut carrier_init_ids: Vec<ValueId> = Vec::new();
for _ in 0..carrier_count {
carrier_init_ids.push(alloc_value());
}
let loop_result = alloc_value(); // result from loop_step
// loop_step() parameters: [i_param, carrier1_param, carrier2_param, ...]
let i_param = alloc_value(); // Parameter for loop variable
let mut carrier_param_ids: Vec<ValueId> = Vec::new();
for _ in 0..carrier_count {
carrier_param_ids.push(alloc_value());
}
// Phase 169 / Phase 171-fix: Lower condition using condition_to_joinir helper with ConditionEnv
// This will allocate ValueIds dynamically based on condition complexity
@ -203,14 +350,20 @@ pub fn lower_loop_with_break_minimal(
// ==================================================================
// main() function
// ==================================================================
// Phase 188-Impl-2: main() takes i as a parameter (boundary input)
// The host will inject a Copy instruction: i_init_local = Copy host_i
let mut main_func = JoinFunction::new(main_id, "main".to_string(), vec![i_init]);
// Phase 176-3: Multi-carrier support - main() includes all carrier parameters
// main() takes (i, carrier1, carrier2, ...) as parameters (boundary inputs)
// The host will inject Copy instructions for each
let mut main_params = vec![i_init];
main_params.extend(carrier_init_ids.iter().copied());
let mut main_func = JoinFunction::new(main_id, "main".to_string(), main_params);
// result = loop_step(i_init)
// Phase 176-3: Multi-carrier support - Call includes all carrier inits
// result = loop_step(i_init, carrier1_init, carrier2_init, ...)
let mut call_args = vec![i_init];
call_args.extend(carrier_init_ids.iter().copied());
main_func.body.push(JoinInst::Call {
func: loop_step_id,
args: vec![i_init],
args: call_args,
k_next: None,
dst: Some(loop_result),
});
@ -223,12 +376,15 @@ pub fn lower_loop_with_break_minimal(
join_module.add_function(main_func);
// ==================================================================
// loop_step(i) function
// loop_step(i, carrier1, carrier2, ...) function
// ==================================================================
// Phase 176-3: Multi-carrier support - loop_step includes all carrier parameters
let mut loop_params = vec![i_param];
loop_params.extend(carrier_param_ids.iter().copied());
let mut loop_step_func = JoinFunction::new(
loop_step_id,
"loop_step".to_string(),
vec![i_param],
loop_params,
);
// ------------------------------------------------------------------
@ -246,10 +402,13 @@ pub fn lower_loop_with_break_minimal(
operand: cond_value,
}));
// Jump(k_exit, [i], cond=exit_cond) // Natural exit path
// Phase 176-3: Multi-carrier support - Jump includes all carrier values
// Jump(k_exit, [i, carrier1, carrier2, ...], cond=exit_cond) // Natural exit path
let mut natural_exit_args = vec![i_param];
natural_exit_args.extend(carrier_param_ids.iter().copied());
loop_step_func.body.push(JoinInst::Jump {
cont: k_exit_id.as_cont(),
args: vec![i_param], // Pass current i as exit value
args: natural_exit_args,
cond: Some(exit_cond),
});
@ -259,17 +418,54 @@ pub fn lower_loop_with_break_minimal(
// Insert all break condition evaluation instructions
loop_step_func.body.append(&mut break_cond_instructions);
// Jump(k_exit, [i], cond=break_cond) // Break exit path
// Phase 176-3: Multi-carrier support - Jump includes all carrier values
// Jump(k_exit, [i, carrier1, carrier2, ...], cond=break_cond) // Break exit path
let mut break_exit_args = vec![i_param];
break_exit_args.extend(carrier_param_ids.iter().copied());
loop_step_func.body.push(JoinInst::Jump {
cont: k_exit_id.as_cont(),
args: vec![i_param], // Pass current i as exit value
args: break_exit_args,
cond: Some(break_cond_value), // Phase 170-B: Use lowered condition
});
// ------------------------------------------------------------------
// Loop Body: i_next = i + 1
// Loop Body: Compute updated values for all carriers
// ------------------------------------------------------------------
// Step 1: const 1
// Phase 176-3: Multi-carrier support - emit updates for all carriers
let mut updated_carrier_values: Vec<ValueId> = Vec::new();
for (idx, carrier) in carrier_info.carriers.iter().enumerate() {
let carrier_name = &carrier.name;
// Get the update expression for this carrier
let update_expr = carrier_updates.get(carrier_name).ok_or_else(|| {
format!(
"No update expression found for carrier '{}' in carrier_updates map",
carrier_name
)
})?;
// Emit the carrier update instructions
let updated_value = emit_carrier_update(
carrier,
update_expr,
&mut alloc_value,
env,
&mut loop_step_func.body,
)?;
updated_carrier_values.push(updated_value);
eprintln!(
"[joinir/pattern2] Phase 176-3: Carrier '{}' update: {:?} -> {:?}",
carrier_name, carrier_param_ids[idx], updated_value
);
}
// Phase 176-3: Multi-carrier support - tail call includes all updated carriers
// Call(loop_step, [i_next, carrier1_next, carrier2_next, ...]) // tail recursion
// Note: We need to emit i_next = i + 1 first for the loop variable
let const_1 = alloc_value();
loop_step_func
.body
.push(JoinInst::Compute(MirLikeInst::Const {
@ -277,7 +473,7 @@ pub fn lower_loop_with_break_minimal(
value: ConstValue::Integer(1),
}));
// Step 2: i_next = i + 1
let i_next = alloc_value();
loop_step_func
.body
.push(JoinInst::Compute(MirLikeInst::BinOp {
@ -287,10 +483,18 @@ pub fn lower_loop_with_break_minimal(
rhs: const_1,
}));
// Call(loop_step, [i_next]) // tail recursion
let mut tail_call_args = vec![i_next];
tail_call_args.extend(updated_carrier_values.iter().copied());
eprintln!(
"[joinir/pattern2/debug] Tail call args count: {}, updated_carrier_values: {:?}",
tail_call_args.len(),
updated_carrier_values
);
loop_step_func.body.push(JoinInst::Call {
func: loop_step_id,
args: vec![i_next],
args: tail_call_args,
k_next: None, // CRITICAL: None for tail call
dst: None,
});
@ -298,13 +502,23 @@ pub fn lower_loop_with_break_minimal(
join_module.add_function(loop_step_func);
// ==================================================================
// k_exit(i_exit) function - Exit PHI
// k_exit(i_exit, carrier1_exit, carrier2_exit, ...) function - Exit PHI
// ==================================================================
// Pattern 2 key difference: k_exit receives exit value from both paths
// Phase 176-3: Multi-carrier support - k_exit receives all carrier exit values
// Pattern 2 key difference: k_exit receives exit values from both paths (natural + break)
// Allocate exit parameter ValueIds for all carriers
let mut carrier_exit_ids: Vec<ValueId> = Vec::new();
for _ in 0..carrier_count {
carrier_exit_ids.push(alloc_value());
}
let mut exit_params = vec![i_exit];
exit_params.extend(carrier_exit_ids.iter().copied());
let mut k_exit_func = JoinFunction::new(
k_exit_id,
"k_exit".to_string(),
vec![i_exit], // Exit PHI: receives i from both exit paths
exit_params, // Exit PHI: receives (i, carrier1, carrier2, ...) from both exit paths
);
// return i_exit (return final loop variable value)
@ -323,11 +537,43 @@ pub fn lower_loop_with_break_minimal(
eprintln!("[joinir/pattern2] Break condition from AST (delegated to condition_to_joinir)");
eprintln!("[joinir/pattern2] Exit PHI: k_exit receives i from both natural exit and break");
// Phase 176-3: Multi-carrier support - ExitMeta includes all carrier bindings
// Build exit_values vec with all carriers
let mut exit_values = Vec::new();
// Add loop variable first
exit_values.push((loop_var_name.to_string(), i_exit));
eprintln!(
"[joinir/pattern2/exit_meta] Building exit_values: loop_var '{}' → {:?}",
loop_var_name, i_exit
);
// Add all additional carriers
for (idx, carrier) in carrier_info.carriers.iter().enumerate() {
eprintln!(
"[joinir/pattern2/exit_meta] Adding carrier '{}' → {:?} (idx={})",
carrier.name, carrier_exit_ids[idx], idx
);
exit_values.push((carrier.name.clone(), carrier_exit_ids[idx]));
}
eprintln!(
"[joinir/pattern2/exit_meta] Total exit_values count: {}",
exit_values.len()
);
let exit_meta = ExitMeta::multiple(exit_values);
// Phase 172-3 → Phase 33-14: Build JoinFragmentMeta with expr_result
// Pattern 2: Loop is used as expression → `return i` means k_exit's return is expr_result
let exit_meta = ExitMeta::single(loop_var_name.to_string(), i_exit);
let fragment_meta = JoinFragmentMeta::with_expr_result(i_exit, exit_meta);
eprintln!("[joinir/pattern2] Phase 33-14: JoinFragmentMeta {{ expr_result: {:?}, carrier: {} }}", i_exit, loop_var_name);
eprintln!(
"[joinir/pattern2] Phase 33-14/176-3: JoinFragmentMeta {{ expr_result: {:?}, carriers: {} }}",
i_exit,
carrier_info.carriers.len()
);
Ok((join_module, fragment_meta))
}
@ -471,4 +717,271 @@ mod tests {
assert!(vars.contains("start"));
assert!(vars.contains("ch")); // The problematic body-local variable
}
// Phase 176-2: Tests for emit_carrier_update helper
mod carrier_update_tests {
use super::*;
use crate::mir::join_ir::lowering::loop_update_analyzer::{UpdateExpr, UpdateRhs};
// Helper: Create a test ConditionEnv
fn test_env() -> ConditionEnv {
let mut env = ConditionEnv::new();
env.insert("count".to_string(), ValueId(10));
env.insert("sum".to_string(), ValueId(20));
env.insert("i".to_string(), ValueId(30));
env
}
// Helper: Create a test CarrierVar
fn test_carrier(name: &str, host_id: u32) -> CarrierVar {
CarrierVar {
name: name.to_string(),
host_id: ValueId(host_id),
}
}
#[test]
fn test_emit_const_update() {
// Test: count = count + 1 (UpdateExpr::Const)
let carrier = test_carrier("count", 100);
let update = UpdateExpr::Const(1);
let env = test_env();
let mut value_counter = 50u32;
let mut alloc_value = || {
let id = ValueId(value_counter);
value_counter += 1;
id
};
let mut instructions = Vec::new();
let result = emit_carrier_update(
&carrier,
&update,
&mut alloc_value,
&env,
&mut instructions,
);
assert!(result.is_ok());
let result_id = result.unwrap();
// Should generate 2 instructions: Const(1) + BinOp(Add)
assert_eq!(instructions.len(), 2);
// Instruction 1: Const(1)
match &instructions[0] {
JoinInst::Compute(MirLikeInst::Const { dst, value }) => {
assert_eq!(*dst, ValueId(50)); // First allocated
assert!(matches!(value, ConstValue::Integer(1)));
}
_ => panic!("Expected Const instruction"),
}
// Instruction 2: BinOp(Add, count, const_1)
match &instructions[1] {
JoinInst::Compute(MirLikeInst::BinOp { dst, op, lhs, rhs }) => {
assert_eq!(*dst, ValueId(51)); // Second allocated
assert_eq!(*op, BinOpKind::Add);
assert_eq!(*lhs, ValueId(10)); // count from env
assert_eq!(*rhs, ValueId(50)); // const_1
}
_ => panic!("Expected BinOp instruction"),
}
assert_eq!(result_id, ValueId(51));
}
#[test]
fn test_emit_binop_update_with_const() {
// Test: sum = sum + 5 (UpdateExpr::BinOp with Const RHS)
let carrier = test_carrier("sum", 200);
let update = UpdateExpr::BinOp {
lhs: "sum".to_string(),
op: BinOpKind::Add,
rhs: UpdateRhs::Const(5),
};
let env = test_env();
let mut value_counter = 60u32;
let mut alloc_value = || {
let id = ValueId(value_counter);
value_counter += 1;
id
};
let mut instructions = Vec::new();
let result = emit_carrier_update(
&carrier,
&update,
&mut alloc_value,
&env,
&mut instructions,
);
assert!(result.is_ok());
let result_id = result.unwrap();
// Should generate 2 instructions: Const(5) + BinOp(Add)
assert_eq!(instructions.len(), 2);
// Instruction 1: Const(5)
match &instructions[0] {
JoinInst::Compute(MirLikeInst::Const { dst, value }) => {
assert_eq!(*dst, ValueId(60));
assert!(matches!(value, ConstValue::Integer(5)));
}
_ => panic!("Expected Const instruction"),
}
// Instruction 2: BinOp(Add, sum, const_5)
match &instructions[1] {
JoinInst::Compute(MirLikeInst::BinOp { dst, op, lhs, rhs }) => {
assert_eq!(*dst, ValueId(61));
assert_eq!(*op, BinOpKind::Add);
assert_eq!(*lhs, ValueId(20)); // sum from env
assert_eq!(*rhs, ValueId(60)); // const_5
}
_ => panic!("Expected BinOp instruction"),
}
assert_eq!(result_id, ValueId(61));
}
#[test]
fn test_emit_binop_update_with_variable() {
// Test: sum = sum + i (UpdateExpr::BinOp with Variable RHS)
let carrier = test_carrier("sum", 200);
let update = UpdateExpr::BinOp {
lhs: "sum".to_string(),
op: BinOpKind::Add,
rhs: UpdateRhs::Variable("i".to_string()),
};
let env = test_env();
let mut value_counter = 70u32;
let mut alloc_value = || {
let id = ValueId(value_counter);
value_counter += 1;
id
};
let mut instructions = Vec::new();
let result = emit_carrier_update(
&carrier,
&update,
&mut alloc_value,
&env,
&mut instructions,
);
assert!(result.is_ok());
let result_id = result.unwrap();
// Should generate 1 instruction: BinOp(Add, sum, i)
assert_eq!(instructions.len(), 1);
// Instruction: BinOp(Add, sum, i)
match &instructions[0] {
JoinInst::Compute(MirLikeInst::BinOp { dst, op, lhs, rhs }) => {
assert_eq!(*dst, ValueId(70));
assert_eq!(*op, BinOpKind::Add);
assert_eq!(*lhs, ValueId(20)); // sum from env
assert_eq!(*rhs, ValueId(30)); // i from env
}
_ => panic!("Expected BinOp instruction"),
}
assert_eq!(result_id, ValueId(70));
}
#[test]
fn test_emit_update_carrier_not_in_env() {
// Test error case: carrier not found in env
let carrier = test_carrier("unknown", 300);
let update = UpdateExpr::Const(1);
let env = test_env(); // doesn't have "unknown"
let mut value_counter = 80u32;
let mut alloc_value = || {
let id = ValueId(value_counter);
value_counter += 1;
id
};
let mut instructions = Vec::new();
let result = emit_carrier_update(
&carrier,
&update,
&mut alloc_value,
&env,
&mut instructions,
);
assert!(result.is_err());
assert!(result.unwrap_err().contains("Carrier 'unknown' not found"));
}
#[test]
fn test_emit_update_lhs_mismatch() {
// Test error case: LHS doesn't match carrier name
let carrier = test_carrier("count", 100);
let update = UpdateExpr::BinOp {
lhs: "sum".to_string(), // Wrong! Should be "count"
op: BinOpKind::Add,
rhs: UpdateRhs::Const(1),
};
let env = test_env();
let mut value_counter = 90u32;
let mut alloc_value = || {
let id = ValueId(value_counter);
value_counter += 1;
id
};
let mut instructions = Vec::new();
let result = emit_carrier_update(
&carrier,
&update,
&mut alloc_value,
&env,
&mut instructions,
);
assert!(result.is_err());
assert!(result.unwrap_err().contains("doesn't match carrier"));
}
#[test]
fn test_emit_update_rhs_variable_not_found() {
// Test error case: RHS variable not in env
let carrier = test_carrier("sum", 200);
let update = UpdateExpr::BinOp {
lhs: "sum".to_string(),
op: BinOpKind::Add,
rhs: UpdateRhs::Variable("unknown_var".to_string()),
};
let env = test_env();
let mut value_counter = 100u32;
let mut alloc_value = || {
let id = ValueId(value_counter);
value_counter += 1;
id
};
let mut instructions = Vec::new();
let result = emit_carrier_update(
&carrier,
&update,
&mut alloc_value,
&env,
&mut instructions,
);
assert!(result.is_err());
assert!(result.unwrap_err().contains("Update RHS variable 'unknown_var' not found"));
}
}
}