Files

nyash-codex d4f90976da refactor(joinir): Phase 244 - ConditionLoweringBox trait unification

Unify condition lowering logic across Pattern 2/4 with trait-based API.

New infrastructure:
- condition_lowering_box.rs: ConditionLoweringBox trait + ConditionContext (293 lines)
- ExprLowerer implements ConditionLoweringBox trait (+51 lines)

Pattern migrations:
- Pattern 2 (loop_with_break_minimal.rs): Use trait API
- Pattern 4 (loop_with_continue_minimal.rs): Use trait API

Benefits:
- Unified condition lowering interface
- Extensible for future lowering strategies
- Clean API boundary between patterns and lowering logic
- Zero code duplication

Test results: 911/911 PASS (+2 new tests)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

2025-12-11 02:35:31 +09:00

10 KiB

Raw Blame History

Phase 154: Implementation Summary - MIR CFG Integration & Dead Block Detection

Overview

Successfully implemented HC020 Unreachable Basic Block Detection rule using MIR CFG information. This provides block-level dead code analysis complementing the existing method-level HC019 rule from Phase 153.

Status: Core infrastructure complete, CFG data bridge pending (see Known Limitations)

Completed Deliverables

1. CFG Extractor (`src/mir/cfg_extractor.rs`)

Purpose: Extract CFG information from MIR modules for analysis tools.

Features:

Extracts block-level reachability information
Exports successor relationships
Identifies terminator types (Branch/Jump/Return)
Deterministic output (sorted by block ID)

API:

pub fn extract_cfg_info(module: &MirModule) -> serde_json::Value

Output Format:

{
  "functions": [
    {
      "name": "Main.main/0",
      "entry_block": 0,
      "blocks": [
        {
          "id": 0,
          "reachable": true,
          "successors": [1, 2],
          "terminator": "Branch"
        }
      ]
    }
  ]
}

Testing: Includes unit tests for simple CFG and unreachable blocks.

2. DeadBlockAnalyzerBox (`tools/hako_check/rules/rule_dead_blocks.hako`)

Purpose: HC020 rule implementation for unreachable basic block detection.

Features:

Scans CFG information from Analysis IR
Reports unreachable blocks with function and block ID
Infers reasons for unreachability (early return, dead branch, etc.)
Gracefully skips if CFG info unavailable

API:

static box DeadBlockAnalyzerBox {
    method apply_ir(ir, path, out) {
        // Analyze CFG and report HC020 diagnostics
    }
}

Output Format:

[HC020] Unreachable basic block: fn=Main.test bb=5 (after early return) :: test.hako

3. CLI Integration (`tools/hako_check/cli.hako`)

New Flag: --dead-blocks

Usage:

# Run HC020 dead block detection
./tools/hako_check.sh --dead-blocks program.hako

# Combined with other modes
./tools/hako_check.sh --dead-code --dead-blocks program.hako

# Or use rules filter
./tools/hako_check.sh --rules dead_blocks program.hako

Integration Points:

Added DeadBlockAnalyzerBox import
Added --dead-blocks flag parsing
Added HC020 rule execution after HC019
Added debug logging for HC020

4. Test Cases

Created 4 comprehensive test cases:

test_dead_blocks_early_return.hako
- Pattern: Early return creates unreachable code
- Expected: HC020 for block after return
test_dead_blocks_always_false.hako
- Pattern: Constant false condition (if 0)
- Expected: HC020 for dead then-branch
test_dead_blocks_infinite_loop.hako
- Pattern: loop(1) never exits
- Expected: HC020 for code after loop
test_dead_blocks_after_break.hako
- Pattern: Unconditional break in loop
- Expected: HC020 for code after break

5. Smoke Test Script

File: tools/hako_check_deadblocks_smoke.sh

Features:

Tests all 4 test cases
Checks for HC020 output
Gracefully handles CFG info unavailability (MVP limitation)
Non-failing for incomplete CFG integration

Known Limitations & Next Steps

Current State: Core Infrastructure Complete ✅

What Works:

✅ CFG extractor implemented and tested
✅ DeadBlockAnalyzerBox implemented
✅ CLI integration complete
✅ Test cases created
✅ Smoke test script ready

Outstanding: CFG Data Bridge 🔄

The Gap: Currently, analysis_consumer.hako builds Analysis IR by text scanning, not from MIR. The CFG information exists in Rust's MirModule but isn't exposed to the .hako side yet.

Solution Path (Phase 155+):

Option A: Extend analysis_consumer with MIR access (Recommended)

// In analysis_consumer.hako
static box HakoAnalysisBuilderBox {
    build_from_source_flags(text, path, no_ast) {
        local ir = new MapBox()
        // ... existing text scanning ...

        // NEW: Request CFG from MIR if available
        local cfg = me._extract_cfg_from_mir(text, path)
        if cfg != null {
            ir.set("cfg", cfg)
        }

        return ir
    }

    _extract_cfg_from_mir(text, path) {
        // Call Rust function that:
        // 1. Compiles text to MIR
        // 2. Calls extract_cfg_info()
        // 3. Returns JSON value
    }
}

Option B: Add MIR compilation step to hako_check pipeline

# In tools/hako_check.sh
# 1. Compile to MIR JSON
hakorune --emit-mir-json /tmp/mir.json program.hako

# 2. Extract CFG
hakorune --extract-cfg /tmp/mir.json > /tmp/cfg.json

# 3. Pass to analyzer
hakorune --backend vm tools/hako_check/cli.hako \
    --source-file program.hako "$(cat program.hako)" \
    --cfg-file /tmp/cfg.json

Recommended: Option A (cleaner integration, single pass)

Implementation Roadmap (Phase 155)

Add Rust-side function to compile .hako to MIR and extract CFG
Expose to VM as builtin function (e.g., extract_mir_cfg(text, path))
Update analysis_consumer.hako to call this function
Test end-to-end with all 4 test cases
Update smoke script to expect HC020 output

Estimated Effort: 2-3 hours (mostly Rust-side plumbing)

Architecture Decisions

Why Not Merge HC019 and HC020?

Decision: Keep HC019 (method-level) and HC020 (block-level) separate

Rationale:

Different granularity: Methods vs. blocks are different analysis levels
Different use cases: HC019 finds unused code, HC020 finds unreachable paths
Optional CFG: HC019 works without MIR, HC020 requires CFG
User control: --dead-code vs --dead-blocks allows selective analysis

CFG Info Location in Analysis IR

Decision: Add cfg as top-level field in Analysis IR

Alternatives considered:

Embed in methods array → Breaks existing format
Separate IR structure → More complex

Chosen:

{
    "methods": [...],  // Existing
    "calls": [...],    // Existing
    "cfg": {           // NEW
        "functions": [...]
    }
}

Benefits:

Backward compatible (optional field)
Extensible (can add more CFG data later)
Clean separation of concerns

Reachability: MIR vs. Custom Analysis

Decision: Use MIR's built-in block.reachable flag

Rationale:

Already computed during MIR construction
Proven correct (used by optimizer)
No duplication of logic
Consistent with Rust compiler design

Alternative (rejected): Re-compute reachability in DeadBlockAnalyzerBox

Pro: Self-contained
Con: Duplication, potential bugs, slower

Testing Strategy

Unit Tests

✅ cfg_extractor::tests::test_extract_simple_cfg
✅ cfg_extractor::tests::test_unreachable_block

Integration Tests

🔄 Pending CFG bridge (Phase 155)
Test cases ready in apps/tests/hako_check/

Smoke Tests

✅ tools/hako_check_deadblocks_smoke.sh
Currently validates infrastructure, will validate HC020 output once bridge is complete

Performance Considerations

CFG Extraction Cost

Negligible: Already computed during MIR construction
One-time: Extracted once per function
Small output: ~100 bytes per function typically

DeadBlockAnalyzerBox Cost

O(blocks): Linear scan of blocks array
Typical: <100 blocks per function
Fast: Simple boolean check and string formatting

Conclusion: No performance concerns, suitable for CI/CD pipelines.

Future Enhancements (Phase 160+)

Enhanced Diagnostics

Show source code location of unreachable blocks
Suggest how to fix (remove code, change condition, etc.)
Group related unreachable blocks

Deeper Analysis

Constant propagation to find more dead branches
Path sensitivity (combine conditions across blocks)
Integration with type inference

Visualization

DOT graph output showing dead blocks in red
Interactive HTML report with clickable blocks
Side-by-side source and CFG view

Files Modified/Created

New Files

✅ src/mir/cfg_extractor.rs (184 lines)
✅ tools/hako_check/rules/rule_dead_blocks.hako (100 lines)
✅ apps/tests/hako_check/test_dead_blocks_*.hako (4 files, ~20 lines each)
✅ tools/hako_check_deadblocks_smoke.sh (65 lines)
✅ docs/development/current/main/phase154_mir_cfg_inventory.md
✅ docs/development/current/main/phase154_implementation_summary.md

Modified Files

✅ src/mir/mod.rs (added cfg_extractor module and re-export)
✅ tools/hako_check/cli.hako (added --dead-blocks flag and HC020 rule execution)

Total Lines: ~450 lines (code + docs + tests)

Recommendations for Next Phase

Immediate (Phase 155)

Implement CFG data bridge (highest priority)
- Add extract_mir_cfg() builtin function
- Update analysis_consumer.hako to use it
- Test end-to-end with all 4 test cases
Update documentation
- Mark CFG bridge as complete
- Add usage examples to hako_check README
- Update CURRENT_TASK.md

Short-term (Phase 156-160)

Add source location mapping
- Track span information for unreachable blocks
- Show line numbers in HC020 output
Enhance test coverage
- Add tests for complex control flow (nested loops, try-catch, etc.)
- Add negative tests (no false positives)

Long-term (Phase 160+)

Constant folding integration
- Detect more dead branches via constant propagation
- Integrate with MIR optimizer
Visualization tools
- DOT/GraphViz output for CFG
- HTML reports with interactive CFG

Conclusion

Phase 154 successfully establishes the infrastructure for block-level dead code detection. The core components (CFG extractor, analyzer box, CLI integration, tests) are complete and tested.

The remaining work is a straightforward data bridge to connect the Rust-side MIR CFG to the .hako-side Analysis IR. This is a mechanical task estimated at 2-3 hours for Phase 155.

Key Achievement: Demonstrates the power of the boxed modular architecture - DeadBlockAnalyzerBox is completely independent and swappable, just like DeadCodeAnalyzerBox from Phase 153.

Author: Claude (Anthropic) Date: 2025-12-04 Phase: 154 (MIR CFG Integration & Dead Block Detection) Status: Core infrastructure complete, CFG bridge pending (Phase 155) Status: Historical

10 KiB Raw Blame History

Phase 154: Implementation Summary - MIR CFG Integration & Dead Block Detection

Overview

Completed Deliverables

1. CFG Extractor (src/mir/cfg_extractor.rs)

2. DeadBlockAnalyzerBox (tools/hako_check/rules/rule_dead_blocks.hako)

3. CLI Integration (tools/hako_check/cli.hako)

4. Test Cases

5. Smoke Test Script

Known Limitations & Next Steps

Current State: Core Infrastructure Complete ✅

Outstanding: CFG Data Bridge 🔄

Option A: Extend analysis_consumer with MIR access (Recommended)

Option B: Add MIR compilation step to hako_check pipeline

Implementation Roadmap (Phase 155)

Architecture Decisions

Why Not Merge HC019 and HC020?

CFG Info Location in Analysis IR

Reachability: MIR vs. Custom Analysis

Testing Strategy

Unit Tests

Integration Tests

Smoke Tests

Performance Considerations

CFG Extraction Cost

DeadBlockAnalyzerBox Cost

Future Enhancements (Phase 160+)

Enhanced Diagnostics

Deeper Analysis

Visualization

Files Modified/Created

New Files

Modified Files

Recommendations for Next Phase

Immediate (Phase 155)

Short-term (Phase 156-160)

Long-term (Phase 160+)

Conclusion

10 KiB

Raw Blame History

1. CFG Extractor (`src/mir/cfg_extractor.rs`)

2. DeadBlockAnalyzerBox (`tools/hako_check/rules/rule_dead_blocks.hako`)

3. CLI Integration (`tools/hako_check/cli.hako`)