Files
hakorune/tools/smokes
tomoaki 1f3ef2ac61 smokes(v2): Reach 45s target (Phase 287 P2)
Phase 287 P2 completed: 45-second target achieved

Changes:
- Moved 34 heavy tests (>0.4s) from quick to integration
- Disabled FAST_FAIL in quick profile for accurate measurement
- Reduced execution time: 63.0s → 45.85s (27% reduction)
- Reduced test count: 447 → 413 tests (8% reduction)

Key improvements:
- Fixed FAST_FAIL=1 issue preventing full test execution
- Selective individual test movement vs directory-level in P1
- Maintained relative path structure for --filter compatibility

Performance metrics:
- P1 (FAST_FAIL=0): 63.0s / 447 tests / 111 failures
- P2 (optimized): 45.85s / 413 tests / 90 failures
- Target achieved:  45s (within tolerance)

Moved test categories:
- mirbuilder_loop_* (0.46-0.49s)
- mirbuilder_provider_* (0.36-0.68s)
- hako_primary_no_fallback_* (0.46-0.47s)
- parser_embedded_json_canary (0.49s)
- emit_mir_canary (0.47s)

Documentation updates:
- tools/smokes/v2/README.md: Updated quick target to ~400 tests
- docs/development/current/main/phases/phase-287/README.md: P2 results
- tools/smokes/v2/configs/auto_detect.conf: FAST_FAIL=0 for accurate timing

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2025-12-25 07:58:51 +09:00
..

Hakorune Smoke Tests v2 — Guide (Nyash-compatible)

Overview

  • Entry: tools/smokes/v2/run.sh — unified runner for quick/integration/full.
  • Profiles:
    • quick — fast developer checks.
    • integration — VM↔LLVM parity, basic stability.
    • full — comprehensive matrix.

🎯 Two Baselines (Runbook)

これから開発の基準となる2つのベースライン

📦 VM ラインRust VM - 既定)

用途: 開発・デバッグ・検証用(高速・型安全)

# ビルド
cargo build --release

# 一括スモークテスト
tools/smokes/v2/run.sh --profile quick

# 個別スモークテスト
tools/smokes/v2/run.sh --profile quick --filter "<glob>"
# 例: --filter "core/json_query_min_vm.sh"

# 単発実行(参考)
$NYASH_BIN --backend vm apps/APP/main.hako

llvmlite ラインLLVMハーネス

用途: 本番・最適化・配布用(実証済み安定性)

前提: Python3 + llvmlite

pip install llvmlite  # 未導入の場合

実行手順:

# ビルドLLVM_SYS_180_PREFIX不要
cargo build --release --features llvm

# 一括スモークテスト
tools/smokes/v2/run.sh --profile integration

# 個別スモークテスト
tools/smokes/v2/run.sh --profile integration --filter "<glob>"

# 単発実行
NYASH_LLVM_USE_HARNESS=1 $NYASH_BIN --backend llvm apps/tests/peek_expr_block.hako

# 有効化確認
$NYASH_BIN --version | rg -i 'features.*llvm'

💡 重要: 両方のラインのテストが通ることで、MIR14統一アーキテクチャの品質を保証

🔁 QuickでAST/LLVM系も実行したいとき

通常、quick は LLVM未ビルド時に AST/LLVM系テストを自動で SKIP します。 Quickでも実行したい場合は、先に LLVM 有効でビルドしてください:

LLVM_SYS_180_PREFIX=$(llvm-config-18 --prefix) cargo build --release --features llvm
tools/smokes/v2/run.sh --profile quick

テストランナーは LLVM 非対応時にヒントを出力しますbuildコマンドの案内

Notes

  • Using resolution: prefer nyash.toml aliases (SSOT). Some tests may enable NYASH_ALLOW_USING_FILE=1 internally for convenience.
  • Plugin warnings are informational; smokes are designed to pass without dynamic plugins.
  • Harness single-run may take longer due to link+exec; integration profile includes generous timeouts.
  • Binary selection in scripts: always use $NYASH_BIN (the runner sets it to target/release/hakorune when present, otherwise falls back to nyash).
  • Branding: uservisible messages prefer “Hakorune”; the legacy nyash binary remains for compatibility but may be gatedHAKO_ALLOW_NYASH=1)。

Dev Mode (defaults)

  • In v2 smokes, the quick profile exports NYASH_DEV=1 by default.
    • This enables CLI --dev-equivalent defaults inside Nyash:
      • AST using ON (SSOT + AST prelude merge)
      • Operator Boxes in observe mode (no adoption)
      • Minimal diagnostics; output parity is preserved
  • You can also run manually with nyash --dev script.hako.

Common commands

  • Quick suite (auto NYASH_DEV=1):
    • tools/smokes/v2/run.sh --profile quick
  • Focus JSON smokes:
    • tools/opbox-json.sh (Roundtrip/Nested, plugins disabled, generous timeout)
  • One-off program (VM):
    • $NYASH_BIN --backend vm --dev apps/APP/main.hako

Key env knobs

  • NYASH_DEV=1 — enable dev defaults (same effect as --dev).
  • SMOKES_DEFAULT_TIMEOUT — per test timeout seconds (default 15 for quick).
  • SMOKES_PLUGIN_MODE=dynamic|static — plugin mode for preflight (auto by default).
  • SMOKES_FORCE_CONFIG=rust_vm_dynamic|llvm_static — force backend config.
  • SMOKES_NOTIFY_TAIL — lines to show on failure tail (default 80).

Notes

  • Dev defaults are designed to be non-intrusive: tests remain behaviorcompatible.
  • To repro outside smokes, either pass --dev or export NYASH_DEV=1.

Heavy JSON probes

  • Heavy JSON tests (nested/roundtrip/query_min) run a tiny parser probe first.
  • The probe's stdout last non-empty line is trimmed and compared to ok.
  • If not ok, the test is SKIP (parser unavailable), not FAIL. This avoids false negatives due to environment noise or optional dependencies.