Files

Tomoaki d62114c705 Core-13 pure: add CI workflows, VM e2e tests, LLVM parity bridge (minimal); do not touch private docs

2025-09-07 07:28:53 +09:00

3.7 KiB

Raw Blame History

ベンチマークアプリ推奨案（AI会議結果）

Date: 2025-08-31 相談先: Gemini + Codex

🎯 AI先生たちの推奨ベンチマーク

📊 Gemini先生の推奨（論文・実用重視）

Tier 1: 論文の主役候補

レイトレーサー (Ray Tracer)
- 理由: CPU負荷高、JIT/AOT効果が劇的
- 効果: Interpreter vs VM vs LLVM の性能差を視覚的に証明
- 規模: 200-300行
Lispインタープリター
- 理由: 「Everything is Box」の優位性を完璧に証明
- 効果: 動的型付け言語での統一モデルの威力
- 規模: 300-400行

Tier 2: 実用性アピール

静的サイトジェネレータ
- 理由: Markdownパーサー + HTMLコンパイラ
- 効果: Fileプラグイン、文字列処理、実用性
- 規模: 400-500行
REST APIサーバー
- 理由: 現代的アプリ開発での適性証明
- 効果: Netプラグイン、JSON操作、並行性
- 規模: 300-400行

📈 Codex先生の推奨（ベンチマーク標準重視）

CLBG（Computer Language Benchmarks Game）定番

n-body - 数値計算の定番
spectral-norm - 行列・ベクトル演算
mandelbrot - 純粋計算 + 画像出力
fannkuch-redux - 組み合わせ計算
k-nucleotide - 文字列・ハッシュ処理
binary-trees - メモリ割り当て・GC性能

現実的なアプリ（Nyash特化）

JSON/CSV Stream Aggregator
- File/Netプラグインから同じコードで処理
- 「Everything is Box」の威力を実証

🚀 推奨実装順序（論文説得力重視）

Phase 1: CLBG標準ベンチマーク（信頼性）

# 1. binary-trees（メモリ・GC基準）
# 2. n-body（数値計算）  
# 3. mandelbrot（計算 + I/O）

Phase 2: Nyash特色ベンチマーク（独自性）

# 4. JSON Stream Aggregator（プラグイン統一）
# 5. レイトレーサー（視覚的インパクト）

Phase 3: 実用アプリ（実践性）

# 6. Lispインタープリター（言語能力）
# 7. 静的サイトジェネレータ（実用性）

📊 期待される性能差

Interpreter → VM → LLVM

n-body: 1x → 10x → 50x（数値計算）
mandelbrot: 1x → 15x → 80x（ループ最適化）
binary-trees: 1x → 8x → 20x（GC最適化）
JSON Stream: 1x → 12x → 30x（プラグイン + ループ）

🎯 AOTスモークテストの具体例

ChatGPT5さんが言う「代表サンプルのAOTスモーク」：

Array get/set スモーク

// smoke_array_getset.nyash
local arr = new ArrayBox()
arr.set(0, 42)
arr.set(1, 100)
local result = arr.get(0) + arr.get(1)
print("Result: " + result)  // Expected: Result: 142

Console.log スモーク

// smoke_console_log.nyash  
local console = new ConsoleBox()
console.log("Hello from LLVM!")
print("Result: success")

実行・比較

# VM実行
./nyash --backend vm smoke_array_getset.nyash > vm_out.txt

# LLVM AOT実行
./nyash --compile-native smoke_array_getset.nyash -o test_app
./test_app > aot_out.txt

# 結果比較（一致すればOK）
diff vm_out.txt aot_out.txt

💡 論文での活用方法

Section 5: Evaluation
- CLBGベンチマークで他言語と性能比較
- Nyash特色ベンチマークで独自性証明
Section 6: Case Studies
- レイトレーサーでJIT/LLVM効果の視覚化
- Stream Aggregatorで「Everything is Box」実証
Appendix: Reproducibility
- 全ベンチマークコードを掲載
- 実行手順を詳細に記載

これで論文の説得力とインパクトが大幅に向上しますね！

3.7 KiB Raw Blame History Unescape Escape

ベンチマークアプリ推奨案（AI会議結果）

🎯 AI先生たちの推奨ベンチマーク

📊 Gemini先生の推奨（論文・実用重視）

Tier 1: 論文の主役候補

Tier 2: 実用性アピール

📈 Codex先生の推奨（ベンチマーク標準重視）

CLBG（Computer Language Benchmarks Game）定番

現実的なアプリ（Nyash特化）

🚀 推奨実装順序（論文説得力重視）

Phase 1: CLBG標準ベンチマーク（信頼性）

Phase 2: Nyash特色ベンチマーク（独自性）

Phase 3: 実用アプリ（実践性）

📊 期待される性能差

Interpreter → VM → LLVM

🎯 AOTスモークテストの具体例

Array get/set スモーク

Console.log スモーク

実行・比較

💡 論文での活用方法

3.7 KiB

Raw Blame History