benchmarks: add baseline planner comparison#20
Conversation
|
Thanks Krish. This is a useful direction and fits the goal of making the toy benchmark easier for new contributors to understand. Before merging, could you make one small correction to the feasibility reporting? Right now Could you either:
Also, GitHub is flagging hidden/bidirectional Unicode text in the file. Please clean that up as well. After those changes, I think this is a good beginner-friendly baseline example to merge. |
|
Thanks, Krish! I rechecked locally, and the file looks fine regarding hidden Unicode / line endings, so please ignore that part of my earlier comment. The only remaining point is the feasibility reporting. Right now, |
|
Addressed the feasibility-reporting review note. The comparison result now reports Validation run: |
|
Merged. Thanks for the fix. |
Summary\n- add a beginner-friendly baseline planner comparison example for the toy process-window benchmark\n- compare random sampling against grid search using the existing synthetic physics objective\n- print objective, quality, defect risk, feasibility, and selected physical process settings\n\nFixes #15\n\n## Test Report\n-
cd benchmarks/toy-process-window/industrial-world-model && PYTHONPATH=src python3 examples/compare_baseline_planners.py