benchmarks: add baseline planner comparison by kgarg2468 · Pull Request #20 · programmablemanufacturing/programmable-manufacturing-lab

kgarg2468 · 2026-05-25T19:57:10Z

Summary\n- add a beginner-friendly baseline planner comparison example for the toy process-window benchmark\n- compare random sampling against grid search using the existing synthetic physics objective\n- print objective, quality, defect risk, feasibility, and selected physical process settings\n\nFixes #15\n\n## Test Report\n- `cd benchmarks/toy-process-window/industrial-world-model && PYTHONPATH=src python3 examples/compare_baseline_planners.py`

programmablemanufacturing · 2026-05-26T00:29:41Z

Thanks Krish. This is a useful direction and fits the goal of making the toy benchmark easier for new contributors to understand.

Before merging, could you make one small correction to the feasibility reporting? Right now feasible appears to mean whether any feasible candidate was found in the evaluated set, while the selected candidate is chosen by best objective. That could be misleading in the printed comparison.

Could you either:

set feasible=best.feasible so it reports whether the selected candidate is feasible, or
keep a separate field such as any_feasible / feasible_count?

Also, GitHub is flagging hidden/bidirectional Unicode text in the file. Please clean that up as well.

After those changes, I think this is a good beginner-friendly baseline example to merge.

programmablemanufacturing · 2026-05-26T01:14:42Z

Thanks, Krish! I rechecked locally, and the file looks fine regarding hidden Unicode / line endings, so please ignore that part of my earlier comment.

The only remaining point is the feasibility reporting. Right now, feasible=feasible_count > 0 means any evaluated candidate was feasible, not whether the selected candidate is feasible. Could you change it to feasible=best.feasible, or rename it to something like any_feasible?

kgarg2468 · 2026-05-26T23:06:42Z

Addressed the feasibility-reporting review note.

The comparison result now reports feasible for the selected/best candidate action rather than whether any evaluated candidate was feasible. This makes the table match the action that is actually being reported.

Validation run: PYTHONPATH=src python3 examples/compare_baseline_planners.py completed successfully.

programmablemanufacturing · 2026-05-29T01:12:50Z

Merged. Thanks for the fix.

benchmarks: add baseline planner comparison

1ace504

fix: report feasibility for selected planner action

fa2f17d

programmablemanufacturing merged commit bffdb6d into programmablemanufacturing:main May 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

benchmarks: add baseline planner comparison#20

benchmarks: add baseline planner comparison#20
programmablemanufacturing merged 2 commits into
programmablemanufacturing:mainfrom
kgarg2468:kgarg/baseline-planner-comparison

kgarg2468 commented May 25, 2026

Uh oh!

programmablemanufacturing commented May 26, 2026

Uh oh!

programmablemanufacturing commented May 26, 2026 •

edited

Loading

Uh oh!

kgarg2468 commented May 26, 2026

Uh oh!

programmablemanufacturing commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kgarg2468 commented May 25, 2026

Uh oh!

programmablemanufacturing commented May 26, 2026

Uh oh!

programmablemanufacturing commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kgarg2468 commented May 26, 2026

Uh oh!

programmablemanufacturing commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

programmablemanufacturing commented May 26, 2026 •

edited

Loading