Modify legal actions of rocksample #84

HarryXuancy · 2025-10-03T12:15:53Z

In this PR I modify the legal actions in RockSample environment:

Sample action is always allowed without any penalty --> Only allow to sample when the agent is in the position of a rock.
After a rock is sampled, it is replaced by a bad rock --> After a rock is sampled, it can not be sampled or checked anymore. We can regard it as removed from the environment.

These new settings align with Silver's cpp code and can bring better results since the search space is smaller. I am not sure if pomdp-py wants to adopt this new setting, you can decide whether to merge based on the need.

zkytony

Thank you! Great work.

Requesting a few changes; see comments.

Also, could you add some unit tests? Those can go in pomdp_py/tests. There's a custom test framework here and it should be easy to understand. These tests are run by CI so ensures your work is preserved in a working state.

.gitignore

pomdp_py/problems/rocksample/rocksample_problem.py

zkytony · 2025-10-03T13:02:59Z

pomdp_py/problems/rocksample/rocksample_problem.py

+    print(f"Max total reward: {max(total_rewards)}")
+    print(f"Min discounted reward: {min(total_discounted_rewards):.3f}")
+    print(f"Max discounted reward: {max(total_discounted_rewards):.3f}")
+    print("="*50)


Could you separate this part into a function called run_experiment or benchmark? Then, if a user runs

python -m pomdp_py -r rocksample --benchmark

it will run this. Otherwise, it still runs a single-run example?

I added this here: #85

zkytony · 2025-10-03T13:07:48Z

@HarryXuancy also notice the pre-commit check fails.

zkytony · 2025-10-25T01:30:35Z

@HarryXuancy I see this has become stale. I'll see if I can get this pass CI. Also I noticed two of my comments were resolved without reason. Specifically, it doesn't look like the code is generating output.txt files so I don't think we need to add it in gitignore.

zkytony · 2025-10-25T03:13:20Z

@HarryXuancy please check out this one #85
Feel free to review. It has basically all your changes + the benchmark flag. I am waiting for the benchmark to finish & will report the numbers.

HarryXuancy · 2025-10-26T15:57:55Z

@HarryXuancy I see this has become stale. I'll see if I can get this pass CI. Also I noticed two of my comments were resolved without reason. Specifically, it doesn't look like the code is generating output.txt files so I don't think we need to add it in gitignore.

Sorry for my big delay! And I resolved the comments because of my personal habit. If I think a comment is absolutely correct, I usually just apply the corresponding change without adding a reply. Sorry I probably should have added some responses before resolving the comments.

* Modify legal actions of rocksample * allow --benchmark in rocksample * pre-commit * remove gitignore * minor hash improvement --------- Co-authored-by: Chunyu Xuan <lhsxxcy@126.com>

zkytony · 2025-10-28T03:55:33Z

Merged by #85

Modify legal actions of rocksample

d54aca9

zkytony requested changes Oct 3, 2025

View reviewed changes

zkytony mentioned this pull request Oct 25, 2025

Merge rocksample improvements from #84 and enable --benchmark flag #85

Merged

zkytony closed this Oct 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modify legal actions of rocksample #84

Modify legal actions of rocksample #84

Uh oh!

HarryXuancy commented Oct 3, 2025

Uh oh!

zkytony left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zkytony Oct 3, 2025

Uh oh!

zkytony Oct 25, 2025

Uh oh!

zkytony commented Oct 3, 2025

Uh oh!

zkytony commented Oct 25, 2025

Uh oh!

zkytony commented Oct 25, 2025

Uh oh!

HarryXuancy commented Oct 26, 2025

Uh oh!

zkytony commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Modify legal actions of rocksample #84

Modify legal actions of rocksample #84

Uh oh!

Conversation

HarryXuancy commented Oct 3, 2025

Uh oh!

zkytony left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zkytony Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

zkytony Oct 25, 2025

Choose a reason for hiding this comment

Uh oh!

zkytony commented Oct 3, 2025

Uh oh!

zkytony commented Oct 25, 2025

Uh oh!

zkytony commented Oct 25, 2025

Uh oh!

HarryXuancy commented Oct 26, 2025

Uh oh!

zkytony commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants