Fix NullPointerException in WAGED rebalancer when instance config is deleted #93
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Issues
Fixes a NullPointerException in WagedInstanceCapacity that occurs when an instance config is deleted while partitions are still assigned to it. The exception prevents the controller pipeline from completing, blocking rebalancing and leaving stale assignments in IdealState.
(apache#200 - Link your issue number here: You can write "Fixes #XXX". Please use the proper keyword so that the issue gets closed automatically. See https://docs.github.com/en/github/managing-your-work-on-github/linking-a-pull-request-to-an-issue
Any of the following keywords can be used: close, closes, closed, fix, fixes, fixed, resolve, resolves, resolved)
Description
When an instance config is deleted:
Added null checks in methods to handle deleted instance configs:
checkAndReduceInstanceCapacity(): Returns false and logs when instance config is missing
This allows the pipeline to continue, and the rebalancer removes the deleted instance from assignments in the next cycle.
Here are some details about my PR, including screenshots of any UI changes:
(Write a concise description including what, why, how)
Tests
Reproduced the issue locally by deleting an instance config while it had partition assignments
Verified the fix prevents the NPE and allows rebalancing to proceed
Confirmed the deleted instance is removed from IdealState after rebalancing
The following tests are written for this issue:
(List the names of added unit/integration tests)
(If CI test fails due to known issue, please specify the issue and test PR locally. Then copy & paste the result of "mvn test" to here.)
Changes that Break Backward Compatibility (Optional)
(Consider including all behavior changes for public methods or API. Also include these changes in merge description so that other developers are aware of these changes. This allows them to make relevant code changes in feature branches accounting for the new method/API behavior.)
Documentation (Optional)
(Link the GitHub wiki you added)
Commits
Code Quality
(helix-style-intellij.xml if IntelliJ IDE is used)