Skip to content

feat: Added an update to CoreDNS #530

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 21 commits into from
Aug 4, 2025

Conversation

DannyLiCom
Copy link
Collaborator

@DannyLiCom DannyLiCom commented Jul 14, 2025

Fixes / Features

Enhance DNS Scalability for Large-Scale Testing:

  • Issue: During large-scale load testing, the existing Kube-DNS solution was found to be insufficient in supporting the demands of McJAX and Pathways TPU paths, leading to potential performance bottlenecks.
  • Solution: Adjusted configurations to default to CoreDNS for McJAX and Pathways TPU paths.

Testing / Documentation

When using the command python3 xpk.py cluster create-pathways and python3 xpk.py cluster create to create a cluster, CoreDNS will be used by default.

  • [ y/n ] Tests pass
  • [ y/n ] Appropriate changes to documentation are included in the PR

@ycchenzheng
Copy link
Collaborator

This PR is from #501 due to fork issue
@pawloch00 @SujeethJinesh @wstcliyu please review when you get time

@DannyLiCom
Copy link
Collaborator Author

@pawloch00 Please review it again.

@pawloch00
Copy link
Collaborator

@sharabiani ptal

@DannyLiCom
Copy link
Collaborator Author

@wstcliyu @pawloch00 It also seems to be caused by an IP connection refused error.

@pawloch00
Copy link
Collaborator

The test that failed wasn't any of paths mentioned in PR description

@DannyLiCom
Copy link
Collaborator Author

Right, the problem only appeared after I update the branch.

@pawloch00
Copy link
Collaborator

pawloch00 commented Jul 25, 2025

Please add github workflow test that will clearly test this path and check for presence of coreDNS deployment. Also, if possible add checks to existing tests, that existing DNS deployment was not changed.

@DannyLiCom DannyLiCom merged commit 48012d8 into develop Aug 4, 2025
32 of 33 checks passed
@DannyLiCom DannyLiCom deleted the lidanny/feature/update-to-CoreDNS branch August 4, 2025 02:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants