Skip to content

cuda : add softcap fusion #14907

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Jul 29, 2025
Merged

cuda : add softcap fusion #14907

merged 6 commits into from
Jul 29, 2025

Conversation

CISC
Copy link
Collaborator

@CISC CISC commented Jul 27, 2025

Adds softcap fusion (scale->tanh->scale).

Minor refactoring to ggml_cuda_can_fuse to handle unary ops.

@CISC CISC requested a review from JohannesGaessler July 27, 2025 19:45
@github-actions github-actions bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jul 27, 2025
@CISC CISC requested a review from JohannesGaessler July 28, 2025 20:36
@CISC CISC merged commit 138b288 into master Jul 29, 2025
86 of 88 checks passed
@CISC CISC deleted the cisc/cuda-fuse-softcap branch July 29, 2025 12:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants