A few more updates for GPU compatibility for TensorKit #100

kshyatt · 2025-11-17T18:31:18Z

Base.require_one_based_indexing returns a Bool on some versions, not n (integer).

Converting to the full CuMatrix/ROCMatrix is inefficient/ugly but dispatching on a SubArray of a Diagonal of a CuVector (or ROCVector) is also really ugly. Not sure what the best approach is here, but this works for now (we can revisit if it's really problematic).

lkdvos

Is it only the Diagonal giving issues?
We could also just add a specialization for Diagonal in general, which we should be able to handle in a GPU friendly way:

function project_hermitian_native!(A::Diagonal, B::Diagonal, ::Val{anti}) where {anti}
   if anti
       diagview(A) .= imag.(diagview(B)) .* im
   else
       diagview(A) .= real.(diagview(B))
   end
end

[edit] I think we have a utility function somewhere for the imaginary part, I can't remember the name though

kshyatt · 2025-11-17T18:50:16Z

That's also fine, I wrote this in the post lunch carb coma there's probably a better way

lkdvos · 2025-11-17T18:50:52Z

I guess the diagonal specialization is something we want anyways so that might make sense?

codecov · 2025-11-17T19:01:20Z

Codecov Report

❌ Patch coverage is 96.77419% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/common/matrixproperties.jl	92.85%	1 Missing ⚠️

Files with missing lines	Coverage Δ
...ixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl	`87.30% <100.00%> (+5.21%)`	⬆️
...MatrixAlgebraKitCUDAExt/MatrixAlgebraKitCUDAExt.jl	`61.81% <100.00%> (+4.19%)`	⬆️
src/MatrixAlgebraKit.jl	`100.00% <ø> (ø)`
src/implementations/projections.jl	`96.42% <100.00%> (+0.32%)`	⬆️
src/common/matrixproperties.jl	`87.85% <92.85%> (-1.85%)`	⬇️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ext/MatrixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl

src/implementations/projections.jl

Jutho · 2025-11-25T23:46:08Z

src/implementations/svd.jl


    return USVᴴ
 end
+svd_full!(A::Diagonal, USVᴴ, alg::GPU_SVDAlgorithm) = svd_full!(diagm(A.diag), USVᴴ, alg)


Is this actually being used? This is to cover the case where somehow a Diagonal ends up with a GPU_SVDAlgorithm?

We currently don't have a DiagonalGPUAlgorithm, which is why this happens, I suppose

I think this might also be a problem with the specific testing setup: we are manually picking the GPUAlgorithm and then calling this on a Diagonal, while the automated algorithm selection would have picked DiagonalAlgorithm instead.
I agree that we should make sure that the diagonal gpu arrays work, but I don't think we have to have specializations to make sure the GPU_SVDAlgorithm implementations work for Diagonal.
TLDR, instead of these specializations, I think I would rather just remove the tests, since this really is supposed to error?

Jutho · 2025-11-25T23:47:14Z

ext/MatrixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl

 MatrixAlgebraKit.ishermitian_approx(A::StridedROCMatrix; kwargs...) =
    @invoke MatrixAlgebraKit.ishermitian_approx(A::Any; kwargs...)


Is this kind of definition useful? What happens if it is removed?

cc @lkdvos who added this

This is to avoid the StridedMatrix implementation that does scalar indexing, and instead use norm(project_hermitian(A)) <= ..., as this is the fallback definition for Any.
I updated this and simply copied over that code now, in hindsight the benefit of not repeating code is probably not outweighed by the clarity.

kshyatt · 2025-11-26T09:59:02Z

Yes, I think the max of the two tolerances is the best compromise… the original I think was some late night coding again.

…

On Wed, Nov 26, 2025 at 10:57 AM Jutho ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In ext/MatrixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl <#100 (comment)> : > MatrixAlgebraKit.isantihermitian_approx(A::StridedROCMatrix; kwargs...) = @invoke MatrixAlgebraKit.isantihermitian_approx(A::Any; kwargs...) +function MatrixAlgebraKit.isantihermitian_approx(A::Diagonal{T, <:StridedROCVector{T}}; atol, rtol, kwargs...) where {T <: Real} + return sum(abs2, A.diag) ≤ max(atol, rtol * norm(A)) Ok, maybe max(atol, rtol). Although that is not really equivalent. I think it is natural that this cannot be satisfied if atol is zero (except for norm(A) == 0, so it should definitely be norm(A) <= atol instead of a strictly smaller than). But the general condition is, for a matrix A that can always be split as A_full = A_hermitian + A_antihermitian Then our condition for isantihermitian_approx when atol == 0 amounts to norm(A_hermitan) <= rtol * norm(A_full) But in the case of a real diagonal matrix, there is no anti-hermitian part and A_full == A_hermitian. So the condition above can never be satisfied (for rtol < 1, whereas it is always satisfied for rtol >= 1, which isn't really a sane choice). — Reply to this email directly, view it on GitHub <#100 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGKJY4OG6JXW2F6HMPUEJT36V2SDAVCNFSM6AAAAACMLW45ISVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZTKMJQGEZDSMZUGE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

ext/MatrixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl

Co-authored-by: Jutho <Jutho@users.noreply.github.com>

kshyatt · 2025-12-01T16:04:06Z

SVD now has tests, but I rather dislike the approach here and I think will implement GPUDiagonalAlgorithm for CUDA and ROCm in a separate PR. This however could be merged as a stop-gap to unblock the TensorKit PR, and then another small update PR made to TensorKit if needed to address GPUDiagonalAlgorithm. Does that sound reasonable?

github-actions · 2025-12-01T16:07:50Z

Your PR no longer requires formatting changes. Thank you for your contribution!

This reverts commit 1b4399f.

lkdvos · 2025-12-01T23:49:22Z

Apologies for the PR hijack here, I hope everything now turns green, and would be happy to merge.

I migrated the ishermitian implementations for Diagonal from the GPU extensions to the main package, and implemented them in a GPU-friendly manner there.
I also reverted the specific diagm casts for the GPUAlgorithm implementations, which I'm happy to rerevert if need be. My main idea here is that we shouldn't ever end up calling svd_compact!(A::GPUDiagonal, alg::GPU_SVDAlgorithm), and this should have selected svd_compact!(A::GPUDiagonal, alg::DiagonalAlgortihm) instead.

kshyatt · 2025-12-01T23:50:41Z

I think the followup PR #106 can handle the SVD case, so I'm happy to turn on automerge if you are :)

lkdvos · 2025-12-02T00:15:16Z

As a note for a potential (follow-up) improvement: I think the polar decompositions are now defaulting to PolarViaSvd(DiagonalAlgorithm()) for Diagonal inputs, which means the output has to be non-diagonal because the svd sorts the singular values.
Am I missing something, or can we bypass that and directly return diagonal W and P arrays, similar to what a positive QR would return?

kshyatt requested a review from lkdvos November 17, 2025 18:31

lkdvos reviewed Nov 17, 2025

View reviewed changes

kshyatt force-pushed the ksh/tk2 branch from 5b9b22f to d151045 Compare November 19, 2025 10:11

Jutho reviewed Nov 25, 2025

View reviewed changes

ext/MatrixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl Outdated Show resolved Hide resolved

Jutho reviewed Nov 25, 2025

View reviewed changes

src/implementations/projections.jl Outdated Show resolved Hide resolved

Jutho reviewed Nov 25, 2025

View reviewed changes

kshyatt force-pushed the ksh/tk2 branch from 065e069 to 716bd77 Compare November 26, 2025 16:45

Jutho reviewed Nov 26, 2025

View reviewed changes

ext/MatrixAlgebraKitAMDGPUExt/MatrixAlgebraKitAMDGPUExt.jl Outdated Show resolved Hide resolved

kshyatt and others added 3 commits December 1, 2025 11:54

A few more updates for GPU compatibility for TensorKit

f8c97f1

Update src/implementations/projections.jl

4ba611a

Co-authored-by: Jutho <Jutho@users.noreply.github.com>

Update antihermiticity check

4af44a3

kshyatt force-pushed the ksh/tk2 branch from 716bd77 to 4af44a3 Compare December 1, 2025 10:54

Respond to comments, update tests

8c574ec

Add tests for SVD

1b4399f

kshyatt force-pushed the ksh/tk2 branch from 5abd074 to 1b4399f Compare December 1, 2025 16:10

lkdvos added 7 commits December 1, 2025 16:21

remove @invoke calls

e83f6cc

fix kwargs

73828ba

further cleanup of ishermitian and friends

3a3f05b

include diagonal tests in projections on CPU

8a402bd

make JET happy

59ac475

Revert "Add tests for SVD"

978ae49

This reverts commit 1b4399f.

revert Diagonal to diagm for svd with gpuarrays

6bf4854

dont call project_isometric with invalid alg-input types

d154bae

lkdvos force-pushed the ksh/tk2 branch from 5a65208 to d154bae Compare December 2, 2025 00:10

Add alg for amd test

60461ab

kshyatt enabled auto-merge (squash) December 2, 2025 08:38

Jutho approved these changes Dec 2, 2025

View reviewed changes

kshyatt merged commit ba2c9ef into main Dec 2, 2025
10 checks passed

kshyatt deleted the ksh/tk2 branch December 2, 2025 10:01

		MatrixAlgebraKit.ishermitian_approx(A::StridedROCMatrix; kwargs...) =
		@invoke MatrixAlgebraKit.ishermitian_approx(A::Any; kwargs...)

A few more updates for GPU compatibility for TensorKit #100

A few more updates for GPU compatibility for TensorKit #100

Uh oh!

Conversation

kshyatt commented Nov 17, 2025

Uh oh!

lkdvos left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kshyatt commented Nov 17, 2025

Uh oh!

lkdvos commented Nov 17, 2025

Uh oh!

codecov bot commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Jutho Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

kshyatt Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

kshyatt Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

lkdvos Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Jutho Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

kshyatt Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

lkdvos Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

kshyatt commented Nov 26, 2025 via email

Uh oh!

Uh oh!

kshyatt commented Dec 1, 2025

Uh oh!

github-actions bot commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lkdvos commented Dec 1, 2025

Uh oh!

kshyatt commented Dec 1, 2025

Uh oh!

lkdvos commented Dec 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lkdvos left a comment •

edited

Loading

codecov bot commented Nov 17, 2025 •

edited

Loading

github-actions bot commented Dec 1, 2025 •

edited

Loading