Reduce number of `mul!` methods #472

dkarrasch · 2023-05-12T16:10:05Z

In the context of JuliaGPU/CUDA.jl#1904 I realized that between LinearAlgebra and CUDA there is yet another layer, which is this GPUArrays.jl. I believe that if we hook into the method hierarchy one level lower, many things will greatly simplify. I don't know if this should be annotated with @inline so that potentially constant propagation could eliminate the character fiddling. Comments welcome!

maleadt · 2023-05-16T18:19:58Z

Seeing the Metal.jl CI failure, I guess this is a breaking change?

dkarrasch · 2023-05-17T09:50:52Z

Sorry for seemingly abandoning this PR. I was working on JuliaLang/julia#49806, and now I'm working on including HermOrSym wrappers into that mechanism. Once that is done, I'll continue with SparseArrays.jl, so that these changes are included in Julia v1.10. Afterwards, I'll return to this one.

Regarding your question, this PR by itself is breaking, that is correct. But we can preempt breakage by introducing a few methods in Metal.jl first. Eventually, starting with Julia v1.10, there will be only one method overload that will handle multiplication by once-wrapped GPUArrays, where the wrappers include Adjoint, Transpose, Hermitian and Symmetric.

maleadt · 2023-05-17T10:01:55Z

Sorry for seemingly abandoning this PR. I was working on JuliaLang/julia#49806, and now I'm working on including HermOrSym wrappers into that mechanism. Once that is done, I'll continue with SparseArrays.jl, so that these changes are included in Julia v1.10. Afterwards, I'll return to this one.

Of course, no problem; thanks for doing this!

andreasnoack · 2023-05-22T07:03:12Z

src/host/linalg.jl

+    transA = tA == 'N' ? identity : tA == 'T' ? transpose : adjoint
+    generic_matmatmul!(C, transA(A), B, a, b)
+end
+function LinearAlgebra.generic_matvecmul!(C::AbstractGPUVector, tA::AbstractChar, A::AbstractGPUMatrix, B::AbstractGPUVector, _add::MulAddMul = MulAddMul())


It's not really related to this PR but I think it's a shame the the generic matmul in LinearAlgebra isn't like the one defined in this file, i.e. just three nested loops without trying to be smart about memory. The fact that it's necessary to define a "generic" version here is evidence that the version in LinearAlgebra isn't generic.

Do you think this is fixable by rewriting the generic version as you say, and have a more specific signature for the smart-about-memory version?

maleadt · 2023-05-31T14:17:45Z

Ah, a new issue with Metal.jl; a bad convert method is being called:

  MethodError: convert(::Type{Union{}}, ::MtlMatrix{ComplexF32}) is ambiguous. Candidates:
    convert(T::Type{<:SparseArrays.AbstractSparseMatrixCSC}, m::AbstractMatrix) in SparseArrays at /Users/julia/.julia/scratchspaces/a66863c6-20e8-4ff4-8a62-49f30b1f605e/agent-cache/default-macmini-aarch64-2.0/julia_installs/bin/mac/aarch64/1.8/julia-1.8-latest-macaarch64/share/julia/stdlib/v1.8/SparseArrays/src/sparsematrix.jl:745
    convert(T::Type{<:LinearAlgebra.Bidiagonal}, m::AbstractMatrix) in LinearAlgebra at /Users/julia/.julia/scratchspaces/a66863c6-20e8-4ff4-8a62-49f30b1f605e/agent-cache/default-macmini-aarch64-2.0/julia_installs/bin/mac/aarch64/1.8/julia-1.8-latest-macaarch64/share/julia/stdlib/v1.8/LinearAlgebra/src/bidiag.jl:203
    convert(::Type{Union{}}, a::AbstractArray) in Base at array.jl:618
    convert(::Type{T}, a::AbstractArray) where T<:GPUArraysCore.AbstractGPUArray in GPUArrays at /Users/julia/.julia/scratchspaces/a66863c6-20e8-4ff4-8a62-49f30b1f605e/agent-cache/default-macmini-aarch64-2.0/build/default-macmini-aarch64-2-0/julialang/gpuarrays-dot-jl/src/host/construction.jl:4
    convert(T::Type{<:BitArray}, a::AbstractArray) in Base at bitarray.jl:580
    convert(::Type{T}, a::AbstractArray) where T<:Array in Base at array.jl:617
    convert(::Type{SA}, a::AbstractArray) where SA<:StaticArraysCore.StaticArray in StaticArrays at /Users/julia/.julia/scratchspaces/a66863c6-20e8-4ff4-8a62-49f30b1f605e/agent-cache/default-macmini-aarch64-2.0/depots/c9f52312-b528-44e4-9501-6d408762012b/packages/StaticArrays/J9itA/src/convert.jl:194
    convert(::Type{Union{}}, x) in Base at essentials.jl:213
    convert(::Type{T}, arg) where T<:VecElement in Base at baseext.jl:19
  Possible fix, define
    convert(::Type{Union{}}, ::AbstractMatrix)
  Stacktrace:
   [1] to_power_type(x::MtlMatrix{ComplexF32})
     @ Base ./intfuncs.jl:250

maleadt · 2023-05-31T20:33:37Z

/AppleInternal/Library/BuildRoots/9941690d-bcf7-11ed-a645-863efbbaf80d/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSMatrix/LinearAlgebra/ARM64/MPSMatrixMultiplication.mm:3274: failed assertion `Number of requested rows in left input matrix exceeds left input matrix size.'

Interesting; I guess Metal.jl should protect against that to avoid an abort.

maleadt · 2023-06-01T12:28:20Z

Finally, all green. Let's tag things to get this out there!

dkarrasch added 3 commits May 12, 2023 18:06

Reduce number of mul! methods

a554819

Update linalg.jl

1185b64

Update linalg.jl

b57b795

dkarrasch mentioned this pull request May 12, 2023

Reduce number of mul! methods JuliaGPU/oneAPI.jl#318

Merged

dkarrasch added 2 commits May 12, 2023 20:01

catch mv mul cases

410fe47

Update linalg.jl

1feb1f7

andreasnoack reviewed May 22, 2023

View reviewed changes

dkarrasch mentioned this pull request May 24, 2023

Refactor matmatmul code for faster load time JuliaGPU/Metal.jl#186

Merged

Merge branch 'master' into master

3fc3e3a

dkarrasch added 2 commits May 31, 2023 20:49

update

e50f24a

add type annotations

23382b2

maleadt merged commit 443dab4 into JuliaGPU:master Jun 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce number of `mul!` methods #472

Reduce number of `mul!` methods #472

Uh oh!

dkarrasch commented May 12, 2023

Uh oh!

maleadt commented May 16, 2023

Uh oh!

dkarrasch commented May 17, 2023

Uh oh!

maleadt commented May 17, 2023

Uh oh!

andreasnoack May 22, 2023

Uh oh!

dkarrasch May 24, 2023

Uh oh!

maleadt commented May 31, 2023

Uh oh!

maleadt commented May 31, 2023

Uh oh!

maleadt commented Jun 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Reduce number of mul! methods #472

Reduce number of mul! methods #472

Uh oh!

Conversation

dkarrasch commented May 12, 2023

Uh oh!

maleadt commented May 16, 2023

Uh oh!

dkarrasch commented May 17, 2023

Uh oh!

maleadt commented May 17, 2023

Uh oh!

andreasnoack May 22, 2023

Choose a reason for hiding this comment

Uh oh!

dkarrasch May 24, 2023

Choose a reason for hiding this comment

Uh oh!

maleadt commented May 31, 2023

Uh oh!

maleadt commented May 31, 2023

Uh oh!

maleadt commented Jun 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Reduce number of `mul!` methods #472

Reduce number of `mul!` methods #472