[Sparse conv] Implement implicit gemm algo for SubmConv3D #62747

Wong4j · 2024-03-15T06:56:47Z

PR Category

Performance Optimization

PR Types

Performance

Description

Support implicit GEMM algorithm for SubmConv3D.

Usage:

nn.functional.subm_conv3d_igemm

# 3D
y = paddle.sparse.nn.functional.subm_conv3d(x, weight, key='key1', padding=[0, 1, 1])   # original
y = paddle.sparse.nn.functional.subm_conv3d_igemm(x, weight, key='key2', padding=[0, 1, 1])   # use implicit gemm

# 2D
y = paddle.sparse.nn.functional.subm_conv2d(x, weight, key='key1', padding=[1, 1])   # original
y = paddle.sparse.nn.functional.subm_conv2d_igemm(x, weight, key='key2', padding=[1, 1])   # use implicit gemm

nn.SubmConv3D

# 3D
paddle.nn.SubmConv3D(32, 32, kernel_size=[1, 3, 3], key="key1")  # original
paddle.nn.SubmConv3D(32, 32, kernel_size=[1, 3, 3], key="key2", backend="igemm")  # use implicit gemm

# 2D
paddle.nn.SubmConv2D(32, 32, kernel_size=[3, 3], key="key1")  # original
paddle.nn.SubmConv2D(32, 32, kernel_size=[3, 3], key="key2", backend="igemm")  # use implicit gemm

Perf:
GPU: 3080
Prec: FP16
case: single SubmConv
nnz=214202 dense_shape=[1, 1, 4608, 4608, 32] kernel_size=[1, 3, 3] stride=1 in_channel=out_channel=32
(This perf numbers do not include the overhead of hashmap/rulebook creation, which I assume has been cached.)

---	cutlass	igemm
time(us)	703	194

Note:

Implicit gemm only supports forward now.
I have only verified the correctness for SubmConv, so I'm asserting subm==True and stride==1 and dilation==1 in the code.
The cuda kernels are modified based on the torchsparse's implementation.
~~The input must be 3D (NDHWC), and kernel must has dims=3. For 2D case, please insert zeros to to the D dimention of indices and set kernel sizes to (1, 3, 3).~~
The input can be 3D (NDHWC) or 2D (NHWC)

paddle-bot · 2024-03-15T06:56:52Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot · 2024-03-23T03:08:08Z

Sorry to inform you that ee0d63d's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

Wangzheee · 2024-04-15T12:42:42Z

paddle/phi/infermeta/sparse/binary.cc

  counter->set_dims({1});
 }

+void Conv3dImplicitGemmInferMeta(const MetaTensor& x,


ci的代码没有覆盖到这个OP，可以针对这个OP增加单测

Wangzheee · 2024-04-15T12:43:21Z

paddle/phi/core/kmap_cache.h

+  // std::vector<int>* spatial_range;
+
+  // destructor
+  ~KmapCache() {


单测中没有执行这个析构，可以增加一下

我增加了单测，本地跑单测会跑到这个析构，但是CI仍然显示没有跑到。

paddle-ci-bot · 2024-04-17T03:11:15Z

Sorry to inform you that 4851677's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

qingqing01

后续需要更新中文文档

qingqing01 · 2024-04-19T06:49:45Z

python/paddle/sparse/nn/layer/conv.py

        weight_attr=None,
        bias_attr=None,
        data_format="NDHWC",
+        backend=None,


对用户暴露的接口需增加注释

jzhang533 · 2024-04-22T03:14:12Z

python/paddle/sparse/nn/layer/conv.py

        weight_attr=None,
        bias_attr=None,
        data_format="NDHWC",
+        backend=None,


the newly introduced arg backend should be documented in the docstring.

python/paddle/sparse/nn/layer/conv.py

jzhang533

LGTM

…le#62747) * sparse conv: implement implicit gemm algo

paddle-bot bot added the contributor External developers label Mar 15, 2024

Wong4j requested a review from Wangzheee March 15, 2024 06:59

jeng1220 added the NVIDIA label Mar 15, 2024

Wong4j force-pushed the jaywan/sparse_conv branch 2 times, most recently from 064892d to 10eb558 Compare April 7, 2024 07:49

onecatcn assigned heavengate Apr 8, 2024

Wong4j added 9 commits April 9, 2024 04:58

sparse conv: implement implicit gemm algo

98a427b

add ut

3f8fc96

remove redundant file

fb17ddd

fix error

881bea1

support 2d input for sparse_conv_igemm

b0b3c68

fix 3d bug

32ae706

skip ut when arch < 75

bc12530

minor change

8a9ee31

fix cache

dcc9b0b

Wong4j force-pushed the jaywan/sparse_conv branch from 7a5bec7 to dcc9b0b Compare April 9, 2024 05:13

Wong4j added 3 commits April 9, 2024 05:37

rm print

87bc6a5

skip ut for win32

289e8c7

fix code style

4851677

Wong4j changed the title ~~[WIP] [Sparse conv] Implement implicit gemm algo for SubmConv3D~~ [Sparse conv] Implement implicit gemm algo for SubmConv3D Apr 11, 2024

Wangzheee reviewed Apr 15, 2024

View reviewed changes

Wong4j added 3 commits April 17, 2024 08:14

add more ut

7530c0a

minor change

bd1cebf

add more UTs

ba131b2

tianshuo78520a approved these changes Apr 19, 2024

View reviewed changes

qingqing01 approved these changes Apr 19, 2024

View reviewed changes

ming1753 assigned jzhang533 Apr 19, 2024

Aurelius84 approved these changes Apr 19, 2024

View reviewed changes

jzhang533 reviewed Apr 22, 2024

View reviewed changes

jzhang533 approved these changes Apr 22, 2024

View reviewed changes

Wangzheee approved these changes Apr 22, 2024

View reviewed changes

Wangzheee merged commit 0663608 into PaddlePaddle:develop Apr 22, 2024

co63oc pushed a commit to co63oc/Paddle that referenced this pull request Apr 22, 2024

[Sparse conv] Implement implicit gemm algo for SubmConv3D (PaddlePadd…

9bc75dd

…le#62747) * sparse conv: implement implicit gemm algo

Wong4j mentioned this pull request Apr 25, 2024

Fix PR-CI-Hygon-DCU failure #63864

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Sparse conv] Implement implicit gemm algo for SubmConv3D #62747

[Sparse conv] Implement implicit gemm algo for SubmConv3D #62747

Uh oh!

Wong4j commented Mar 15, 2024 •

edited

Loading

Uh oh!

paddle-bot bot commented Mar 15, 2024

Uh oh!

paddle-ci-bot bot commented Mar 23, 2024

Uh oh!

Wangzheee Apr 15, 2024

Uh oh!

Wong4j Apr 18, 2024

Uh oh!

Wangzheee Apr 15, 2024

Uh oh!

Wong4j Apr 18, 2024

Uh oh!

paddle-ci-bot bot commented Apr 17, 2024

Uh oh!

qingqing01 left a comment

Uh oh!

qingqing01 Apr 19, 2024

Uh oh!

jzhang533 Apr 22, 2024

Uh oh!

Uh oh!

jzhang533 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

[Sparse conv] Implement implicit gemm algo for SubmConv3D #62747

[Sparse conv] Implement implicit gemm algo for SubmConv3D #62747

Uh oh!

Conversation

Wong4j commented Mar 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Mar 15, 2024

Uh oh!

paddle-ci-bot bot commented Mar 23, 2024

Uh oh!

Wangzheee Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

Wong4j Apr 18, 2024

Choose a reason for hiding this comment

Uh oh!

Wangzheee Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

Wong4j Apr 18, 2024

Choose a reason for hiding this comment

Uh oh!

paddle-ci-bot bot commented Apr 17, 2024

Uh oh!

qingqing01 left a comment

Choose a reason for hiding this comment

Uh oh!

qingqing01 Apr 19, 2024

Choose a reason for hiding this comment

Uh oh!

jzhang533 Apr 22, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jzhang533 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Wong4j commented Mar 15, 2024 •

edited

Loading