Commit cd89e54
[METAX] Modify CI logic (#2213)
* [fix] fix fail test when backend is mack
* [metax]change_cupti_and_fix_softmax (#7)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* [Metax] fix dgc & mklml compile product path problem (#8)
* [Metax] fix accuracy kernel & add test_accuracy_op_metax.py unit test (#9)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] fix accuracy kernel & add test_accuracy_op_metax.py unit test
* [Metax] add mixed_vector fix & update change patch
* [Metax] update metax_gpu CMakeLists.txt (#10)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] fix accuracy kernel & add test_accuracy_op_metax.py unit test
* [Metax] add mixed_vector fix & update change patch
* [Metax] update metax_gpu CMakeLists.txt
* [metax] updata_qr_kernel (#11)
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* [Metax] fix illegal address access error in test_momentum_op (#12)
* [Metax] fix illegal address access error in test_momentum_op
* [Metax] fix cufft and fix some blas kernel apply (#13)
* [Metax] fix cufft and fix some blas kernel apply
* [metax] add warpctc_warprnn (#14)
* [metax] fix bug
* [Metax] update metax CI (#15)
* [Metax] update metax CI
* [Metax] update metax CI CMakeLists (#16)
* [Metax] update metax CI
* [Metax] update metax CI CMakeLists
* [Metax] add github action (#18)
* [Metax] add github action
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* [metax] chang build (#19)
* [metax]chaneg build
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* change_build (#20)
* [metax]chaneg build
---------
* change_build (#21)
* change_build (#22)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* 【metax】modify cmake for warpctc and warprnnt (#17)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* [metax]modify library to static library (#24)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* [Metax] organize documents (#25)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] update metax_gpu CMakeLists.txt
* [Metax] organize documents
* [metax]fix_code style and index_elementwise_put_kernel (#27)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* change_build_917 (#29)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* chang_build (#30)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* [metax]modify kernel (#31)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* modify kernel
* change_metax_work (#32)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* change_build (#33)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
* change_metax_work
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* [metax] modify fused_bias_dropout_residual_layer_norm (#34)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* modify kernel
* modify fused_bias_dropout_residual_layer_norm
* change_build (#35)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* change_build (#36)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* change_warpctc.cmake (#38)
* change_warpctc.cmake
* change_warpctc.cmake (#39)
* change warpctc.cmake
* test (#40)
* test
---------
* test_ut (#41)
* change_run_ut
---------
* tets (#43)
* remove_tets
---------
* test (#44)
* test
---------
* [metax] modify compile (#42)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* modify kernel
* modify fused_bias_dropout_residual_layer_norm
* modify compile
* modify blas
* [Metax] add log analysis script (#46)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] update metax_gpu CMakeLists.txt
* [Metax] organize documents
* [Metax] add log analysis script
* add_generate_pb (#47)
* add_generate_pb
---------
* modify blas (#51)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* modify kernel
* modify fused_bias_dropout_residual_layer_norm
* modify compile
* modify blas
* modify blas
* modify blas
* modify blas
* [metax] modify tf32 (#52)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* modify kernel
* modify fused_bias_dropout_residual_layer_norm
* modify compile
* modify blas
* modify blas
* modify blas
* modify blas
* modify context
* [Metax] update metax backend CI test (#53)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] update metax_gpu CMakeLists.txt
* [Metax] organize documents
* [Metax] add log analysis script
* [Metax] update metax backend CI test
* [Metax] fix log_analysis.py bug (#54)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] update metax_gpu CMakeLists.txt
* [Metax] organize documents
* [Metax] add log analysis script
* [Metax] update metax backend CI test
* [Metax] fix log_analysis.py bug
* [Metax] update metax CI CMakeLists & scripts (#56)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] update metax_gpu CMakeLists.txt
* [Metax] organize documents
* [Metax] add log analysis script
* [Metax] update metax backend CI test
* [Metax] fix log_analysis.py bug
* [Metax] update metax CI CMakeLists & scripts
* [Metax] fix MatmulKernel problem (#57)
* [Metax] fix dgc & mklml compile product path problem
* [Metax] update metax_gpu CMakeLists.txt
* [Metax] organize documents
* [Metax] add log analysis script
* [Metax] update metax backend CI test
* [Metax] fix log_analysis.py bug
* [Metax] update metax CI CMakeLists & scripts
* [Metax] fix MatmulKernel problem
* [Metax] update metax CI program
* [metax]fix paddle bug" (#58)
* [metax]fix paddle bug
* change—ut (#59)
* change_ut
* change_ut (#60)
* change_ut
---------
* change_ut (#63)
* change_ut
* change_ut
---------
* [Metax] add keyword filter in CI CMakeLists.txt (#64)
* [Metax] add keyword filter in CI CMakeLists.txt
* [Metax] add ignore case list
* [metax] modify kernels (#67)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* modify kernel
* modify fused_bias_dropout_residual_layer_norm
* modify compile
* modify blas
* modify blas
* modify blas
* modify blas
* modify context
* modify kernels
* Fix part of the missing kernel issues (#66)
Co-authored-by: root <[email protected]>
* [Metax] fix index_elementwise_get kernel (#68)
* [Metax] add keyword filter in CI CMakeLists.txt
* [Metax] add ignore case list
* [Metax] fix phi::backends::gpu::DnnVersion() symbol not found
* Revert "[Metax] fix phi::backends::gpu::DnnVersion() symbol not found"
This reverts commit 087a9c1240f024210d536e543a2fc55db1175529.
* [Metax] fix index_elementwise_get kernel
* [metax]fix patch and fix missing kernel (#72)
* [metax]fix patch and fix missing kernel
* [metax] modify kernels (#73)
* modify kernels
* [metax] modify kernels (#74)
* modify kernels
* [metax] link mccl and fix missing kernel (#76)
* [metax] link mccl and fix missing kernel
* [metax] rename yaml file (#77)
* [metax]fix patch and fix missing kernel
* [metax] link mccl and fix missing kernel
* [metax] rename yaml file
---------
* [metax] rm file (#78)
* [metax]fix patch and fix missing kernel
* [metax] link mccl and fix missing kernel
* [metax] rename yaml file
* [metax] rm file
* [metax] rm file
---------
* metax_fix_ci (#79)
* [metax] add Rules
---------
* [metax] add print tensor (#91)
* modify cmake for warpctc and warprnnt
* modify conv for tf32 and fp32
* modify conv kernel
* modify library to static library
* modify kernel
* modify fused_bias_dropout_residual_layer_norm
* modify compile
* modify blas
* modify blas
* modify blas
* modify blas
* modify context
* modify kernels
* modify kernels
* modify kernels
* add print tensor
* [Metax] change_patch (#94)
* [metax] change_patch
---------
* update paddle (#95)
* update paddle
---------
* [metax] fix dot error (#96)
* [metax] fix dot error
---------
* Update metax_work.yaml
* [metax]rm opt path and fix activation_kernel bug (#98)
* [metax]rm opt path and fix activation_kernel bug
---------
* updata_paddle (#99)
* updata paddle
---------
* [Metax] Fix some tests (#102)
* fix some tests
* [metax] support wint4 in quantize (#103)
* updata_metax (#104)
* test
* test
---------
* updata_metax (#105)
* chang_meatx_yaml
* chang_meatx_yaml
* updata_metax
* test
* test
* test
* test
---------
* add one test to metax (#107)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* fix some tests
* add one test
---------
Co-authored-by: sw <[email protected]>
Co-authored-by: duqimeng <[email protected]>
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
* uodata_metax (#106)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_warpctc.cmake
* change warpctc.cmake
* test
* change_run_ut
* remove_tets
* test
* add_generate_pb
* [metax]fix paddle bug
* change_ut
* change_ut
* change_ut
* [metax]fix patch and fix missing kernel
* [metax] link mccl and fix missing kernel
* [metax] rename yaml file
* [metax] rm file
* [metax] rm file
* [metax] add Rules
* [metax] change_patch
* update paddle
* [metax] fix dot error
* [metax]rm opt path and fix activation_kernel bug
* updata paddle
* chang_meatx_yaml
* chang_meatx_yaml
* updata_metax
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* updata eigen_and fix_bug (#109)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_warpctc.cmake
* change warpctc.cmake
* test
* change_run_ut
* remove_tets
* test
* add_generate_pb
* [metax]fix paddle bug
* change_ut
* change_ut
* change_ut
* [metax]fix patch and fix missing kernel
* [metax] link mccl and fix missing kernel
* [metax] rename yaml file
* [metax] rm file
* [metax] rm file
* [metax] add Rules
* [metax] change_patch
* update paddle
* [metax] fix dot error
* [metax]rm opt path and fix activation_kernel bug
* updata paddle
* chang_meatx_yaml
* chang_meatx_yaml
* updata_metax
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* updata_enigen
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* updata paddle (#110)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_warpctc.cmake
* change warpctc.cmake
* test
* change_run_ut
* remove_tets
* test
* add_generate_pb
* [metax]fix paddle bug
* change_ut
* change_ut
* change_ut
* [metax]fix patch and fix missing kernel
* [metax] link mccl and fix missing kernel
* [metax] rename yaml file
* [metax] rm file
* [metax] rm file
* [metax] add Rules
* [metax] change_patch
* update paddle
* [metax] fix dot error
* [metax]rm opt path and fix activation_kernel bug
* updata paddle
* chang_meatx_yaml
* chang_meatx_yaml
* updata_metax
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* updata_enigen
* updata_paddle
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
* test
* [metax] modify kernels (#117)
* modify kernels
* modify kernels
* fix activation_grad kernel (#118)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* fix some tests
* add one test
* fix one kernel
---------
Co-authored-by: sw <[email protected]>
Co-authored-by: duqimeng <[email protected]>
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
* updata flag_and_fix_activation
* updata flag_and_fix_activation
* updataignore
---------
* updata_patch (#120)
* updata_patch
---------
* Update Paddle submodule to latest develop (#121)
Co-authored-by: tianshuo78520a <[email protected]>
* [metax] modify kernels (#122)
* modify kernels
* [Metax] fix weight_quant & weight_only_linear bug (#125)
* [Metax] fix weight_quant & weight_only_linear bug
* fix and add some kernels (#126)
* fix and add some kernels
* [Metax] fix 'WeightQuantizeKernel' wint4 branch (#133)
* [Metax] fix 'WeightQuantizeKernel' wint4 branch
* [Metax] add quanted weight layout transformation using CPU programming (#135)
* [Metax] adjust quanted weight layout transformation
* [Metax] add quanted weight layout transformation using GPU programming (#136)
* [Metax] add quanted weight layout transformation using GPU programming
* [Metax] updata_softmax (#138)
* updata_softmax
* udata patch (#139)
* updata_patch
---------
* [Metax] optimize wint4 quantization implementation (#140)
* [Metax] optimize wint4 quantization implementation
* change_flag (#141)
* change_flag
* [Metax] register fused_fc_elementwise_layernorm kernel (#143)
* [Metax] register fused_fc_elementwise_layernorm kernel
* updata paddle
* [Metax] add private CI (#144)
* [Metax] add private CI
* [Metax] add Upload (#145)
* [Metax] add Upload
* test (#154)
* ReRun CI (#150)
* [metax]fix collect_fpn_proposals (#157)
* [Metax_change_ut]
* fix sum&collect_fpn_proposals op register
* modify profile
* [Metax] fix paddle bug replace 'MoeGradDispatchKernel' to 'MoeGateDispatchKernel'
* [Metax] register bce_loss_grad & bce_loss & index_add_grad kernels
* [Metax] con2d_grad use gpudnn
* blas handle support
* [Metax] register some kernels & update CMakeLists
* [Metax] fix metax unittest fail
* [Metax] add group_norm & label_smooth kernel and update matmul kernel
* [Metax] fix rmsprop kernel register and add meshgrid & meshgrid_grad kernel register
* add test
* add test
* [test] chang the logic of workspace_host in cholesky_kernel_register
alloc(cpuplace,size), test pass
alloc(cpuplace, size, stream), crash
* [Metax] fix compile fail
* Revert "[Metax] fix compile fail"
This reverts commit 83bc87f686227962b0262e044225c6ed5507b824.
* [Metax] fix compile fail by 'conv_transpose_grad_kernel_impl.h'
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] con2d_grad use gpudnn
* [Metax]fix bug and add qr lstsq logsoftmax
* [Metax] change_patch
* [Metax] update unit test CMakeLists.txt
* [Metax] update unit test CMakeLists.txt
* [feature] add unique_consecutive kernel
* [metax] add some kernel
* [metax] add some kernel
* [Metax] register baddbmm kernel & update blas api
* [Metax] register baddbmm kernel & update blas api
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [feature] add add unique_consecutive kernel.cu
* [fix] fix some test case due to missing op register
* [fix] fix some fail text
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax]fix lu eigvalshsqueeze rnn kernel
* add and fix some kernels
* [Metax] register deformable_conv kernel & fix 'ModulatedDeformableCol2imCoord' symbol undefined
* [Metax] fix conflict
* [Metax] adapt to paddle-cpu-20250901 & resolve the issue of 'test_elementwise_mul_op_metax' failure
* [Metax] update repeat_interleave kernel & ignore max op test
* [metax]fix lu eigvalshsqueeze rnn kernel
* [metax] chang patch fix copy
* [metax] chang patch fix copy
* [Metax] update metax_gpu unit test
* [Metax] fix test CMakeList.txt
* [metax]change_cupti_and_fix_softmax
* [metax]change_patch
* [metax]change_patch
* [metax] updata_qr_kernel
* [metax] updata_qr_kernel
* [Metax] fix cufft and fix some blas kernel apply
* [metax] fix bug
* [Metax] add github action
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]chaneg build
* [metax]fix_code style and index_elementwise_put_kernel
* [metax]change_build
* [metax]change_build
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_metax_work
* change_warpctc.cmake
* change warpctc.cmake
* test
* change_run_ut
* remove_tets
* test
* add_generate_pb
* [metax]fix paddle bug
* change_ut
* change_ut
* change_ut
* [metax]fix patch and fix missing kernel
* [metax] link mccl and fix missing kernel
* [metax] rename yaml file
* [metax] rm file
* [metax] rm file
* [metax] add Rules
* [metax] change_patch
* update paddle
* [metax] fix dot error
* [metax]rm opt path and fix activation_kernel bug
* updata paddle
* chang_meatx_yaml
* chang_meatx_yaml
* updata_metax
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* test
* updata_enigen
* updata_paddle
* test
* updata ignore
* updata_ignore
* updata flag_and_fix_activation
* updataignore
* updata_patch
* feat: add gammaln_grad_kernel.cu
* updata_softmax
* updata_patch
* change_flag
* [metax] add private CI
* [metax] add private CI
* [metax] add private CI
* [Metax] add private CI
* [Metax] add private CI
* [Metax] add private CI
* [Metax] add private CI
* [Metax] add private CI
* [Metax] add private CI
* [Metax] add Upload
* chang yaml
* chang ut
* updata_paddle
* [metax] add schedule
* test
* [metax]fix collect_fpn_proposals
---------
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: metax666 <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: chezhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
Co-authored-by: root <[email protected]>
* [Metax]Update version information (#158)
* [Metax] update env (#163)
* [metax] Timed trigger (#164)
* 【Metax】update (#165)
* [Metax] fix version (#166)
* [Metax] fix nterpolate_grad_kernel (#167)
* [metax]fix version.txt (#169)
* test (#170)
* update yaml (#171)
* [Metax]add parameterized (#172)
* [Metax] Assign data stream to CUDA (#174)
* [Metax] fix CUDA Kernel No.50 (#175)
* [metax] change yaml (#176)
* [metax] Add some tests for CI (#173)
* Change test script to use 8 jobs instead of 16
* 【Metax】fix patch (#178)
* [METAX] Modify CI logic (#179)
* [Metax] fix patch (#180)
* ignore bilinear_interp_v2_op (#181)
* change yaml-yml (#182)
* test (#183)
* rm metax ci (#184)
* updata paddle (#185)
* updata_paddle (#186)
* tets
---------
Co-authored-by: chezhang <[email protected]>
Co-authored-by: duqimeng <[email protected]>
Co-authored-by: Mingkun.Zhang <[email protected]>
Co-authored-by: jiaxinWang-metax <[email protected]>
Co-authored-by: MingkunZhang <[email protected]>
Co-authored-by: zhang-chenyi <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
Co-authored-by: Theendlessofhell <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: ZhouDuan <[email protected]>
Co-authored-by: sw <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: tianshuo78520a <[email protected]>
Co-authored-by: Yuqiang Ge <[email protected]>
Co-authored-by: metax666 <[email protected]>1 parent 78f6295 commit cd89e54
File tree
5 files changed
+134
-121
lines changed- .github/workflows
- backends/metax_gpu
- patch
- tests
5 files changed
+134
-121
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
50 | 54 | | |
51 | 55 | | |
52 | 56 | | |
| |||
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | | - | |
| 22 | + | |
23 | 23 | | |
24 | 24 | | |
25 | | - | |
| 25 | + | |
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
| |||
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | 37 | | |
51 | 38 | | |
52 | 39 | | |
| |||
98 | 85 | | |
99 | 86 | | |
100 | 87 | | |
101 | | - | |
| 88 | + | |
102 | 89 | | |
103 | 90 | | |
104 | 91 | | |
| |||
134 | 121 | | |
135 | 122 | | |
136 | 123 | | |
137 | | - | |
| 124 | + | |
138 | 125 | | |
139 | 126 | | |
140 | 127 | | |
| |||
148 | 135 | | |
149 | 136 | | |
150 | 137 | | |
151 | | - | |
| 138 | + | |
152 | 139 | | |
153 | 140 | | |
154 | 141 | | |
| |||
191 | 178 | | |
192 | 179 | | |
193 | 180 | | |
194 | | - | |
| 181 | + | |
195 | 182 | | |
196 | 183 | | |
197 | 184 | | |
| |||
229 | 216 | | |
230 | 217 | | |
231 | 218 | | |
232 | | - | |
| 219 | + | |
233 | 220 | | |
234 | 221 | | |
235 | 222 | | |
| |||
347 | 334 | | |
348 | 335 | | |
349 | 336 | | |
350 | | - | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
351 | 353 | | |
352 | 354 | | |
353 | 355 | | |
| |||
651 | 653 | | |
652 | 654 | | |
653 | 655 | | |
654 | | - | |
| 656 | + | |
655 | 657 | | |
656 | 658 | | |
657 | 659 | | |
| |||
995 | 997 | | |
996 | 998 | | |
997 | 999 | | |
998 | | - | |
| 1000 | + | |
999 | 1001 | | |
1000 | 1002 | | |
1001 | 1003 | | |
| |||
1008 | 1010 | | |
1009 | 1011 | | |
1010 | 1012 | | |
1011 | | - | |
| 1013 | + | |
1012 | 1014 | | |
1013 | 1015 | | |
1014 | 1016 | | |
| |||
1021 | 1023 | | |
1022 | 1024 | | |
1023 | 1025 | | |
1024 | | - | |
| 1026 | + | |
1025 | 1027 | | |
1026 | 1028 | | |
1027 | 1029 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| 15 | + | |
0 commit comments