Skip to content

Adam operator takes too much time in CPU and lanch multiple kernels in CUDA #6985

@reyoung

Description

@reyoung

It could be slow. And we need a simple and unify way to write elemwise code.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions