Unify the implementation of activation operation #32348

ZzSean · 2021-04-19T07:24:41Z

PR types

Performance optimization

PR changes

OPs

Describe

Unify the implementation of activation operation
本次提交共修改激活算子25个，包括其前向和反向，其中

三角函数类的 9 个：sin, cos, tan, asin, acos, atan, sinh, cosh, tanh
relu 类的 3 个：relu, leaky_relu, ~~elu~~
sigmoid类的 3 个: sigmoid, silu, logsigmoid
舍入类的 3 个：ceil, floor, round
数学运算类的 6 个：sqrt, rsqrt, square, log, exp, reciprocal
缩放类的 1 个：softshrink

每种类型算子的性能提升近似，因此选取每个类别中的一个算子作为示例进行描述，如下表：
case配置：[16, 128, 257, 257]

OP Name	FP32 old	FP32 new	pro	FP16 old	FP16 new	pro
elu fwd	1.6077ms	1.3114ms	22.6%	898.68us	670.20us	34.1%
elu bwd	2.1628ms	1.9057ms	13.5%	1.5737ms	963.07us	63.4%
sigmoid fwd	1.4083ms	1.3123ms	7.3%	1.0360ms	674.24us	53.7%
sigmoid bwd	2.0002ms	1.9059ms	4.9%	1.1890ms	961.18us	23.7%
ceil fwd	1.5198ms	1.3116ms	15.9%	904.18us	670.92us	34.8%
ceil bwd	1.4069ms	603.45us	133.1%	909.00us	302.23us	200%
sin fwd	1.5071ms	1.3132ms	14.8%	989.87us	673.57us	47.0%
sin bwd	2.0647ms	1.9062ms	8.3%	1.3319ms	970.76us	37.2%
sqrt fwd	1.4051ms	1.3121ms	7.1%	950.01us	672.92us	41.2%
sqrt bwd	2.0164ms	1.9069ms	5.7%	1.3418ms	966.90us	38.7%
softshrink fwd	1.5230ms	1.3118ms	16.1%	910.58us	669.97us	35.9%
softshrink bwd	2.0644ms	1.9057ms	8.3%	1.2642ms	963.07us	31.3%

paddle-bot-old · 2021-04-19T07:24:46Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/fluid/operators/activation_op.cu

Xreki · 2021-04-26T12:16:49Z

paddle/fluid/operators/activation_op.cu

+    CT dout = static_cast<CT>(args[0]);
+    CT x = static_cast<CT>(args[1]);
+    CT temp1 = one + exp(-x);
+    CT temp2 = x * exp(-x);


这样写，exp()会调用2次吗？

已对实现进行修改

Xreki · 2021-04-26T12:38:42Z

paddle/fluid/operators/activation_op.cu

+  __device__ __forceinline__ T operator()(const T* args) const {
+    CT x = static_cast<CT>(args[0]);
+    CT temp = x > zero ? zero : -x;
+    return T(-temp - log(exp(-temp) + exp(-x - temp)));


既然用的都是-temp，那temp计算的时候是不是就可以不要这个负号？

为了保持与原实现和公式的统一，还是先不改了

Xreki · 2021-04-26T12:49:17Z

paddle/fluid/operators/activation_op.cu

+    CT dout = static_cast<CT>(args[0]);
+    CT x = static_cast<CT>(args[1]);
+    CT temp = x > zero ? zero : -x;
+    return T(dout * (exp(-x - temp) / (exp(-temp) + exp(-x - temp))));


同上：既然用的都是-temp，那temp计算的时候是不是就可以不要这个负号？

分子、分母都会用到exp(-x - temp)，是不是可以提取出来？

paddle/fluid/operators/activation_op.cu

Xreki · 2021-04-26T13:49:02Z

paddle/fluid/operators/activation_op.cu

+  // Inputs: args[0], the input x
+  __device__ __forceinline__ T operator()(const T* args) const {
+    CT x = static_cast<CT>(args[0]);
+    return x >= zero ? args[0] : T(static_cast<CT>(alpha) * (exp(x) - one));


这个实现好像跟文档里面的公式有点对不上？https://www.paddlepaddle.org.cn/documentation/docs/zh/api/paddle/nn/layer/activation/ELU_cn.html#elu

暂不修改

Xreki · 2021-04-26T13:54:59Z

paddle/fluid/operators/activation_op.cu

+  __device__ __forceinline__ T operator()(const T* args) const {
+    CT dout = static_cast<CT>(args[0]);
+    CT x = static_cast<CT>(args[1]);
+    return x >= zero ? args[0] : T(dout * static_cast<CT>(alpha) * exp(x));


这个实现跟原注释里面的公式也有点对不上？https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/fluid/operators/activation_op.h#L1262

暂不修改

Xreki · 2021-04-26T14:05:15Z

paddle/fluid/operators/activation_op.h

    auto temp1 = x < static_cast<T>(threshold * -1.f);
    auto temp2 = x > static_cast<T>(threshold);
-    out.device(d) = x * (temp1 + temp2).template cast<T>();
+    out.device(d) = x * (temp1 || temp2).template cast<T>();


加也没问题吧？

单测里有threshold为负的情况，用加号就会变成两倍

Xreki · 2021-04-26T14:05:45Z

paddle/fluid/operators/activation_op.cu

+                               ThresholdedReluFunctor,
+                               ThresholdedReluGradFunctor);
+REGISTER_ACTIVATION_GPU_KERNEL(hard_swish, HardSwish, HardSwishFunctor,
+                               HardSwishGradFunctor);


注册的宏有点太长了，后续优化一下吧。

Xreki

LGTM and great work~

ZzSean force-pushed the activation_op_impl branch from bd411e5 to 59d62d1 Compare April 20, 2021 06:19

ZzSean force-pushed the activation_op_impl branch from adfeb25 to 1d41980 Compare April 21, 2021 07:27

ZzSean force-pushed the activation_op_impl branch from beb7ed2 to 5c3cf4d Compare April 22, 2021 02:43

ZzSean force-pushed the activation_op_impl branch 4 times, most recently from d910b6d to 711a097 Compare April 22, 2021 07:02

ZzSean force-pushed the activation_op_impl branch 2 times, most recently from 1e5c724 to 78058c6 Compare April 23, 2021 02:44

ZzSean force-pushed the activation_op_impl branch from c81aa21 to 52d8151 Compare April 23, 2021 08:12

ZzSean force-pushed the activation_op_impl branch from c8942d9 to d8da01f Compare April 25, 2021 06:51

Xreki reviewed Apr 26, 2021

View reviewed changes

ZzSean added 9 commits April 26, 2021 11:13

rebase

823b0e9

add 12 op

ec22be6

add all activation op

59b16b9

fix

a51d16f

fix

c17f3aa

add silu

88d2913

revert swish and softrelu

95aad4b

fix

0d09b3e

add notes

f67e8a4

ZzSean force-pushed the activation_op_impl branch from 767cbb6 to f67e8a4 Compare April 26, 2021 11:13

Xreki reviewed Apr 26, 2021

View reviewed changes

ZzSean added 2 commits April 27, 2021 03:30

fix

63e938d

fix

31665c6

Xreki approved these changes Apr 27, 2021

View reviewed changes

Xreki merged commit eca8dcc into PaddlePaddle:develop Apr 27, 2021

Unify the implementation of activation operation #32348

Unify the implementation of activation operation #32348

Uh oh!

Conversation

ZzSean commented Apr 19, 2021 • edited by Xreki Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Apr 19, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ZzSean Apr 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Xreki left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ZzSean commented Apr 19, 2021 •

edited by Xreki

Loading

ZzSean Apr 27, 2021 •

edited

Loading