add new post-quant methods #32208

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

wanghaoshuang merged 3 commits into PaddlePaddle:develop from XGZhang11:add_ptq_methods

Apr 14, 2021

Contributor

XGZhang11 commented Apr 12, 2021 •

edited

Loading

PR types

Performance optimization

PR changes

APIs

Describe

1.Add new methods: 'mse', 'hist', 'avg' of getting threshold values of activations for post-training quantization in 'post_training_quantization.py'.
2.Add bias correction method in of https://arxiv.org/abs/1810.05723 for post-training quantization in 'quantization_pass.py'. Bias correction changes the quantized weights by the following formulation:

Experimental results on 6-bit MobileNetV1 which calibrated by a batch of 32 images:
abs_max: 44.99 abs_max+bias_correction: 47.91
avg: 56.34 avg+bias_correction: 56.17
mse: 61.32 mse+bias_correction: 61.42
hist(0.9999): 60.72 hist+bias_correction: 62.44
KL: 53.11 KL+bias_correction: 58.46

Time cost: Abs_max and avg cost about 1 minute; hist and KL cost about 6 minutes; mse cost about 10 minutes. Bias correction cost little time.


          add new post-quant methods

7ab63d8

paddle-bot-old bot commented Apr 12, 2021

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

wanghaoshuang requested review from juncaipeng, qingqing01 and wanghaoshuang

April 12, 2021 09:50

Contributor

wanghaoshuang commented Apr 13, 2021

请为新增代码补充下单测。

juncaipeng reviewed

View reviewed changes

Contributor

juncaipeng left a comment

请添加相应单侧，要不然代码覆盖率达不到，参考../tests/下面的单侧示例。

不同方法的实验数据最好贴到pr上，包括量化模型精度、量化过程的时间等。

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated

    
                               batch_size=10,

                               batch_nums=None,

                               algo="KL",

                               hist_perc=0.99999,

Contributor

juncaipeng Apr 12, 2021

建议用完整的hist_percent

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated

    
                               hist_perc=0.99999,

                               quantizable_op_type=["conv2d", "depthwise_conv2d", "mul"],

                               is_full_quantize=False,

                               bias_correct=False,

Contributor

juncaipeng Apr 12, 2021

bias_correction?

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated Show resolved Hide resolved

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated

    
                          if self._batch_nums and batch_id >= self._batch_nums:

                              break

                      if self._algo == 'avg':

Contributor

juncaipeng Apr 12, 2021

这是获取阈值的逻辑，后面计算阈值的部分。

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated

    
                      if self._algo == "abs_max":

                          self._sample_abs_max()

                      if self._algo in ["avg", "abs_max"]:

                          self._sample_abs_max_avg()

Contributor

juncaipeng Apr 12, 2021

两个不相同的采样方式，分开成两个函数。

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated

    
                              if mse_loss <= self._best_mse_loss[var_name]:

                                  self._best_mse_loss[var_name] = mse_loss

                                  best_scale = scale

                          if best_scale > 0.0:

Contributor

juncaipeng Apr 12, 2021

这个判断没有必要，self._quantized_threshold[var_name] = best_scale 可以放到if mse_loss <= self._best_mse_loss[var_name]:中

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated

    
                              save_info(

                                  op_node, out_var_name, self._quantized_threshold,

                                  argname_index[0] + str(argname_index[1]) + "_threshold",

                                  "post_absmax")

Contributor

juncaipeng Apr 12, 2021

三种方法有点点差别，建议区分开来

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py Outdated

    
                          if self._algo == 'avg':

                              if (var_name not in self._quantized_var_avg):

                                  self._quantized_var_avg[var_name] = []

                              abs_avg_value = float(np.mean(np.max(np.abs(var_tensor.reshape(var_tensor.shape[0], -1)), axis=(1))))

Contributor

juncaipeng Apr 12, 2021

注意代码每行的长度

python/paddle/fluid/contrib/slim/quantization/quantization_pass.py Outdated

    
                                  quantized_param_v = self._quant(

                                      param_v, scale_v, self._weight_bits, quant_axis)

                                      param_v.copy(), scale_v, self._weight_bits, quant_axis)

                                  if self._bias_correct == True:

Contributor

juncaipeng Apr 13, 2021

将bias_correction功能独立为一个函数实现

python/paddle/fluid/contrib/slim/quantization/quantization_pass.py Outdated

    
                                      if isinstance(scale_v, list):

                                          if quant_axis == 0:

                                              for i, s in enumerate(scale_v):

                                                  quantized_param_v[i] = quantized_param_v[i] * s / bnt

Contributor

juncaipeng Apr 13, 2021

这是dequantized_param_v了


          add new methods and tests

216e946

XGZhang11 requested a review from juncaipeng

April 13, 2021 09:39


          code style changed

724e56c

wanghaoshuang approved these changes

View reviewed changes

juncaipeng approved these changes

View reviewed changes

Contributor

juncaipeng left a comment

LGTM

wanghaoshuang merged commit 4281eb4 into PaddlePaddle:develop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet