#### Desgin - Add the Design doc PRs: - [x] design doc: https://github.com/PaddlePaddle/Paddle/issues/10552 #### C++ - fake_quantize_op - calculate sliding maximum value for the dequantization range. - quantize implementation - fake_dequantize_op PRs: - [x] fake_dequantize_op: https://github.com/PaddlePaddle/Paddle/pull/10965 - [x] fake_quantize_op: #### Python transpiler: Requirement: - develop quantization transpiler to rewrite ProgramDesc to insert the fake_quantize_op and fake_dequantize_op. - Only insert the fake_quantize_op and fake_dequantize_op in the forward pass. - Do not change the inputs and outputs of the backward operator. - consider the batch-norm folding and quantization. PRs: - [x] implement training transpiler prototype #10693 - [x] Enhance the delay quantization for training. - [x] implement inference transpiler #### Model verification. - Need to determine the baseline.