The training framework of fixed point quantization.

####  Desgin
   - Add the Design doc  

PRs:
- [x] design doc:  https://github.com/PaddlePaddle/Paddle/issues/10552

####  C++  
  - fake_quantize_op
    - calculate sliding maximum value for the dequantization range.
    - quantize implementation
  - fake_dequantize_op  

PRs: 
- [x] fake_dequantize_op: https://github.com/PaddlePaddle/Paddle/pull/10965
- [x] fake_quantize_op:

#### Python transpiler:
Requirement: 
  - develop quantization transpiler to rewrite ProgramDesc to insert the fake_quantize_op and fake_dequantize_op.
    - Only insert the fake_quantize_op and fake_dequantize_op in the forward pass.
    - Do not change the inputs and outputs of the backward operator.
  - consider the batch-norm folding and quantization.

PRs:
- [x] implement training transpiler prototype #10693 
- [x] Enhance the delay quantization for training.
- [x] implement inference transpiler

#### Model verification.
  - Need to determine the baseline.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The training framework of fixed point quantization. #10551

Desgin

C++

Python transpiler:

Model verification.

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The training framework of fixed point quantization. #10551

Description

Desgin

C++

Python transpiler:

Model verification.

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions