Skip to content

TRT: Support set dynamic range in int8 mode.#17524

Merged
NHZlX merged 7 commits intoPaddlePaddle:developfrom
NHZlX:feature/align_fluid_quant_trt_int8
May 25, 2019
Merged

TRT: Support set dynamic range in int8 mode.#17524
NHZlX merged 7 commits intoPaddlePaddle:developfrom
NHZlX:feature/align_fluid_quant_trt_int8

Conversation

@NHZlX
Copy link
Contributor

@NHZlX NHZlX commented May 21, 2019

Align fluid int8 training and trt int8 inference .

We first do this on resnet50 model and alignment is made on accuracy and speed.

Env: p4 card
Dataset: imagenet

resnet50 speed accuracy
fp32 5.28776ms 76.35%
int8 2.22264ms 76.26%

image

NHZlX added 6 commits May 7, 2019 05:20
trt int8 predict init
op converter
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.
@NHZlX NHZlX requested review from Superjomn and wzzju May 21, 2019 09:29
@NHZlX NHZlX force-pushed the feature/align_fluid_quant_trt_int8 branch 3 times, most recently from 52d088c to 4b2366f Compare May 22, 2019 05:51
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop
@NHZlX NHZlX force-pushed the feature/align_fluid_quant_trt_int8 branch from 4b2366f to 0475ecb Compare May 22, 2019 08:39
Copy link
Contributor

@Superjomn Superjomn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@NHZlX NHZlX merged commit 61221eb into PaddlePaddle:develop May 25, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants