Enhance gc to support deleting tensor buffer in advance by sneaxiy · Pull Request #16409 · PaddlePaddle/Paddle

sneaxiy · 2019-03-23T17:23:27Z

For example, gc can collect data buffer of input Y in elementwise_add_grad op before this op runs. In the meanwhile, shape and lod of Y can be kept when elementwise_add_grad op runs.

Op developers can use DECLARE_NO_NEED_BUFFER_VARS_INFERENCE to declare a class that indicates the unused-buffer tensors.

For example, inputs of concat op x0, x1, x2 can be deleted before concat_grad op runs:

refine gc code test=develop

test=develop

fix ctest eager deletion disable bug test=develop

test=develop

liupluswei · 2019-03-27T03:22:04Z

paddle/fluid/framework/details/op_registry.h

+template <typename T>
+struct OpInfoFiller<T, kNoNeedBufferVarsInference> {
+  void operator()(const char* op_type, OpInfo* info) const {
+    info->infer_no_need_buffer_vars_ = [](const VariableNameMap& inputs,


curious about when will these three parameters be used to get the NoNeedBufferVars, seems now we just return the parameters specified in the macro as an unordered_set?

I reserve these parameters for future use. Some ops may not need some forward inputs or outputs when some attribute is true/false. For example, batch_norm_grad_op does not need Bias when use_mkldnn is false.

sneaxiy · 2019-03-27T03:34:21Z

paddle/fluid/framework/details/op_registry.h

+  static constexpr OpInfoFillType kFillType = kType;
+};
+
+using OpRegistryClasses = std::tuple<                                // NOLINT


Ugly but scalable codes here. I rewrite OpInfoFillTypeID::ID() method because the character number limit is set to be 80 in a line.

chengduoZH · 2019-03-27T04:22:58Z

paddle/fluid/operators/add_position_encoding_op.cc

    if (ctx->HasOutput(framework::GradVarName("X"))) {
+      auto out_dims = ctx->GetInputDim(framework::GradVarName("Out"));
      ctx->SetOutputDim(framework::GradVarName("X"), out_dims);
    }


Maybe the above code is confusing, if the ctx->HasOutput(framework::GradVarName("X") is False, there not need call AddPositionEncodingOpGrad .

I am confused with these codes too. I just follow the original logic.

test=develop

liupluswei · 2019-03-27T07:02:04Z

paddle/fluid/framework/details/reference_count_pass.cc

+           ++iter) {
+        bool ok;
+        auto result =
+            ExtractComputationOpFromLastLivedVar(*iter, i, shrink_func, &ok);


actually I am a little bit confused about ExtractComputationOpFromLastLivedVar function. I know it want to get the last op which will use this variable, but in reference count part, do we need to care about which op generate this variable? What's the impact if we only care about the ops use this variable (the variable is used in the input part)

ExtractComputationOpFromLastLivedVar returns (1) the last ops which read this variable if read ops exist, or (2) the last one op which writes this variables if read op does not exist. See HERE in details.

liupluswei · 2019-03-27T07:08:48Z

paddle/fluid/framework/details/reference_count_pass.cc

+      continue;
+    }
+
+    for (auto &out_pair : op_base->Outputs()) {


Do we need to care about outputs here?

Yes, we should care about outputs for in-place operation.

sneaxiy force-pushed the feature/advance_gc branch 5 times, most recently from dbb80d6 to 134e9a3 Compare March 24, 2019 03:22

add op registry type

a93a9ee

refine gc code test=develop

sneaxiy force-pushed the feature/advance_gc branch from 134e9a3 to a93a9ee Compare March 24, 2019 08:15

Merge develop

072d95d

test=develop

sneaxiy force-pushed the feature/advance_gc branch 4 times, most recently from 7b675aa to 493cd06 Compare March 25, 2019 07:56

try to fix ci error

f8ed2c2

test=develop

sneaxiy force-pushed the feature/advance_gc branch 4 times, most recently from 06b1fd6 to adf5e09 Compare March 25, 2019 12:12

sneaxiy requested review from chengduoZH and panyx0718 March 25, 2019 12:35

sneaxiy added 2 commits March 26, 2019 04:05

fix some op grad maker

7000ec8

fix ctest eager deletion disable bug test=develop

Merge develop

a7d0ac5

sneaxiy force-pushed the feature/advance_gc branch from adf5e09 to 7b72c11 Compare March 26, 2019 06:04

sneaxiy requested a review from liupluswei March 26, 2019 06:09

sneaxiy force-pushed the feature/advance_gc branch from 7b72c11 to 796cc2c Compare March 26, 2019 07:17

fix env variable settting bug

78fb3a6

test=develop

sneaxiy force-pushed the feature/advance_gc branch from 796cc2c to 78fb3a6 Compare March 26, 2019 09:30

liupluswei reviewed Mar 27, 2019

View reviewed changes

sneaxiy commented Mar 27, 2019

View reviewed changes

chengduoZH reviewed Mar 27, 2019

View reviewed changes

chengduoZH previously approved these changes Mar 27, 2019

View reviewed changes

delete source file no_need_buffer_vars_inference.cc

a0f4fef

test=develop

sneaxiy dismissed chengduoZH’s stale review via a0f4fef March 27, 2019 05:13

chengduoZH approved these changes Mar 27, 2019

View reviewed changes

liupluswei reviewed Mar 27, 2019

View reviewed changes

sneaxiy merged commit c7c6eeb into PaddlePaddle:develop Mar 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance gc to support deleting tensor buffer in advance#16409

Enhance gc to support deleting tensor buffer in advance#16409
sneaxiy merged 7 commits intoPaddlePaddle:developfrom
sneaxiy:feature/advance_gc

sneaxiy commented Mar 23, 2019 •

edited

Loading

Uh oh!

liupluswei Mar 27, 2019

Uh oh!

sneaxiy Mar 27, 2019 •

edited

Loading

Uh oh!

sneaxiy Mar 27, 2019 •

edited

Loading

Uh oh!

chengduoZH Mar 27, 2019

Uh oh!

sneaxiy Mar 27, 2019

Uh oh!

liupluswei Mar 27, 2019

Uh oh!

sneaxiy Mar 27, 2019 •

edited

Loading

Uh oh!

liupluswei Mar 27, 2019

Uh oh!

sneaxiy Mar 27, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sneaxiy commented Mar 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

liupluswei Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

sneaxiy Mar 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sneaxiy Mar 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chengduoZH Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

sneaxiy Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

liupluswei Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

sneaxiy Mar 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

liupluswei Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

sneaxiy Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sneaxiy commented Mar 23, 2019 •

edited

Loading

sneaxiy Mar 27, 2019 •

edited

Loading

sneaxiy Mar 27, 2019 •

edited

Loading

sneaxiy Mar 27, 2019 •

edited

Loading