Add sequence reshape operator by pkuyym · Pull Request #7662 · PaddlePaddle/Paddle

pkuyym · 2018-01-18T12:31:06Z

Resolves #6678

… fix-6678

chengduoZH · 2018-01-18T13:06:15Z

paddle/operators/sequence_reshape_op.cu

+namespace ops = paddle::operators;
+REGISTER_OP_CUDA_KERNEL(
+    sequence_reshape,
+    ops::SequenceReshapeKernel<paddle::platform::CUDADeviceContext, float>);


You also need register double type for sequence_reshape.

chengduoZH · 2018-01-18T13:08:22Z

paddle/operators/sequence_reshape_op.cc

+then out is a LoDTensor:
+    out.lod  = [[0,    1,    3]]
+    out.data = [[0.1, 0.2, 0.3, 0.4],
+                [0.5, 0.6, 0.7, 0.8], [0.9, 1.0, 1.1, 1.2]]


This can be written as an integer so that it will be better to see.

chengduoZH · 2018-01-19T02:12:43Z

paddle/operators/sequence_reshape_op.h

+                        "to 0 after reshaped.",
+                        i + 1);
+      out_lod[0].push_back(out_lod[0].back() + offset);
+    }


I think that line 50~64 should be put in InferShape. This code belongs to the input data validity checking.

I think it's ok to do this in the kernel.

chengduoZH · 2018-01-19T02:16:32Z

paddle/operators/sequence_reshape_op.cc

+    auto x_dims = ctx->GetInputDim("X");
+    PADDLE_ENFORCE_EQ(x_dims.size(), 2U, "Rank of Input(X) should be 2.");
+    int dimension = ctx->Attrs().Get<int>("new_dim");
+    ctx->SetOutputDim("Out", {x_dims[0], static_cast<int64_t>(dimension)});


The output dim may be not {x_dims[0], dimension}. And the output dim can be computed in InferShape.

chengduoZH · 2018-01-19T02:19:46Z

paddle/operators/sequence_reshape_op.h

+    auto& out_lod = *out->mutable_lod();
+    out_lod.resize(1);
+    out_lod[0].clear();
+    out_lod[0].push_back(0);


What if out_width equals in_dims[1]?

Just do the copy.

chengduoZH · 2018-01-19T02:29:36Z

paddle/operators/sequence_reshape_op.h

+                     p_in_data + in_offset, bytes, dev_ctx.stream());
+#endif
+      }
+    }


From the description of the example, you need only copy input to output and reset out_lod and out_dim, but not so complex.

chengduoZH · 2018-01-19T07:29:35Z

paddle/operators/sequence_reshape_op.h

+    }
+
+    out->mutable_data<T>(context.GetPlace());
+    framework::Copy(*in, context.GetPlace(), out);


Line 65 can be placed on line 40, and out->mutable_data<T>(context.GetPlace()); can be removed.

It seems Copy will invoke mutable_data of dest tensor, so L64 is not necessary.

chengduoZH · 2018-01-19T07:33:51Z

paddle/operators/sequence_reshape_op.h

+    } else {
+      auto& out_lod = *out->mutable_lod();
+      out_lod.resize(1);
+      out_lod[0].clear();


push_back: this effectively increases the container size by one, which causes an automatic reallocation of the allocated storage space if -and only if- the new vector size surpasses the current vector capacity.

you can replace out_lod[0].clear(); with out_lod[0].resize(seq_num);.

chengduoZH · 2018-01-19T07:48:27Z

paddle/operators/sequence_reshape_op.cc

+    op_desc_ptr->SetOutput(framework::GradVarName("X"), InputGrad("X"));
+    op_desc_ptr->SetAttrMap(Attrs());
+    return std::unique_ptr<framework::OpDesc>(op_desc_ptr);
+  }


I don't think you need override Apply. You can use the default xxxGradOpMaker.
You can refer this, and register op with REGISTER_OP.

I think, adding GradOpMaker explicitly is not harmful.

Yes, I agreed with you.
But it seems not necessary, and sequence_reshape_op should consistent with other ops.
I think Apply should only be overridden in the complex op, just like while_op, recurrent_op and so on, because the default GradOpMaker does not meet these op's needs.

Default GradOpMaker will make the prototxt containing many unnecessary variables.

I see, it is helpful for memory optimization.

chengduoZH

LGTM+

chengduoZH · 2018-01-19T08:23:01Z

paddle/operators/sequence_reshape_op.cc

+    op_desc_ptr->SetOutput(framework::GradVarName("X"), InputGrad("X"));
+    op_desc_ptr->SetAttrMap(Attrs());
+    return std::unique_ptr<framework::OpDesc>(op_desc_ptr);
+  }


I see, it is helpful for memory optimization.

pkuyym added 4 commits January 18, 2018 11:29

Add sequence_reshape_op.

9bd9d8b

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f20617b

… fix-6678

Refine the implementation and add unit test.

bea4144

Change the CopyRight.

fc581bc

pkuyym requested review from chengduoZH and qingqing01 January 18, 2018 12:31

chengduoZH reviewed Jan 19, 2018

View reviewed changes

Simplify the implementation.

08cb472

pkuyym force-pushed the fix-6678 branch from 96a22cc to 08cb472 Compare January 19, 2018 04:55

chengduoZH reviewed Jan 19, 2018

View reviewed changes

resize before computing LoD.

b07ca1d

chengduoZH approved these changes Jan 19, 2018

View reviewed changes

pkuyym merged commit 4f93331 into PaddlePaddle:develop Jan 19, 2018

Comments

Conversation

pkuyym commented Jan 18, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chengduoZH left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants