Add CTC align op by wanghaoshuang · Pull Request #7527 · PaddlePaddle/Paddle

wanghaoshuang · 2018-01-15T07:51:38Z

No description provided.

qingqing01 · 2018-01-15T12:33:09Z

paddle/operators/ctc_greedy_decode_op.cu

+    auto stream = ctx.cuda_device_context().stream();
+    ArgmaxCudaKernel<T, PADDLE_CUDA_NUM_THREADS><<<
+        num_tokens, PADDLE_CUDA_NUM_THREADS, 0, stream>>>(seq_width, logits,
+                                                          tokens);


这个Kernel是在计算top 1吗？如果是可以调用top_k_op的实现吧~

You are right. I will remove argmax content from both CPU kernel and GPU kernel.

qingqing01 · 2018-01-15T12:47:01Z

Please create an issue and add it to https://github.com/PaddlePaddle/Paddle/projects/39

kuke · 2018-01-15T12:37:43Z

paddle/operators/ctc_greedy_decode_op.cc

+    AddInput("Input",
+             "(LodTensor, default: LoDTensor<float>), the unscaled "
+             "probabilities of variable-length sequences, which is a 2-D "
+             "Tensor with LoD information. It's shape is "


It's -> Its

kuke · 2018-01-15T12:54:41Z

python/paddle/v2/fluid/tests/test_ctc_greedy_decode.py

+    result = []
+    for token in np.argmax(softmax, axis=1):
+        if (token != blank) and not (merge_repeated and token == prev_token):
+            result.append(token)


Should there be one line prev_token = token?

Thx. Fixed.

kuke · 2018-01-15T12:55:53Z

python/paddle/v2/fluid/tests/test_ctc_greedy_decode.py

+    def test_check_output(self):
+        self.check_output()
+
+


Please add another test case for merge_repeated = False

Thx. Fixed.

kuke · 2018-01-15T12:57:35Z

paddle/operators/ctc_greedy_decode_op.cc

+  CTCGreedyDecodeOpMaker(OpProto* proto, OpAttrChecker* op_checker)
+      : OpProtoAndCheckerMaker(proto, op_checker) {
+    AddInput("Input",
+             "(LodTensor, default: LoDTensor<float>), the unscaled "


unscaled -> unnormalized

kuke · 2018-01-15T13:00:39Z

paddle/operators/ctc_greedy_decode_op.cc

+                  "merge repeated elements between two blanks. ")
+        .SetDefault(true);
+    AddComment(R"DOC(
+CTCGreedyDecoder is an implementation of the simple best path decoding


Need more detailed document here

1. Remove 'top 1'(or argmax) from CPU and GPU kernel 2. Add a new test case 3. Refine doc

…o ctc_greedy_decode

kuke · 2018-01-17T02:15:47Z

Please keep the name CTCGreedyDecode, we would add beam search decoding soon.

qingqing01 · 2018-01-17T07:16:13Z

paddle/operators/ctc_decode_op.cu

+    auto stream = ctx.cuda_device_context().stream();
+    MergeAndDelCudaKernel<T><<<1, 1, 0, stream>>>(
+        num_tokens, tokens, num_seq, input_lod[level].data(), blank,
+        merge_repeated, dev_out_lod0_ptr, output_data);


The CUDA kernel is less efficient here. We can profile the speed when training the model. Then determine whether to delete the GPU kernel in this op and editing distance op.

1. Allocate memory for output before compute. 2. Rename 'ctc_decode' to 'ctc_align'

wanghaoshuang · 2018-01-19T04:39:44Z

Have removed debug code.

kuke

LGTM

wanghaoshuang added 3 commits December 14, 2017 11:47

fix doc of seq_expand_op

a60b3a5

Merge branch 'wanghaoshuang-fix_seq' into develop

63f5e47

Add ctc_greedy_decode_op

579f684

wanghaoshuang requested review from kuke and qingqing01 January 15, 2018 07:51

wanghaoshuang changed the title ~~Add ctc_greedy_decode_op~~ Add CTC greedy decode op Jan 15, 2018

qingqing01 reviewed Jan 15, 2018

View reviewed changes

kuke requested changes Jan 15, 2018

View reviewed changes

Remove 'top 1' from CPU and GPU kernel

281e93b

1. Remove 'top 1'(or argmax) from CPU and GPU kernel 2. Add a new test case 3. Refine doc

wanghaoshuang changed the title ~~Add CTC greedy decode op~~ Add CTC decode op Jan 16, 2018

wanghaoshuang added 3 commits January 16, 2018 15:56

Rename 'ctc_greedy_decode' to 'ctc_decode'

10dd632

Merge branch 'develop' of https://github.com/wanghaoshuang/Paddle int…

0c10b5f

…o ctc_greedy_decode

Modify unitest

adcfde3

qingqing01 reviewed Jan 17, 2018

View reviewed changes

wanghaoshuang self-assigned this Jan 17, 2018

wanghaoshuang mentioned this pull request Jan 17, 2018

Add greedy CTC evaluator python API #7596

Closed

wanghaoshuang added 2 commits January 17, 2018 16:34

Refine CPU kernel

7150289

1. Allocate memory for output before compute. 2. Rename 'ctc_decode' to 'ctc_align'

Add Copyright to test_ctc_align.py

e469545

wanghaoshuang changed the title ~~Add CTC decode op~~ Add CTC align op Jan 18, 2018

Registry int64_t kernels

6089b50

wanghaoshuang force-pushed the ctc_greedy_decode branch from a1cdeb0 to 6089b50 Compare January 19, 2018 03:25

kuke approved these changes Jan 19, 2018

View reviewed changes

wanghaoshuang merged commit 47753a9 into PaddlePaddle:develop Jan 19, 2018

wanghaoshuang deleted the ctc_greedy_decode branch January 19, 2018 05:11

Comments

Conversation

wanghaoshuang commented Jan 15, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qingqing01 commented Jan 15, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kuke commented Jan 17, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wanghaoshuang commented Jan 19, 2018

Uh oh!

kuke left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants