Skip to content

Comments

Enable CPU on Parallel executor#11306

Merged
chengduoZH merged 13 commits intoPaddlePaddle:developfrom
chengduoZH:enable_cpu_on_pe
Jun 11, 2018
Merged

Enable CPU on Parallel executor#11306
chengduoZH merged 13 commits intoPaddlePaddle:developfrom
chengduoZH:enable_cpu_on_pe

Conversation

@chengduoZH
Copy link
Contributor

@chengduoZH chengduoZH commented Jun 8, 2018

Enable CPU on the Parallel executor.

@chengduoZH chengduoZH force-pushed the enable_cpu_on_pe branch 9 times, most recently from 0ac07d8 to a4f5fa5 Compare June 9, 2018 14:32
@chengduoZH chengduoZH force-pushed the enable_cpu_on_pe branch 7 times, most recently from 591340e to f5f039b Compare June 10, 2018 09:52
@chengduoZH chengduoZH force-pushed the enable_cpu_on_pe branch 2 times, most recently from 59681d8 to 61b14f1 Compare June 10, 2018 11:39
@chengduoZH chengduoZH force-pushed the enable_cpu_on_pe branch 3 times, most recently from f98634a to e77874c Compare June 11, 2018 03:38
Copy link
Contributor

@luotao1 luotao1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How fluid_benckmark.py benchmark CPU multi-thread training? Would you implement it in next PR?

@chengduoZH
Copy link
Contributor Author

@luotao1 Certainly.

@chengduoZH chengduoZH force-pushed the enable_cpu_on_pe branch 3 times, most recently from dd8969e to 7289a1a Compare June 11, 2018 06:47

if(WITH_GPU)
nv_library(nccl_all_reduce_op_handle SRCS nccl_all_reduce_op_handle.cc DEPS op_handle_base scope lod_tensor ddim memory
nv_library(nccl_all_reduce_op_handle SRCS all_reduce_op_handle.cc DEPS op_handle_base scope lod_tensor ddim memory
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nccl_all_reduce_op_handle --> all_reduce_op_handle ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

@PaddlePaddle PaddlePaddle deleted a comment from chengduoZH Jun 11, 2018
Copy link
Collaborator

@reyoung reyoung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cool

However, MultiCPU version should not broadcast parameters to multi-scopes. Parameter should be shared in global.

Looking forward to next PRs.

@chengduoZH chengduoZH merged commit 183377f into PaddlePaddle:develop Jun 11, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants