Skip to content

make fleet support mpi job submit directly#18441

Merged
guru4elephant merged 1 commit intoPaddlePaddle:developfrom
guru4elephant:upgrade_fleet_transpiler
Jul 2, 2019
Merged

make fleet support mpi job submit directly#18441
guru4elephant merged 1 commit intoPaddlePaddle:developfrom
guru4elephant:upgrade_fleet_transpiler

Conversation

@guru4elephant
Copy link
Member

test=develop
support qsub to submit mpi job, depend on mpi4py.

mpirun -npernode 2 python trainer.py

will startup 2 process one for trainer, the other one for server.
A user has to use MPISymetricRoleMaker to initialize fleet.

@guru4elephant guru4elephant requested a review from seiriosPlus July 2, 2019 00:40
Copy link
Collaborator

@gavin1332 gavin1332 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@guru4elephant guru4elephant merged commit 357311f into PaddlePaddle:develop Jul 2, 2019
seiriosPlus pushed a commit to seiriosPlus/Paddle that referenced this pull request Aug 28, 2019
seiriosPlus added a commit that referenced this pull request Aug 29, 2019
* fix bug in Class MultiSlotDataGenerator's function _gen_str, test=develop (#18222)
* fix some bug when merge sparse embedding parameters, test=develop (#18223)
* fix communicator with pyreader (#18350)
* delete AllocatorFacade destructor  (#18606)
* fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
* merge pr #18441
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants