Multi-device support

**TODO 1. Kernel Selection with Fallback**

Our current kernel selection mechanism is defined below

https://github.com/PaddlePaddle/Paddle/blob/7d85b6d36effbd5534b2ef7f70b0001a48844a00/paddle/framework/operator.cc#L404-L429

Please be aware that all our computational operators (i.e., except for control-flow operators like WhileOp and IfElseOp and I/O operators like Send, Recv, ListenAndDo, ReadFile) are derived from the base class `OperatorWithKernels`.

Each computational operator class has multiple *kernels*, each a function making use of a specific acceleration device, e.g., MKL, CUDA, etc.

The `OperatorWithKernel::Run` posted above selects a kernel from `kernel_key` and runs it.  

Our current implementation assumes that all computational operators in a program run on the same device.  However, this is not true.  For example, it is technically difficult to implement CRFOp on CUDA, so our CRFOp has only the CPU kernel.  So, if we assign a program including the CRFOp to run on a CUDA device, it would crash.

Thus we need a *fallback* mechanism for finding the right kernel. In particular, we need to change the system to provide a priority list of devices, instead of a single device, to a program.  For example, `[ROCm, CUDA, MKL, CPU]`.  And we need to change the implementation of `OperatorWithKernel::Run` to take such a priority list, and finds and runs the existing kernel of the highest priority.

	void OperatorWithKernel::Run(const Scope& scope,
	const platform::DeviceContext& dev_ctx) const {
	RuntimeInferShapeContext infer_shape_ctx(*this, scope);
	this->InferShape(&infer_shape_ctx);

	ExecutionContext ctx(*this, scope, dev_ctx);

	// check if op[type] has kernel registered.
	auto& all_op_kernels = AllOpKernels();
	auto kernels_iter = all_op_kernels.find(type_);
	if (kernels_iter == all_op_kernels.end()) {
	PADDLE_THROW(
	"There are no kernels which are registered in the %s operator.", type_);
	}

	// check if op[type] have kernel for kernel_key
	OpKernelMap& kernels = kernels_iter->second;
	auto kernel_key = GetKernelType(ctx);
	auto kernel_iter = kernels.find(kernel_key);

	if (kernel_iter == kernels.end()) {
	PADDLE_THROW("The operator %s does not support %s", type_, kernel_key);
	}

	kernel_iter->second->Compute(ctx);
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-device support #6403

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Multi-device support #6403

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions