What should do to add MKLDNN kernel #8305

jacquesqiao · 2018-02-09T05:12:31Z

Base class that need to implement:

MKLDNNLoDTensor derived from LoDTensor. in formal PaddlePaddle.
Like MKLDNNMatrix.h for v2
MKLDNNKernelBase derived from KernelBase. Like MKLDNNLayer.h in formal PaddlePaddle for v2
Tensor transform functions used to tranform between LodTensor and MKLDNNKernel

Steps to add new kernel

Take pool_cudnn_op as example:

add pool_cudnn_op.cu.cc
implement PoolCUDNNOpKernel
register kernel with library type CUDNN

REGISTER_OP_KERNEL(pool2d, CUDNN, ::paddle::platform::CUDAPlace,
                   ops::PoolCUDNNOpKernel<float>,
                   ops::PoolCUDNNOpKernel<double>);

If we want to add pool_mkldnn kernel, the step should be:

add new file pool_mkldnn_op.cc
implement PoolMKLDNNOpKernel
register PoolMKLDNNOpKernel with library MKLDNN

REGISTER_OP_KERNEL(pool2d, MKLDNN, ::paddle::platform::MKLDNNPlace,
                   ops::PoolMKLDNNOpKernel<float>,
                   ops::PoolMKLDNNOpKernel<double>);

The text was updated successfully, but these errors were encountered:

mrysztow · 2018-03-06T15:01:26Z

Can we move the instruction to some wiki page, link it here and close the issue?

luotao1 · 2018-03-07T08:20:00Z

Your suggestion sounds nice. The related documentation is new_op_kernel_en.md. Can you move the instruction to it?

mrysztow · 2018-03-08T14:26:55Z

@pzelazko-intel can you validate the instruction above and if it is actual, insert it into the file pointed by @luotao1 ?

pzelazko-intel · 2018-03-08T14:53:01Z

@jacquesqiao For MKLDNN kernels we actually derive from framework::OpKernel directly. Please take a look at #8879.

Does class representing MKLDNN matrix have to derive from LoDTensor? Couldn't it derive from Tensor?

luotao1 · 2018-03-12T04:23:23Z

Does class representing MKLDNN matrix have to derive from LoDTensor? Couldn't it derive from Tensor?

It should derive from LoDTensor. The reason is that （sorry, we don't have a documentation about it now, we will add it）:

All the input and output of ops are LoDTensor in fact. Ops should deliver the LoD information one by one, if one of them is not LoDTensor, the delivery is broken. For examples, A->B->C, if B doesn't use LoDTensor, C would not get the LoD information. (#3717)

The related code is

Paddle/paddle/fluid/framework/operator.cc

Lines 245 to 307 in ee88855

    
           static const Tensor* GetTensorFromVar(Variable* var) { 
        
             if (var->IsType<LoDTensor>()) { 
        
               return var->GetMutable<LoDTensor>(); 
        
             } else if (var->IsType<SelectedRows>()) { 
        
               return var->GetMutable<SelectedRows>()->mutable_value(); 
        
             } else { 
        
               PADDLE_THROW("Variable type_id %s, expect LoDTensor/SelectedRows.", 
        
                            var->Type().name()); 
        
             } 
        
           } 
        
           static Tensor* GetMutableTensorFromVar(Variable* var) { 
        
             if (var->IsType<LoDTensor>()) { 
        
               return var->GetMutable<LoDTensor>(); 
        
             } else if (var->IsType<SelectedRows>()) { 
        
               return var->GetMutable<SelectedRows>()->mutable_value(); 
        
             } else { 
        
               PADDLE_THROW("Variable type_id %s, expect LoDTensor/SelectedRows.", 
        
                            var->Type().name()); 
        
             } 
        
           } 
        
           template <> 
        
           const Tensor* ExecutionContext::Input<Tensor>(const std::string& name) const { 
        
             auto* var = InputVar(name); 
        
             return var == nullptr ? nullptr 
        
                                   : GetTensorFromVar(const_cast<Variable*>(var)); 
        
           } 
        
           template <> 
        
           const std::vector<const Tensor*> ExecutionContext::MultiInput<Tensor>( 
        
               const std::string& name) const { 
        
             auto names = op().Inputs(name); 
        
             std::vector<const Tensor*> res; 
        
             res.reserve(names.size()); 
        
             std::transform(names.begin(), names.end(), std::back_inserter(res), 
        
                            [&](const std::string& sub_name) { 
        
                              auto var = scope_.FindVar(sub_name); 
        
                              return var == nullptr ? nullptr : GetTensorFromVar(var); 
        
                            }); 
        
             return res; 
        
           } 
        
           template <> 
        
           Tensor* ExecutionContext::Output<Tensor>(const std::string& name) const { 
        
             auto var = OutputVar(name); 
        
             return var == nullptr ? nullptr : GetMutableTensorFromVar(var); 
        
           } 
        
           template <> 
        
           std::vector<Tensor*> ExecutionContext::MultiOutput<Tensor>( 
        
               const std::string& name) const { 
        
             auto names = op().Outputs(name); 
        
             std::vector<Tensor*> res; 
        
             res.reserve(names.size()); 
        
             std::transform(names.begin(), names.end(), std::back_inserter(res), 
        
                            [&](const std::string& sub_name) { 
        
                              auto var = scope_.FindVar(sub_name); 
        
                              return var == nullptr ? nullptr 
        
                                                    : GetMutableTensorFromVar(var); 
        
                            }); 
        
             return res; 
        
           }

luotao1 added the Intel label Feb 9, 2018

pzelazko-intel mentioned this issue Apr 28, 2018

[DO NOT MERGE] MKLDNN layouts #10291

Closed

luotao1 closed this as completed Jun 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What should do to add MKLDNN kernel #8305

What should do to add MKLDNN kernel #8305

jacquesqiao commented Feb 9, 2018 •

edited by luotao1

Loading

mrysztow commented Mar 6, 2018

luotao1 commented Mar 7, 2018

mrysztow commented Mar 8, 2018

pzelazko-intel commented Mar 8, 2018

luotao1 commented Mar 12, 2018 •

edited

Loading

What should do to add MKLDNN kernel #8305

What should do to add MKLDNN kernel #8305

Comments

jacquesqiao commented Feb 9, 2018 • edited by luotao1 Loading

Base class that need to implement:

Steps to add new kernel

mrysztow commented Mar 6, 2018

luotao1 commented Mar 7, 2018

mrysztow commented Mar 8, 2018

pzelazko-intel commented Mar 8, 2018

luotao1 commented Mar 12, 2018 • edited Loading

jacquesqiao commented Feb 9, 2018 •

edited by luotao1

Loading

luotao1 commented Mar 12, 2018 •

edited

Loading