DL4J: Add way to load/convert models to specific datatype #7520

dollarHome · 2019-04-10T21:17:23Z

Issue Description

I was looking for a way to load a pretrained FP16 Resnet50 model and run on CPUs. There is no direct way to do that but a round about way as suggested by @raver119 on gitter chat (4/10/2019)

@raver119 suggested:
"to get resnet on fp16 right now you'll probably have to cast params to fp16, and assign these params to your model
but i think we should improve that
cc @AlexDBlack ^^^
there's also support for bfloat16 planned
it's already available in c++, just wasn't introduced to java yet
but params of your nn is just INDArray
but you'd better file an issue
and we'll provide convenient method to do that
we'll need to do that for quantized types anyway"

Deeplearning4j version - snapshots
platform information (OS, etc): Linux, Ubuntu18.04, 28c SKX Xeon system

AlexDBlack · 2019-04-11T00:49:36Z

Yes, we will want this functionality.

I man thinking we could have an API like:

MultiLayerNetwork.load(File, DataType) - "load as FP16 regardless of what the model is saved as"
MultiLayerNetwork.convertTo(DataType) - "recreate the network with the specified data type"

Of course we'll want to add the equivalent methods for ComputationGraph.
We should think about this for SameDiff also, though conversion might be on a per variable basis there...

* Fix BaseNDArray.equalsWithEps issue for scalars of different ranks * #7447 Fix slice on row vector * #7483 Remove old deserialization warnings * #6861 SameDiff datatype validation, round 1 * #6861 SameDiff datatype validation, round 2 * #6861 SameDiff datatype validation, round 3 * More rank 2 minimum shape fixes * Multiple test fixes after changing rank2 minimum shapes * Test fixes * #7520 add MultiLayerNetwork.convertDataType(DataType) + test * Datatype cleanup and fixes * DL4J: Fixes for global (default) vs. network datatypes * Fix incorrect datatype when arrays (different to default dtype) are detached * Multiple fixes, improve tests * Test * #7532 New network datatype configuration * Pass network dtype to layer/vertex initialization * Yolo datatype fixes * More fixes, more tests * More fixes, more tests * Fix bug in PoolHelperVertex backprop * Vertex dtype tests; misc fixes * Fix for BaseReduce3Op dtype * More fix; finally all layers/vertices/preprocessors tested for dtypes * Fix slices() * Fixes - gradient check dtype issues * Pass network dtype when constructing layers * Pass network dtype when constructing vertices * Layer dtype/casting fixes * Various fixes * Fix Shape.elementWiseStride for 1d view case * #7092 INDArray.get(point,x)/get(x,point) returns 1d array * More 1d getRow/getCol fixes * Indexing/sub-array fixes * More test and indexing fixes * More test fixes, add getRow(i,keepDim) and getColumn(i,keepDim) * More indexing/test fixes * More fixes * More fixes * More fixes * #7550 Evaluation dtype tests + fixes * Nd4j.gemm result dtype fix * Next round of fixes * Even more dtype fixes... * Datavec and more DL4J fixes * Next round of fixes * DL4J cuDNN helpers - dtype improvements/fixes * Another round of fixes * Datavec fixes * DL4J Fixes * Keras/Spark/elementwisevertex fixes * Final (hopefully) fixes * Last set of fixes

lock · 2019-05-17T10:51:09Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

AlexDBlack changed the title ~~DL4J: Need an efficient way to load and run pretrained FP16 models on CPU~~ DL4J: Add way to load/convert models to specific datatype Apr 11, 2019

AlexDBlack added DL4J General DeepLearning4j issues SameDiff Autodiff related issues labels Apr 11, 2019

AlexDBlack added this to the beta4 release milestone Apr 11, 2019

AlexDBlack self-assigned this Apr 11, 2019

AlexDBlack added a commit that referenced this issue Apr 11, 2019

#7520 add MultiLayerNetwork.convertDataType(DataType) + test

d458331

This was referenced Apr 11, 2019

[WIP] QA, fixes, DL4J net convertDataType methods #7531

Merged

DL4J: Add a way to configure datatype for new models #7532

Closed

AlexDBlack added a commit that referenced this issue Apr 12, 2019

#7520 add MultiLayerNetwork.convertDataType(DataType) + test

53298fa

AlexDBlack added a commit that referenced this issue Apr 15, 2019

#7520 add MultiLayerNetwork.convertDataType(DataType) + test

548d30e

AlexDBlack added a commit that referenced this issue Apr 17, 2019

#7520 add MultiLayerNetwork.convertDataType(DataType) + test

6977ae1

AlexDBlack closed this as completed in #7531 Apr 17, 2019

lock bot locked and limited conversation to collaborators May 17, 2019

eclipsewebmaster unassigned AlexDBlack Jun 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DL4J: Add way to load/convert models to specific datatype #7520

DL4J: Add way to load/convert models to specific datatype #7520

dollarHome commented Apr 10, 2019

AlexDBlack commented Apr 11, 2019

lock bot commented May 17, 2019

DL4J: Add way to load/convert models to specific datatype #7520

DL4J: Add way to load/convert models to specific datatype #7520

Comments

dollarHome commented Apr 10, 2019

Issue Description

AlexDBlack commented Apr 11, 2019

lock bot commented May 17, 2019