Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dl4j java.lang.RuntimeException: Can't allocate [HOST] memory && java.lang.OutOfMemoryError: Physical memory usage is too high #4335

Closed
kfiring opened this issue Nov 28, 2017 · 5 comments
Labels
Bug Bugs and problems

Comments

@kfiring
Copy link

kfiring commented Nov 28, 2017

Issue Description

Please describe our issue, along with:
i'm training a rnn on a 4-gpu computer, but got "java.lang.RuntimeException: Can't allocate [HOST] memory: 1997324; threadId: 35" error when using workspace (if don't use workspace, it works fine).

Environment Information

cpu: 2 * 8cores
memory: 64G
gpu: 4 * GeForce GTX 1080 Ti (11G ram each)

Version Information

jdk version: openjdk version "1.8.0_151"
dl4j version: 0.9.1
os: Ubuntu 16.04.3 LTS
cuda version: 8.0
NVRM version: NVIDIA UNIX x86_64 Kernel Module 384.98 Thu Oct 26 15:16:01 PDT 2017
GCC version: gcc version 5.4.0 20160609 (Ubuntu 5.4.0-6ubuntu1~16.04.5)
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Tue_Jan_10_13:22:03_CST_2017
Cuda compilation tools, release 8.0, V8.0.61

pom file

(there are some dependency on spark because i can choose to train the network on spark or gpus)
<dependencies> <dependency> <groupId>org.apache.spark</groupId> <artifactId>spark-core_${scala.binary.version}</artifactId> </dependency> <dependency> <groupId>org.apache.spark</groupId> <artifactId>spark-sql_${scala.binary.version}</artifactId> </dependency> <dependency> <groupId>org.apache.spark</groupId> <artifactId>spark-hive_${scala.binary.version}</artifactId> </dependency> <dependency> <groupId>org.deeplearning4j</groupId> <artifactId>deeplearning4j-core</artifactId> </dependency> <dependency> <groupId>org.deeplearning4j</groupId> <artifactId>dl4j-spark_${scala.binary.version}</artifactId> </dependency> <dependency> <groupId>org.deeplearning4j</groupId> <artifactId>deeplearning4j-parallel-wrapper_${scala.binary.version}</artifactId> </dependency> <dependency> <groupId>org.nd4j</groupId> <artifactId>nd4j-kryo_${scala.binary.version}</artifactId> </dependency> <dependency> <groupId>org.datavec</groupId> <artifactId>datavec-api</artifactId> </dependency> <dependency> <groupId>org.datavec</groupId> <artifactId>datavec-spark_${scala.binary.version}</artifactId> </dependency> <dependency> <groupId>log4j</groupId> <artifactId>log4j</artifactId> </dependency> </dependencies> <profiles> <profile> <id>use_cpu</id> <activation> <activeByDefault>true</activeByDefault> </activation> <dependencies> <dependency> <groupId>org.nd4j</groupId> <artifactId>nd4j-native-platform</artifactId> </dependency> </dependencies> </profile> <profile> <id>use_gpu</id> <activation> <activeByDefault>false</activeByDefault> </activation> <dependencies> <dependency> <groupId>org.nd4j</groupId> <artifactId>nd4j-cuda-${cuda.version}-platform</artifactId> </dependency> </dependencies> </profile> </profiles>

code

(network: 2 hidden layers, each with 80 hidden neurons, batch size=10, input feature size=1011, time series length=from 10 to 100, output=1011, tbptt length=10)
`
Nd4j.setDataType(DataBuffer.Type.HALF);
// DataTypeUtil.setDTypeForContext(DataBuffer.Type.HALF);
// CudaEnvironment.getInstance().getConfiguration()
// .allowMultiGPU(true)
// .setMaximumDeviceCacheableLength(1024 * 1024 * 1024L)
// .setMaximumDeviceCache(6L * 1024 * 1024 * 1024L)
// .setMaximumHostCacheableLength(1024 * 1024 * 1024L)
// .setMaximumHostCache(6L * 1024 * 1024 * 1024L)
// .allowCrossDeviceAccess(true);

long st = System.currentTimeMillis();
ItemSeqIterator train_data = dataloader.prepareData(params.train_data_file, params.batch_size, params.augment_sample);
logger.info("get {} training data, cost {} seconds", train_data.numExamples(), (System.currentTimeMillis()-st)/1000);

st = System.currentTimeMillis();
ItemSeqIterator test_data = dataloader.prepareData(params.test_data_file, 1, 0);
logger.info("get {} test data, cost {} seconds", test_data.numExamples(), (System.currentTimeMillis()-st)/1000);

MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
.optimizationAlgo(OptimizationAlgorithm.STOCHASTIC_GRADIENT_DESCENT)
.iterations(1)
.learningRate(params.learning_rate)
.trainingWorkspaceMode(WorkspaceMode.SEPARATE)
.inferenceWorkspaceMode(WorkspaceMode.SEPARATE)
.seed(rd_seed)
.regularization(true)
.l2(params.l2_norm_coff)
.weightInit(WeightInit.XAVIER)
.updater(Updater.RMSPROP)
.list()
.layer(0, new GravesLSTM.Builder()
.nIn(train_data.inputColumns())
.nOut(params.lstm_layer_size)
.activation(Activation.TANH)
.dropOut(params.dropout)
.build())
.layer(1, new GravesLSTM.Builder()
.nIn(params.lstm_layer_size)
.nOut(params.lstm_layer_size)
.activation(Activation.TANH)
.dropOut(params.dropout)
.build())
.layer(2, new RnnOutputLayer.Builder(LossFunctions.LossFunction.MCXENT)
.activation(Activation.SOFTMAX)
.nIn(params.lstm_layer_size)
.nOut(train_data.totalOutcomes())
.build())
.backpropType(BackpropType.TruncatedBPTT)
.tBPTTForwardLength(params.tbptt_length)
.tBPTTBackwardLength(params.tbptt_length)
.pretrain(false)
.backprop(true)
.build();

MultiLayerNetwork net = new MultiLayerNetwork(conf);
net.init();

logger.info("network has {} parameters", net.numParams());
net.setListeners(new ScoreIterationListener(1));

ParallelWrapper wrapper = new ParallelWrapper.Builder(net)
.prefetchBuffer(4)
.workers(4)
.averagingFrequency(1)
.reportScoreAfterAveraging(true)
.workspaceMode(WorkspaceMode.SEPARATE)
.build();

Nd4j.getMemoryManager().setAutoGcWindow(5000);
Nd4j.getMemoryManager().togglePeriodicGc(false);
logger.info("Starting training");
for (int i = 0; i < params.num_epochs; i++) {
st = System.currentTimeMillis();
logger.info("epoch {} start", i);
wrapper.fit(train_data);
// net.fit(train_data);
logger.info("epoch {} complete, cost {} seconds, start evalating", (System.currentTimeMillis() - st)/1000);

Evaluation evaluation = net.evaluate(test_data);
logger.info(evaluation.stats());
train_data.reset();
train_data.shuffle();

}
`

log

16:34:37.506 [main] INFO org.nd4j.linalg.factory.Nd4jBackend - Loaded [JCublasBackend] backend
16:34:42.266 [main] INFO org.nd4j.nativeblas.NativeOpsHolder - Number of threads used for NativeOps: 32
16:34:43.068 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.074 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.076 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.078 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.080 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.081 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.083 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.085 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.086 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.088 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.090 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.091 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.094 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.095 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.097 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.099 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.101 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.102 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.104 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.106 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.107 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.109 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.111 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.112 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.114 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.116 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.117 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.120 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.121 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.123 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.125 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.126 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
16:34:43.484 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.491 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.494 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.497 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.500 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.503 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.505 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.508 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.511 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.514 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.517 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.519 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.523 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.525 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.528 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.530 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.533 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.535 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.539 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.542 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.545 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.548 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.551 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.554 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.556 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.559 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.562 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.565 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.568 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.570 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.573 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.576 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
16:34:43.921 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.925 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.926 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.928 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.929 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.931 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.932 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.933 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.935 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.936 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.938 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.939 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.941 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.942 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.944 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.945 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.947 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.948 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.949 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.951 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.952 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.954 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.955 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.956 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.958 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.959 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.961 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.963 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.964 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.965 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.967 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:43.968 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
16:34:44.284 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.288 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.289 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.290 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.292 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.293 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.295 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.296 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.298 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.299 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.300 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.302 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.304 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.305 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.307 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.308 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.309 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.311 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.312 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.314 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.315 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.317 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.318 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.319 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.321 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.322 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.324 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.326 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.327 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.329 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.330 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.331 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
16:34:44.336 [main] DEBUG o.n.j.c.CudaAffinityManager - Mapping thread [1] to device [0], out of [4] devices...
16:34:44.336 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [25] to device [0], out of [4] devices...
16:34:44.337 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [26] to device [0], out of [4] devices...
16:34:44.337 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [27] to device [0], out of [4] devices...
16:34:44.337 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [28] to device [0], out of [4] devices...
16:34:44.337 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [29] to device [0], out of [4] devices...
16:34:44.337 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [30] to device [0], out of [4] devices...
16:34:44.365 [main] DEBUG org.reflections.Reflections - going to scan these urls:
jar:file:/data/lib/nd4j-native-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-parameter-server-model-0.9.1.jar!/
jar:file:/data/lib/nd4j-base64-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-context-0.9.1.jar!/
jar:file:/data/lib/jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-aeron-0.9.1.jar!/
jar:file:/data/lib/nd4j-kryo_2.11-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-client-0.9.1.jar!/
jar:file:/data/lib/nd4j-common-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-buffer-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-0.9.1.jar!/
16:34:44.480 [main] INFO org.reflections.Reflections - Reflections took 112 ms to scan 23 urls, producing 31 keys and 227 values
16:34:44.605 [main] INFO o.n.l.a.o.e.DefaultOpExecutioner - Backend used: [CUDA]; OS: [Linux]
16:34:44.605 [main] INFO o.n.l.a.o.e.DefaultOpExecutioner - Cores: [16]; Memory: [7.1GB];
16:34:44.605 [main] INFO o.n.l.a.o.e.DefaultOpExecutioner - Blas vendor: [CUBLAS]
16:34:44.607 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11712987136]
16:34:44.607 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11715084288]
16:34:44.607 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11715084288]
16:34:44.607 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11715084288]
16:36:00.970 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - get 6123748 training data, cost 76 seconds
16:36:01.084 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - get 30969 test data, cost 0 seconds
16:36:21.949 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 1
16:36:22.173 [main] DEBUG org.reflections.Reflections - going to scan these urls:
file:/data/lib/scala-java8-compat_2.11-0.3.0.jar
file:/data/lib/nd4j-parameter-server-model-0.9.1.jar
file:/data/lib/commons-cli-1.2.jar
file:/data/lib/scala-stm_2.11-0.7.jar
file:/data/lib/snappy-0.2.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-macosx-x86_64.jar
file:/data/lib/play-functional_2.11-2.4.6.jar
file:/data/lib/jersey-container-servlet-core-2.22.2.jar
file:/data/lib/mapdb-3.0.5.jar
file:/data/lib/api-util-1.0.0-M20.jar
file:/data/lib/parquet-generator-1.7.0.jar
file:/data/lib/scala-library-2.11.8.jar
file:/data/lib/commons-beanutils-core-1.8.0.jar
file:/data/lib/parquet-column-1.7.0.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-macosx-x86_64.jar
file:/data/lib/jackson-databind-2.6.5.jar
file:/data/lib/commons-codec-1.10.jar
file:/data/lib/nd4j-native-0.9.1-linux-ppc64le.jar
file:/data/lib/aopalliance-1.0.jar
file:/data/lib/derby-10.10.2.0.jar
file:/data/lib/play-netty-utils-2.4.6.jar
file:/data/lib/datanucleus-api-jdo-3.2.6.jar
file:/data/lib/jodd-core-3.5.2.jar
file:/data/lib/avro-ipc-1.7.7-tests.jar
file:/data/lib/openblas-0.2.19-1.3-android-x86.jar
file:/data/lib/htrace-core-3.1.0-incubating.jar
file:/data/lib/slf4j-log4j12-1.7.16.jar
file:/data/lib/leptonica-1.73-1.3-linux-x86_64.jar
file:/data/lib/netty-3.8.0.Final.jar
file:/data/lib/scala-reflect-2.11.7.jar
file:/data/lib/leveldb-api-0.5.jar
file:/data/lib/elsa-3.0.0-M5.jar
file:/data/lib/janino-2.7.8.jar
file:/data/lib/joda-convert-1.7.jar
file:/data/lib/cuda-8.0-6.0-1.3.jar
file:/data/lib/leptonica-1.73-1.3-linux-x86.jar
file:/data/lib/joni-2.1.2.jar
file:/data/lib/RoaringBitmap-0.5.11.jar
file:/data/lib/leptonica-1.73-1.3-android-arm.jar
file:/data/lib/opencv-3.2.0-1.3-linux-x86_64.jar
file:/data/lib/pyrolite-4.9.jar
file:/data/lib/hibernate-validator-5.0.3.Final.jar
file:/data/lib/scala-compiler-2.11.0.jar
file:/data/lib/leptonica-1.73-1.3-linux-ppc64le.jar
file:/data/lib/deeplearning4j-nn-0.9.1.jar
file:/data/lib/guice-assistedinject-4.0.jar
file:/data/lib/findbugs-annotations-1.3.9-1.jar
file:/data/lib/jersey-media-jaxb-2.22.2.jar
file:/data/lib/akka-actor_2.11-2.3.13.jar
file:/data/lib/jtransforms-2.4.0.jar
file:/data/lib/hbase-protocol-1.2.5.jar
file:/data/lib/imageio-bmp-3.1.1.jar
file:/data/lib/jaxb-core-2.2.7.jar
file:/data/lib/c3p0-0.9.5.2.jar
file:/data/lib/commons-collections-3.2.1.jar
file:/data/lib/compress-lzf-1.0.3.jar
file:/data/lib/openblas-0.2.19-1.3-linux-x86.jar
file:/data/lib/logback-core-1.1.3.jar
file:/data/lib/cuda-8.0-6.0-1.3-linux-x86_64.jar
file:/data/lib/javax.annotation-api-1.2.jar
file:/data/lib/httpcore-nio-4.4.4.jar
file:/data/lib/zookeeper-3.4.5.jar
file:/data/lib/bonecp-0.8.0.RELEASE.jar
file:/data/lib/ffmpeg-3.2.1-1.3.jar
file:/data/lib/aopalliance-repackaged-2.4.0-b34.jar
file:/data/lib/datavec-spark_2.11-0.9.1_spark_2.jar
file:/data/lib/bson-3.5.0.jar
file:/data/lib/ivy-2.4.0.jar
file:/data/lib/calcite-core-1.2.0-incubating.jar
file:/data/lib/opencv-3.2.0-1.3-windows-x86.jar
file:/data/lib/breeze_2.11-0.11.2.jar
file:/data/lib/antlr-2.7.7.jar
file:/data/lib/commons-configuration-1.6.jar
file:/data/lib/hk2-locator-2.4.0-b34.jar
file:/data/lib/imageio-psd-3.1.1.jar
file:/data/lib/leveldb-0.5.jar
file:/data/lib/JavaEWAH-0.3.2.jar
file:/data/lib/openblas-0.2.19-1.3-linux-x86_64.jar
file:/data/lib/opencv-platform-3.2.0-1.3.jar
file:/data/lib/opencv-3.2.0-1.3-linux-x86.jar
file:/data/lib/kryo-4.0.0.jar
file:/data/lib/classmate-1.0.0.jar
file:/data/lib/opencsv-2.3.jar
file:/data/lib/spring-core-4.1.6.RELEASE.jar
file:/data/lib/deeplearning4j-ui-components-0.9.1.jar
file:/data/lib/commons-digester-1.8.jar
file:/data/lib/parquet-hadoop-bundle-1.6.0.jar
file:/data/lib/jsr305-1.3.9.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-windows-x86_64.jar
file:/data/lib/jackson-datatype-jsr310-2.4.4.jar
file:/data/lib/json-20090211.jar
file:/data/lib/stream-2.7.0.jar
file:/data/lib/deeplearning4j-core-0.9.1.jar
file:/data/lib/commons-lang-2.6.jar
file:/data/lib/artoolkitplus-2.3.1-1.3.jar
file:/data/lib/unused-1.0.0.jar
file:/data/lib/hk2-utils-2.4.0-b34.jar
file:/data/lib/deeplearning4j-modelimport-0.9.1.jar
file:/data/lib/hive-exec-1.2.1.spark2.jar
file:/data/lib/objenesis-2.2.jar
file:/data/lib/chill-java-0.8.0.jar
file:/data/lib/play-iteratees_2.11-2.4.6.jar
file:/data/lib/hbase-client-1.2.5.jar
file:/data/lib/nd4j-native-0.9.1-android-x86.jar
file:/data/lib/json4s-jackson_2.11-3.2.11.jar
file:/data/lib/lz4-1.3.0.jar
file:/data/lib/commons-httpclient-3.1.jar
file:/data/lib/univocity-parsers-2.1.1.jar
file:/data/lib/commons-collections-3.2.2.jar
file:/data/lib/leptonica-1.73-1.3-windows-x86_64.jar
file:/data/lib/parquet-format-2.3.0-incubating.jar
file:/data/lib/play-netty-server_2.11-2.4.6.jar
file:/data/lib/hbase-annotations-1.2.5.jar
file:/data/lib/akka-remote_2.11-2.3.13.jar
file:/data/lib/kotlin-runtime-1.0.7.jar
file:/data/lib/openblas-0.2.19-1.3-android-arm.jar
file:/data/lib/nd4j-native-api-0.9.1.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-linux-x86_64.jar
file:/data/lib/asm-5.0.4.jar
file:/data/lib/javacv-1.3.3.jar
file:/data/lib/nd4j-kryo_2.11-0.9.1.jar
file:/data/lib/datavec-api-0.9.1.jar
file:/data/lib/jai-imageio-core-1.3.0.jar
file:/data/lib/unirest-java-1.4.9.jar
file:/data/lib/kryo-shaded-3.0.3.jar
file:/data/lib/play-server_2.11-2.4.6.jar
file:/data/lib/metrics-json-3.1.2.jar
file:/data/lib/jcip-annotations-1.0.jar
file:/data/lib/leptonica-1.73-1.3-windows-x86.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1.jar
file:/data/lib/javax.servlet-api-3.1.0.jar
file:/data/lib/scalap-2.11.0.jar
file:/data/lib/play_2.11-2.4.6.jar
file:/data/lib/netty-http-pipelining-1.1.4.jar
file:/data/lib/nd4j-native-0.9.1.jar
file:/data/lib/javax.inject-2.4.0-b34.jar
file:/data/lib/jackson-datatype-jdk8-2.4.4.jar
file:/data/lib/opencv-3.2.0-1.3-macosx-x86_64.jar
file:/data/lib/javax.ws.rs-api-2.0.1.jar
file:/data/lib/spire_2.11-0.7.4.jar
file:/data/lib/guice-4.0.jar
file:/data/lib/config-1.3.0.jar
file:/data/lib/antlr4-runtime-4.5.3.jar
file:/data/lib/jcl-over-slf4j-1.7.16.jar
file:/data/lib/kryo-serializers-0.41.jar
file:/data/lib/libfb303-0.9.2.jar
file:/data/lib/libdc1394-2.2.4-1.3.jar
file:/data/lib/opencv-3.2.0-1.3-android-arm.jar
file:/data/lib/jul-to-slf4j-1.7.16.jar
file:/data/lib/scala-xml_2.11-1.0.2.jar
file:/data/lib/metrics-graphite-3.1.2.jar
file:/data/lib/stax-api-1.0.1.jar
file:/data/lib/imageio-tiff-3.1.1.jar
file:/data/lib/hamcrest-core-1.3.jar
file:/data/lib/common-lang-3.1.1.jar
file:/data/lib/validation-api-1.1.0.Final.jar
file:/data/lib/junit-4.12.jar
file:/data/lib/pmml-model-1.2.15.jar
file:/data/lib/leptonica-1.73-1.3-macosx-x86_64.jar
file:/data/lib/httpcore-4.4.4.jar
file:/data/models/
file:/data/lib/akka-slf4j_2.11-2.3.13.jar
file:/data/lib/openblas-0.2.19-1.3-linux-ppc64le.jar
file:/data/lib/api-asn1-api-1.0.0-M20.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-linux-ppc64le.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-windows-x86.jar
file:/data/lib/datanucleus-core-3.2.10.jar
file:/data/lib/guice-3.0.jar
file:/data/lib/openblas-0.2.19-1.3.jar
file:/data/lib/cuda-8.0-6.0-1.3-macosx-x86_64.jar
file:/data/lib/eclipse-collections-7.1.1.jar
file:/data/lib/neoitertools-1.0.0.jar
file:/data/lib/jaxb-impl-2.2.7.jar
file:/data/lib/logback-classic-1.1.3.jar
file:/data/lib/jackson-0.9.1.jar
file:/data/lib/pmml-schema-1.2.15.jar
file:/data/lib/datavec-hadoop-0.9.1.jar
file:/data/lib/deeplearning4j-ui-model-0.9.1.jar
file:/data/lib/aeron-all-1.0.4.jar
file:/data/lib/nd4j-native-0.9.1-linux-x86_64.jar
file:/data/lib/pmml-agent-1.1.15.jar
file:/data/lib/imageio-core-3.1.1.jar
file:/data/lib/reflectasm-1.11.3.jar
file:/data/lib/minlog-1.3.0.jar
file:/data/lib/jackson-module-paranamer-2.6.5.jar
file:/data/lib/junit-4.8.2.jar
file:/data/lib/nd4j-native-0.9.1-macosx-x86_64.jar
file:/data/lib/tomcat-servlet-api-8.0.21.jar
file:/data/lib/jackson-module-scala_2.11-2.6.5.jar
file:/data/lib/jackson-core-2.6.5.jar
file:/data/lib/javolution-5.5.1.jar
file:/data/lib/hk2-api-2.4.0-b34.jar
file:/data/lib/kotlin-stdlib-1.0.7.jar
file:/data/lib/jackson-core-asl-1.9.13.jar
file:/data/lib/mesos-0.21.1-shaded-protobuf.jar
file:/data/lib/twirl-api_2.11-1.1.1.jar
file:/data/lib/deeplearning4j-parallel-wrapper_2.11-0.9.1.jar
file:/data/lib/imageio-metadata-3.1.1.jar
file:/data/lib/play-java_2.11-2.4.6.jar
file:/data/lib/uncommons-maths-1.2.2a.jar
file:/data/lib/jetty-util-6.1.26.jar
file:/data/lib/xercesImpl-2.11.0.jar
file:/data/lib/httpmime-4.5.2.jar
file:/data/lib/sqlite-jdbc-3.15.1.jar
file:/data/lib/jdo-api-3.0.1.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-windows-x86_64.jar
file:/data/lib/xz-1.5.jar
file:/data/lib/play-datacommons_2.11-2.4.6.jar
file:/data/lib/avro-mapred-1.7.7-hadoop2.jar
file:/data/lib/commons-logging-1.1.3.jar
file:/data/lib/commons-io-2.4.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-linux-x86.jar
file:/data/lib/openblas-0.2.19-1.3-windows-x86.jar
file:/data/lib/jets3t-0.7.1.jar
file:/data/lib/Agrona-0.5.4.jar
file:/data/lib/commons-net-2.2.jar
file:/data/lib/nd4j-buffer-0.9.1.jar
file:/data/lib/opencv-3.2.0-1.3.jar
file:/data/lib/nd4j-parameter-server-client-0.9.1.jar
file:/data/lib/parquet-jackson-1.7.0.jar
file:/data/lib/akka-contrib_2.11-2.3.13.jar
file:/data/lib/pmml-schema-1.1.15.jar
file:/data/lib/opencv-3.2.0-1.3-windows-x86_64.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-ppc64le.jar
file:/data/lib/nd4j-jackson-0.9.1.jar
file:/data/lib/oro-2.0.8.jar
file:/data/lib/build-link-2.4.6.jar
file:/data/lib/jersey-client-2.22.2.jar
file:/data/lib/commons-dbcp-1.4.jar
file:/data/lib/protobuf-java-2.5.0.jar
file:/data/lib/curator-framework-2.4.0.jar
file:/data/lib/slf4j-api-1.7.25.jar
file:/data/lib/openblas-0.2.19-1.3-windows-x86_64.jar
file:/data/lib/json4s-core_2.11-3.2.11.jar
file:/data/lib/hive-metastore-1.2.1.spark2.jar
file:/data/lib/typetools-0.4.3.jar
file:/data/lib/common-io-3.1.1.jar
file:/data/lib/akka-persistence-experimental_2.11-2.3.13.jar
file:/data/lib/parquet-common-1.7.0.jar
file:/data/lib/jaxb-api-2.2.7.jar
file:/data/lib/stringtemplate-3.2.1.jar
file:/data/lib/leptonica-1.73-1.3.jar
file:/data/lib/commons-pool-1.5.4.jar
file:/data/lib/nearestneighbor-core-0.9.1.jar
file:/data/lib/libfreenect2-0.2.0-1.3.jar
file:/data/lib/curator-client-2.4.0.jar
file:/data/lib/librealsense-1.9.6-1.3.jar
file:/data/lib/javassist-3.19.0-GA.jar
file:/data/lib/openblas-platform-0.2.19-1.3.jar
file:/data/lib/chill_2.11-0.8.0.jar
file:/data/lib/netty-all-4.0.29.Final.jar
file:/data/lib/curator-recipes-2.4.0.jar
file:/data/lib/gson-2.8.1.jar
file:/data/lib/apache-log4j-extras-1.2.17.jar
file:/data/lib/cuda-8.0-6.0-1.3-windows-x86_64.jar
file:/data/lib/calcite-avatica-1.2.0-incubating.jar
file:/data/lib/jcodings-1.0.8.jar
file:/data/lib/metrics-core-3.1.2.jar
file:/data/lib/flandmark-1.07-1.3.jar
file:/data/lib/scala-parser-combinators_2.11-1.0.1.jar
file:/data/lib/spring-beans-4.1.6.RELEASE.jar
file:/data/lib/parquet-encoding-1.7.0.jar
file:/data/lib/leptonica-platform-1.73-1.3.jar
file:/data/lib/opencv-3.2.0-1.3-android-x86.jar
file:/data/lib/datanucleus-rdbms-3.2.9.jar
file:/data/lib/freemarker-2.3.23.jar
file:/data/lib/jboss-logging-3.2.1.Final.jar
file:/data/lib/avro-1.7.7.jar
file:/data/lib/jackson-annotations-2.6.5.jar
file:/data/lib/httpasyncclient-4.1.1.jar
file:/data/lib/videoinput-0.200-1.3.jar
file:/data/lib/guava-18.0.jar
file:/data/lib/metrics-jvm-3.1.2.jar
file:/data/models/al-rec-models-itemseq-1.0.jar
file:/data/lib/cuda-8.0-6.0-1.3-linux-ppc64le.jar
file:/data/lib/ST4-4.0.4.jar
file:/data/lib/jersey-container-servlet-2.22.2.jar
file:/data/lib/jersey-server-2.22.2.jar
file:/data/lib/jersey-common-2.22.2.jar
file:/data/lib/leptonica-1.73-1.3-linux-armhf.jar
file:/data/lib/apacheds-i18n-2.0.0-M15.jar
file:/data/lib/leptonica-1.73-1.3-android-x86.jar
file:/data/lib/commons-math-2.1.jar
file:/data/lib/eigenbase-properties-1.1.5.jar
file:/data/lib/commons-beanutils-1.7.0.jar
file:/data/lib/slf4j-log4j12-1.7.25.jar
file:/data/lib/snakeyaml-1.12.jar
file:/data/lib/snappy-java-1.1.2.6.jar
file:/data/lib/flycapture-2.9.3.43-1.3.jar
file:/data/lib/objenesis-2.1.jar
file:/data/lib/cuda-platform-8.0-6.0-1.3.jar
file:/data/lib/datavec-data-image-0.9.1.jar
file:/data/lib/nd4j-native-0.9.1-windows-x86_64.jar
file:/data/lib/nd4j-native-platform-0.9.1.jar
file:/data/lib/nd4j-base64-0.9.1.jar
file:/data/lib/nd4j-api-0.9.1.jar
file:/data/lib/calcite-linq4j-1.2.0-incubating.jar
file:/data/lib/avro-ipc-1.7.7.jar
file:/data/lib/nd4j-aeron-0.9.1.jar
file:/data/lib/libfreenect-0.5.3-1.3.jar
file:/data/lib/mysql-connector-java-6.0.6.jar
file:/data/lib/nd4j-native-0.9.1-android-arm.jar
file:/data/lib/core-1.1.2.jar
file:/data/lib/metrics-core-2.2.0.jar
file:/data/lib/openblas-0.2.19-1.3-macosx-x86_64.jar
file:/data/lib/xmlenc-0.52.jar
file:/data/lib/paranamer-2.3.jar
file:/data/lib/play-exceptions-2.4.6.jar
file:/data/lib/joda-time-2.9.3.jar
file:/data/lib/common-image-3.1.1.jar
file:/data/lib/apacheds-kerberos-codec-2.0.0-M15.jar
file:/data/lib/opencv-3.2.0-1.3-linux-ppc64le.jar
file:/data/lib/eclipse-collections-forkjoin-7.1.1.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3.jar
file:/data/lib/akka-cluster_2.11-2.3.13.jar
file:/data/lib/nd4j-cuda-8.0-platform-0.9.1.jar
file:/data/lib/javacpp-1.3.3.jar
file:/data/lib/jta-1.1.jar
file:/data/lib/mongodb-driver-3.5.0.jar
file:/data/lib/hdf5-platform-1.10.0-patch1-1.3.jar
file:/data/lib/deeplearning4j-play_2.11-0.9.1.jar
file:/data/lib/commons-compress-1.8.jar
file:/data/lib/scalatest_2.11-2.2.6.jar
file:/data/lib/commons-compiler-2.7.6.jar
file:/data/lib/xbean-asm5-shaded-4.4.jar
file:/data/lib/hbase-common-1.2.5.jar
file:/data/lib/pmml-model-1.1.15.jar
file:/data/lib/reflections-0.9.10.jar
file:/data/lib/jcommander-1.27.jar
file:/data/lib/libthrift-0.9.2.jar
file:/data/lib/nd4j-parameter-server-0.9.1.jar
file:/data/lib/xml-apis-1.4.01.jar
file:/data/lib/commons-math3-3.4.1.jar
file:/data/lib/jersey-guava-2.22.2.jar
file:/data/lib/slf4j-api-1.7.16.jar
file:/data/lib/json4s-ast_2.11-3.2.11.jar
file:/data/lib/mchange-commons-java-0.2.11.jar
file:/data/lib/opencv-3.2.0-1.3-linux-armhf.jar
file:/data/lib/arpack_combined_all-0.1.jar
file:/data/lib/breeze-macros_2.11-0.11.2.jar
file:/data/lib/lombok-1.16.16.jar
file:/data/lib/c3p0-0.9.1.2.jar
file:/data/lib/leveldbjni-all-1.8.jar
file:/data/lib/imageio-jpeg-3.1.1.jar
file:/data/lib/antlr-runtime-3.4.jar
file:/data/lib/javassist-3.18.1-GA.jar
file:/data/lib/log4j-1.2.17.jar
file:/data/lib/stax2-api-3.1.4.jar
file:/data/lib/osgi-resource-locator-1.0.1.jar
file:/data/lib/py4j-0.10.3.jar
file:/data/lib/mongodb-driver-core-3.5.0.jar
file:/data/lib/nd4j-context-0.9.1.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-x86_64.jar
file:/data/lib/httpclient-4.5.2.jar
file:/data/lib/play-json_2.11-2.4.6.jar
file:/data/lib/javax.inject-1.jar
file:/data/lib/spring-context-4.1.6.RELEASE.jar
file:/data/lib/spire-macros_2.11-0.7.4.jar
file:/data/lib/eclipse-collections-api-7.1.1.jar
file:/data/lib/openblas-0.2.19-1.3-linux-armhf.jar
file:/data/lib/al-rec-common-1.0.jar
file:/data/lib/nd4j-common-0.9.1.jar
file:/data/lib/parquet-hadoop-1.7.0.jar
file:/data/lib/dl4j-spark_2.11-0.9.1_spark_2.jar
file:/data/lib/commons-lang3-3.3.2.jar
file:/data/lib/annotations-2.0.1.jar
file:/data/lib/jackson-mapper-asl-1.9.13.jar
file:/data/lib/guava-20.0.jar
16:36:27.254 [main] INFO org.reflections.Reflections - Reflections took 5080 ms to scan 368 urls, producing 4712 keys and 36863 values
16:36:27.514 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.layers.CenterLossOutputLayer as subtype of org.deeplearning4j.nn.conf.layers.Layer
16:36:27.514 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.modelimport.keras.preprocessors.TensorFlowCnnToFeedForwardPreProcessor as subtype of org.deeplearning4j.nn.conf.InputPreProcessor
16:36:27.515 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ReshapeVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
16:36:27.515 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.PoolHelperVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
16:36:27.515 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ShiftVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
16:36:27.520 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.layers.CenterLossOutputLayer as subtype of org.deeplearning4j.nn.conf.layers.Layer
16:36:27.521 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.modelimport.keras.preprocessors.TensorFlowCnnToFeedForwardPreProcessor as subtype of org.deeplearning4j.nn.conf.InputPreProcessor
16:36:27.521 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ReshapeVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
16:36:27.521 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.PoolHelperVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
16:36:27.521 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ShiftVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
16:36:27.550 [main] INFO o.d.nn.multilayer.MultiLayerNetwork - Starting MultiLayerNetwork with WorkspaceModes set to [training: SEPARATE; inference: SEPARATE]
16:36:47.167 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 5
16:36:47.182 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 3
16:36:47.214 [main] DEBUG org.reflections.Reflections - going to scan these urls:
jar:file:/data/lib/nd4j-native-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-parameter-server-model-0.9.1.jar!/
jar:file:/data/lib/nd4j-base64-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-context-0.9.1.jar!/
jar:file:/data/lib/jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-aeron-0.9.1.jar!/
jar:file:/data/lib/nd4j-kryo_2.11-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-client-0.9.1.jar!/
jar:file:/data/lib/nd4j-common-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-buffer-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-0.9.1.jar!/
16:37:07.698 [main] INFO org.reflections.Reflections - Reflections took 20484 ms to scan 23 urls, producing 420 keys and 1665 values
16:37:28.795 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 2
16:37:28.814 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 4
16:37:28.814 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 0
16:37:28.859 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - network has 499331 parameters
16:37:28.877 [main] INFO o.d.parallelism.ParallelWrapper - Creating new AveragingTraining instance
16:37:28.878 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - Starting training
16:37:28.878 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - epoch 0 start
16:37:28.878 [main] INFO o.d.parallelism.ParallelWrapper - Using workspaceMode SEPARATE for training
16:37:28.883 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [33] to device [0], out of [4] devices...
16:37:28.883 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [35] to device [1], out of [4] devices...
16:37:28.884 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [37] to device [2], out of [4] devices...
16:37:28.885 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [39] to device [3], out of [4] devices...
16:37:28.885 [main] INFO o.d.parallelism.ParallelWrapper - Creating asynchronous prefetcher...
16:37:28.890 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [40] to device [0], out of [4] devices...
16:37:28.890 [main] INFO o.d.parallelism.ParallelWrapper - Starting ParallelWrapper training round...
16:37:28.896 [ADSI prefetch thread] DEBUG o.n.l.memory.abstracts.Nd4jWorkspace - Steps: 10
16:37:28.912 [ADSI prefetch thread] DEBUG o.n.l.memory.abstracts.Nd4jWorkspace - Steps: 17
16:37:28.928 [ADSI prefetch thread] DEBUG o.n.l.memory.abstracts.Nd4jWorkspace - Steps: 17
16:37:28.943 [ADSI prefetch thread] DEBUG o.n.l.memory.abstracts.Nd4jWorkspace - Steps: 17
16:37:28.958 [ADSI prefetch thread] DEBUG o.n.l.memory.abstracts.Nd4jWorkspace - Steps: 17
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:2172 code=77() "cudaStreamSynchronize(*stream)"
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4885 code=77() "result"
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4738 code=77() "result"
Exception in thread "ADSI prefetch thread" 16:37:29.020 [ParallelWrapper training thread 0] DEBUG o.d.p.trainer.DefaultTrainer - Terminating all workspaces for trainer_0
16:37:51.246 [ParallelWrapper training thread 0] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [41] to device [0], out of [4] devices...
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4895 code=77() "result"
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4895 code=77() "result"
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4895 code=77() "result"
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4895 code=77() "result"
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4895 code=77() "result"
Exception in thread "UniGC thread 5" Exception in thread "UniGC thread 1" Exception in thread "UniGC thread 4" Exception in thread "UniGC thread 3" Exception in thread "UniGC thread 2" org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.nd4j.jita.allocator.pointers.cuda.cudaEvent_t.synchronize(cudaEvent_t.java:55)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillFinished(SynchronousFlowController.java:106)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillFinished(GridFlowController.java:47)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillReleased(SynchronousFlowController.java:203)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillReleased(GridFlowController.java:62)
at org.nd4j.jita.handler.impl.CudaZeroHandler.purgeDeviceObject(CudaZeroHandler.java:1113)
at org.nd4j.jita.allocator.impl.AtomicAllocator.purgeDeviceObject(AtomicAllocator.java:515)
at org.nd4j.jita.allocator.impl.AtomicAllocator$UnifiedGarbageCollectorThread.run(AtomicAllocator.java:714)
CUDA error at /home/jenkins/workspace/dl4j/all-multiplatform@2_linux-x86_64/stream1/libnd4j/blas/cuda/NativeOps.cu:4895 code=77() "result"
16:37:51.249 [ParallelWrapper training thread 0] ERROR o.d.parallelism.ParallelWrapper - Uncaught exception: java.lang.RuntimeException: org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.nd4j.jita.allocator.pointers.cuda.cudaEvent_t.synchronize(cudaEvent_t.java:55)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillFinished(SynchronousFlowController.java:106)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillFinished(GridFlowController.java:47)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillReleased(SynchronousFlowController.java:203)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillReleased(GridFlowController.java:62)
at org.nd4j.jita.handler.impl.CudaZeroHandler.purgeDeviceObject(CudaZeroHandler.java:1113)
at org.nd4j.jita.allocator.impl.AtomicAllocator.purgeDeviceObject(AtomicAllocator.java:515)
at org.nd4j.jita.allocator.impl.AtomicAllocator$UnifiedGarbageCollectorThread.run(AtomicAllocator.java:714)
org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.nd4j.jita.allocator.pointers.cuda.cudaEvent_t.synchronize(cudaEvent_t.java:55)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillFinished(SynchronousFlowController.java:106)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillFinished(GridFlowController.java:47)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillReleased(SynchronousFlowController.java:203)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillReleased(GridFlowController.java:62)
at org.nd4j.jita.handler.impl.CudaZeroHandler.purgeDeviceObject(CudaZeroHandler.java:1113)
at org.nd4j.jita.allocator.impl.AtomicAllocator.purgeDeviceObject(AtomicAllocator.java:515)
at org.nd4j.jita.allocator.impl.AtomicAllocator$UnifiedGarbageCollectorThread.run(AtomicAllocator.java:714)
org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.nd4j.jita.allocator.pointers.cuda.cudaEvent_t.synchronize(cudaEvent_t.java:55)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillFinished(SynchronousFlowController.java:106)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillFinished(GridFlowController.java:47)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillReleased(SynchronousFlowController.java:203)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillReleased(GridFlowController.java:62)
at org.nd4j.jita.handler.impl.CudaZeroHandler.purgeDeviceObject(CudaZeroHandler.java:1113)
at org.nd4j.jita.allocator.impl.AtomicAllocator.purgeDeviceObject(AtomicAllocator.java:515)
at org.nd4j.jita.allocator.impl.AtomicAllocator$UnifiedGarbageCollectorThread.run(AtomicAllocator.java:714)
org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.nd4j.jita.allocator.pointers.cuda.cudaEvent_t.synchronize(cudaEvent_t.java:55)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillFinished(SynchronousFlowController.java:106)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillFinished(GridFlowController.java:47)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillReleased(SynchronousFlowController.java:203)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillReleased(GridFlowController.java:62)
at org.nd4j.jita.allocator.impl.AtomicAllocator$UnifiedGarbageCollectorThread.run(AtomicAllocator.java:696)
java.lang.RuntimeException: org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:399)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.nd4j.jita.allocator.pointers.cuda.cudaStream_t.synchronize(cudaStream_t.java:24)
at org.nd4j.jita.handler.impl.CudaZeroHandler.alloc(CudaZeroHandler.java:302)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:470)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:396)
at org.nd4j.linalg.jcublas.buffer.BaseCudaDataBuffer.(BaseCudaDataBuffer.java:216)
at org.nd4j.linalg.jcublas.buffer.BaseCudaDataBuffer.(BaseCudaDataBuffer.java:327)
at org.nd4j.linalg.jcublas.buffer.CudaIntDataBuffer.(CudaIntDataBuffer.java:53)
at org.nd4j.linalg.jcublas.buffer.CudaIntDataBuffer.(CudaIntDataBuffer.java:81)
at org.nd4j.linalg.jcublas.buffer.factory.CudaDataBufferFactory.createInt(CudaDataBufferFactory.java:356)
at org.nd4j.linalg.factory.Nd4j.createBufferDetached(Nd4j.java:1430)
at org.nd4j.linalg.api.shape.Shape.createShapeInformation(Shape.java:2045)
at org.nd4j.linalg.api.ndarray.BaseShapeInfoProvider.createShapeInformation(BaseShapeInfoProvider.java:47)
at org.nd4j.jita.constant.ProtectedCudaShapeInfoProvider.createShapeInformation(ProtectedCudaShapeInfoProvider.java:64)
at org.nd4j.linalg.jcublas.CachedShapeInfoProvider.createShapeInformation(CachedShapeInfoProvider.java:26)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:163)
at org.nd4j.linalg.jcublas.JCublasNDArray.(JCublasNDArray.java:335)
at org.nd4j.linalg.jcublas.JCublasNDArrayFactory.create(JCublasNDArrayFactory.java:257)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:4231)
at org.nd4j.linalg.api.ndarray.BaseNDArray.create(BaseNDArray.java:1967)
at org.nd4j.linalg.api.ndarray.BaseNDArray.subArray(BaseNDArray.java:2135)
at org.nd4j.linalg.api.ndarray.BaseNDArray.get(BaseNDArray.java:4216)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.doTruncatedBPTT(MultiLayerNetwork.java:1441)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.fit(MultiLayerNetwork.java:1824)
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.fit(DefaultTrainer.java:209)
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:335)
... 3 more
Exception in thread "UniGC thread 0" java.lang.RuntimeException: java.lang.NullPointerException
at org.deeplearning4j.datasets.iterator.AsyncDataSetIterator$AsyncPrefetchThread.run(AsyncDataSetIterator.java:442)
Caused by: java.lang.NullPointerException
at org.nd4j.jita.allocator.pointers.CudaPointer.(CudaPointer.java:22)
at org.nd4j.jita.allocator.pointers.cuda.cudaEvent_t.(cudaEvent_t.java:33)
at org.nd4j.jita.concurrency.EventsProvider.getEvent(EventsProvider.java:34)
at org.nd4j.jita.flow.impl.SynchronousFlowController.registerAction(SynchronousFlowController.java:249)
at org.nd4j.jita.handler.impl.CudaZeroHandler.registerAction(CudaZeroHandler.java:1258)
at org.nd4j.jita.allocator.impl.AtomicAllocator.registerAction(AtomicAllocator.java:1017)
at org.nd4j.linalg.jcublas.ops.executioner.CudaExecutioner.invoke(CudaExecutioner.java:1638)
at org.nd4j.linalg.jcublas.ops.executioner.CudaGridExecutioner.pushToGrid(CudaGridExecutioner.java:225)
at org.nd4j.linalg.jcublas.ops.executioner.CudaGridExecutioner.processAsGridOp(CudaGridExecutioner.java:307)
at org.nd4j.linalg.jcublas.ops.executioner.CudaGridExecutioner.exec(CudaGridExecutioner.java:112)
at org.nd4j.linalg.api.ndarray.BaseNDArray.assign(BaseNDArray.java:1267)
at org.nd4j.linalg.api.shape.Shape.toOffsetZeroCopyHelper(Shape.java:248)
at org.nd4j.linalg.api.shape.Shape.toOffsetZeroCopy(Shape.java:213)
at org.nd4j.linalg.api.ndarray.BaseNDArray.dup(BaseNDArray.java:1714)
at org.nd4j.linalg.jcublas.JCublasNDArray.dup(JCublasNDArray.java:440)
at org.nd4j.linalg.jcublas.JCublasNDArray.migrate(JCublasNDArray.java:689)
at org.nd4j.linalg.dataset.DataSet.migrate(DataSet.java:1339)
at org.deeplearning4j.datasets.iterator.callbacks.InterleavedDataSetCallback.call(InterleavedDataSetCallback.java:66)
at org.deeplearning4j.datasets.iterator.AsyncDataSetIterator$AsyncPrefetchThread.run(AsyncDataSetIterator.java:420)
org.nd4j.linalg.exception.ND4JException: CUDA exception happened. Terminating. Last op: [null]
at org.nd4j.jita.allocator.pointers.cuda.cudaEvent_t.synchronize(cudaEvent_t.java:55)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillFinished(SynchronousFlowController.java:106)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillFinished(GridFlowController.java:47)
at org.nd4j.jita.flow.impl.SynchronousFlowController.waitTillReleased(SynchronousFlowController.java:203)
at org.nd4j.jita.flow.impl.GridFlowController.waitTillReleased(GridFlowController.java:62)
at org.nd4j.jita.allocator.impl.AtomicAllocator$UnifiedGarbageCollectorThread.run(AtomicAllocator.java:696)
16:37:51.351 [ParallelWrapper training thread 2] DEBUG o.d.p.trainer.DefaultTrainer - Terminating all workspaces for trainer_2
16:37:51.351 [ParallelWrapper training thread 3] DEBUG o.d.p.trainer.DefaultTrainer - Terminating all workspaces for trainer_3
16:38:11.846 [ParallelWrapper training thread 2] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [42] to device [1], out of [4] devices...
16:38:26.784 [ParallelWrapper training thread 3] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [43] to device [2], out of [4] devices...
16:38:26.784 [ParallelWrapper training thread 1] DEBUG o.d.p.trainer.DefaultTrainer - Terminating all workspaces for trainer_1
16:38:26.787 [ParallelWrapper training thread 2] ERROR o.d.parallelism.ParallelWrapper - Uncaught exception: java.lang.RuntimeException: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 37
java.lang.RuntimeException: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 3716:38:26.787 [ParallelWrapper training thread 3] ERROR o.d.parallelism.ParallelWrapper - Uncaught exception: java.lang.RuntimeException: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 39

at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:399)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)16:38:34.481 [ParallelWrapper training thread 1] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [44] to device [3], out of [4] devices...

at java.lang.Thread.run(Thread.java:748)

16:38:34.482 [ParallelWrapper training thread 1] ERROR o.d.parallelism.ParallelWrapper - Uncaught exception: java.lang.RuntimeException: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 35
Caused by: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 37
at org.nd4j.jita.memory.impl.CudaDirectProvider.malloc(CudaDirectProvider.java:59)
at org.nd4j.jita.memory.impl.CudaCachingZeroProvider.malloc(CudaCachingZeroProvider.java:113)
at org.nd4j.jita.memory.impl.CudaFullCachingProvider.malloc(CudaFullCachingProvider.java:91)
at org.nd4j.jita.handler.impl.CudaZeroHandler.alloc(CudaZeroHandler.java:237)
at org.nd4j.jita.handler.impl.CudaZeroHandler.alloc(CudaZeroHandler.java:258)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:470)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:396)
at org.nd4j.linalg.jcublas.buffer.BaseCudaDataBuffer.(BaseCudaDataBuffer.java:216)
at org.nd4j.linalg.jcublas.buffer.CudaHalfDataBuffer.(CudaHalfDataBuffer.java:60)
at org.nd4j.linalg.jcublas.buffer.factory.CudaDataBufferFactory.createHalf(CudaDataBufferFactory.java:511)
at org.nd4j.linalg.factory.Nd4j.createBuffer(Nd4j.java:1472)
at org.nd4j.linalg.factory.Nd4j.createBuffer(Nd4j.java:1442)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:247)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:284)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:566)
at org.nd4j.linalg.jcublas.JCublasNDArray.(JCublasNDArray.java:252)
at org.nd4j.linalg.jcublas.JCublasNDArrayFactory.create(JCublasNDArrayFactory.java:238)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:5014)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:4965)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:4093)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.init(MultiLayerNetwork.java:598)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.init(MultiLayerNetwork.java:539)
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:262)
... 3 more
java.lang.RuntimeException: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 35
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:399)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 35
at org.nd4j.jita.memory.impl.CudaDirectProvider.malloc(CudaDirectProvider.java:59)
at org.nd4j.jita.memory.impl.CudaCachingZeroProvider.malloc(CudaCachingZeroProvider.java:113)
at org.nd4j.jita.memory.impl.CudaFullCachingProvider.malloc(CudaFullCachingProvider.java:91)
at org.nd4j.jita.handler.impl.CudaZeroHandler.alloc(CudaZeroHandler.java:237)
at org.nd4j.jita.handler.impl.CudaZeroHandler.alloc(CudaZeroHandler.java:258)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:470)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:396)
at org.nd4j.linalg.jcublas.buffer.BaseCudaDataBuffer.(BaseCudaDataBuffer.java:216)
at org.nd4j.linalg.jcublas.buffer.CudaHalfDataBuffer.(CudaHalfDataBuffer.java:60)
at org.nd4j.linalg.jcublas.buffer.factory.CudaDataBufferFactory.createHalf(CudaDataBufferFactory.java:511)
at org.nd4j.linalg.factory.Nd4j.createBuffer(Nd4j.java:1472)
at org.nd4j.linalg.factory.Nd4j.createBuffer(Nd4j.java:1442)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:247)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:284)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:566)
at org.nd4j.linalg.jcublas.JCublasNDArray.(JCublasNDArray.java:252)
at org.nd4j.linalg.jcublas.JCublasNDArrayFactory.create(JCublasNDArrayFactory.java:238)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:5014)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:4965)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:4093)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.init(MultiLayerNetwork.java:598)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.init(MultiLayerNetwork.java:539)
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:262)
... 3 more
java.lang.RuntimeException: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 39
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:399)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Can't allocate [HOST] memory: 998662; threadId: 39
at org.nd4j.jita.memory.impl.CudaDirectProvider.malloc(CudaDirectProvider.java:59)
at org.nd4j.jita.memory.impl.CudaCachingZeroProvider.malloc(CudaCachingZeroProvider.java:113)
at org.nd4j.jita.memory.impl.CudaFullCachingProvider.malloc(CudaFullCachingProvider.java:91)
at org.nd4j.jita.handler.impl.CudaZeroHandler.alloc(CudaZeroHandler.java:237)
at org.nd4j.jita.handler.impl.CudaZeroHandler.alloc(CudaZeroHandler.java:258)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:470)
at org.nd4j.jita.allocator.impl.AtomicAllocator.allocateMemory(AtomicAllocator.java:396)
at org.nd4j.linalg.jcublas.buffer.BaseCudaDataBuffer.(BaseCudaDataBuffer.java:216)
at org.nd4j.linalg.jcublas.buffer.CudaHalfDataBuffer.(CudaHalfDataBuffer.java:60)
at org.nd4j.linalg.jcublas.buffer.factory.CudaDataBufferFactory.createHalf(CudaDataBufferFactory.java:511)
at org.nd4j.linalg.factory.Nd4j.createBuffer(Nd4j.java:1472)
at org.nd4j.linalg.factory.Nd4j.createBuffer(Nd4j.java:1442)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:247)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:284)
at org.nd4j.linalg.api.ndarray.BaseNDArray.(BaseNDArray.java:566)
at org.nd4j.linalg.jcublas.JCublasNDArray.(JCublasNDArray.java:252)
at org.nd4j.linalg.jcublas.JCublasNDArrayFactory.create(JCublasNDArrayFactory.java:238)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:5014)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:4965)
at org.nd4j.linalg.factory.Nd4j.create(Nd4j.java:4093)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.init(MultiLayerNetwork.java:598)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.init(MultiLayerNetwork.java:539)
at org.deeplearning4j.parallelism.trainer.DefaultTrainer.run(DefaultTrainer.java:262)
... 3 more

@kfiring
Copy link
Author

kfiring commented Nov 28, 2017

unfortunately, when not using workspace and ParallelWrapper, it starts to train, but after 274 iterations, another error come out "java.lang.OutOfMemoryError: Cannot allocate new FloatPointer(1): totalBytes = 257, physicalBytes = 10G"

log output as below:

17:27:06.827 [main] INFO org.nd4j.linalg.factory.Nd4jBackend - Loaded [JCublasBackend] backend
17:27:11.306 [main] INFO org.nd4j.nativeblas.NativeOpsHolder - Number of threads used for NativeOps: 32
17:27:11.840 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.847 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.850 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.852 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.854 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.857 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.859 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.861 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.863 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.865 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.867 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.870 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.872 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.875 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.877 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.879 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.881 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.883 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.885 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.887 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.889 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.891 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.893 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.895 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.897 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.899 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.901 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.904 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.906 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.908 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.910 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:11.911 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [0]...
17:27:12.279 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.283 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.285 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.286 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.288 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.289 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.291 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.292 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.294 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.296 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.297 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.299 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.301 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.302 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.304 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.306 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.307 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.309 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.310 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.312 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.314 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.315 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.317 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.318 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.320 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.322 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.323 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.325 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.327 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.329 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.330 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.332 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [1]...
17:27:12.642 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.646 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.648 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.649 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.651 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.652 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.654 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.655 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.657 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.659 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.660 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.662 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.664 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.665 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.667 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.668 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.670 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.671 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.673 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.674 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.676 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.677 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.679 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.681 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.682 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.684 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.685 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.688 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.689 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.691 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.692 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:12.694 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [2]...
17:27:13.027 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.031 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.033 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.034 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.036 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.037 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.039 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.040 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.042 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.043 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.045 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.046 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.048 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.050 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.051 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.053 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.054 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.056 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.059 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.062 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.065 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.068 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.071 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.074 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.077 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.079 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.080 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.082 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.084 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.085 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.087 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.088 [main] DEBUG o.n.j.a.c.impl.BasicContextPool - Creating new stream for thread: [1], device: [3]...
17:27:13.093 [main] DEBUG o.n.j.c.CudaAffinityManager - Mapping thread [1] to device [0], out of [4] devices...
17:27:13.093 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [25] to device [0], out of [4] devices...
17:27:13.094 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [26] to device [0], out of [4] devices...
17:27:13.094 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [27] to device [0], out of [4] devices...
17:27:13.094 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [28] to device [0], out of [4] devices...
17:27:13.094 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [29] to device [0], out of [4] devices...
17:27:13.094 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [30] to device [0], out of [4] devices...
17:27:27.871 [main] DEBUG org.reflections.Reflections - going to scan these urls:
jar:file:/data/lib/nd4j-native-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-parameter-server-model-0.9.1.jar!/
jar:file:/data/lib/nd4j-base64-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-context-0.9.1.jar!/
jar:file:/data/lib/jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-aeron-0.9.1.jar!/
jar:file:/data/lib/nd4j-kryo_2.11-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-client-0.9.1.jar!/
jar:file:/data/lib/nd4j-common-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-buffer-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-0.9.1.jar!/
17:27:27.987 [main] INFO org.reflections.Reflections - Reflections took 114 ms to scan 23 urls, producing 31 keys and 227 values
17:27:28.106 [main] INFO o.n.l.a.o.e.DefaultOpExecutioner - Backend used: [CUDA]; OS: [Linux]
17:27:28.106 [main] INFO o.n.l.a.o.e.DefaultOpExecutioner - Cores: [16]; Memory: [7.1GB];
17:27:28.106 [main] INFO o.n.l.a.o.e.DefaultOpExecutioner - Blas vendor: [CUBLAS]
17:27:28.108 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11712987136]
17:27:28.108 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11715084288]
17:27:28.108 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11715084288]
17:27:28.108 [main] INFO o.n.l.j.o.e.CudaExecutioner - Device name: [GeForce GTX 1080 Ti]; CC: [6.1]; Total/free memory: [11715084288]
17:27:38.422 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 0
17:27:38.643 [main] DEBUG org.reflections.Reflections - going to scan these urls:
file:/data/lib/scala-java8-compat_2.11-0.3.0.jar
file:/data/lib/nd4j-parameter-server-model-0.9.1.jar
file:/data/lib/commons-cli-1.2.jar
file:/data/lib/scala-stm_2.11-0.7.jar
file:/data/lib/snappy-0.2.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-macosx-x86_64.jar
file:/data/lib/play-functional_2.11-2.4.6.jar
file:/data/lib/jersey-container-servlet-core-2.22.2.jar
file:/data/lib/mapdb-3.0.5.jar
file:/data/lib/api-util-1.0.0-M20.jar
file:/data/lib/parquet-generator-1.7.0.jar
file:/data/lib/scala-library-2.11.8.jar
file:/data/lib/commons-beanutils-core-1.8.0.jar
file:/data/lib/parquet-column-1.7.0.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-macosx-x86_64.jar
file:/data/lib/jackson-databind-2.6.5.jar
file:/data/lib/commons-codec-1.10.jar
file:/data/lib/nd4j-native-0.9.1-linux-ppc64le.jar
file:/data/lib/aopalliance-1.0.jar
file:/data/lib/derby-10.10.2.0.jar
file:/data/lib/play-netty-utils-2.4.6.jar
file:/data/lib/datanucleus-api-jdo-3.2.6.jar
file:/data/lib/jodd-core-3.5.2.jar
file:/data/lib/avro-ipc-1.7.7-tests.jar
file:/data/lib/openblas-0.2.19-1.3-android-x86.jar
file:/data/lib/htrace-core-3.1.0-incubating.jar
file:/data/lib/slf4j-log4j12-1.7.16.jar
file:/data/lib/leptonica-1.73-1.3-linux-x86_64.jar
file:/data/lib/netty-3.8.0.Final.jar
file:/data/lib/scala-reflect-2.11.7.jar
file:/data/lib/leveldb-api-0.5.jar
file:/data/lib/elsa-3.0.0-M5.jar
file:/data/lib/janino-2.7.8.jar
file:/data/lib/joda-convert-1.7.jar
file:/data/lib/cuda-8.0-6.0-1.3.jar
file:/data/lib/leptonica-1.73-1.3-linux-x86.jar
file:/data/lib/joni-2.1.2.jar
file:/data/lib/RoaringBitmap-0.5.11.jar
file:/data/lib/leptonica-1.73-1.3-android-arm.jar
file:/data/lib/opencv-3.2.0-1.3-linux-x86_64.jar
file:/data/lib/pyrolite-4.9.jar
file:/data/lib/hibernate-validator-5.0.3.Final.jar
file:/data/lib/scala-compiler-2.11.0.jar
file:/data/lib/leptonica-1.73-1.3-linux-ppc64le.jar
file:/data/lib/deeplearning4j-nn-0.9.1.jar
file:/data/lib/guice-assistedinject-4.0.jar
file:/data/lib/findbugs-annotations-1.3.9-1.jar
file:/data/lib/jersey-media-jaxb-2.22.2.jar
file:/data/lib/akka-actor_2.11-2.3.13.jar
file:/data/lib/jtransforms-2.4.0.jar
file:/data/lib/hbase-protocol-1.2.5.jar
file:/data/lib/imageio-bmp-3.1.1.jar
file:/data/lib/jaxb-core-2.2.7.jar
file:/data/lib/c3p0-0.9.5.2.jar
file:/data/lib/commons-collections-3.2.1.jar
file:/data/lib/compress-lzf-1.0.3.jar
file:/data/lib/openblas-0.2.19-1.3-linux-x86.jar
file:/data/lib/logback-core-1.1.3.jar
file:/data/lib/cuda-8.0-6.0-1.3-linux-x86_64.jar
file:/data/lib/javax.annotation-api-1.2.jar
file:/data/lib/httpcore-nio-4.4.4.jar
file:/data/lib/zookeeper-3.4.5.jar
file:/data/lib/bonecp-0.8.0.RELEASE.jar
file:/data/lib/ffmpeg-3.2.1-1.3.jar
file:/data/lib/aopalliance-repackaged-2.4.0-b34.jar
file:/data/lib/datavec-spark_2.11-0.9.1_spark_2.jar
file:/data/lib/bson-3.5.0.jar
file:/data/lib/ivy-2.4.0.jar
file:/data/lib/calcite-core-1.2.0-incubating.jar
file:/data/lib/opencv-3.2.0-1.3-windows-x86.jar
file:/data/lib/breeze_2.11-0.11.2.jar
file:/data/lib/antlr-2.7.7.jar
file:/data/lib/commons-configuration-1.6.jar
file:/data/lib/hk2-locator-2.4.0-b34.jar
file:/data/lib/imageio-psd-3.1.1.jar
file:/data/lib/leveldb-0.5.jar
file:/data/lib/JavaEWAH-0.3.2.jar
file:/data/lib/openblas-0.2.19-1.3-linux-x86_64.jar
file:/data/lib/opencv-platform-3.2.0-1.3.jar
file:/data/lib/opencv-3.2.0-1.3-linux-x86.jar
file:/data/lib/kryo-4.0.0.jar
file:/data/lib/classmate-1.0.0.jar
file:/data/lib/opencsv-2.3.jar
file:/data/lib/spring-core-4.1.6.RELEASE.jar
file:/data/lib/deeplearning4j-ui-components-0.9.1.jar
file:/data/lib/commons-digester-1.8.jar
file:/data/lib/parquet-hadoop-bundle-1.6.0.jar
file:/data/lib/jsr305-1.3.9.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-windows-x86_64.jar
file:/data/lib/jackson-datatype-jsr310-2.4.4.jar
file:/data/lib/json-20090211.jar
file:/data/lib/stream-2.7.0.jar
file:/data/lib/deeplearning4j-core-0.9.1.jar
file:/data/lib/commons-lang-2.6.jar
file:/data/lib/artoolkitplus-2.3.1-1.3.jar
file:/data/lib/unused-1.0.0.jar
file:/data/lib/hk2-utils-2.4.0-b34.jar
file:/data/lib/deeplearning4j-modelimport-0.9.1.jar
file:/data/lib/hive-exec-1.2.1.spark2.jar
file:/data/lib/objenesis-2.2.jar
file:/data/lib/chill-java-0.8.0.jar
file:/data/lib/play-iteratees_2.11-2.4.6.jar
file:/data/lib/hbase-client-1.2.5.jar
file:/data/lib/nd4j-native-0.9.1-android-x86.jar
file:/data/lib/json4s-jackson_2.11-3.2.11.jar
file:/data/lib/lz4-1.3.0.jar
file:/data/lib/commons-httpclient-3.1.jar
file:/data/lib/univocity-parsers-2.1.1.jar
file:/data/lib/commons-collections-3.2.2.jar
file:/data/lib/leptonica-1.73-1.3-windows-x86_64.jar
file:/data/lib/parquet-format-2.3.0-incubating.jar
file:/data/lib/play-netty-server_2.11-2.4.6.jar
file:/data/lib/hbase-annotations-1.2.5.jar
file:/data/lib/akka-remote_2.11-2.3.13.jar
file:/data/lib/kotlin-runtime-1.0.7.jar
file:/data/lib/openblas-0.2.19-1.3-android-arm.jar
file:/data/lib/nd4j-native-api-0.9.1.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-linux-x86_64.jar
file:/data/lib/asm-5.0.4.jar
file:/data/lib/javacv-1.3.3.jar
file:/data/lib/nd4j-kryo_2.11-0.9.1.jar
file:/data/lib/datavec-api-0.9.1.jar
file:/data/lib/jai-imageio-core-1.3.0.jar
file:/data/lib/unirest-java-1.4.9.jar
file:/data/lib/kryo-shaded-3.0.3.jar
file:/data/lib/play-server_2.11-2.4.6.jar
file:/data/lib/metrics-json-3.1.2.jar
file:/data/lib/jcip-annotations-1.0.jar
file:/data/lib/leptonica-1.73-1.3-windows-x86.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1.jar
file:/data/lib/javax.servlet-api-3.1.0.jar
file:/data/lib/scalap-2.11.0.jar
file:/data/lib/play_2.11-2.4.6.jar
file:/data/lib/netty-http-pipelining-1.1.4.jar
file:/data/lib/nd4j-native-0.9.1.jar
file:/data/lib/javax.inject-2.4.0-b34.jar
file:/data/lib/jackson-datatype-jdk8-2.4.4.jar
file:/data/lib/opencv-3.2.0-1.3-macosx-x86_64.jar
file:/data/lib/javax.ws.rs-api-2.0.1.jar
file:/data/lib/spire_2.11-0.7.4.jar
file:/data/lib/guice-4.0.jar
file:/data/lib/config-1.3.0.jar
file:/data/lib/antlr4-runtime-4.5.3.jar
file:/data/lib/jcl-over-slf4j-1.7.16.jar
file:/data/lib/kryo-serializers-0.41.jar
file:/data/lib/libfb303-0.9.2.jar
file:/data/lib/libdc1394-2.2.4-1.3.jar
file:/data/lib/opencv-3.2.0-1.3-android-arm.jar
file:/data/lib/jul-to-slf4j-1.7.16.jar
file:/data/lib/scala-xml_2.11-1.0.2.jar
file:/data/lib/metrics-graphite-3.1.2.jar
file:/data/lib/stax-api-1.0.1.jar
file:/data/lib/imageio-tiff-3.1.1.jar
file:/data/lib/hamcrest-core-1.3.jar
file:/data/lib/common-lang-3.1.1.jar
file:/data/lib/validation-api-1.1.0.Final.jar
file:/data/lib/junit-4.12.jar
file:/data/lib/pmml-model-1.2.15.jar
file:/data/lib/leptonica-1.73-1.3-macosx-x86_64.jar
file:/data/lib/httpcore-4.4.4.jar
file:/data/models/
file:/data/lib/akka-slf4j_2.11-2.3.13.jar
file:/data/lib/openblas-0.2.19-1.3-linux-ppc64le.jar
file:/data/lib/api-asn1-api-1.0.0-M20.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-linux-ppc64le.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-windows-x86.jar
file:/data/lib/datanucleus-core-3.2.10.jar
file:/data/lib/guice-3.0.jar
file:/data/lib/openblas-0.2.19-1.3.jar
file:/data/lib/cuda-8.0-6.0-1.3-macosx-x86_64.jar
file:/data/lib/eclipse-collections-7.1.1.jar
file:/data/lib/neoitertools-1.0.0.jar
file:/data/lib/jaxb-impl-2.2.7.jar
file:/data/lib/logback-classic-1.1.3.jar
file:/data/lib/jackson-0.9.1.jar
file:/data/lib/pmml-schema-1.2.15.jar
file:/data/lib/datavec-hadoop-0.9.1.jar
file:/data/lib/deeplearning4j-ui-model-0.9.1.jar
file:/data/lib/aeron-all-1.0.4.jar
file:/data/lib/nd4j-native-0.9.1-linux-x86_64.jar
file:/data/lib/pmml-agent-1.1.15.jar
file:/data/lib/imageio-core-3.1.1.jar
file:/data/lib/reflectasm-1.11.3.jar
file:/data/lib/minlog-1.3.0.jar
file:/data/lib/jackson-module-paranamer-2.6.5.jar
file:/data/lib/junit-4.8.2.jar
file:/data/lib/nd4j-native-0.9.1-macosx-x86_64.jar
file:/data/lib/tomcat-servlet-api-8.0.21.jar
file:/data/lib/jackson-module-scala_2.11-2.6.5.jar
file:/data/lib/jackson-core-2.6.5.jar
file:/data/lib/javolution-5.5.1.jar
file:/data/lib/hk2-api-2.4.0-b34.jar
file:/data/lib/kotlin-stdlib-1.0.7.jar
file:/data/lib/jackson-core-asl-1.9.13.jar
file:/data/lib/mesos-0.21.1-shaded-protobuf.jar
file:/data/lib/twirl-api_2.11-1.1.1.jar
file:/data/lib/deeplearning4j-parallel-wrapper_2.11-0.9.1.jar
file:/data/lib/imageio-metadata-3.1.1.jar
file:/data/lib/play-java_2.11-2.4.6.jar
file:/data/lib/uncommons-maths-1.2.2a.jar
file:/data/lib/jetty-util-6.1.26.jar
file:/data/lib/xercesImpl-2.11.0.jar
file:/data/lib/httpmime-4.5.2.jar
file:/data/lib/sqlite-jdbc-3.15.1.jar
file:/data/lib/jdo-api-3.0.1.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-windows-x86_64.jar
file:/data/lib/xz-1.5.jar
file:/data/lib/play-datacommons_2.11-2.4.6.jar
file:/data/lib/avro-mapred-1.7.7-hadoop2.jar
file:/data/lib/commons-logging-1.1.3.jar
file:/data/lib/commons-io-2.4.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3-linux-x86.jar
file:/data/lib/openblas-0.2.19-1.3-windows-x86.jar
file:/data/lib/jets3t-0.7.1.jar
file:/data/lib/Agrona-0.5.4.jar
file:/data/lib/commons-net-2.2.jar
file:/data/lib/nd4j-buffer-0.9.1.jar
file:/data/lib/opencv-3.2.0-1.3.jar
file:/data/lib/nd4j-parameter-server-client-0.9.1.jar
file:/data/lib/parquet-jackson-1.7.0.jar
file:/data/lib/akka-contrib_2.11-2.3.13.jar
file:/data/lib/pmml-schema-1.1.15.jar
file:/data/lib/opencv-3.2.0-1.3-windows-x86_64.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-ppc64le.jar
file:/data/lib/nd4j-jackson-0.9.1.jar
file:/data/lib/oro-2.0.8.jar
file:/data/lib/build-link-2.4.6.jar
file:/data/lib/jersey-client-2.22.2.jar
file:/data/lib/commons-dbcp-1.4.jar
file:/data/lib/protobuf-java-2.5.0.jar
file:/data/lib/curator-framework-2.4.0.jar
file:/data/lib/slf4j-api-1.7.25.jar
file:/data/lib/openblas-0.2.19-1.3-windows-x86_64.jar
file:/data/lib/json4s-core_2.11-3.2.11.jar
file:/data/lib/hive-metastore-1.2.1.spark2.jar
file:/data/lib/typetools-0.4.3.jar
file:/data/lib/common-io-3.1.1.jar
file:/data/lib/akka-persistence-experimental_2.11-2.3.13.jar
file:/data/lib/parquet-common-1.7.0.jar
file:/data/lib/jaxb-api-2.2.7.jar
file:/data/lib/stringtemplate-3.2.1.jar
file:/data/lib/leptonica-1.73-1.3.jar
file:/data/lib/commons-pool-1.5.4.jar
file:/data/lib/nearestneighbor-core-0.9.1.jar
file:/data/lib/libfreenect2-0.2.0-1.3.jar
file:/data/lib/curator-client-2.4.0.jar
file:/data/lib/librealsense-1.9.6-1.3.jar
file:/data/lib/javassist-3.19.0-GA.jar
file:/data/lib/openblas-platform-0.2.19-1.3.jar
file:/data/lib/chill_2.11-0.8.0.jar
file:/data/lib/netty-all-4.0.29.Final.jar
file:/data/lib/curator-recipes-2.4.0.jar
file:/data/lib/gson-2.8.1.jar
file:/data/lib/apache-log4j-extras-1.2.17.jar
file:/data/lib/cuda-8.0-6.0-1.3-windows-x86_64.jar
file:/data/lib/calcite-avatica-1.2.0-incubating.jar
file:/data/lib/jcodings-1.0.8.jar
file:/data/lib/metrics-core-3.1.2.jar
file:/data/lib/flandmark-1.07-1.3.jar
file:/data/lib/scala-parser-combinators_2.11-1.0.1.jar
file:/data/lib/spring-beans-4.1.6.RELEASE.jar
file:/data/lib/parquet-encoding-1.7.0.jar
file:/data/lib/leptonica-platform-1.73-1.3.jar
file:/data/lib/opencv-3.2.0-1.3-android-x86.jar
file:/data/lib/datanucleus-rdbms-3.2.9.jar
file:/data/lib/freemarker-2.3.23.jar
file:/data/lib/jboss-logging-3.2.1.Final.jar
file:/data/lib/avro-1.7.7.jar
file:/data/lib/jackson-annotations-2.6.5.jar
file:/data/lib/httpasyncclient-4.1.1.jar
file:/data/lib/videoinput-0.200-1.3.jar
file:/data/lib/guava-18.0.jar
file:/data/lib/metrics-jvm-3.1.2.jar
file:/data/models/al-rec-models-itemseq-1.0.jar
file:/data/lib/cuda-8.0-6.0-1.3-linux-ppc64le.jar
file:/data/lib/ST4-4.0.4.jar
file:/data/lib/jersey-container-servlet-2.22.2.jar
file:/data/lib/jersey-server-2.22.2.jar
file:/data/lib/jersey-common-2.22.2.jar
file:/data/lib/leptonica-1.73-1.3-linux-armhf.jar
file:/data/lib/apacheds-i18n-2.0.0-M15.jar
file:/data/lib/leptonica-1.73-1.3-android-x86.jar
file:/data/lib/commons-math-2.1.jar
file:/data/lib/eigenbase-properties-1.1.5.jar
file:/data/lib/commons-beanutils-1.7.0.jar
file:/data/lib/slf4j-log4j12-1.7.25.jar
file:/data/lib/snakeyaml-1.12.jar
file:/data/lib/snappy-java-1.1.2.6.jar
file:/data/lib/flycapture-2.9.3.43-1.3.jar
file:/data/lib/objenesis-2.1.jar
file:/data/lib/cuda-platform-8.0-6.0-1.3.jar
file:/data/lib/datavec-data-image-0.9.1.jar
file:/data/lib/nd4j-native-0.9.1-windows-x86_64.jar
file:/data/lib/nd4j-native-platform-0.9.1.jar
file:/data/lib/nd4j-base64-0.9.1.jar
file:/data/lib/nd4j-api-0.9.1.jar
file:/data/lib/calcite-linq4j-1.2.0-incubating.jar
file:/data/lib/avro-ipc-1.7.7.jar
file:/data/lib/nd4j-aeron-0.9.1.jar
file:/data/lib/libfreenect-0.5.3-1.3.jar
file:/data/lib/mysql-connector-java-6.0.6.jar
file:/data/lib/nd4j-native-0.9.1-android-arm.jar
file:/data/lib/core-1.1.2.jar
file:/data/lib/metrics-core-2.2.0.jar
file:/data/lib/openblas-0.2.19-1.3-macosx-x86_64.jar
file:/data/lib/xmlenc-0.52.jar
file:/data/lib/paranamer-2.3.jar
file:/data/lib/play-exceptions-2.4.6.jar
file:/data/lib/joda-time-2.9.3.jar
file:/data/lib/common-image-3.1.1.jar
file:/data/lib/apacheds-kerberos-codec-2.0.0-M15.jar
file:/data/lib/opencv-3.2.0-1.3-linux-ppc64le.jar
file:/data/lib/eclipse-collections-forkjoin-7.1.1.jar
file:/data/lib/hdf5-1.10.0-patch1-1.3.jar
file:/data/lib/akka-cluster_2.11-2.3.13.jar
file:/data/lib/nd4j-cuda-8.0-platform-0.9.1.jar
file:/data/lib/javacpp-1.3.3.jar
file:/data/lib/jta-1.1.jar
file:/data/lib/mongodb-driver-3.5.0.jar
file:/data/lib/hdf5-platform-1.10.0-patch1-1.3.jar
file:/data/lib/deeplearning4j-play_2.11-0.9.1.jar
file:/data/lib/commons-compress-1.8.jar
file:/data/lib/scalatest_2.11-2.2.6.jar
file:/data/lib/commons-compiler-2.7.6.jar
file:/data/lib/xbean-asm5-shaded-4.4.jar
file:/data/lib/hbase-common-1.2.5.jar
file:/data/lib/pmml-model-1.1.15.jar
file:/data/lib/reflections-0.9.10.jar
file:/data/lib/jcommander-1.27.jar
file:/data/lib/libthrift-0.9.2.jar
file:/data/lib/nd4j-parameter-server-0.9.1.jar
file:/data/lib/xml-apis-1.4.01.jar
file:/data/lib/commons-math3-3.4.1.jar
file:/data/lib/jersey-guava-2.22.2.jar
file:/data/lib/slf4j-api-1.7.16.jar
file:/data/lib/json4s-ast_2.11-3.2.11.jar
file:/data/lib/mchange-commons-java-0.2.11.jar
file:/data/lib/opencv-3.2.0-1.3-linux-armhf.jar
file:/data/lib/arpack_combined_all-0.1.jar
file:/data/lib/breeze-macros_2.11-0.11.2.jar
file:/data/lib/lombok-1.16.16.jar
file:/data/lib/c3p0-0.9.1.2.jar
file:/data/lib/leveldbjni-all-1.8.jar
file:/data/lib/imageio-jpeg-3.1.1.jar
file:/data/lib/antlr-runtime-3.4.jar
file:/data/lib/javassist-3.18.1-GA.jar
file:/data/lib/log4j-1.2.17.jar
file:/data/lib/stax2-api-3.1.4.jar
file:/data/lib/osgi-resource-locator-1.0.1.jar
file:/data/lib/py4j-0.10.3.jar
file:/data/lib/mongodb-driver-core-3.5.0.jar
file:/data/lib/nd4j-context-0.9.1.jar
file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-x86_64.jar
file:/data/lib/httpclient-4.5.2.jar
file:/data/lib/play-json_2.11-2.4.6.jar
file:/data/lib/javax.inject-1.jar
file:/data/lib/spring-context-4.1.6.RELEASE.jar
file:/data/lib/spire-macros_2.11-0.7.4.jar
file:/data/lib/eclipse-collections-api-7.1.1.jar
file:/data/lib/openblas-0.2.19-1.3-linux-armhf.jar
file:/data/lib/al-rec-common-1.0.jar
file:/data/lib/nd4j-common-0.9.1.jar
file:/data/lib/parquet-hadoop-1.7.0.jar
file:/data/lib/dl4j-spark_2.11-0.9.1_spark_2.jar
file:/data/lib/commons-lang3-3.3.2.jar
file:/data/lib/annotations-2.0.1.jar
file:/data/lib/jackson-mapper-asl-1.9.13.jar
file:/data/lib/guava-20.0.jar
17:27:43.475 [main] INFO org.reflections.Reflections - Reflections took 4830 ms to scan 368 urls, producing 4712 keys and 36863 values
17:27:43.547 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.modelimport.keras.preprocessors.TensorFlowCnnToFeedForwardPreProcessor as subtype of org.deeplearning4j.nn.conf.InputPreProcessor
17:27:43.547 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.layers.CenterLossOutputLayer as subtype of org.deeplearning4j.nn.conf.layers.Layer
17:27:43.547 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ReshapeVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
17:27:43.547 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ShiftVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
17:27:43.547 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.PoolHelperVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
17:27:43.557 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.modelimport.keras.preprocessors.TensorFlowCnnToFeedForwardPreProcessor as subtype of org.deeplearning4j.nn.conf.InputPreProcessor
17:27:43.557 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.layers.CenterLossOutputLayer as subtype of org.deeplearning4j.nn.conf.layers.Layer
17:27:43.557 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ReshapeVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
17:27:43.557 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.ShiftVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
17:27:43.557 [main] DEBUG o.d.nn.conf.NeuralNetConfiguration - Registering class for JSON serialization: org.deeplearning4j.nn.conf.graph.PoolHelperVertex as subtype of org.deeplearning4j.nn.conf.graph.GraphVertex
17:27:43.595 [main] INFO o.d.nn.multilayer.MultiLayerNetwork - Starting MultiLayerNetwork with WorkspaceModes set to [training: NONE; inference: SEPARATE]
17:27:53.341 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 1
17:27:53.358 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 4
17:27:53.395 [main] DEBUG org.reflections.Reflections - going to scan these urls:
jar:file:/data/lib/nd4j-native-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-parameter-server-model-0.9.1.jar!/
jar:file:/data/lib/nd4j-base64-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-context-0.9.1.jar!/
jar:file:/data/lib/jackson-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-linux-x86_64.jar!/
jar:file:/data/lib/nd4j-aeron-0.9.1.jar!/
jar:file:/data/lib/nd4j-kryo_2.11-0.9.1.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-client-0.9.1.jar!/
jar:file:/data/lib/nd4j-common-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-api-0.9.1.jar!/
jar:file:/data/lib/nd4j-buffer-0.9.1.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-linux-ppc64le.jar!/
jar:file:/data/lib/nd4j-native-0.9.1-windows-x86_64.jar!/
jar:file:/data/lib/nd4j-cuda-8.0-0.9.1-macosx-x86_64.jar!/
jar:file:/data/lib/nd4j-parameter-server-0.9.1.jar!/
17:27:53.561 [main] INFO org.reflections.Reflections - Reflections took 165 ms to scan 23 urls, producing 420 keys and 1665 values
17:27:53.571 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 2
17:27:53.576 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 3
17:27:53.589 [main] DEBUG o.n.j.handler.impl.CudaZeroHandler - Creating bucketID: 5
17:27:53.634 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - network has 499331 parameters
17:27:53.635 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - Starting training
17:27:53.635 [main] INFO c.a.r.m.itemseqrnn.TrainByLocalFile - epoch 0 start
17:27:53.643 [main] DEBUG o.n.j.c.CudaAffinityManager - Manually mapping thread [32] to device [0], out of [4] devices...
17:28:08.063 [main] INFO org.nd4j.nativeblas.Nd4jBlas - Number of threads used for BLAS: 0
17:28:32.758 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 0 is 68.50628185024709
17:28:46.677 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 1 is 41.44992539746616
17:28:51.079 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 2 is 20.16864830277142
17:28:56.171 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 3 is 9.054840155623788
17:29:05.606 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 4 is 4.131898638355524
17:29:10.106 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 5 is 2.7673222913104416
17:29:10.170 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 6 is 1.8611667143960973
17:29:13.811 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 7 is 0.7803778472561304
17:29:19.381 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 8 is 0.23514553787770315
17:29:30.514 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 9 is 67.90872073048077
17:29:34.880 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 10 is 38.27602818538221
17:29:39.133 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 11 is 16.078983572341993
17:29:39.196 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 12 is 8.911213531120797
17:29:43.493 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 13 is 4.707477350164911
17:29:49.055 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 14 is 2.2014184016296396
17:29:53.241 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 15 is 1.2998445337504898
17:29:53.304 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 16 is 0.6895551542123411
17:29:58.173 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 17 is 0.09355633069841028
17:30:07.906 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 18 is 66.59423953364455
17:30:07.967 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 19 is 32.34601283155097
17:30:12.293 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 20 is 14.09388473726949
17:30:17.845 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 21 is 7.879941168437947
17:30:17.908 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 22 is 3.2853097723008444
17:30:23.114 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 23 is 1.187079064031737
17:30:27.520 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 24 is 0.47209667058206223
17:30:31.809 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 25 is 0.2957522332652825
17:30:31.873 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 26 is 0.3070101520079327
17:30:43.019 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 27 is 0.1877214953742392
17:30:47.136 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 28 is 64.06188486756156
17:30:51.409 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 29 is 27.267624324021252
17:30:55.913 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 30 is 11.777486839757877
17:30:55.969 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 31 is 6.0429934165217425
17:31:01.483 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 32 is 4.367176709365673
17:31:05.731 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 33 is 2.412711342631978
17:31:05.794 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 34 is 1.5823521527459679
17:31:11.267 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 35 is 1.1647312923242414
17:31:15.852 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 36 is 0.44081940249125506
17:31:26.452 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 37 is 60.70530592102341
17:31:26.509 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 38 is 27.2741093763446
17:31:31.282 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 39 is 13.835963900748968
17:31:31.347 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 40 is 6.646220603248152
17:31:36.785 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 41 is 5.061646856407678
17:31:41.337 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 42 is 2.589693536997478
17:31:41.389 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 43 is 1.803078748882257
17:31:46.907 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 44 is 0.6754937954022344
17:31:46.962 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 45 is 0.5154287217372642
17:31:56.091 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 46 is 0.37708782374069855
17:32:01.651 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 47 is 57.45421465891764
17:32:06.176 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 48 is 26.72406978727359
17:32:06.233 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 49 is 14.822540145459284
17:32:11.679 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 50 is 8.728569081534198
17:32:16.846 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 51 is 4.348091833367242
17:32:16.908 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 52 is 2.4405388377425252
17:32:22.332 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 53 is 2.1041525092861235
17:32:26.584 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 54 is 1.7732325035447534
17:32:26.648 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 55 is 0.3734297798379157
17:32:31.848 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 56 is 0.05563720040246581
17:32:42.444 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 57 is 56.09157073359899
17:32:42.499 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 58 is 26.796547429965443
17:32:47.218 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 59 is 14.703819414697355
17:32:51.584 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 60 is 7.0850236711792
17:32:51.639 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 61 is 4.75066192681145
17:32:56.046 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 62 is 3.770006731260208
17:32:56.101 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 63 is 2.289542561698051
17:33:00.277 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 64 is 1.311228995525623
17:33:05.094 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 65 is 1.0414858940150256
17:33:13.945 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 66 is 0.2663626875673848
17:33:19.664 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 67 is 52.633383240535366
17:33:24.985 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 68 is 23.579858060863412
17:33:25.040 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 69 is 11.030025291888615
17:33:30.412 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 70 is 6.1239069767956025
17:33:30.470 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 71 is 4.190042406874197
17:33:35.375 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 72 is 2.2150585791119273
17:33:35.430 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 73 is 1.4196516407450062
17:33:40.260 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 74 is 0.7061874089003358
17:33:44.976 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 75 is 0.6440431050443546
17:33:50.637 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 76 is 50.28315733542052
17:33:54.989 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 77 is 18.868277287663496
17:33:55.049 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 78 is 8.750883218536124
17:33:59.232 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 79 is 4.9716494693550946
17:34:04.539 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 80 is 2.6288640854541176
17:34:04.595 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 81 is 1.237159225099644
17:34:10.190 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 82 is 1.256241758135954
17:34:10.246 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 83 is 1.0232032349165108
17:34:14.271 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 84 is 0.5500145155757497
17:34:14.287 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 85 is 0.13254104682250692
17:34:19.781 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 86 is 52.444310638768755
17:34:19.834 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 87 is 26.01138672808942
17:34:24.720 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 88 is 13.58646757826635
17:34:24.777 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 89 is 5.928475116059745
17:34:29.115 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 90 is 3.7113921699982195
17:34:34.686 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 91 is 2.289782884846792
17:34:34.739 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 92 is 1.7705524736921137
17:34:39.625 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 93 is 1.365388252454236
17:34:49.464 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 94 is 0.628883799631014
17:34:49.547 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 95 is 50.349052531762155
17:34:53.619 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 96 is 23.112171046342333
17:34:59.248 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 97 is 10.512133385266257
17:34:59.310 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 98 is 5.709459358636361
17:35:03.556 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 99 is 3.262651387462084
17:35:03.610 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 100 is 2.1386913382949033
17:35:09.197 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 101 is 1.3096797714858868
17:35:13.455 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 102 is 0.9257466975207096
17:35:13.516 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 103 is 0.5134879862429695
17:35:18.959 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 104 is 0.21149269621101324
17:35:23.800 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 105 is 51.633633369718694
17:35:23.865 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 106 is 26.05338503389611
17:35:28.125 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 107 is 8.634967943812846
17:35:28.184 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 108 is 3.215622577470285
17:35:32.560 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 109 is 1.9855975212421697
17:35:36.268 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 110 is 1.4285953321396179
17:35:36.327 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 111 is 1.061127928683176
17:35:41.726 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 112 is 0.5717641363216834
17:35:46.037 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 113 is 0.36682027384036936
17:35:46.105 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 114 is 50.08998497791651
17:35:51.760 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 115 is 24.291879353723115
17:35:56.650 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 116 is 8.061309228780006
17:35:56.712 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 117 is 3.528499664437425
17:36:01.374 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 118 is 1.5967972297134354
17:36:01.433 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 119 is 0.8218345266897668
17:36:06.920 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 120 is 0.7119734920175178
17:36:11.177 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 121 is 0.45448913369323324
17:36:11.189 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 122 is 0.058102670184048424
17:36:20.131 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 123 is 49.57057253964549
17:36:20.190 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 124 is 22.59633975622943
17:36:24.732 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 125 is 12.709103005082412
17:36:24.795 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 126 is 7.288500531785708
17:36:28.524 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 127 is 3.8220001636120737
17:36:33.165 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 128 is 1.8594024133296148
17:36:33.220 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 129 is 1.08474348513834
17:36:38.730 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 130 is 0.6306280639691869
17:36:38.779 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 131 is 0.38399375254795065
17:36:48.548 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 132 is 49.18727868245063
17:36:48.608 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 133 is 29.66926883084545
17:36:53.590 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 134 is 17.32759978575792
17:36:53.644 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 135 is 11.187482079271053
17:36:58.078 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 136 is 7.196496788369121
17:37:03.600 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 137 is 4.4967535775159675
17:37:03.652 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 138 is 3.5780156184644407
17:37:09.162 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 139 is 1.7600584141711866
17:37:09.215 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 140 is 1.1720804501880584
17:37:13.422 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 141 is 0.6248916426803529
17:37:17.685 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 142 is 49.87109593814125
17:37:21.260 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 143 is 25.905399859996265
17:37:21.312 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 144 is 11.79426355717269
17:37:25.496 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 145 is 7.686523376288935
17:37:25.549 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 146 is 4.559129437037942
17:37:29.734 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 147 is 3.3207141338264488
17:37:35.231 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 148 is 2.1400185904956612
17:37:35.285 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 149 is 0.3550522814978635
17:37:40.828 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 150 is 0.1407264585048929
17:37:45.255 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 151 is 0.08684587037302549
17:37:50.786 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 152 is 47.7339880470199
17:37:50.851 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 153 is 25.78214173294715
17:37:55.215 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 154 is 13.409899398523633
17:37:59.464 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 155 is 8.17404856531412
17:37:59.529 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 156 is 5.043779578590293
17:38:03.789 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 157 is 3.221656240166612
17:38:03.849 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 158 is 2.2204950837088906
17:38:08.202 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 159 is 2.053248272811696
17:38:11.675 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 160 is 1.137453510144217
17:38:11.716 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 161 is 0.41937313997205405
17:38:15.920 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 162 is 48.71681160068897
17:38:20.012 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 163 is 26.41310129738507
17:38:20.065 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 164 is 12.36001752711247
17:38:24.414 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 165 is 5.981650087782556
17:38:24.466 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 166 is 2.9035884918882413
17:38:28.750 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 167 is 0.929532845030352
17:38:33.591 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 168 is 0.25540985662153354
17:38:41.918 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 169 is 49.505175601410286
17:38:46.698 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 170 is 25.440101131057514
17:38:46.756 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 171 is 13.464874472781869
17:38:51.293 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 172 is 7.581954049272854
17:38:51.348 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 173 is 4.683895225701594
17:38:55.811 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 174 is 2.7469588734868298
17:38:55.866 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 175 is 1.6657924918560958
17:39:00.721 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 176 is 1.6983258835653785
17:39:05.596 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 177 is 0.5620075404984984
17:39:05.627 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 178 is 0.1616067821632889
17:39:11.018 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 179 is 48.18017929478339
17:39:11.080 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 180 is 23.001272388806655
17:39:16.370 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 181 is 10.70098022127729
17:39:20.678 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 182 is 6.52865815827827
17:39:20.732 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 183 is 4.589806968113061
17:39:26.179 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 184 is 1.7653766444837715
17:39:26.235 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 185 is 0.9998550933142423
17:39:31.059 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 186 is 0.38570851738276246
17:39:31.094 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 187 is 0.09202612110757377
17:39:36.180 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 188 is 47.37815696511363
17:39:41.739 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 189 is 26.61349215825485
17:39:46.153 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 190 is 13.507943216235116
17:39:46.209 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 191 is 8.22523417946924
17:39:49.907 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 192 is 3.6947469828552126
17:39:49.964 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 193 is 0.904356257571332
17:39:59.294 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 194 is 48.95590389268267
17:39:59.356 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 195 is 28.24039995291947
17:40:04.565 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 196 is 18.878200205744847
17:40:04.627 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 197 is 13.497492441746079
17:40:09.324 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 198 is 8.985873063299529
17:40:14.852 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 199 is 5.497495029482541
17:40:14.918 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 200 is 3.302528268051878
17:40:19.251 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 201 is 0.7788409506066093
17:40:24.544 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 202 is 0.5269800343343976
17:40:29.417 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 203 is 47.75686617280183
17:40:29.478 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 204 is 23.032319695818988
17:40:34.964 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 205 is 10.193065297966095
17:40:39.559 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 206 is 5.408254463709212
17:40:39.613 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 207 is 4.102891343633533
17:40:44.146 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 208 is 2.848740934334025
17:40:44.204 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 209 is 1.1003004743258962
17:40:47.756 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 210 is 0.6089954448078667
17:40:47.797 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 211 is 0.26955215772786195
17:40:56.561 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 212 is 45.72274226292803
17:40:56.618 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 213 is 26.6498674101358
17:41:00.971 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 214 is 13.044339988488781
17:41:06.460 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 215 is 7.162028588894649
17:41:06.516 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 216 is 2.7640204947965454
17:41:10.381 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 217 is 0.8611861917436157
17:41:10.438 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 218 is 0.3424075539009794
17:41:14.772 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 219 is 0.25488877239452135
17:41:19.660 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 220 is 45.68110846426218
17:41:24.076 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 221 is 27.644050493360705
17:41:24.131 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 222 is 13.266831945909662
17:41:29.530 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 223 is 7.61904085490927
17:41:33.870 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 224 is 3.7403694333497124
17:41:33.926 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 225 is 0.9470037216478501
17:41:38.887 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 226 is 0.9592077423014206
17:41:38.942 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 227 is 0.7554587504972311
17:41:43.578 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 228 is 45.74332684447889
17:41:48.997 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 229 is 22.7923177355657
17:41:54.402 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 230 is 12.019281335957125
17:41:54.462 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 231 is 5.807190208389684
17:41:58.305 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 232 is 2.0850525003860905
17:42:03.709 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 233 is 1.1206574521807564
17:42:03.771 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 234 is 0.37875360657690427
17:42:08.168 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 235 is 0.13063559148132675
17:42:08.201 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 236 is 0.08933613026208378
17:42:12.506 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 237 is 47.27323088940705
17:42:12.559 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 238 is 26.597784996709855
17:42:16.774 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 239 is 14.63975331576883
17:42:21.126 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 240 is 9.54073068321058
17:42:21.180 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 241 is 5.727606723357777
17:42:25.980 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 242 is 3.6532623589814177
17:42:26.039 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 243 is 3.493070571741675
17:42:30.396 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 244 is 1.9347886691354936
17:42:35.668 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 245 is 0.3213941716124431
17:42:49.884 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 246 is 46.45438620901293
17:42:49.939 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 247 is 29.24205129047561
17:42:54.125 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 248 is 14.743895032037187
17:42:54.178 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 249 is 9.807799518631743
17:42:58.453 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 250 is 4.675074254514863
17:42:58.506 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 251 is 3.171240787380837
17:43:02.113 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 252 is 2.0296526920900133
17:43:06.596 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 253 is 1.5737501568914747
17:43:06.650 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 254 is 0.9525874982316858
17:43:12.163 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 255 is 0.2717783512145162
17:43:12.222 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 256 is 45.400851973048255
17:43:16.550 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 257 is 24.855725680860097
17:43:22.060 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 258 is 11.515900195715458
17:43:22.119 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 259 is 7.549246160536983
17:43:26.583 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 260 is 4.030213266520001
17:43:26.637 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 261 is 2.6034191988231687
17:43:32.096 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 262 is 1.8666779526129642
17:43:32.157 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 263 is 1.3914047983885862
17:43:36.557 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 264 is 0.5783041370175535
17:43:41.564 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 265 is 45.53311966655962
17:43:41.628 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 266 is 27.343618012841397
17:43:47.196 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 267 is 14.139261642043145
17:43:51.402 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 268 is 8.418720894852235
17:43:51.469 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 269 is 3.523150990118793
17:43:55.671 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 270 is 2.257294641414
17:43:55.730 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 271 is 1.5565605275006467
17:44:00.254 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 272 is 1.4781217100708455
17:44:05.789 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 273 is 0.40601370526408437
17:44:05.845 [main] INFO o.d.o.l.ScoreIterationListener - Score at iteration 274 is 0.21918442030622629
Exception in thread "main" java.lang.OutOfMemoryError: Cannot allocate new FloatPointer(1): totalBytes = 257, physicalBytes = 10G
at org.bytedeco.javacpp.FloatPointer.(FloatPointer.java:76)
at org.bytedeco.javacpp.FloatPointer.(FloatPointer.java:41)
at org.nd4j.linalg.jcublas.blas.JcublasLevel3.sgemm(JcublasLevel3.java:107)
at org.nd4j.linalg.api.blas.impl.BaseLevel3.gemm(BaseLevel3.java:84)
at org.nd4j.linalg.factory.Nd4j.gemm(Nd4j.java:930)
at org.deeplearning4j.nn.layers.recurrent.LSTMHelpers.backpropGradientHelper(LSTMHelpers.java:624)
at org.deeplearning4j.nn.layers.recurrent.GravesLSTM.backpropGradientHelper(GravesLSTM.java:102)
at org.deeplearning4j.nn.layers.recurrent.GravesLSTM.tbpttBackpropGradient(GravesLSTM.java:79)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.truncatedBPTTGradient(MultiLayerNetwork.java:1541)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.computeGradientAndScore(MultiLayerNetwork.java:2219)
at org.deeplearning4j.optimize.solvers.BaseOptimizer.gradientAndScore(BaseOptimizer.java:174)
at org.deeplearning4j.optimize.solvers.StochasticGradientDescent.optimize(StochasticGradientDescent.java:60)
at org.deeplearning4j.optimize.Solver.optimize(Solver.java:53)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.doTruncatedBPTT(MultiLayerNetwork.java:1468)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.fit(MultiLayerNetwork.java:1227)
at com.akulaku.recommend.models.itemseqrnn.TrainByLocalFile.train(TrainByLocalFile.java:107)
at com.akulaku.recommend.models.itemseqrnn.TrainByLocalFile.main(TrainByLocalFile.java:163)
Caused by: java.lang.OutOfMemoryError: Physical memory usage is too high: physicalBytes = 10G > maxPhysicalBytes = 10G
at org.bytedeco.javacpp.Pointer.deallocator(Pointer.java:576)
at org.bytedeco.javacpp.Pointer.init(Pointer.java:121)
at org.bytedeco.javacpp.FloatPointer.allocateArray(Native Method)
at org.bytedeco.javacpp.FloatPointer.(FloatPointer.java:68)
... 16 more

@kfiring kfiring changed the title dl4j java.lang.RuntimeException: Can't allocate [HOST] memory dl4j java.lang.RuntimeException: Can't allocate [HOST] memory && java.lang.OutOfMemoryError: Physical memory usage is too high Nov 28, 2017
@raver119 raver119 self-assigned this Nov 28, 2017
@raver119 raver119 added the Bug Bugs and problems label Nov 28, 2017
@saudet
Copy link
Contributor

saudet commented Nov 29, 2017

Caused by: java.lang.OutOfMemoryError: Physical memory usage is too high: physicalBytes = 10G > maxPhysicalBytes = 10G

Increase available memory with the java -Xmx command line option, or we can tune that parameter more finely with the "org.bytedeco.javacpp.maxphysicalbytes" system property.

@kfiring
Copy link
Author

kfiring commented Nov 29, 2017

i've tried that, here is my java options:
-Xmx8g -Dorg.bytedeco.javacpp.maxbytes=10G -Dorg.bytedeco.javacpp.maxphysicalbytes=10G -Dorg.nd4j.versioncheck="false",
even i increased Xmx to 16G, still not work

@AlexDBlack
Copy link
Contributor

Memory use should be a lot lower after this PR: #4900
You can access that on snapshots: https://deeplearning4j.org/snapshots

@lock
Copy link

lock bot commented Sep 22, 2018

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

@lock lock bot locked and limited conversation to collaborators Sep 22, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Bug Bugs and problems
Projects
None yet
Development

No branches or pull requests

4 participants