Skip to content

[deeplab] Evaluation with pre-trained model does not match provided value #3673

@rogercw

Description

@rogercw

System information

  • What is the top-level directory of the model you are using: deeplab
  • Have I written custom code (as opposed to using a stock example script provided in TensorFlow): no
  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Linux Ubuntu 16.04
  • TensorFlow installed from (source or binary): binary (pip install --upgrade)
  • TensorFlow version (use command below): 1.6.0
  • Bazel version (if compiling from source):
  • CUDA/cuDNN version: V9.0.176 / 7.0.5
  • GPU model and memory: GeForce GTX 1080 Ti / 10.91GiB
  • Exact command to reproduce:

Describe the problem

I downloaded the pre-trained model 'xception_coco_voc_trainaug' from model zoo, and used it as "checkpoint_dir" for the evaluation. Since there is no 'checkpoint' file included in the tar file, I manually created one with both "model_checkpoint_path" and "all_model_checkpoint_paths" assigned to the downloaded file "model.ckpt" (evaluation will not run without 'checkpoint' file.).

However, after I ran the 'eval.py' with the command in 'local_test.sh', the "miou_1.0" I got is 0.613665, which is way less than the the expected number 82.20%. May I know what I might do wrong here? Thanks.

P.S. I originally planned to post this question in StackOverflow. However, there is no 'deeplab' avaliable yet. and I do not have enough reputation to create it.

Source code / logs

python "${WORK_DIR}"/eval.py \

--logtostderr
--eval_split="val"
--model_variant="xception_65"
--atrous_rates=6
--atrous_rates=12
--atrous_rates=18
--output_stride=16
--decoder_output_stride=4
--eval_crop_size=513
--eval_crop_size=513
--checkpoint_dir="${TRAIN_LOGDIR}"
--eval_logdir="${EVAL_LOGDIR}"
--dataset_dir="${PASCAL_DATASET}"
--max_number_of_evaluations=1
<local_path>/lib/python2.7/site-packages/h5py/init.py:36: FutureWarning: Conversion of the second argument of issubdtype from float to np.floating is deprecated. In future, it will be treated as np.float64 == np.dtype(float).type.
from ._conv import register_converters as _register_converters
INFO:tensorflow:Evaluating on val set
INFO:tensorflow:Performing single-scale test.
INFO:tensorflow:Eval num images 1449
INFO:tensorflow:Eval batch size 1 and num batch 1449
INFO:tensorflow:Waiting for new checkpoint at <local_path>/pascal_voc/exp/train_on_trainval_set/train0
INFO:tensorflow:Found new checkpoint at <local_path>/pascal_voc/exp/train_on_trainval_set/train0/model.ckpt
WARNING:tensorflow:From <local_path>/lib/python2.7/site-packages/tensorflow/contrib/training/python/training/evaluation.py:303: get_or_create_global_step (from tensorflow.contrib.framework.python.ops.variables) is deprecated and will be removed in a future version.
Instructions for updating:
Please switch to tf.train.get_or_create_global_step
INFO:tensorflow:Graph was finalized.
2018-03-20 11:29:44.459901: I tensorflow/core/platform/cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2018-03-20 11:29:48.065719: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1212] Found device 0 with properties:
name: GeForce GTX 1080 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.582
pciBusID: 0000:0e:00.0
totalMemory: 10.91GiB freeMemory: 10.75GiB
2018-03-20 11:29:48.066463: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1312] Adding visible gpu devices: 0
2018-03-20 11:29:48.445285: I tensorflow/core/common_runtime/gpu/gpu_device.cc:993] Creating TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 10409 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1080 Ti, pci bus id: 0000:0e:00.0, compute capability: 6.1)
INFO:tensorflow:Restoring parameters from <local_path>/pascal_voc/exp/train_on_trainval_set/train0/model.ckpt
INFO:tensorflow:Running local_init_op.
INFO:tensorflow:Done running local_init_op.
INFO:tensorflow:Starting evaluation at 2018-03-20-18:29:53
INFO:tensorflow:Evaluation [144/1449]
INFO:tensorflow:Evaluation [288/1449]
INFO:tensorflow:Evaluation [432/1449]
INFO:tensorflow:Evaluation [576/1449]
INFO:tensorflow:Evaluation [720/1449]
INFO:tensorflow:Evaluation [864/1449]
INFO:tensorflow:Evaluation [1008/1449]
INFO:tensorflow:Evaluation [1152/1449]
INFO:tensorflow:Evaluation [1296/1449]
INFO:tensorflow:Evaluation [1440/1449]
INFO:tensorflow:Evaluation [1449/1449]
INFO:tensorflow:Finished evaluation at 2018-03-20-18:31:19
miou_1.0[0.613665]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions