-
Notifications
You must be signed in to change notification settings - Fork 45.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when training voc2012 with mask rcnn #3972
Comments
I face the same error, and I really need help about how to solve it. |
Me too.Has anyone solved it? |
If you run without the checkpoint do you still get the assertion errors? |
Hi @robieta , When I did this, I got other errors. Could you pls clarify? |
What are the errors that you get when from_detection_checkpoint to false? |
Hi @robieta, EDIT: (robieta) Moved full output to a separate file C:\Users\hedey\AppData\Local\Programs\Python\Python36\lib\site-packages\tensorflow\python\ops\gradients_impl.py:97: UserWarning: Converting sparse IndexedSlices to a dense Tensor of unknown shape. This may consume a large amount of memory. ... WARNING:root:Variable [InceptionResnetV2/Repeat_2/block8_9/Conv2d_1x1/weights/Momentum] is not available in checkpoint |
Do not use checkpoint。like this
you can try |
Hi @lulu12132017 , Now, I get the following errors: EDIT: (robieta) Moved full output to a separate file INFO:tensorflow:Error reported to Coordinator: assertion failed: [] [Condition x == y did not hold element-wise:] [x (Loss/BoxClassifierLoss/assert_equal_2/x:0) = ] [0] [y (Loss/BoxClassifierLoss/assert_equal_2/y:0) = ] [1] ... InvalidArgumentError (see above for traceback): assertion failed: [] [Condition x == y did not hold element-wise:] [x (Loss/BoxClassifierLoss/assert_equal_2/x:0) = ] [0] [y (Loss/BoxClassifierLoss/assert_equal_2/y:0) = ] [1] |
Hi @lulu12132017 & @robieta, I really need your help to get a solution for this, because I need to use the tensorflow object detection API in my master's project. |
I'm going to close this and refer you to the tensorflow StackOverflow, as this appears to be a configuration issue rather than a clear bug in the object detection code. If you think we've misinterpreted a bug, please comment again with a clear explanation, as well as all of the information requested in the issue template. Thanks! |
Although the issue is closed by Robieta, the solution isn't available anywhere. There are multiple bugs on this issue with no suggestion what the configuration is and what is the real way of solving this. Please help. |
Hi @SarvMangal, |
Isn't there any way of reopening this thread? Or I will add one more issue
with all the required details.
Even if it is a configuration issue, the documentation is just not enough
to help us solve the problem.
…On Tue 8 May, 2018, 2:05 AM hedeya1980, ***@***.***> wrote:
Hi @SarvMangal <https://github.com/SarvMangal>,
I agree with you.
We need help by getting a real way of solving this.
Even after I followed @robieta <https://github.com/robieta>'s advice and
posted at StackOverflow, I haven't received any replies yet.
Here is my Stackoverflow post:
https://stackoverflow.com/questions/50009709/assertion-failed-error-when-using-tensorflow-object-detection-api-to-fine-tune-t
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3972 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AUNFigk--1MPYemBxoLQrVF3s8PsxYJxks5twLAngaJpZM4TTVyi>
.
|
When you convert the MIO-TCD dataset into TFRecord,you should set include_masks parameter like this.
--include_masks=True
You can try.
在 2018-05-08 04:35:51,"hedeya1980" <notifications@github.com> 写道:
Hi @SarvMangal,
I agree with you.
We need help by getting a real way of solving this.
Even after I followed @robieta's advice and posted at StackOverflow, I haven't received any replies yet.
Here is my Stackoverflow post:
https://stackoverflow.com/questions/50009709/assertion-failed-error-when-using-tensorflow-object-detection-api-to-fine-tune-t
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Hi @lulu12132017 ,
|
I have same issue |
@hedeya1980 I could not post my answer in your question in stackoverflow I had this problem, I solved as follow: The name of the TFRecords files should be check the line here Then, I regenerated TFRecord files by this
Then, I got two TFRecords files with names pet_train/val.record, then I used them for training process with Hope this helps |
I have this issue only when I use TFRecord files generated by |
when i set it's solved what's faces_only means ? |
I am still getting this error on this issue?.Has anybody figured this out yet? NotFoundError (see above for traceback): Key Conv/biases/Momentum not found in checkpoint |
faces_only means we display only box on faces not on whole body, and no segmentation is made |
The same error on all datasets and all mask models
System information
(tensorflow) philip_chen@Chen-Lenovo:~/TensorFlow/models/research$ CUDA_VISIBLE_DEVICES=1 python object_detection/train.py --logtostderr --pipeline_config_path=/home/philip_chen/TensorFlow/models/research/object_detection/mask_rcnn_inception_v2_coco_2018_01_28/mask_rcnn_inception_v2_coco.config --train_dir=/home/philip_chen/TensorFlow/models/research/object_detection/mask_rcnn_inception_v2_coco_2018_01_28/train
EDIT: (robieta) Moved full output to a separate file
obj_detection_output.txt
/home/philip_chen/anaconda3/envs/tensorflow/lib/python3.6/site-packages/h5py/init.py:36: FutureWarning: Conversion of the second argument of issubdtype from float to np.floating is deprecated. In future, it will be treated as np.float64 == np.dtype(float).type.
from ._conv import register_converters as _register_converters
INFO:tensorflow:Scale of 0 disables regularizer.
INFO:tensorflow:Scale of 0 disables regularizer.
INFO:tensorflow:Scale of 0 disables regularizer.
...
InvalidArgumentError (see above for traceback): assertion failed: [] [Condition x == y did not hold element-wise:] [x (Loss/BoxClassifierLoss/assert_equal_2/x:0) = ] [0] [y (Loss/BoxClassifierLoss/assert_equal_2/y:0) = ] [2]
[[Node: Loss/BoxClassifierLoss/assert_equal_2/Assert/Assert = Assert[T=[DT_STRING, DT_STRING, DT_STRING, DT_INT32, DT_STRING, DT_INT32], summarize=3, _device="/job:localhost/replica:0/task:0/device:CPU:0"](Loss/BoxClassifierLoss/assert_equal_2/All, Loss/RPNLoss/assert_equal/Assert/Assert/data_0, Loss/RPNLoss/assert_equal/Assert/Assert/data_1, Loss/BoxClassifierLoss/assert_equal_2/Assert/Assert/data_2, Loss/BoxClassifierLoss/assert_equal_2/x, Loss/BoxClassifierLoss/assert_equal_2/Assert/Assert/data_4, Loss/RPNLoss/ones_1/packed)]]
The text was updated successfully, but these errors were encountered: