Training Error? #8

rishabh-ojha98 · 2019-09-09T06:54:03Z

While following all the training steps i encountered the following training error again and again. Whereas i am not getting from where this error has been occurred?
Please explain me a bit about error and how to resolve the same?

`E1: 100.0% (train err 109.10), E0 val err: inf, 3a, GPU:0.
Traceback (most recent call last):
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 1356, in _do_call
return fn(*args)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 1341, in _run_fn
options, feed_dict, fetch_list, target_list, run_metadata)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 1429, in _call_tf_sessionrun
run_metadata)
tensorflow.python.framework.errors_impl.InvalidArgumentError: 2 root error(s) found.
(0) Invalid argument: Need minval < maxval, got 0 >= -39117
[[{{node map/while/random_uniform}}]]
[[map/while/Slice_1/size/_2979]]
(1) Invalid argument: Need minval < maxval, got 0 >= -39117
[[{{node map/while/random_uniform}}]]
0 successful operations.
0 derived errors ignored.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "deepxi.py", line 290, in
if args.train: train(sess, net, args)
File "deepxi.py", line 195, in train
net.s_len_ph: args.val_s_len[start_idx:end_idx], net.d_len_ph: args.val_d_len[start_idx:end_idx], net.snr_ph: args.val_snr[start_idx:end_idx]}) # mini-batch.
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 950, in run
run_metadata_ptr)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 1173, in _run
feed_dict_tensor, options, run_metadata)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 1350, in _do_run
run_metadata)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 1370, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InvalidArgumentError: 2 root error(s) found.
(0) Invalid argument: Need minval < maxval, got 0 >= -39117
[[node map/while/random_uniform (defined at lib/feat.py:434) ]]
[[map/while/Slice_1/size/_2979]]
(1) Invalid argument: Need minval < maxval, got 0 >= -39117
[[node map/while/random_uniform (defined at lib/feat.py:434) ]]
0 successful operations.
0 derived errors ignored.

Original stack trace for 'map/while/random_uniform':
File "deepxi.py", line 284, in
net = deepxi_net(args)
File "deepxi.py", line 86, in init
self.feature = feat.xi_mapped(self.s_ph, self.d_ph, self.s_len_ph, self.d_len_ph, self.snr_ph, args.Nw, args.Ns, args.NFFT, args.fs, self.P, args.nconst, self.mu, self.sigma) # feature graph.
File "lib/feat.py", line 43, in xi_mapped
P, nconst), (s, d, s_len, d_len, Q), dtype=(tf.float32, tf.float32, tf.float32)) # padded waveforms.
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/map_fn.py", line 268, in map_fn
maximum_iterations=n)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/control_flow_ops.py", line 3501, in while_loop
return_same_structure)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/control_flow_ops.py", line 3012, in BuildLoop
pred, body, original_loop_vars, loop_vars, shape_invariants)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/control_flow_ops.py", line 2937, in _BuildLoop
body_result = body(*packed_vars_for_body)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/control_flow_ops.py", line 3456, in
body = lambda i, lv: (i + 1, orig_body(*lv))
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/map_fn.py", line 257, in compute
packed_fn_values = fn(packed_values)
File "lib/feat.py", line 43, in
P, nconst), (s, d, s_len, d_len, Q), dtype=(tf.float32, tf.float32, tf.float32)) # padded waveforms.
File "lib/feat.py", line 410, in addnoisepad
(y, d) = addnoise(x, d, Q) # compute noisy waveform.
File "lib/feat.py", line 434, in addnoise
i = tf.random_uniform([1], 0, tf.add(1, tf.subtract(d_len, x_len)), tf.int32)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/random_ops.py", line 245, in random_uniform
shape, minval, maxval, seed=seed1, seed2=seed2, name=name)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/ops/gen_random_ops.py", line 919, in random_uniform_int
seed=seed, seed2=seed2, name=name)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/framework/op_def_library.py", line 788, in _apply_op_helper
op_def=op_def)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/util/deprecation.py", line 507, in new_func
return func(*args, **kwargs)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/framework/ops.py", line 3616, in create_op
op_def=op_def)
File "/home/paperspace/anaconda3/envs/tensorflow_gpuenv/lib/python3.7/site-packages/tensorflow/python/framework/ops.py", line 2005, in init
self._traceback = tf_stack.extract_stack()

`

anicolson · 2019-09-09T07:11:01Z

it seems that there is a problem with your validation set, can you check the size of args.val_s, args.val_s_len, args.val_d, args.val_d_len, and args.val_snr?

rishabh-ojha98 · 2019-09-09T09:06:58Z

Shape of the validation set parameters are as follows:

args.val_s
(1000, 273520)

args.val_s_len
(1000,)

args.val_snr
(1000,)

args.val_d
(1000, 4832000)

args.val_d_len
(1000,)

Note: I have also added few of the noises of my use case in training as well as in validation set, if that is a problem or not?

anicolson · 2019-09-09T09:42:05Z

this should solve your problems: https://github.com/anicolson/DeepXi/blob/master/set/info.txt

rishabh-ojha98 · 2019-09-09T10:06:55Z

I have followed this info.txt
But given SNR values randomly as [5, 10]

The file name in val_clean_speech are as follows:
1001-134707-0015_noise-free-sound-0210_10dB.wav
1081-125237-0001_noise-free-sound-0531_10dB.wav
1224-121064-0078_noise-free-sound-0768_10dB.wav
1335-163935-0006_noise-free-sound-0592_5dB.wav
1001-134707-0021_uspolo_10dB.wav
1081-125237-0002_noise-free-sound-0365_10dB.wav
1224-121064-0107_noise-free-sound-0157_10dB.wav
1335-163935-0011_noise-free-sound-0768_5dB.wav

The file name in val_noise are as follows:
1001-134707-0015_noise-free-sound-0210_5dB.wav
1081-125237-0001_noise-free-sound-0531_10dB.wav
1224-121064-0078_noise-free-sound-0768_10dB.wav
1335-163935-0006_noise-free-sound-0592_5dB.wav
1001-134707-0021_uspolo_5dB.wav
1081-125237-0002_noise-free-sound-0365_10dB.wav
1224-121064-0107_noise-free-sound-0157_10dB.wav
1335-163935-0011_noise-free-sound-0768_10dB.wav

Is there any issue with randomly generating SNR values? so let me know.

anicolson · 2019-09-09T10:09:13Z

does each pair have the same number of samples? Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: rishabh-ojha98 <notifications@github.com> Sent: Monday, September 9, 2019 8:06:56 PM To: anicolson/DeepXi <DeepXi@noreply.github.com> Cc: Aaron Nicolson <aaron.nicolson@griffithuni.edu.au>; Comment <comment@noreply.github.com> Subject: Re: [anicolson/DeepXi] Training Error? (#8) I have followed this info.txt But given SNR values randomly as [5, 10] The file name in val_clean_speech are as follows: 1001-134707-0015_noise-free-sound-0210_10dB.wav 1081-125237-0001_noise-free-sound-0531_10dB.wav 1224-121064-0078_noise-free-sound-0768_10dB.wav 1335-163935-0006_noise-free-sound-0592_5dB.wav 1001-134707-0021_uspolo_10dB.wav 1081-125237-0002_noise-free-sound-0365_10dB.wav 1224-121064-0107_noise-free-sound-0157_10dB.wav 1335-163935-0011_noise-free-sound-0768_5dB.wav The file name in val_noise are as follows: 1001-134707-0015_noise-free-sound-0210_5dB.wav 1081-125237-0001_noise-free-sound-0531_10dB.wav 1224-121064-0078_noise-free-sound-0768_10dB.wav 1335-163935-0006_noise-free-sound-0592_5dB.wav 1001-134707-0021_uspolo_5dB.wav 1081-125237-0002_noise-free-sound-0365_10dB.wav 1224-121064-0107_noise-free-sound-0157_10dB.wav 1335-163935-0011_noise-free-sound-0768_10dB.wav Is there any issue with randomly generating SNR values? so let me know. — You are receiving this because you commented. Reply to this email directly, view it on GitHub<#8?email_source=notifications&email_token=AGHGZ7TK2CWX2ZDCGCXECS3QIYN4BA5CNFSM4IUXJD32YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6G7QXY#issuecomment-529397855>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AGHGZ7STGR2Y6XVINC3GQU3QIYN4BANCNFSM4IUXJD3Q>.

rishabh-ojha98 · 2019-09-09T10:13:28Z

Got it!!
Thanks @anicolson
Issue is solved now, will let you know if encountered any problem.

anicolson · 2019-09-09T10:42:23Z

glad to hear Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: rishabh-ojha98 <notifications@github.com> Sent: Monday, September 9, 2019 8:28:26 PM To: anicolson/DeepXi <DeepXi@noreply.github.com> Cc: Aaron Nicolson <aaron.nicolson@griffithuni.edu.au>; Mention <mention@noreply.github.com> Subject: Re: [anicolson/DeepXi] Training Error? (#8) Closed #8<#8>. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#8?email_source=notifications&email_token=AGHGZ7VVIBK2WOUTH5ELUCLQIYQMVA5CNFSM4IUXJD32YY3PNVWWK3TUL52HS4DFWZEXG43VMVCXMZLOORHG65DJMZUWGYLUNFXW5KTDN5WW2ZLOORPWSZGOTQHCTVA#event-2618173908>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AGHGZ7V2RUDM3HKQ5EXZFLDQIYQMVANCNFSM4IUXJD3Q>.

rishabh-ojha98 · 2019-09-10T08:59:46Z

does each pair have the same number of samples?
~ Do we need to have equal number of sampled in each pairs??

anicolson · 2019-09-10T09:18:40Z

yea, so for example, you can have a clean speech waveform with 30,000 samples and a noise waveform with 100,000 samples. just extract a section of the noise waveform that is of length 30,000 (and save it). This is done so that the validation set is identical each time. Otherwise the code will mix the clean speech with a random section of the noise signal, meaning that the validation set will be different each time. Does that make sense? Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: rishabh-ojha98 <notifications@github.com> Sent: Tuesday, September 10, 2019 6:59:58 PM To: anicolson/DeepXi <DeepXi@noreply.github.com> Cc: Aaron Nicolson <aaron.nicolson@griffithuni.edu.au>; Mention <mention@noreply.github.com> Subject: Re: [anicolson/DeepXi] Training Error? (#8) Reopened #8<#8>. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#8?email_source=notifications&email_token=AGHGZ7TSA2PDF3FNDVF5PZTQI5OY5A5CNFSM4IUXJD32YY3PNVWWK3TUL52HS4DFWZEXG43VMVCXMZLOORHG65DJMZUWGYLUNFXW5KTDN5WW2ZLOORPWSZGOTRAFYII#event-2621463585>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AGHGZ7XI7QD5ZG4GBA2T2G3QI5OY5ANCNFSM4IUXJD3Q>.

rishabh-ojha98 · 2019-09-10T10:22:49Z

yeah got it, thanks

rishabh-ojha98 · 2019-09-18T05:43:31Z

Just want to clear my doubt
Is there any significance of having dB value in my inferencing wav file?
CA_CA01_03_voice_babble_5dB.wav

anicolson · 2019-09-18T05:56:13Z

no, only matters for val set. Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: rishabh-ojha98 <notifications@github.com> Sent: Wednesday, September 18, 2019 3:43:32 PM To: anicolson/DeepXi <DeepXi@noreply.github.com> Cc: Aaron Nicolson <aaron.nicolson@griffithuni.edu.au>; State change <state_change@noreply.github.com> Subject: Re: [anicolson/DeepXi] Training Error? (#8) Just want to clear my doubt Is there any significance of having dB value in my inferencing wav file? CA_CA01_03_voice_babble_5dB.wav — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#8?email_source=notifications&email_token=AGHGZ7S3AZQQH7ORY6ILBFTQKG5YJA5CNFSM4IUXJD32YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD663WSA#issuecomment-532527944>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AGHGZ7UXK7L36R7ND4NDTL3QKG5YJANCNFSM4IUXJD3Q>.

rishabh-ojha98 closed this as completed Sep 9, 2019

rishabh-ojha98 reopened this Sep 10, 2019

anicolson closed this as completed Sep 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Error? #8

Training Error? #8

rishabh-ojha98 commented Sep 9, 2019

anicolson commented Sep 9, 2019

rishabh-ojha98 commented Sep 9, 2019 •

edited

Loading

anicolson commented Sep 9, 2019

rishabh-ojha98 commented Sep 9, 2019 •

edited

Loading

anicolson commented Sep 9, 2019 via email

rishabh-ojha98 commented Sep 9, 2019 •

edited

Loading

anicolson commented Sep 9, 2019 via email

rishabh-ojha98 commented Sep 10, 2019

anicolson commented Sep 10, 2019 via email

rishabh-ojha98 commented Sep 10, 2019

rishabh-ojha98 commented Sep 18, 2019

anicolson commented Sep 18, 2019 via email

Training Error? #8

Training Error? #8

Comments

rishabh-ojha98 commented Sep 9, 2019

anicolson commented Sep 9, 2019

rishabh-ojha98 commented Sep 9, 2019 • edited Loading

anicolson commented Sep 9, 2019

rishabh-ojha98 commented Sep 9, 2019 • edited Loading

anicolson commented Sep 9, 2019 via email

rishabh-ojha98 commented Sep 9, 2019 • edited Loading

anicolson commented Sep 9, 2019 via email

rishabh-ojha98 commented Sep 10, 2019

anicolson commented Sep 10, 2019 via email

rishabh-ojha98 commented Sep 10, 2019

rishabh-ojha98 commented Sep 18, 2019

anicolson commented Sep 18, 2019 via email

rishabh-ojha98 commented Sep 9, 2019 •

edited

Loading

rishabh-ojha98 commented Sep 9, 2019 •

edited

Loading

rishabh-ojha98 commented Sep 9, 2019 •

edited

Loading