Question about dataset structure #8

drydenwiebe · 2021-05-26T20:20:55Z

Hello.

Thank you so much for this dataset, it is very large and well thought out!

I have a question about the structure of the dataset. The audio files are in the form: audio/audio_<mic_number_from_0_to_7>_.mp3

When I untar the audio directories they are mostly like this audio/audio_<mic_number_from_0_to_7>.mp3
but sometimes they are of the form audio/audio<mic_number_from_0_to_7><extra_number>.mp3 where there is another number after the time stamp.

For example in /drive_day_2020_04_14_15_56_26/audio there is audio_0_1586873154_433877998_1.mp3 and audio_0_1586873154_433877998_4.mp3 and when I diff them, they seem to be the same file.

Why is this the case. Can I just ignore all but one when processing the audio?

Thanks!

franchuterivera · 2021-05-31T19:42:45Z

Hello, thanks for your interest in the project.

To answer your question, you can just use the _0.mp3.

Why?

The format isaudio_<mic_number>_<timestamp>.mp3. We consider 1586873154_433877998_1 as timestamp because it represents the instant of time when all of the modalities are aligned. Going deeper into this timestamp, it actually follows the format <seconds>_<nano_seconds>_<sequence_number>.

The sequence_number is a product of our alignment and recording technique. Our microphone cannot go to this granularity, but other modalities can.

drydenwiebe · 2021-05-31T20:28:52Z

Thank you for the response!

That makes sense.

hxixixh · 2021-07-17T20:29:54Z

Thanks for your explanation and it makes a lot more sense. But I'm still confused about what the sequence number is. For example, I get a sequence of frames from the dataset

fl_rgb_1590957096_772363597_0.jpg
fl_rgb_1590957096_772363597_1.jpg 
fl_rgb_1590957096_772363597_2.jpg
fl_rgb_1590957096_772363597_4.jpg
fl_rgb_1590957096_806343405_0.jpg
fl_rgb_1590957096_806343405_1.jpg
fl_rgb_1590957096_806343405_2.jpg
fl_rgb_1590957096_806343405_3.jpg
fl_rgb_1590957096_806343405_4.jpg
fl_rgb_1590957096_840317597_0.jpg

Every single frame is different, and I suppose one unit of the sequence number indicates 1/5 nanoseconds. However, if I generate a video from the frames, they don't seem to be consecutive. fl_rgb_1590957096_772363597_4.jpg appears to be a later frame than fl_rgb_1590957096_806343405_0.jpg

drydenwiebe closed this as completed May 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about dataset structure #8

Question about dataset structure #8

drydenwiebe commented May 26, 2021 •

edited

franchuterivera commented May 31, 2021

drydenwiebe commented May 31, 2021

hxixixh commented Jul 17, 2021

Question about dataset structure #8

Question about dataset structure #8

Comments

drydenwiebe commented May 26, 2021 • edited

franchuterivera commented May 31, 2021

drydenwiebe commented May 31, 2021

hxixixh commented Jul 17, 2021

drydenwiebe commented May 26, 2021 •

edited