Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about dataset structure #8

Closed
drydenwiebe opened this issue May 26, 2021 · 3 comments
Closed

Question about dataset structure #8

drydenwiebe opened this issue May 26, 2021 · 3 comments

Comments

@drydenwiebe
Copy link

drydenwiebe commented May 26, 2021

Hello.

Thank you so much for this dataset, it is very large and well thought out!

I have a question about the structure of the dataset. The audio files are in the form: audio/audio_<mic_number_from_0_to_7>_.mp3

When I untar the audio directories they are mostly like this audio/audio_<mic_number_from_0_to_7>.mp3
but sometimes they are of the form audio/audio
<mic_number_from_0_to_7><extra_number>.mp3 where there is another number after the time stamp.

For example in /drive_day_2020_04_14_15_56_26/audio there is audio_0_1586873154_433877998_1.mp3 and audio_0_1586873154_433877998_4.mp3 and when I diff them, they seem to be the same file.

Why is this the case. Can I just ignore all but one when processing the audio?

Thanks!

@franchuterivera
Copy link
Collaborator

Hello, thanks for your interest in the project.

To answer your question, you can just use the _0.mp3.

Why?

The format isaudio_<mic_number>_<timestamp>.mp3. We consider 1586873154_433877998_1 as timestamp because it represents the instant of time when all of the modalities are aligned. Going deeper into this timestamp, it actually follows the format <seconds>_<nano_seconds>_<sequence_number>.

The sequence_number is a product of our alignment and recording technique. Our microphone cannot go to this granularity, but other modalities can.

@drydenwiebe
Copy link
Author

Thank you for the response!

That makes sense.

@hxixixh
Copy link

hxixixh commented Jul 17, 2021

Thanks for your explanation and it makes a lot more sense. But I'm still confused about what the sequence number is. For example, I get a sequence of frames from the dataset

fl_rgb_1590957096_772363597_0.jpg
fl_rgb_1590957096_772363597_1.jpg 
fl_rgb_1590957096_772363597_2.jpg
fl_rgb_1590957096_772363597_4.jpg
fl_rgb_1590957096_806343405_0.jpg
fl_rgb_1590957096_806343405_1.jpg
fl_rgb_1590957096_806343405_2.jpg
fl_rgb_1590957096_806343405_3.jpg
fl_rgb_1590957096_806343405_4.jpg
fl_rgb_1590957096_840317597_0.jpg

Every single frame is different, and I suppose one unit of the sequence number indicates 1/5 nanoseconds. However, if I generate a video from the frames, they don't seem to be consecutive. fl_rgb_1590957096_772363597_4.jpg appears to be a later frame than fl_rgb_1590957096_806343405_0.jpg

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants