New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Eigenscape Raw Park 6 and 8 recordings are not really "Raw" #8
Comments
@sakshamsingh1, could you run a simple test to check if Park 6 and Park 8 files are perhaps the same in both versions of the Eigenscape dataset? This would show that the format is "B" for these files in both the Raw and B-format versions of the dataset. |
Hi @iranroman, I think Raw and B-format for Park-6 and Park-8 files have been flipped (i.e. Park-6(8)-Raw is in B-format and Park-6(8)-B is in Raw format). Because Park-6(8)-Raw has 25 channels and Park-6(8)-B has 32 channels. Below is the code that I used for confirming this.
|
Hi, author of EigenScape here. I've just checked my original files and it does look as though the raw Park 6 & 8 are missing, rather than flipped. I'm honestly not sure where those extra channels are coming from in this code snippet becuase I certainly don't have them! Perhaps the code is retrieving a different file? |
Hello @marc1701. Thanks a lot for joining the discussion. Using I also went ahead and downloaded a fresh version of I think we found a real bug. |
Hi all - I have created a new version of the dataset on zenodo (version 3), replacing I am not sure how this issue crept in as the 'original' files I have on my hard drive are all the correct versions. My apologies for any problems this may have caused. |
Thank you very much @marc1701 for addressing this on the B-format version of the dataset. I have started the process to update the dataset loaders, first in For the A-format (raw mics) version of the dataset, is the verdict that the equivalent files are missing? Could they actually be the files previously mistaken in the B-format version? |
Yes I have checked the raw files and it does seem as though the two formats got mixed up. So the raw files are actually those mistakenly filed with the B-format, and the B-format were files as raw. Again, I'm not sure how this happened and can only apologise for the confusion! Unfortunately it will be slightly more difficult for me to amend the raw files as these are hosted at the University of York rather than Zenodo so I will have to liaise with University admin to get them changed, which could be a pain. Might it be possible to engineer a workaround? |
We have engineered a solution with the @marc1701 at soundata (see soundata/soundata#102 and soundata/soundata#98). If you are interacting with this dataset using |
TLDR: Do not use Park 6 and Park 8 files in the Eigenmike Raw dataset. The files included in the dataset are in a different format.
@sakshamsingh1 brought to my attention that, in the Eigenscape Raw dataset, the recordings for Park 6 and 8 have 25 channels, instead of 32 channels.
In essence, Raw Park 6 and Park 8 files look more like B-format than A-format (raw format).
I talked with Marc Green, author of the Eigenscape dataset, and he confirmed that there was a confusion when the A-format files were released, and, in fact, the "raw" recordings for Park 6 and Park 8 in are wrong. The real recordings are missing and unavailable to the public.
This is something that
micarraylib
has no control of, so users are advised to NOT use those specific files.The text was updated successfully, but these errors were encountered: