Script for automating process for converting files and running generateSoundMetadata.rb #27615

kimj42 · 2019-03-20T22:34:25Z

What changed:
A new script: autorun_generateSoundMetadata.rb has been added.
Where it was added:
- Location: code-dot-org/tools/scripts/autorun_generateSoundMetadata.rb.
Which parts of the site does this change affect?
- This script is part of the process of updating the sound library manifest which affects the sound library dialog for App lab, Sprite Lab, and Game Lab. Once the script is run, the developer that uses that script will now have mp3 files and corresponding JSON on their local machine to upload to S3.
- The script is currently set up for the developer to enter the full path to the folder AND the csv file so that multiple sound files in that folder can be converted/used by generateSoundMetadata.rb. I uploaded the mp3 and JSON manually to S3 by going to AWS and clicking upload.
- Steps:
  - Download one folder from: such as Achievements
  - Unzip Achievements folder in your ~/Downloads
  - Run ./autorun_generateSoundMetadata.rb ~/Downloads/Achievements ./sound_metadata.csv
  - Script will create a folder called "mp3_files" which will hold all the mp3 files and corresponding JSONs
  - Go to cdo-sound-library in S3 -> click appropriate category folder (ex: category_achievements) -> upload all files under mp3_files
Why are we making this change?
- The first PR in order to update the sound library manifest included the a script similar to this as well as the taglib-ruby gem but the drone test continued to fail. Brad mentioned that it is worth removing the gem and the script to pass the test and to only include the updated manifest. Thus, it was removed and the sound library was updated with all of the new changes applied to it.
- However, for the next time that we update the sound library manifest, which isn't often, it will be helpful to have this script to have an automated process for updating the sound library. Thus one script that did the converting and the other script that autoran the generateSoundMetadata has been combined to this one script.
- A test was written after the sound library was pushed with some corrupted filenames but the test logic is best placed in the script. It is because that allows the developer that is updating the manifest to catch corrupted filenames earlier than later after they push their PR and drone is testing whether or not there is a corrupted one which is time consuming.
How I tested the script:
- A lot of the original sound filenames were corrupted with spaces in between, underscores in the beginning and/or end of the filename, uppercases, multiple underscores instead of one in between words, and typos. All of the spaces, multiple underscores, underscores before and after, and uppercases were fixed and visible on my local machine after running this test.
- For the current filenames that are corrupted in the live site can be re-run by running this script on those files on a local machine. I am 95% confident that this script will fix corrupted filenames now.
What's next?
- One file was found with spaces still and it is live on the site. Ryan noted one file is fine to drop the live sound and upload the new file with the correct filename with underscores. So I will submit a PR for that soon.
- If the script doesn't pass the drone test again, then we will have to end up adding some comments in the generateSoundmetadata.rb regarding instructions/precautions for the next developer that updates the sound library without a script.

islemaster

Hey Karis! This is a neat tool. I've got lots of feedback on this code, but you can consider most of it optional since this is a one-off script and we'll probably need to make some changes to it next time we use it one way or another. Thank you for the detailed PR description!

islemaster · 2019-03-20T23:14:35Z