New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extracting MFCCs on Mobile Devices in React Native #1280
Comments
|
Hi! I haven't tried this on react native. Have you tried this? I would love to see a flame graph if you have it. |
I'm working on it, but the main problem is the naive implementation of the DCT. So far I'm using a different implementation to extract MFCCs, but I'll test with Meyda's implementation too, maybe I have to use a different DCT. In the implementation I'm using, the DCT needs to process an array with 17k elements. |
|
Oh wow that's a lot. Have you considered splitting your array into chunks, running the extraction on each chunk and taking the average result? |
Yes, now I'm working on an implementation that cuts the audio by several frames, but the previous implementation didn't use this approach. |
|
Which previous implementation? Was it working with meyda before and has since stopped working? |
|
Sorry, I was referring to a function for MFCC made by a friend of mine. Anyway you were right about the strategy of slicing the audio into frames. I found a Jupyter Notebook that made things pretty clear to me. I'll leave the link here, although I know it won't add much to what you already know. https://www.kaggle.com/code/ilyamich/mfcc-implementation-and-tutorial/notebook |
|
Cool, thanks for the update, and I'm glad you got it working :) |
Hi guys, I'm trying to extract MFCCs from 5 second audios on mobile devices. The problem is that this requires some CPU stress, so I believe it's not that simple to do this with Javascript in React Native. I would like to know if anyone has any experience or information to help me.
The text was updated successfully, but these errors were encountered: