Exporting audio model to python? #36

td0m · 2019-11-11T20:44:58Z

Is it possible to export the audio model to tflite and include a snippet explaining the usage in python?

HalfdanJ · 2019-11-12T22:58:38Z

This is high on our wishlist. The issue we haven't solved yet is that the pre-processing of the audio data to the format the network expects hasn't been written for python yet. As I understand it, fft processing of audio is handled differently natively in javascript and python, making it tricky.

The model that is used for audio training is https://github.com/tensorflow/tfjs-models/tree/master/speech-commands, and does unfortunately not yet have a python counter part.

Contributions to this is very welcomed!

td0m · 2019-11-12T23:00:53Z

Thanks for the reply @HalfdanJ, does the speech commands package work on node.js? I tried running it in a non-browser environment and it didnt seem to work.

HalfdanJ · 2019-11-12T23:07:10Z

I don't believe so

caisq · 2019-11-13T02:43:14Z

@d0minikt Can you say a little more about your use case? The audio model is tied to WebAudio's frequency analyzer (FFT). This means that in order to use the model in Python, you'll find a way to replicate the audio input parameters and frequency analysis.

td0m · 2019-11-14T19:42:45Z

@caisq if that's hard to do, is it easier to port the speech-commands package so that it also runs on node.js? I'm sure I'm not the only one with a headless use case.

lc0 · 2019-11-17T12:08:33Z

There are ops for DSP in tensorflow directly[1], but I guess it's hard to maintain these for different platforms like TFLite and TFjs.

Also, most likely you rely on optimized FFT of browser.

https://www.tensorflow.org/api_docs/python/tf/signal

lc0 · 2019-11-17T12:19:57Z

Also, any plans to support exporting saved model? Currently I only see export to Tensorflow.js

nickoala · 2019-12-15T03:36:46Z

As I understand it, fft processing of audio is handled differently natively in javascript and python, making it tricky.

The model that is used for audio training is ...... and does unfortunately not yet have a python counter part.

In case anyone hasn't noticed, the Coral example project Keyphrase detector seems to have the pre-processing code necessary. Not sure it's equivalent to those in Speech Commands, but at least they both compute Mel spectrogram.

I am just saying this, in case it may be helpful to someone.

caisq · 2019-12-15T03:39:42Z

@nickoala To be clear, I'm pretty sure the preprocessing steps in the Coral example doesn't fit TF.js Speech Commands, because Speech Commands is based on the browser's WebAudio FFT, which is a linear-frequency spectrum, not a Mel one.

nickoala · 2019-12-15T03:51:18Z

@caisq, but there is a SOFT_FFT option to speechCommands.create() right? This file does seem to compute Mel spectrogram.

caisq · 2019-12-15T03:53:30Z

@nickoala My apologies: The document is not very clear and some of the code is obsolete. The SOFT_FFT mode does use Mel spectrum. But the default mode of Speech Commands (BROWSER_FFT) uses linear spectrum from WebAudio.

nickoala · 2019-12-15T03:53:59Z

I see. Thank you for clarification.

…464) * Fixes googlecreativelab/teachablemachine-community#36 The notebook shows * how to convert a speech-commands model from the TF.js format to the Python (tf.keras) and TFLite formats * how to run the Python (tf.keras) model for inference.

charlielito · 2021-05-10T15:14:17Z

Hey guys, I had the same problem trying to run an audio model in a headless device with python. I could make it work but with node.js, but the trick could work also with python. The little trick was to launch a headless chromium with puppeteer where the javascript run the model and inside the node.js script the predictions are parsed and then you are free to go and do whatever with the predictions.

I made it to turn off/on my room's light. If you want to check out the code and how to do it go to: https://github.com/charlielito/teachable-machines-audio-demo

Any feedback is welcome!

HalfdanJ added the feature request New feature or request label Nov 12, 2019

nickoala mentioned this issue Dec 7, 2019

Audio model export to Tensorflow, TF Lite, and EdgeTPU? #66

Open

irealva mentioned this issue Dec 13, 2019

How can I use the pretrained audio model in Python? #68

Open

irealva added the duplicate This issue or pull request already exists label Dec 13, 2019

ankitsinghal04 mentioned this issue Apr 16, 2020

.tflite support for PoseEstimation #104

Closed

caisq mentioned this issue Jul 4, 2020

[SpeechCommands] Add notebook to show how to convert model to tflite tensorflow/tfjs-models#464

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exporting audio model to python? #36

Exporting audio model to python? #36

td0m commented Nov 11, 2019

HalfdanJ commented Nov 12, 2019

td0m commented Nov 12, 2019

HalfdanJ commented Nov 12, 2019

caisq commented Nov 13, 2019

td0m commented Nov 14, 2019

lc0 commented Nov 17, 2019 •

edited

Loading

lc0 commented Nov 17, 2019

nickoala commented Dec 15, 2019

caisq commented Dec 15, 2019

nickoala commented Dec 15, 2019

caisq commented Dec 15, 2019

nickoala commented Dec 15, 2019

charlielito commented May 10, 2021

Exporting audio model to python? #36

Exporting audio model to python? #36

Comments

td0m commented Nov 11, 2019

HalfdanJ commented Nov 12, 2019

td0m commented Nov 12, 2019

HalfdanJ commented Nov 12, 2019

caisq commented Nov 13, 2019

td0m commented Nov 14, 2019

lc0 commented Nov 17, 2019 • edited Loading

lc0 commented Nov 17, 2019

nickoala commented Dec 15, 2019

caisq commented Dec 15, 2019

nickoala commented Dec 15, 2019

caisq commented Dec 15, 2019

nickoala commented Dec 15, 2019

charlielito commented May 10, 2021

lc0 commented Nov 17, 2019 •

edited

Loading