Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to embedding audio stream data to k-vector (512) #9

Closed
buaapengbo opened this issue Nov 29, 2018 · 1 comment
Closed

How to embedding audio stream data to k-vector (512) #9

buaapengbo opened this issue Nov 29, 2018 · 1 comment
Labels
question Further information is requested

Comments

@buaapengbo
Copy link

Hi, thank you for open source it !

I read your paper and tests/integration_test.py , my question is that I want to know the way you use, to embedding the audio stream data with D = 512.
Actually it's like the question here
The way you generate train data or test data from a audio stream.

Is that like librosa.feature.mfcc(y=X, sr=sample_rate, n_mfcc=40) ?
In your paper, say:
In this system, audio signals are first transformed into frames of width 25ms and step 10ms, and log-mel-filterbank energies of dimension 40 are extracted from each frame as the network input. These frames form overlapping sliding windows of a fixed length, on which we run the LSTM network. The last-frame output of the LSTM is then used as the d-vector representation of this sliding window
How can I reproduce this part ~

I appreciate it, waiting for your response!
Thanks,
Bo

@wq2012
Copy link
Member

wq2012 commented Nov 29, 2018

The feature extraction system and d-vector system at Google are proprietary code, and cannot be open-sourced. You need to either find a third-party implementation, or use your own implementation. This repo is dedicated to the UIS-RNN system.

@wq2012 wq2012 added the question Further information is requested label Nov 29, 2018
@wq2012 wq2012 closed this as completed Nov 29, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants