Varying the sound length

Fantastic work! I have been evaluating the model using sound files of different lengths. For sounds shorter (500ms in this example) than the 2 second audio clips used to train, I get the following warning: 
WARNING:root:Large gap between audio n_frames(48) and target_length (204). Is the audio_target_length setting correct?

My question is how do sound clips of varying length affect the embedding output? In other words, can I still use embeddings from shorter clips, or should I duplicate shorter sounds to approximate the 2 seconds expected by the model?





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Varying the sound length #34

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Varying the sound length #34

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions