Bad inference result.

Hello, I'm trying to reproduce [this ](https://github.com/SeanNaren/deepspeech.pytorch/issues/308) issue #308 using the same audio but I'm still receiving Gibberish (ish) inferences. 

Since I could not find any information on which model and which command they were using in the issue, I'm posting here the info I'm using:

``` bash

# Download test audio and resample (sampling rate) to 16k

deepspeech.pytorch# wget https://dare.wiscweb.wisc.edu/wp-content/uploads/sites/1051/2008/04/Arthur.mp3
deepspeech.pytorch# sox Arthur.mp3 -c 1 -r 16000 arthur_clip.wav trim 0 15

# Running the inference on the audio clip
deepspeech.pytorch# python transcribe.py --model-path librispeech_pretrained_v2.pth --audio-path arthur_clip.wav --lm-path 3-gram.pruned.3e-7.arpa --alpha 1.65 --beta 0.35

>>>
{
    "output": [
        {
            "transcription": "THE STARY OF OWT OF THE WRAPTH ONCE UPON A TIME THERE WAS A YOUNG RAG AND CUTD IN MYE GUFF ERS MOINE WHENEVER THE HAD THE RIHT SAYES HIM IF HE WOULD LIKE TO COME OUT HUNTING BOT THEM HE WHEN ANSWER IN A HORSE"
        }
    ],
    "_meta": {
        "acoustic_model": {
            "name": "librispeech_pretrained_v2.pth"
        },
        "language_model": {
            "name": "3-gram.pruned.3e-7.arpa"
        },
        "decoder": {
            "lm": true,
            "alpha": 1.65,
            "beta": 0.35,
            "type": "greedy"
        }
    }
}
```

I'm using the [latest ](https://github.com/SeanNaren/deepspeech.pytorch/releases/tag/v2.0) release  (v2) as well as its [respective commit](https://github.com/SeanNaren/deepspeech.pytorch/commit/9b9c96a16ffb0ebf05a9aa899ed8e0e14fdb0349).

As you can see, those are fairly different results from the one @ryanleary got in [aforementioned comment](https://github.com/SeanNaren/deepspeech.pytorch/issues/308#issuecomment-396279059)

I tested several different configurations with the different models and both `3-gram.pruned.3e-7.arpa` and `3-gram.3e-7.arpa` as arguments for the `transcribe.py` script but in every case I got weird results with those uppercase characters and random words.

Am I doing something wrong here?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bad inference result. #497

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Bad inference result. #497

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions