speech_recognize_continuous_from_file : how to print all results together in a text file #345

Khushu06 · 2019-08-11T13:45:18Z

# coding: utf-8

# Copyright (c) Microsoft. All rights reserved.
# Licensed under the MIT license. See LICENSE.md file in the project root for full license information.

import time
import wave

try:
    import azure.cognitiveservices.speech as speechsdk
except ImportError:
    print("""
    Importing the Speech SDK for Python failed.
    Refer to
    https://docs.microsoft.com/azure/cognitive-services/speech-service/quickstart-python for
    installation instructions.
    """)
    import sys
    sys.exit(1)

# Set up the subscription info for the Speech Service:
# Replace with your own subscription key and service region (e.g., "westus").
speech_key, service_region = "key", "region"

# Specify the path to an audio file containing speech (mono WAV / PCM with a sampling rate of 16
# kHz).
hindi = "C:/Users/Khushboo.Girotra/Desktop/audionew1.wav"

def speech_recognize_continuous_from_file():
    """performs continuous speech recognition with input from an audio file"""
    # <SpeechContinuousRecognitionWithFile>
    speech_config = speechsdk.SpeechConfig(subscription=speech_key, region=service_region, speech_recognition_language='hi-IN')
    audio_config = speechsdk.audio.AudioConfig(filename=hindi)

    speech_recognizer = speechsdk.SpeechRecognizer(speech_config=speech_config, audio_config=audio_config)

    done = False

    def stop_cb(evt):
        """callback that stops continuous recognition upon receiving an event `evt`"""
        print('CLOSING on {}'.format(evt))
        speech_recognizer.stop_continuous_recognition()
        nonlocal done
        done = True
    
                
    # Connect callbacks to the events fired by the speech recognizer
    speech_recognizer.recognizing.connect(lambda evt: print('RECOGNIZING: {}'.format(evt)))
    all_results = []
    def handle_final_result(evt):
    all_results.append(evt.result.text)

    speech_recognizer.recognized.connect(handle_final_result)
    speech_recognizer.start_continuous_recognition()
    print(all_results)
    
    speech_recognizer.recognized.connect(lambda evt: print('RECOGNIZED: {}'.format(evt)))
    speech_recognizer.session_started.connect(lambda evt: print('SESSION STARTED: {}'.format(evt)))
    speech_recognizer.session_stopped.connect(lambda evt: print('SESSION STOPPED {}'.format(evt)))
    speech_recognizer.canceled.connect(lambda evt: print('CANCELED {}'.format(evt)))
    # stop continuous recognition on either session stopped or canceled events
    speech_recognizer.session_stopped.connect(stop_cb)
    speech_recognizer.canceled.connect(stop_cb)
    
    #speech_recognizer.start_continuous_recognition()

    # Start continuous speech recognition
    speech_recognizer.start_continuous_recognition()
    
    while not done:
        time.sleep(.5)
    # </SpeechContinuousRecognitionWithFile>```

**Above code is not working

The text was updated successfully, but these errors were encountered:

Khushu06 · 2019-08-17T11:20:41Z

Hi , Kindly help!

chlandsi · 2019-08-18T19:18:38Z

You're trying to print results before any recognition has happened. Move the print(all_results) command after the call to stop_continuous_recognition, and you should see the accumulated transcribed utterances.

Khushu06 · 2019-08-19T05:20:01Z


# Copyright (c) Microsoft. All rights reserved.
# Licensed under the MIT license. See LICENSE.md file in the project root for full license information.

import time
import wave

try:
    import azure.cognitiveservices.speech as speechsdk
except ImportError:
    print("""
    Importing the Speech SDK for Python failed.
    Refer to
    https://docs.microsoft.com/azure/cognitive-services/speech-service/quickstart-python for
    installation instructions.
    """)
    import sys
    sys.exit(1)

# Set up the subscription info for the Speech Service:
# Replace with your own subscription key and service region (e.g., "westus").
speech_key, service_region = "key", "region"

# Specify the path to an audio file containing speech (mono WAV / PCM with a sampling rate of 16
# kHz).
hindi = "C:/Users/Khushboo.Girotra/Desktop/audionew1.wav"

def speech_recognize_continuous_from_file():
    """performs continuous speech recognition with input from an audio file"""
    # <SpeechContinuousRecognitionWithFile>
    speech_config = speechsdk.SpeechConfig(subscription=speech_key, region=service_region, speech_recognition_language='hi-IN')
    audio_config = speechsdk.audio.AudioConfig(filename=hindi)

    speech_recognizer = speechsdk.SpeechRecognizer(speech_config=speech_config, audio_config=audio_config)

    done = False

    def stop_cb(evt):
        """callback that stops continuous recognition upon receiving an event `evt`"""
        print('CLOSING on {}'.format(evt))
        speech_recognizer.stop_continuous_recognition()
        print(all_results)
        nonlocal done
        done = True
    
                
    # Connect callbacks to the events fired by the speech recognizer
    speech_recognizer.recognizing.connect(lambda evt: print('RECOGNIZING: {}'.format(evt)))
    
    all_results = []
    def handle_final_result(evt):
        all_results.append(evt.result.text)
        speech_recognizer.recognized.connect(handle_final_result)
        speech_recognizer.start_continuous_recognition()
        
    speech_recognizer.recognized.connect(lambda evt: print('RECOGNIZED: {}'.format(evt)))
    speech_recognizer.session_started.connect(lambda evt: print('SESSION STARTED: {}'.format(evt)))
    speech_recognizer.session_stopped.connect(lambda evt: print('SESSION STOPPED {}'.format(evt)))
    speech_recognizer.canceled.connect(lambda evt: print('CANCELED {}'.format(evt)))
    # stop continuous recognition on either session stopped or canceled event
    
    speech_recognizer.session_stopped.connect(stop_cb)
    speech_recognizer.canceled.connect(stop_cb)
    
   
    #speech_recognizer.start_continuous_recognition()

    # Start continuous speech recognition
    speech_recognizer.start_continuous_recognition()
    
    while not done:
        time.sleep(.5)
    # </SpeechContinuousRecognitionWithFile>```

As you suggested I moved the print(all_results) command after the call to stop_continuous_recognition but it is still reflecting the same output. I am not getting all the text together .  Kindly help

chlandsi · 2019-08-19T07:24:02Z

Please try this snippet:

def speech_recognize_continuous_from_file():
    """performs continuous speech recognition with input from an audio file"""
    # <SpeechContinuousRecognitionWithFile>
    speech_config = speechsdk.SpeechConfig(subscription=speech_key, region=service_region)
    audio_config = speechsdk.audio.AudioConfig(filename=weatherfilename)

    speech_recognizer = speechsdk.SpeechRecognizer(speech_config=speech_config, audio_config=audio_config)

    done = False

    def stop_cb(evt):
        """callback that stops continuous recognition upon receiving an event `evt`"""
        print('CLOSING on {}'.format(evt))
        speech_recognizer.stop_continuous_recognition()
        nonlocal done
        done = True

    all_results = []
    def handle_final_result(evt):
        all_results.append(evt.result.text)

    speech_recognizer.recognized.connect(handle_final_result)
    # Connect callbacks to the events fired by the speech recognizer
    speech_recognizer.recognizing.connect(lambda evt: print('RECOGNIZING: {}'.format(evt)))
    speech_recognizer.recognized.connect(lambda evt: print('RECOGNIZED: {}'.format(evt)))
    speech_recognizer.session_started.connect(lambda evt: print('SESSION STARTED: {}'.format(evt)))
    speech_recognizer.session_stopped.connect(lambda evt: print('SESSION STOPPED {}'.format(evt)))
    speech_recognizer.canceled.connect(lambda evt: print('CANCELED {}'.format(evt)))
    # stop continuous recognition on either session stopped or canceled events
    speech_recognizer.session_stopped.connect(stop_cb)
    speech_recognizer.canceled.connect(stop_cb)

    # Start continuous speech recognition
    speech_recognizer.start_continuous_recognition()
    while not done:
        time.sleep(.5)

    print("Printing all results:")
    print(all_results)

Khushu06 · 2019-08-19T07:43:59Z

There are 2 speakers in a conversation . How can I separate the texts/statements of 2 speakers?

chlandsi · 2019-08-19T07:51:31Z

This feature (diarization) is currently not available in the real-time transcription service, but it is available via the batch service. You can find a Python sample here, and #286 describes how to enable diarization.

Khushu06 · 2019-08-19T07:54:14Z

One more question , The accuracy of conversion is very very less(irrelevant conversion).
Is there anyway , I can improve the accuracy of text conversion.

chlandsi · 2019-08-19T08:15:40Z

If possible, try to improve the audio quality of the input (different audio settings, better microphone equipment, try to reduce environment noise, etc.). On the service side, Custom Speech can be used to train models that are specific to your applications.

Khushu06 · 2019-08-19T08:43:47Z

Can I export this file or download it ?
Like right now I am getting text result in shell . If I want a text file to be downloaded what code shall I write

chlandsi · 2019-08-19T09:35:35Z

You can use standard Python methods to save the text to a file, i.e.

with open('output.txt') as f:
    f.write('\n'.join(all_results))

Please note that this forum is primarily for SDK-related issues, not for general programming questions (try for example stackoverflow for these), so I'm closing this issue.

Khushu06 changed the title ~~speech_recognize_continuous_from_file : print all results together in a text file~~ speech_recognize_continuous_from_file : how to print all results together in a text file Aug 11, 2019

chlandsi closed this as completed Aug 19, 2019

chlandsi mentioned this issue Oct 11, 2019

Multi-utterance extension of speech_recognizer.recognize_once() API #397

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech_recognize_continuous_from_file : how to print all results together in a text file #345

speech_recognize_continuous_from_file : how to print all results together in a text file #345

Khushu06 commented Aug 11, 2019 •

edited

Khushu06 commented Aug 17, 2019

chlandsi commented Aug 18, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

speech_recognize_continuous_from_file : how to print all results together in a text file #345

speech_recognize_continuous_from_file : how to print all results together in a text file #345

Comments

Khushu06 commented Aug 11, 2019 • edited

Khushu06 commented Aug 17, 2019

chlandsi commented Aug 18, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

Khushu06 commented Aug 19, 2019

chlandsi commented Aug 19, 2019

Khushu06 commented Aug 11, 2019 •

edited