GitHub - wwlib/misty-azure

misty conversation skill

A first pass Conversation Skill using Misty's Key Phrase Recognition (wake word), Azure Speech To Text (STT) and Azure Text To Speech (TTS)

https://github.com/wwlib/misty-azure

Misty II - Conversation Skill (V2) https://youtu.be/-DtE3KhRmNQ

Misty II - Conversation Skill (V2) w/Console https://youtu.be/HJD_yYEE2v8

Notes about using LUIS NLU https://medium.com/@andrew.rapo/robokit-setting-up-azure-cognitive-services-bing-speech-luis-nlu-fbb39f5dc957

Azure Function Apps

The Azure services are implemented as a Function Apps

AudioToIntent takes an audio input and an NLU intent (Used by Conversation Skill V2)
TextToSpeech takes a text input and returns a TTS response (Used by Conversation Skill V2)
AudioToTTSResponse takes an audio input and returns a TTS response (Used by Conversation Skill V1)

Conversation Skill V2 calls the AudioToIntent function app using Misty's misty.SendExternalRequest() On-robot api call. The function app makes a call to LUIS NLU and then returns an intent as a string.

Then Conversation Skill V2 calls the TextToSpeech function app and plays the audio that is returned.

Conversation Skill V2 uses misty.StartKeyPhraseRecognition() to listen for the "Hey Misty" key phrase (i.e. wake-up word)

example

With Conversation V2 Skill running:

say "Hey, Misty"
Misty will set her LED to BLUE to indicate that she is listening
say "do you know any jokes?"
Misty will say: "Where does the general keep his armies? ,, In his sleevies."
say "Hey, Misty"
Misty will set her LED to BLUE to indicate that she is listening
say "what time is it?"
Misty will say: "The time is current-time."

Azure funciton example: AudioToIntent

This folder contains the code for an Azure Fucnction App that processes audio from Misty and returns an intent. The function manages:

speech to text
text to intent (NLU via LUIS)

try {
    const accessToken = await getAccessToken();
    const utterance = await speechToText(accessToken, audioBase64, context);
    await textToIntent(accessToken, utterance, context);
} catch (err) {
    context.log(`Something went wrong: ${err}`);
}

Note: Each function requires data from its config.json:

{
    "Microsoft": {
        "AzureSpeechSubscriptionKey": "<YOUR-BING-SUBSCRIPTION-KEY>",
        "nluLUIS_endpoint": "https://westus.api.cognitive.microsoft.com/luis/v2.0/apps/",
        "nluLUIS_appId": "<YOUR-LUIS-APP-ID>",
        "nluLUIS_subscriptionKey": "<YOUR-LUIS-SUBSCRIPTION-KEY>"
    }
}

tools

The tools foldler contains node/javascript tools for testing the calls to azure:

post-to-azure-audio-to-intent.js
post-to-azure-audio-to-tts-response.js
post-to-azure-text-to-tts.js
test-azure-speech-bae64.js
test-azure-speech-wav.js
test-azure-tts.js
test-luis-nlu.js

Note: Each of these tools requires data from config.json:

{
    "Microsoft": {
        "AudioToTTSFunctionURL": "",
        "AudioToIntentFunctionURL": "",
        "TextToTTSFunctionURL": ""
    }
}

reference

misc

Azure

https://markheath.net/post/avoiding-azure-functions-cold-starts

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
AzureFunctions		AzureFunctions
Conversation Skill v2		Conversation Skill v2
Conversation Skill		Conversation Skill
tools		tools
.gitignore		.gitignore
README.md		README.md
package.json		package.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

misty conversation skill

Azure Function Apps

example

Azure funciton example: AudioToIntent

tools

reference

misc

Azure

About

Releases

Packages

Languages

wwlib/misty-azure

Folders and files

Latest commit

History

Repository files navigation

misty conversation skill

Azure Function Apps

example

Azure funciton example: AudioToIntent

tools

reference

misc

Azure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages