Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dialog flow improvement ideas #168

Open
1 of 5 tasks
maxbachmann opened this issue Dec 23, 2019 · 0 comments
Open
1 of 5 tasks

dialog flow improvement ideas #168

maxbachmann opened this issue Dec 23, 2019 · 0 comments
Labels
enhancement New feature or request

Comments

@maxbachmann
Copy link
Contributor

maxbachmann commented Dec 23, 2019

since we are leaving snips behind we can change some things in the dialog flow. This issue is a lost of some ideas of what we could do better:

  • split sentences before sending them to the tts
    when generating the sentences in multiple steps we can already generate them while alice says the previous sentence resulting in faster reactions from alice, since e.g. with the Who Are You question it takes a noticeable delay. This does not change pricing for online tts, since they are payed per character anyways

  • allow asr to listen while playing the feedback sound using some kind of acoustic echo cancelation we could directly react to alice while she still plays the feedback sound. This results in a faster reaction time for alice -> improves usability

  • add offline only intents right now when we use a online asr we will always use it unless we lose the internet connection. While I did not find a way for use to detect which one to use for the first intent (without serious time loss), it is rather simple for every other intent in a dialog flow. For these intents it could be specified by skill whether a intent should be offline only, since e.g. a YesOrNo intent absolutely requires no online asr. When all currently activated intents are offline only it could simply use an offline asr. This would be faster, since we do not have to send them over the internet, improve privacy for the user, since less audio recordings are send over the internet and reduce costs, since they do not have to pay for these simple speech to text conversions.

  • allow reactions while tts is speaking allowing reactions while alice speaks e.g. to cancel her speaking would be useful, even though this requires further considerations as to what would be the expected behaviour in different scenarios.

  • allow concurrent sessions people might want to use alice in different rooms in the house so having concurrent sessions would make sense. Requires further considerations as to what the session should be bound to. e.g. room appears to be a logical approach, but leaves the question what happens when a user is walking into a new room while the session is still active

@maxbachmann maxbachmann added the enhancement New feature or request label Dec 23, 2019
@philipp2310 philipp2310 added this to To do in ProjectAlice via automation Aug 4, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
ProjectAlice
  
To do
Development

No branches or pull requests

1 participant