Skip to content

[Exploration-POC] Voice can potentially be way easier as a a HA app #30

@laupalombi

Description

@laupalombi

Problem statement

Voice has a couple of major issues in evolving moving forward, with the perceived complexity on jumping around for configuration (not enough easy paths).

Adding to this, most times other issues are reported are bad configurations of the implementation.

There are other pains to solve, but there is a real opportunity in making Voice perceived at market level in ease to configure, trust of the actions and answers and understanding of timing issues if we get it on a different structure.

This is huge, and I don't feel we should make this a decision based upon theoretical reasoning - we need a solid POC. @synesthesiam is already working on this, but I want to formalize this as a bet.

Here is a map of the information architecture changes proposed and the user values it could bring https://www.figma.com/board/0WFCJW3P77MWQJoOJyIIQD/Voice-Pipeline?node-id=0-1&p=f&t=UoEYS9TiTptQWkdt-0

Scope & Boundaries

In scope

  • POC of Voice as an app that is functional enough to make an educated decision about moving forward or not, based on the user value added, the simplicity in setup and the not deterioration of current system. (aka it has to be at least better to grant a change).
  • Valid way to use this for non-app users (docker)
  • Testeable/installable by everyone (open), but not in an easy way or safe from breaking (labs maybe)

-- STRETCH GOAL - we see the experimental POC is in good enough shape to ship it under certain conditions, more publicly.

Out of scope

  • Finessed version
  • Edge cases or nice tweaks - the whole thing could be discarded, we need a finalized POC

Foreseen solution

  • A POC that makes Voice run testable/installable by everyone (open), but not in an easy way or safe from breaking (labs maybe)

-- STRETCH GOAL - we see the experimental POC is in good enough shape to ship it under certain conditions, more publicly.

Community signals

We did a survey at the begining of the year (link)where you can see how different pain points are referring to complexity as the issue in some way or another.

Although this is heavily biased - we don't have a way to research Voice new users that are not technical, so the expectation is this representation of complexity would be much bigger.

Risks & open questions

  • How much can we sacrifice in easy paths vs heavy configurations?
  • Is it really going to be easier?
  • Can we make this and not loose the "tinkering" angle? (custom wake words, LLMs)
  • what is the loss and transition expected from migrating to one system to the other?

Appetite

Medium - should be done before the .6 release, if just the POC. I would advocate that unless we have other goals, we can use the remaining time to see this live tested and collect results, since we need that to make the decission.

Execution issues

No response

Decision log

Date Decision Outcome

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

Status

Shaping

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions