Creation an voice audio satellite with the help of an Esp32

Splitted from here…

https://community.openhab.org/t/ideas-and-discussion-what-features-do-you-want-in-openhab-5-0/160573/25

I can help with the esp32s3 code @florian-h05

3 Likes

@florian-h05 quick ping for me it’s unclear how to process here…let me know if you need anything…

I would say we need to wait for the merge of [audio] Add pcm audio websocket with dialog support by GiviMAD · Pull Request #4032 · openhab/openhab-core · GitHub, then it can be worked on the ESP code.
I would suggest you subscribe to that PR on GitHub so you get notified once it is merged and I don’t have to remember telling you :wink:

I also think that following the Home Assistant solution closely would be a great idea.
This way we can benefit from their client code, and avoiding fragmenting the young voice ecosystem.
I see at least three implementations to watch carefully:
1- the S3-BOX
2- The newly announced Home Assistant Voice (preview edition)
3- The wyoming protocol for custom remote satellite (such as the Raspberry PI) (I would like to do something about it for openHAB sooner or later)

I also wonder if 1 and 2 are based on the same protocol, or at least loosely ?

hi,
Over the last few months, if not years, I have looked at the Vioce Assistant topic from time to time and was pleased to see that there were corresponding topics in the forum. But unfortunately, no solution has yet emerged as a “standard” that I could use.

A decisive point is always the hardware. Of course I can connect a MIC to a MicroController, but in terms of quality I will always run into problems that an Amazon Echo doesn’t have.

The HomeAssisent people haven’t created anything insanely new, but they have the basis of people so that such a DYI voice assistant can survive for a few years.

Even though Home Assistant Voice was designed for Home Assistant, they are also committed to open source. So why not write to the guys and see if they can provide documentation for an interface.

And who do you think should do that ?
Why don‘t you just contact them yourself ?

Yes of course I can contect them.
I would rate the chance higher when a Maintainer would do it.
I will give it a try :wink:

1 Like

Hi again,

most of the stuff is open source (before really checking that I was not aware of that):

So it seems that we can understand everything what that have did.

Beside that I found a other project https://futureproofhomes.net/
Very simular to the home assistant solution a bit more hacker friendly but smaller.

That looks really great, finally some nice looking open-source hardware.
It seems to run ESPHome, so with some adjustments to the source code, we should be able to get it running with openHAB :wink:

The biggest issue I am facing is that all these voice satellites are designed based on the ESP32-S3. Unfortunately, the S3 cannot stream audio via Bluetooth, making these satellites dedicated solely to voice recognition. A better solution would be to use an ESP32-Wrover, which features PSRAM (essential for voice recognition) and supports Bluetooth audio streaming. This approach offers a significant advantage: you could use a Google Mini housing and only need a single device to achieve the same functionality as the Google Mini.

The “solution” from futureproofhome is designed to be replacable from ESP/CPU side.
But of course still not that easy in the end.

This topic was automatically closed 41 days after the last reply. New replies are no longer allowed.