I very much appreciate your taking the time to answer these questions. And it’s great to know that you are providing a WIS instance for the community to use. (I had to look up what best-effort means in this context, so if anyone else is wondering, here is what GPT-4 told me:
“Best-effort” is a term used in computer networking to describe a service that provides the best possible effort to deliver data packets but does not guarantee delivery.”)
I have no problem using that server to get started (once my ESP box arrives in July ). I suppose information about how to access it will reveal itself once I start digging into the documentation…
As others have noted, the documentation is currently not very accessible (and I completely understand that making it more accessible is not a priority at this point. It may even be a good gatekeeper, intentionally or not). For example, I am confused by there being two repositories, one for “Willow” and one for “Willow Inference Server”…
Anyway, regarding the self-hosted solution with a GPU: never mind the cost to buy a GPU, but when the thing is running 24/7, aren’t we incurring significant electricity costs? According to this page, the GTX 1070 idles around 7-10 Watts. Is that correct for our use case here?
When I built my home server it used around 16 W (yes, the whole machine). Since then, I have added some HDDs so consumption is surely higher now, but adding 10 W to whatever it is at now seems considerable…
One more question: I read that in order to optimize for speed, you are not using auto-detection of the language. I’d be curious, though, how much delay is added when autodetecting the language. I’m not sure if it is possible with whisper, but for our purposes it would suffice to limit the range of possible languages to three, which should reduce the delay.
Background: in our household, three languages are spoken and while we have gotten used to Alexa and Google only understanding one of them, the annoying limitation is that they don’t understand song titles or shopping list items in the other languages and while we have invented English names for some frequently needed grocery items, there is no workaround when it comes to song titles…