Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Honestly, there are so many Project on Github doing STT - LLM - TTS that I lost count. The only revolutionary thing that feels like magic is if the STT supports Voice Activity Detection and low latency LLM inference on Groq, so conversations feel natural.


What we have learnt is that big enterprises do not really want to use close source models due to the random bursts in usage which might drain their bills.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: