Question for folks who work a lot with STT models - What is your favorite model that supports word-level timestamps, has good dysfluency detection (whisper isn't great), and is also supported by transformers.js?
The prices are listed as crypto currencies, so the payments are on a blockchain. Is the artwork itself sold as a kind of NFT on a blockchain as well? If so, what data is actually stored on the blockchain? The parameters that created the image?
That's a great question - these are sold as NFTs, and I think you only get the art, not the input parameters used to generate the art. I really like that idea though
Speech MCP Server - uses Kokoro TTS under the hood to give your LLM the ability to speak. I use it to have Cursor agents notify me when long running tasks are complete.