Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm using this kind of technology for temporary voice tracks in animated shorts.

I'd really like something like Img2Img for voices so I can translate a performance to an arbitrary (synthetic) voice.



Tortoise TTS can do this. You just pass it your example as a conditioning latent.


Thanks!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: