The demo video sounds really good. In testing, it is very noisy and poor quality even with the 80M model. It is not all that fast, not instant like I would expect. It’s okay for memory constrained environments, but without voice cloning, may as well stick with whatever built-in TTS your OS has.
The demo video sounds really good. In testing, it is very noisy and poor quality even with the 80M model. It is not all that fast, not instant like I would expect. It’s okay for memory constrained environments, but without voice cloning, may as well stick with whatever built-in TTS your OS has.