• mudkip@lemdro.idOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 months ago

      Agree. I think the developers stated they added cartoon voices on purpose to demonstrate expressiveness.

  • PrimaCora@lemmy.today
    link
    fedilink
    English
    arrow-up
    2
    ·
    27 days ago

    The demo video sounds really good. In testing, it is very noisy and poor quality even with the 80M model. It is not all that fast, not instant like I would expect. It’s okay for memory constrained environments, but without voice cloning, may as well stick with whatever built-in TTS your OS has.