[HN Gopher] Another Text to Speech API
___________________________________________________________________
 
Another Text to Speech API
 
Author : vigneshv59
Score  : 24 points
Date   : 2023-09-25 18:38 UTC (4 hours ago)
 
web link (www.fluxon.ai)
w3m dump (www.fluxon.ai)
 
| sterlind wrote:
| the prosody seems a little robotic, and kind of jarring. maybe
| I'm spoiled by Bark, even in its rough and slow state, but is
| this really that much of a step up from Tacotron2?
 
| splatzone wrote:
| Interesting but there's a weird pause before the word 'danced' in
| each demo? Sounds unnatural
 
| g_xing wrote:
| The price is lower than the competitors... I wonder how good it
| actually is though. Guessing they just sacrifice quality
 
| willsmith72 wrote:
| maybe it's just me but the play buttons take forever to become
| clickable
 
  | vmatsiiako wrote:
  | works well for me
 
    | willsmith72 wrote:
    | seems fixed now
 
| pruthvishetty wrote:
| Which is the best one, so far? Eleven labs? OpenAI apparently has
| something in the works to (going by today's Spotify podcast
| updates).
 
  | airstrike wrote:
  | I'm also curious. A review of what's state-of-the-art today
  | would be a great idea for a blog post. Just don't post it on
  | medium.com please
 
  | robga wrote:
  | Still Azure Speech in my experience.
 
    | radicalriddler wrote:
    | ElevenLabs is better quality wise, but it's vastly more
    | expensive. Azure Speech hits a really good price:quality
    | ratio.
 
  | smusamashah wrote:
  | Google Soundstorm had the best demo so far. It takes few
  | seconds of original audio and continues it with the same
  | voices. Just hearing those examples you wont figure out where
  | original finished and generated one started.
 
| willsmith72 wrote:
| What are the Ts&Cs when it comes to cloning voices?
 
| vigneshv59 wrote:
| All these TTS services look cool but I don't know how any of them
| are different from each other...
 
| akshayys wrote:
| [dead]
 
| fuddle wrote:
| "Ultrarealistic AI Voice Generator" - I initially read the title
| as "Unrealistic AI Voice Generator". I'd suggest adding a space
| to "Ultrarealistic".
 
  | airstrike wrote:
  | I also read the same thing
 
  | stavros wrote:
  | A hyphen, please.
 
| BeetleB wrote:
| Frankly, these don't sound any better than Google Cloud's TTS,
| and is orders of magnitude more expensive.
 
  | ameliaquining wrote:
  | I think the killer feature here is supposed to be voice
  | cloning, which IIUC Google Cloud offers only as a custom
  | enterprise thing that takes weeks (which suggests that it's not
  | fully automated).
 
| mynegation wrote:
| Every time I see one of those, as a big fan of TV crime dramas, I
| cannot help but think that voice recordings as proof are going to
| be a thing in the past very soon.
 
___________________________________________________________________
(page generated 2023-09-25 23:01 UTC)