|
| sterlind wrote:
| the prosody seems a little robotic, and kind of jarring. maybe
| I'm spoiled by Bark, even in its rough and slow state, but is
| this really that much of a step up from Tacotron2?
| splatzone wrote:
| Interesting but there's a weird pause before the word 'danced' in
| each demo? Sounds unnatural
| g_xing wrote:
| The price is lower than the competitors... I wonder how good it
| actually is though. Guessing they just sacrifice quality
| willsmith72 wrote:
| maybe it's just me but the play buttons take forever to become
| clickable
| vmatsiiako wrote:
| works well for me
| willsmith72 wrote:
| seems fixed now
| pruthvishetty wrote:
| Which is the best one, so far? Eleven labs? OpenAI apparently has
| something in the works to (going by today's Spotify podcast
| updates).
| airstrike wrote:
| I'm also curious. A review of what's state-of-the-art today
| would be a great idea for a blog post. Just don't post it on
| medium.com please
| robga wrote:
| Still Azure Speech in my experience.
| radicalriddler wrote:
| ElevenLabs is better quality wise, but it's vastly more
| expensive. Azure Speech hits a really good price:quality
| ratio.
| smusamashah wrote:
| Google Soundstorm had the best demo so far. It takes few
| seconds of original audio and continues it with the same
| voices. Just hearing those examples you wont figure out where
| original finished and generated one started.
| willsmith72 wrote:
| What are the Ts&Cs when it comes to cloning voices?
| vigneshv59 wrote:
| All these TTS services look cool but I don't know how any of them
| are different from each other...
| akshayys wrote:
| [dead]
| fuddle wrote:
| "Ultrarealistic AI Voice Generator" - I initially read the title
| as "Unrealistic AI Voice Generator". I'd suggest adding a space
| to "Ultrarealistic".
| airstrike wrote:
| I also read the same thing
| stavros wrote:
| A hyphen, please.
| BeetleB wrote:
| Frankly, these don't sound any better than Google Cloud's TTS,
| and is orders of magnitude more expensive.
| ameliaquining wrote:
| I think the killer feature here is supposed to be voice
| cloning, which IIUC Google Cloud offers only as a custom
| enterprise thing that takes weeks (which suggests that it's not
| fully automated).
| mynegation wrote:
| Every time I see one of those, as a big fan of TV crime dramas, I
| cannot help but think that voice recordings as proof are going to
| be a thing in the past very soon.
___________________________________________________________________
(page generated 2023-09-25 23:01 UTC) |