dragonscave.space is one of the many independent Mastodon servers you can use to participate in the fediverse.
A fun, happy little Mastodon/Glitch instance.

Server stats:

238
active users

Alex Hall
Public

With all the neat AI stuff these days, is there a service that takes a person's voice and turns it into a SAPI5 synthesizer? I know companies like LyreBird are working on the voice part, but they never let you use the generated voice outside of their website.

Mikołaj Hołysz
Public

@alexhall RH Voice can do it, but it’s a bitch and a half to train and English still needs some work on the phonetic transcription side. Works well enough for most Eastern-european languages though.

Alex Hall
Public

@miki I didn't see anything about training RHVoice. I installed their English voices a while ago, and one is pretty good. If one can train it, how is it still bad at English?

Alex Hall
Public

@miki That sounds both hard and confusing, especially for someone like me who lacks any real understanding of the details.

Mikołaj Hołysz
Public

@alexhall I on’t really understand the math either. Most of it is handled automatically behind the scenes, the only thing you need to know is that text processing (whether plugin is pronounced like plug in or like ploo gin) is controlled programmatically, via language rules, and whether the voice sounds male, female, old, young, like you or like me is based on the training data.

Alex Hall
Public

@miki Can anyone do this? It seems like it should be pretty language-agnostic, if someone can adjust things.

@alexhall Do what? Change the language rules? Yeah, they use something called Foma for doing the processing, so if you get the hang of the syntax it uses, you can do whatever. Getting all the linguistic knowledge to figure out what the rules should be is a different problem entirely. English is notoriously difficult in that aspect, see E.G. the o in woman vs. the o in women, the th in three VS. the th in though VS. the th in lighthouse. I haven’t used RH Voice extensively enough to know how much of that work is already done and how many elusive exceptions remain.