**Satsuma** @Satsuma@cat.family · 2025-05-14T15:23:17Z

Satsuma @Satsuma@cat.family

re: ai audiobooks, question

@JessMahler @zersiax they’re using AI to generate the voices, using similar training methods to how we’ve trained chatGPT to generate text which lets them create larger corpuses of sounds faster and more cheaply than it does to record a person speaking & break that down into constituent parts. It also theoretically gives improvements in that AI can do more natural pattern matching as to which of the hundred a sounds its generated is appropriate than the extremely complex if then statements a normal TTS would use could (more accurately I think it actually generates the new clips on the fly but for discussions sake we’ll just focus on how this allows a larger set of sounds)

However this is all pretty marginal, it’s most just being touted as revolutionary bc AI makes the hot new thing makes firing all your voice actors okay while previously we judged companies for doing that sort of thing.

May 14, 2025, 15:23 · · Metatext · · ·

Trending now

Resources

Developers

What is Mastodon?

cat.family

More…