Just have llama.cpp emit an “um”, “uhh” etc. when the buffer’s down to one word....

umtksa · on Nov 2, 2023

you laught at the end but I love this solution

ryanklee · on Nov 2, 2023

Humans have loved the same solution since we first started talking, as well

mirekrusin · on Nov 2, 2023

Don’t forget to mix it with “apparently”, “you know what I’m saying”, “I mean”, “you know” etc.

ryanklee · on Nov 2, 2023

I can't tell if you're disparaging the usage or not (truly, I can't tell), but such utterances exist because they serve a real function. Disfluency is an integral part of speech.

vidarh · on Nov 2, 2023

I think it's a good idea, if done well. It could also potentially be combined with dynamically adjusting speed of the speech, and reducing or increasing the use of shortcuts and contractions, making word replacements.

I know wish for a model built to be a low-computation filter that takes text in and produces padded text out intended for TTS and annotated with pauses or sounds and extra words that maintains the same meaning but provides the ability to dynamically adjust the level of verbosity to maintain a fixed rate of words per minute.

taneq · on Nov 2, 2023

I always thought of them as the human equivalent of hard drive noises. <brrrrr brrbrrbr>

taneq · on Nov 2, 2023

array_rand($verbal_fry[$locale]) /* :D */