XTTS v2 beams issue #189

madey83 · 2024-12-04T09:12:14Z

madey83
Dec 4, 2024

i have tried to generate voice based on subtitles and when i set beams=3 or 4 i get wav file with additional ~20s silent.

first is the original audio
second is genereted voice with beams=2
third is genereted voice with beams=3 ... the same is happenning for 4.

strange thing is that this 20 secound of silence is added only to few sentens like: "Jak wam idzie?", "nie ładnie", "co kiedy" rest of wavs are fine:

Why silence is not added to other short sentens like: "Nieładnie!" , "Tato.", etc...

I'm using code from documentation Inference available on this page
https://coqui-tts.readthedocs.io/en/latest/models/xtts.html

madey83 · 2024-12-12T08:14:23Z

n/a

0 replies