Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why not use markers when the duration of the audio exceed 30s? #12

Open
ILG2021 opened this issue May 19, 2023 · 2 comments
Open

Why not use markers when the duration of the audio exceed 30s? #12

ILG2021 opened this issue May 19, 2023 · 2 comments

Comments

@ILG2021
Copy link

ILG2021 commented May 19, 2023

No description provided.

@ILG2021 ILG2021 changed the title why not use marker when the duration exceed 30s? Why not use markers when the duration of the audio exceed 30s? May 19, 2023
@EtienneAb3d
Copy link
Owner

@ILG2021
Two reasons:

  1. to get a good marker recognition, the "prompt" texts are very useful. But, Whisper is processing audio files by segments of 30s. The "prompt" is used only on the first 30s. After 30s, without the "prompt" information, it is really less efficient to produce good marker outputs.
  2. in my tests, when using noise and silence removal, it is sufficient to produce quite good results on files larger than 30s, without hallucination (and not sufficient on smaller files).

@ILG2021
Copy link
Author

ILG2021 commented May 19, 2023

Yes, WhisperHallu is a very good solution for the silence sound which cause youtube ads. But it seem can not solve the repeat sentence problem. I have a Chinese fine tune faster whisper and when I speak two similar sentence, the model will produce repeat sentence. I don't know why. The result is:你可以保存,保存在你的手机上或者是笔记本电脑上,这样你可以直接找一些朋友,如果说在你非常忙的时候你没有时间跟他们一句一句聊天,你要很长时间的话,你可以直接发送这些短语,这样的话会让他们看见你真贴心,让他们夸赞你真贴心,让他们看你真贴心,让他们夸赞你真贴心,让他们看见你真贴心,让他们夸赞你真贴心,让他们看见你真贴心,让他们夸赞你真贴心,让他们看见你真贴心,让他们夸赞你真贴心,让他们看见你

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants