Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WhisperX #482

Open
1 of 2 tasks
boegel opened this issue Dec 17, 2024 · 6 comments
Open
1 of 2 tasks

WhisperX #482

boegel opened this issue Dec 17, 2024 · 6 comments
Assignees
Labels
difficulty: medium software that likely require a fair amount of effort to support GPU new new software priority: lowest Python site:ugent Software installation request for UGent Tier-2 sources-only Only sources available (no conda, binaries, container)

Comments

@boegel
Copy link
Contributor

boegel commented Dec 17, 2024

@boegel boegel added difficulty: easy software that should be easy to support new new software priority: medium Python site:ugent Software installation request for UGent Tier-2 GPU sources-only Only sources available (no conda, binaries, container) labels Dec 17, 2024
boegel added a commit that referenced this issue Dec 17, 2024
@boegel
Copy link
Contributor Author

boegel commented Dec 17, 2024

WIP easyconfig available in https://github.com/vscentrum/vsc-software-stack/tree/wip/482_WhisperX

TODO: easyconfig for CTranslate2 dependency

effort so far: 30min

@laraPPr
Copy link
Collaborator

laraPPr commented Jan 2, 2025

Their are problems caused by the dependencies of the current installation:

wrt whisperx, this is quite troublesome to make it work. the deps in the
project are not sufficient, with pip to get somethinhg that starts you
need to do

pip install whisperx 'numpy<2' nvidia-cublas-cu11 nvidia-cudnn-cu11

then whisperx does start, but doesn't do anything.

the dep hell comes from other older deps, esp the pyannote-audo vs
ctranslate2 is troublesome

@pavelToman pavelToman self-assigned this Jan 2, 2025
pavelToman added a commit that referenced this issue Jan 2, 2025
pavelToman added a commit that referenced this issue Jan 3, 2025
pavelToman added a commit that referenced this issue Jan 3, 2025
@boegel boegel added difficulty: medium software that likely require a fair amount of effort to support and removed difficulty: easy software that should be easy to support labels Jan 6, 2025
@boegel
Copy link
Contributor Author

boegel commented Jan 6, 2025

@pavelToman Can you make an estimate how much time you've spent on this already, and how much progress you've made on it (are you stuck, almost done, etc.)?

@pavelToman
Copy link
Collaborator

pavelToman commented Jan 6, 2025

@boegel There was a problem with CTranslate2 but it seems it works now. There will be few more hours (2-3) of work for WhisperX's deps.
But there is a problem with torchaudio - easybuilders/easybuild-framework#4706.
whisperx 3.2.0 has requirement torchaudio>=2, but you have torchaudio 0.16.2 - there is a problem with PyTorch-bundle-2.1.2-2023a-CUDA - torchaudio shows wrong version and can not pass pip check.
Until now I spent 12h with this.

EDIT: maybe it will take longer - it seems I have to create another EC for all pyannote sw - something like this bundle: easybuilders/easybuild-easyconfigs#19992

@boegel
Copy link
Contributor Author

boegel commented Jan 6, 2025

@pavelToman Can you already open a pull request for CTranslate2, and shift your attention to #483 instead? (and then later get back to this one)

Note that both this one and #483 are lowest priority, so please treat it as such as well.

@pavelToman
Copy link
Collaborator

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
difficulty: medium software that likely require a fair amount of effort to support GPU new new software priority: lowest Python site:ugent Software installation request for UGent Tier-2 sources-only Only sources available (no conda, binaries, container)
Projects
None yet
Development

No branches or pull requests

3 participants