Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nvidia-cdi-hook fails to execute properly in NixOS (Possibly related to dynamic linking and glibc conflicts) #841

Open
colonelpanic8 opened this issue Jan 1, 2025 · 0 comments

Comments

@colonelpanic8
Copy link

I'm not fully sure I understand why (although this issue in nixpkgs might be related NixOS/nixpkgs#338511)

but the nvidia-cdi-hook commands generated by calling cdi generate seem to fail in certain settings:

Here is the actual call that is used to generate the cdi json:

https://github.com/NixOS/nixpkgs/blob/85163d3c74112fa8033d18881af9410cebb38323/nixos/modules/services/hardware/nvidia-container-toolkit/cdi-generate.nix#L37

I don't get too much from the crashes (I have some core dumps, but looking at them didn't yield much). There is just a warning about stack smashing.

I'm assuming that this has something to do with dynamic linking and perhaps that environment that is being used by containerd when it is excuting runc, but its not at all obvious how to debug the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant