Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for Copilot+ PCs #92

Open
pkbullock opened this issue Sep 2, 2024 · 6 comments
Open

Support for Copilot+ PCs #92

pkbullock opened this issue Sep 2, 2024 · 6 comments

Comments

@pkbullock
Copy link

It would be great to see if AI Toolkit can leverage the NPU in Copilot PCs.
Currently this uses the CPU, its nice a quick on the Snapdragon processors but not using the AI processor when running models.

@sirredbeard
Copy link
Contributor

I wonder if this is related to onnxruntime-genai still awaiting QNN support.

@pkbullock
Copy link
Author

This is listed in the docs as supports AI Copilot PC but it doesnt, my NPU activity is 0%. So how to use this?

@sirredbeard
Copy link
Contributor

I don't see any reference yet to CoPilot+ PC in the AI Toolkit docs, at least not here. Because it relies on onnxruntime-genai, I believe QNN support must land there first before AI Toolkit can take full advantage of it. You might be able to take some advantage of the NPU now, indirectly, by using DirectML with a model like Phi-3-mini-4k-directml-int4-awq-block-128-onnx which is optimized for that. I have been using DirectML on my non-CoPilot Qualcomm-based WDK23 to speed up training.

@pkbullock
Copy link
Author

Hi @sirredbeard - I saw it in the release notes on installation of the VSCode extension with the mention of support. But I agree seems many frameworks are dependent on the QNN runtimes/sdks being release.
image

@wmmc88
Copy link

wmmc88 commented Sep 16, 2024

It seems like direct-ml models don't show up in the model catalog on my PC that has a QC NPU

@rockcat
Copy link

rockcat commented Oct 24, 2024

Me neither - what is the course of action to enable models to show up on Snapdragon machines ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants