[CI/Build] custom build backend and dynamic build dependencies #7525

dtrifiro · 2024-08-14T17:24:48Z

The rationale for this is that the current build setup uses different methods in each Dockerfile:

python setup.py bdist_wheel
python setup.py develop
python setup.py install
pip install -e ./vllm

as well as using different approaches for installing build requirements: using requirements-build.txt, pip install-ing the expected dependencies or relying on the build dependencies defined in pyproject.toml (see discussion here)

As the first three methods above are deprecated, one might expect modern PEP517/PEP518 style builds to work (e.g. pip install git+https://github.com/vllm-project/vllm).

This PR attempts solve these issues by:

adding a custom build backend _custom_backend/vllm.py to dynamically resolve build dependencies at build time, based on the value of VLLM_TARGET_DEVICE, consolidating build depedency requirements in a single place.
builds will now happen in an isolated environment unless pip install --no-build-isolation or python -m build --no-isolation are used.
-Get rid of torch in requirements-build.txt (required torch version is computer per-device by the custom build backend)
modifying Dockerfiles to use the PEP517/518 isolated build environment (when possible).

github-actions · 2024-08-14T17:25:02Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

youkaichao · 2024-09-22T07:49:24Z

why do we need build isolation? vllm build needs to build against pytorch, and it does not make sense to install pytorch (can be complicated) in an isolated environment and build against it.

I'd like to specify the build dependency in pyproject.toml, get rid of the requirements.txt, while still doing non-isolated build.

it looks strange that pip does not support this out of the box.

dtrifiro · 2024-09-23T15:28:25Z

@youkaichao

why do we need build isolation? vllm build needs to build against pytorch, and it does not make sense to install pytorch (can be complicated) in an isolated environment and build against it.

As long as the build-time and run-time torch dependencies match, there's no issue with this.

I'd like to specify the build dependency in pyproject.toml, get rid of the requirements.txt, while still doing non-isolated build.
it looks strange that pip does not support this out of the box.

We can do this using pip install --no-build-isolation or python -m build --no-isolation, although you will have to install every build dependency beforehand.

youkaichao · 2024-09-23T16:49:57Z

As long as the build-time and run-time torch dependencies match, there's no issue with this.

it does not make sense for vllm. in our case, the build will build against pytorch, with binary dependency. and then it needs to run with that pytorch. I don't see any benefit w.r.t. isolated build, while it introduces additional cost of unnecessarily installing pytorch again.

and as i mentioned, installing pytorch in an isolated environment can be complicated, or even impossible. users might bring their own custom build, and the pytorch might come directly from the base container.

since you are building a custom build backend here, I assume you can disable the isolated build by default, and still read the build time dependency and use pip to install them if they are not installed (e.g. for cmake etc)

youkaichao · 2025-01-20T16:11:40Z

_build_backend/vllm.py

+        *requirements_extras,
+    ]
+    print(
+        f"vllm build-backend: resolved build dependencies to: {complete_requirements}"


print the list in a nicer way?

Ended up removing this altogether, before removal it looked like this:

$ python -m build -v --wheel --installer=uv * Creating isolated environment: venv+uv... * Using external uv from /usr/bin/uv * Installing packages in isolated environment: - setuptools - setuptools-scm > /usr/bin/uv pip install setuptools-scm setuptools < Using Python 3.12.8 environment at: /tmp/build-env-t84pkm2n < Resolved 3 packages in 3ms < warning: Failed to hardlink files; falling back to full copy. This may lead to degraded performance. < If the cache and target directories are on different filesystems, hardlinking may not be supported. < If this is intentional, set `export UV_LINK_MODE=copy` or use `--link-mode=copy` to suppress this warning. < Installed 3 packages in 16ms < + packaging==24.2 < + setuptools==75.8.0 < + setuptools-scm==8.1.0 * Getting build dependencies for wheel... vllm build-backend: resolved build dependencies to: setuptools>=61 setuptools-scm>=8 cmake>=3.26 ninja packaging setuptools setuptools-scm wheel torch==2.5.1 * Installing packages in isolated environment: - cmake>=3.26 - ninja - packaging - setuptools - setuptools-scm - setuptools-scm>=8 - setuptools>=61 - torch==2.5.1 - wheel > /usr/bin/uv pip install torch==2.5.1 ninja cmake>=3.26 setuptools-scm wheel setuptools setuptools>=61 setuptools-scm>=8 packaging [...]

which ended up duplicating a lot of information

youkaichao · 2025-01-20T16:12:07Z

requirements-neuron.txt

are these files still required?

Yes, these are runtime dependencies, not build time.

mergify · 2025-01-21T11:33:06Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @dtrifiro.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

dtrifiro · 2025-01-22T11:14:56Z

Dockerfile.cpu

@@ -22,7 +22,7 @@ ENV LD_PRELOAD="/usr/lib/x86_64-linux-gnu/libtcmalloc_minimal.so.4:/usr/local/li

 RUN echo 'ulimit -c 0' >> ~/.bashrc

-RUN pip install intel_extension_for_pytorch==2.5.0
+RUN pip install intel_extension_for_pytorch==2.5.0 # FIXME: should this be a dependency in requirements-cpu.txt?


Does anybody know if there's a specific reason on why this was added here instead of being added to requirements-cpu.txt?

One potential issue could be that requirements-cpu.txt also is used by ppc and arm Dockerfiles, although this could fixed by adding

intel_extension_for_pytorch==2.5.0; platform_machine == "x86_64"

This does not solve the issue when using AMD processors, but this is currently when building Dockerfile.cpu

dtrifiro · 2025-01-22T12:29:46Z

setup.py



 def _is_hip() -> bool:
-    return (VLLM_TARGET_DEVICE == "cuda"
-            or VLLM_TARGET_DEVICE == "rocm") and torch.version.hip is not None
+    return VLLM_TARGET_DEVICE == "rocm"


I want to stress out this change: I think it's better to explicitly set the ROCm target device rather than inferring it based on the value of torch.version.hip. Dockerfile.rocm has been updated accordingly.

dtrifiro · 2025-01-22T13:35:31Z

async-engine-inputs-utils-worker-test failure seems unrelated

Signed-off-by: Daniele Trifirò <[email protected]>

Signed-off-by: Daniele Trifirò <[email protected]> fix Dockerfile build Signed-off-by: Daniele Trifirò <[email protected]>

Signed-off-by: Daniele Trifirò <[email protected]>

dtrifiro force-pushed the pep517-pep518-improvements branch 2 times, most recently from ff4d15f to b5edbeb Compare August 14, 2024 17:37

dtrifiro changed the title ~~] [CI/Build] custom build backend and dynamic build dependencies~~ [CI/Build] custom build backend and dynamic build dependencies Aug 14, 2024

dtrifiro mentioned this pull request Aug 14, 2024

[CI/Build] PEP 517/518 improvements #4791

Closed

dtrifiro force-pushed the pep517-pep518-improvements branch from b5edbeb to fdcae32 Compare August 20, 2024 09:10

njhill mentioned this pull request Aug 21, 2024

[ci] Cleanup & refactor Dockerfile to pass different Python versions and sccache bucket via build args #7705

Merged

dtrifiro mentioned this pull request Sep 13, 2024

[CI/Build] use setuptools-scm to set __version__ #4738

Merged

dtrifiro force-pushed the pep517-pep518-improvements branch from fdcae32 to 05292a4 Compare September 17, 2024 12:16

dtrifiro marked this pull request as ready for review September 17, 2024 13:09

dtrifiro force-pushed the pep517-pep518-improvements branch from 05292a4 to 6a1ae68 Compare September 23, 2024 15:37

dtrifiro marked this pull request as draft October 16, 2024 10:40

dtrifiro mentioned this pull request Dec 16, 2024

[Installation]: no version of pip install vllm works - Failed to initialize NumPy: No Module named 'numpy' #11037

Open

1 task

dtrifiro force-pushed the pep517-pep518-improvements branch from 6a1ae68 to 4121ba1 Compare January 20, 2025 14:56

mergify bot added the ci/build label Jan 20, 2025

youkaichao reviewed Jan 20, 2025

View reviewed changes

dtrifiro force-pushed the pep517-pep518-improvements branch from fb84bf8 to 85ff124 Compare January 20, 2025 16:27

mergify bot added the documentation Improvements or additions to documentation label Jan 20, 2025

mergify bot added the needs-rebase label Jan 21, 2025

dtrifiro force-pushed the pep517-pep518-improvements branch 4 times, most recently from 26905c7 to e754ed3 Compare January 21, 2025 17:49

mergify bot removed the needs-rebase label Jan 21, 2025

dtrifiro force-pushed the pep517-pep518-improvements branch from e754ed3 to ccc246d Compare January 22, 2025 10:12

dtrifiro force-pushed the pep517-pep518-improvements branch 2 times, most recently from 379867a to bd42837 Compare January 22, 2025 10:53

dtrifiro commented Jan 22, 2025

View reviewed changes

dtrifiro force-pushed the pep517-pep518-improvements branch from bd42837 to 7b29fd6 Compare January 22, 2025 11:19

dtrifiro marked this pull request as ready for review January 22, 2025 11:20

dtrifiro force-pushed the pep517-pep518-improvements branch 4 times, most recently from b26d4b4 to f9ea699 Compare January 22, 2025 12:27

dtrifiro commented Jan 22, 2025

View reviewed changes

dtrifiro force-pushed the pep517-pep518-improvements branch from f9ea699 to bafc285 Compare January 22, 2025 12:34

dtrifiro force-pushed the pep517-pep518-improvements branch 3 times, most recently from a6a7288 to f510738 Compare January 22, 2025 15:27

dtrifiro added 15 commits January 22, 2025 16:29

initial implementation for custom build backend

756a909

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile: use PEP517/518-style builds

69b2f6a

Signed-off-by: Daniele Trifirò <[email protected]> fix Dockerfile build Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.tpu: use PEP-517/518 -style builds

427a829

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.neuron: use PEP-517/518 -style builds

f7691ac

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.cpu: use PEP-517/518 -style builds

fec15e8

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.ppc64le: use PEP-517/518 -style builds

05a0ec2

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.openvino: update for PEP-517/518 -style builds

6d4ab76

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.arm: update for PEP-517/518 -style builds

e9573a6

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.hpu: update for PEP-517/518 -style builds

0387fa0

Signed-off-by: Daniele Trifirò <[email protected]>

Dockerfile.rocm: use PEP-517/518 -style builds

2e48ecf

Signed-off-by: Daniele Trifirò <[email protected]>

update docs

88e610b

Signed-off-by: Daniele Trifirò <[email protected]>

setup.py remove deprecated flash-attn requirement step

142383b

Signed-off-by: Daniele Trifirò <[email protected]>

setup.py: fix sdist builds

ee1c980

Signed-off-by: Daniele Trifirò <[email protected]>

setup.py: cleanup get_rocm_version

6c907cb

Signed-off-by: Daniele Trifirò <[email protected]>

setup.py: nit

f510738

Signed-off-by: Daniele Trifirò <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/Build] custom build backend and dynamic build dependencies #7525

[CI/Build] custom build backend and dynamic build dependencies #7525

dtrifiro commented Aug 14, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Aug 14, 2024

youkaichao commented Sep 22, 2024

dtrifiro commented Sep 23, 2024

youkaichao commented Sep 23, 2024

youkaichao Jan 20, 2025

dtrifiro Jan 21, 2025

youkaichao Jan 20, 2025

dtrifiro Jan 21, 2025

mergify bot commented Jan 21, 2025

dtrifiro Jan 22, 2025

dtrifiro Jan 22, 2025

dtrifiro commented Jan 22, 2025

[CI/Build] custom build backend and dynamic build dependencies #7525

Are you sure you want to change the base?

[CI/Build] custom build backend and dynamic build dependencies #7525

Conversation

dtrifiro commented Aug 14, 2024 • edited by github-actions bot Loading

github-actions bot commented Aug 14, 2024

youkaichao commented Sep 22, 2024

dtrifiro commented Sep 23, 2024

youkaichao commented Sep 23, 2024

youkaichao Jan 20, 2025

Choose a reason for hiding this comment

dtrifiro Jan 21, 2025

Choose a reason for hiding this comment

youkaichao Jan 20, 2025

Choose a reason for hiding this comment

dtrifiro Jan 21, 2025

Choose a reason for hiding this comment

mergify bot commented Jan 21, 2025

dtrifiro Jan 22, 2025

Choose a reason for hiding this comment

dtrifiro Jan 22, 2025

Choose a reason for hiding this comment

dtrifiro commented Jan 22, 2025

dtrifiro commented Aug 14, 2024 •

edited by github-actions bot

Loading