-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bugfix][Frontend] Fix missing /metrics
endpoint
#6463
[Bugfix][Frontend] Fix missing /metrics
endpoint
#6463
Conversation
👋 Hi! Thank you for contributing to the vLLM project. Full CI run is still required to merge this PR so once the PR is ready to go, please make sure to run it. If you need all test signals in between PR commits, you can trigger full CI as well. To run full CI, you can do one of these:
🚀 |
@robertgshaw2-neuralmagic can you help update the test to check #4523? Since I haven't worked with prometheus metrics before, I'm worried that I might break something with this refactor. |
@robertgshaw2-neuralmagic since this PR has been merged already, you can open a new PR if you need to update the tests to be more robust. |
Can we get a new docker release with this fix? |
v0.5.4 docker release still missing /metrics endpoint |
I try use main branch to build docker image,The metrics information has become completely unavailable,no longer contains any statistical information related to the model.
|
Can you revert to the commit from this PR to ensure that the problem isn't caused by subsequent commits? (Notably #6883) cc @robertgshaw2-neuralmagic |
The fix https://github.com/vllm-project/vllm/pull/6463/files for the former issue #6461 also contained tests to ensure metrics are there. Are they not being used @DarkLight1337 ? |
The test only checks that the metrics endpoint exists. It doesn't check which metrics are being exposed. |
That might be a cool thing to test then 🙄 ;-) |
Signed-off-by: Alvant <[email protected]>
Mount does not work on router, so I'm reverting the change in #5090 so that mount is performed on the application object. Since
app
is no longer global, I've moved the code into a function to be called afterapp
is constructed.To avoid similar issues in the future, I've added some tests to check that the endpoints remain accessible.
FIX #6461