fix: Too many requests to `GET /v1/servers/{id}` #661

apricote · 2024-06-12T10:50:02Z

TL;DR

In some situations hcloud-cloud-controller-manager starts to spam GET /v1/servers/{id} for a subset of the nodes in the cluster every few seconds.

Expected behavior

I would expect hccm to only send a single request per --node-status-update-frequency (defaults to 5 minutes).

Observed behavior

We get reports from customers that exhaust the rate limit and we can see that hcloud-cloud-controller-manager sends > 1 req/s for GET /v1/servers/{id}. The default rate limit is 1 request per second.

A restart of the pod fixes the behaviour.

Minimal working example

We are not sure how to reproduce this yet.

Log output

No response

Additional information

Looking at some request logs, this seems to affect even very old versions of HCCM (>2 years).

The text was updated successfully, but these errors were encountered:

github-actions · 2024-09-10T12:56:51Z

This issue has been marked as stale because it has not had recent activity. The bot will close the issue if no further action occurs.

This includes metrics about internal operations from `k8s.io/cloud-provider` like the workqueue depth and requests to the Kubernetes API. This metrics were already exposed on `:8233/metrics` but this was not documented or scraped. This commit now uses the same registry for our metrics and the Kubernetes libraries, and also exposes them on both ports for backwards compatibility. Besides having all data available, this will also help us with debugging #661. Co-authored-by: Lukas Metzner <[email protected]>

This release includes an extension of our current metrics to also include the internals of `k8s.io/cloud-provider` with respect to the work queue depth and requests to the Kubernetes API. Besides having all data available, this will also help us with debugging [#661](#661). ### Features - **metrics**: add metrics from cloud-provider library (#824) - **load-balancer**: emit warning if unsupported port protocol is configured (#828) - allow arbitrary length API tokens (#752)  --- <details> <summary><h4>PR by <a href="https://github.com/apricote/releaser-pleaser">releaser-pleaser</a> 🤖</h4></summary> If you want to modify the proposed release, add you overrides here. You can learn more about the options in the docs. ## Release Notes ### Prefix / Start This will be added to the start of the release notes. ```rp-prefix This release includes an extension of our current metrics to also include the internals of `k8s.io/cloud-provider` with respect to the work queue depth and requests to the Kubernetes API. Besides having all data available, this will also help us with debugging [#661](#661). ``` ### Suffix / End This will be added to the end of the release notes. ```rp-suffix ``` </details> Co-authored-by: releaser-pleaser <>

apricote added the bug Something isn't working label Jun 12, 2024

apricote self-assigned this Jun 12, 2024

github-actions bot added the stale label Sep 10, 2024

jooola added pinned and removed stale labels Sep 10, 2024

apricote mentioned this issue Dec 12, 2024

feat(metrics): add metrics from cloud-provider library #824

Merged

hcloud-bot mentioned this issue Jan 10, 2025

chore(main): release v1.22.0 #826

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Too many requests to `GET /v1/servers/{id}` #661

fix: Too many requests to `GET /v1/servers/{id}` #661

apricote commented Jun 12, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024

fix: Too many requests to GET /v1/servers/{id} #661

fix: Too many requests to GET /v1/servers/{id} #661

Comments

apricote commented Jun 12, 2024 • edited Loading

TL;DR

Expected behavior

Observed behavior

Minimal working example

Log output

Additional information

github-actions bot commented Sep 10, 2024

fix: Too many requests to `GET /v1/servers/{id}` #661

fix: Too many requests to `GET /v1/servers/{id}` #661

apricote commented Jun 12, 2024 •

edited

Loading