Support prom discovery #93

lujiajing1126 · 2023-07-03T06:25:17Z

Closes #59

According to the discussion in the thread, an extra option is added,

--discovery-method           TEXT  Method to discover workload in the cluster. [default: api-server]

The default mode is api-server but can be switched to prometheus.

Currently, the following entity is supported,

Deployment
STS
...

robusta_krr/core/runner.py

robusta_krr/core/models/config.py

LeaveMyYard · 2023-09-04T14:21:18Z

Planned to merge in 1.7

danielhass · 2024-03-17T15:40:15Z

@LeaveMyYard this apparently didn't make it into 1.7. Can you guys share the current plans for this PR?

aantn · 2024-03-28T06:49:28Z

We're looking into this! It needs some updating for the current codebase and is currently incomplete.

So we can't commit to a timeline yet, but it is something we want to merge in principle.

There are two scenarios we're familiar with where this would be useful:

For people who don't have kubectl access to the cluster
For ephemeral workloads that no longer exist at the time of the scan (e.g. gitlab runners)

Just to confirm we're on the right track with this, is your use case one of those two?

aantn · 2024-03-28T06:50:05Z

And if there are any volunteers for updating this PR to the current version of Robusta and completing it - that would definitely accelerate how fast we can merge it.

danielhass · 2024-03-28T07:45:30Z

@aantn thanks for providing this feedback here!

Our use case falls into these cases and extends beyond I would say: we too don't want to require kubectl to the scanned cluster to run krr. Additionally in our landscape we consolidate our metrics on multiple central "infrastructure clusters" via Prom remote-writes. These infra clusters also store metrics with a much larger retention than the local Prom instances in the source/workload clusters. Running krr on these infra clusters would allow us to fit the tool much better in the centralized metrics approach we chose and possibly lead to better results as we have much more data (larger metrics retention) available there.

Does this makes sense?

aantn · 2024-04-01T09:13:20Z

Yep, makes sense.

You may be able to do that today without waiting for the PR. You can pass an explicit Prometheus url with -p and use --prometheus-label and -l to specify which metrics belong to each cluster. See "Scanning with a Centralized Prometheus" in this part of the README.

Let me know if that works for your case?

edit: To clarify, you'll still need kubectl access to each cluster, so it wont solve that problem. But I think it will solve the second goal of letting krr use the centralized metrics with larger retention.

danielhass · 2024-04-09T10:31:26Z

@aantn we could reproduce your edited findings 💯 on our side.

We were indeed able to make use of the larger retention on the central Prom instance with mentioned parameters. However krr failed as soon as we removed the kubectl access to the target cluster as this connection was always used for workload discovery.

aantn · 2024-04-10T10:46:39Z

Thank you for the update! That makes sense.

Co-authored-by: Megrez Lu <[email protected]>

LeaveMyYard · 2024-04-26T07:38:50Z

Closing this one, as #266 is now a successor, updated for a new code version
Thanks @lujiajing1126 for the work done, it helped a lot

lujiajing1126 added 8 commits July 3, 2023 10:48

first commit

d756d49

fix all bugs

67f264c

support labels in the table

180b19a

fix None cluster issue

9279ce8

fix name

28bf5a9

abstract workload discovery

8bb2857

revert BaseFilteredMetricLoader

88b98f8

revert table format

85c62d5

lujiajing1126 commented Jul 3, 2023

View reviewed changes

robusta_krr/core/runner.py Outdated Show resolved Hide resolved

LeaveMyYard reviewed Jul 5, 2023

View reviewed changes

robusta_krr/core/models/config.py Outdated Show resolved Hide resolved

lujiajing1126 and others added 7 commits July 8, 2023 10:07

fix Literal

325274b

Merge branch 'main' into support-prom-discovery

308edf6

Merge branch 'main' into support-prom-discovery

c301571

Merge branch 'main' into support-prom-discovery

dc97454

remove coroutine limit

e4b443b

remove labels

f4267f5

Merge branch 'main' into support-prom-discovery

f6268c5

LeaveMyYard added a commit that referenced this pull request Apr 22, 2024

Moved the logic from #93 for a new refined structure

c7ad1cd

Co-authored-by: Megrez Lu <[email protected]>

LeaveMyYard closed this Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support prom discovery #93

Support prom discovery #93

lujiajing1126 commented Jul 3, 2023

LeaveMyYard commented Sep 4, 2023

danielhass commented Mar 17, 2024

aantn commented Mar 28, 2024

aantn commented Mar 28, 2024

danielhass commented Mar 28, 2024

aantn commented Apr 1, 2024 •

edited

Loading

danielhass commented Apr 9, 2024 •

edited

Loading

aantn commented Apr 10, 2024

LeaveMyYard commented Apr 26, 2024

Support prom discovery #93

Support prom discovery #93

Conversation

lujiajing1126 commented Jul 3, 2023

LeaveMyYard commented Sep 4, 2023

danielhass commented Mar 17, 2024

aantn commented Mar 28, 2024

aantn commented Mar 28, 2024

danielhass commented Mar 28, 2024

aantn commented Apr 1, 2024 • edited Loading

danielhass commented Apr 9, 2024 • edited Loading

aantn commented Apr 10, 2024

LeaveMyYard commented Apr 26, 2024

aantn commented Apr 1, 2024 •

edited

Loading

danielhass commented Apr 9, 2024 •

edited

Loading