Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Waiting on Kubernetes API on startup #7989

Closed
lyngekpdxc opened this issue Dec 20, 2024 · 9 comments
Closed

Waiting on Kubernetes API on startup #7989

lyngekpdxc opened this issue Dec 20, 2024 · 9 comments
Assignees
Labels
kind/bug Something isn't working platform/windows

Comments

@lyngekpdxc
Copy link

Actual Behavior

New installation won't finish starting up. Stuck on "Waiting on Kubernetes API"

Steps to Reproduce

Install Rancher Desktop
Start Rancher Desktop

Result

Stuck on this state without any further progress. Restarting the application. Uninstall/reinstall does not resolve the issue.

Expected Behavior

Install + Start should properly finish setup and give a working installation.

Additional Information

I have previous to installing Rancher desktop already used WSL with Ubuntu 24.04 with a docker installation. I would think that this should have no effect on Rancher. But perhaps it might be relevant to mention?

Rancher Desktop Version

1.16.0

Rancher Desktop K8s Version

1.30.6

Which container engine are you using?

moby (docker cli)

What operating system are you using?

Windows

Operating System / Build Version

Windows 11 Enterprise 23H2

What CPU architecture are you using?

x64

Linux only: what package format did you use to install Rancher Desktop?

None

Windows User Only

Appgade SDP and Ivanti Secure Client. Used for work environment. I have tried disabling both without any change.

@lyngekpdxc lyngekpdxc added the kind/bug Something isn't working label Dec 20, 2024
@lyngekpdxc
Copy link
Author

logs.zip
I have attached logs here

@mook-as
Copy link
Contributor

mook-as commented Dec 20, 2024

wsl.log contains a lot of loops of:

2024-12-20T12:10:56.541Z: Capturing output: wsl.exe --distribution rancher-desktop --exec cat /proc/net/route
2024-12-20T12:10:56.892Z: captureCommand:
command: (cat '/proc/net/route')
output: <Iface  Destination     Gateway         Flags   RefCnt  Use     Metric  Mask            MTU     Window  IRTT                                                       
eth3    00000000        0101A8C0        0003    0       0       25      00000000        0       0       0                                                                              
docker0 000011AC        00000000        0001    0       0       0       0000FFFF        0       0       0                                                                            
eth3    0001A8C0        00000000        0001    0       0       281     00FFFFFF        0       0       0                                                                             
eth3    0101A8C0        00000000        0005    0       0       25      FFFFFFFF        0       0       0                                                                              
veth-rd-wsl     008FA8C0        00000000        0001    0       0       0       00FFFFFF        0       0       0                                                                        
>

That means it's stuck because the code is failing around

const routesString = await this.captureCommand('cat', '/proc/net/route');
const routes = routesString.split(/\r?\n/).map(line => line.split(/\s+/));
const route = routes.find(route => route[0] === 'eth0' && route[1] !== '00000000');
if (!route) {
return undefined;
}

So the issue seems to be that we're unexpectedly not seeing eth0 (but there's eth3 instead). We may need to fall back to any interface that has a destination of all zeros, but I'd be curious to know about why the interface name is different.

(Having a different distribution with a local dockerd should be no issue; I have that too, just with openSUSE. Also, thanks for uploading the logs!)

@lyngekpdxc
Copy link
Author

lyngekpdxc commented Jan 6, 2025

Thanks for clarifying. I don't know the exact mapping for the network interfaces in WSL. Are they directly referring to the actual network interfaces on the host system, or is there some internal mapping in the WSL layer to avoid issues with this?

I am using a dock, so my PC has two ethernet ports, the native "Ethernet" is not in use, while the one on the dock "Ethernet 2" is. If this is also mapped into WSL, perhaps this answers the question. I can try to plug the network cable directly into my laptop to see if this resolves the issue for now.

@lyngekpdxc
Copy link
Author

No immediate difference in the behaviour, still seems to be looping. However it seems that it now continues from the check mentioned before but is still looping back, perhaps due to a failure in the next steps?

I have uploaded new logs below.
logs.zip

@mook-as
Copy link
Contributor

mook-as commented Jan 6, 2025

The WSL networks should be unrelated to your host networks. There's #6665 (which should hopefully be fixed in 1.17), but I'm not sure that's related to this issue.

Looks like your new logs are a lot more complex, which does make me wonder if you have networkingmode=mirrored set. Can you check if that's the case?

@lyngekpdxc
Copy link
Author

You are exactly right. I have had to setup mirrored networking mode in the past to get wsl to work properly with the proxy/vpn setup that I use in my work. I did not realize that this would have an effect on new distributions as well. But I would believe that rancher would experience similar networking issues in my daily setup where I would need to access company repositories, etc.

Would this still be resolved by supporting other network adapter names. Or is mirrored networking mode not supported on purpose?

@mook-as
Copy link
Contributor

mook-as commented Jan 7, 2025

No, there's a separate code path now to support mirrored networking (by always using localhost to talk to the Kubernetes backend), so that should be fine as of the… current release (1.17.0 was released less than a day ago).

@lyngekpdxc
Copy link
Author

That did indeed seem to do the trick! Thank you for the heads up. I had previously thought about updating, but no new version was available. The new version fixed the issue I had.
Looking forward to trying out rancher desktop.

@mook-as
Copy link
Contributor

mook-as commented Jan 8, 2025

Yay! And yeah, I don't think we've enabled the auto-update yet; we normally wait a few days to see if anybody reports significant issues before doing so.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working platform/windows
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants