-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Next steps for distributed deployments #535
Comments
Some relevant places in the code:
|
I think that if this is the specification it should be the same specification across local and remote node, thus breaking changes compared to current implementation.
I would expect this to be an available option all the time as it might not be easy to specify a specific file very far from the daemon spawning path. |
If we pass parameter working_dir when we start daemon then we can manage working_dir just like machine_id, When a node is started, the coordinator passes the working_dir of the corresponding daemon so we don't have to skip checks. However, I'm not sure that's possible. |
This works when both daemons are running on the same machine. However, if a daemon runs on a remote machine, we have no access to its file system, so we cannot check the paths. |
Good point, I added this drawback to the list above.
Yes, it's always available as an option. What I meant is that we don't allow relative paths for remote machines. |
In that case can we maybe try an implementation using option 2 before making Option 1. What do you think @XxChang @Gege-Wang ? |
I think it is good, let me do it. |
I opened a draft PR that implements option 2 a few days ago, maybe that's useful: #534 One challenge is the |
I see. Maybe we can use some environment variable to fix CI? Otherwise, I guess it's fine to hard code GitHub CI path for now. |
There are some issues about
If the cli check the dataflow, this problem should be always here, because cli never know whether the daemon are local or remote.
however, it doesn't work, because the ip of machine A is 127.0.0.1,so we must start dataflow like this
|
Thanks a lot for testing and reporting these issues! This is very useful!
I think we can fix this in the following way:
The issue is around these lines: dora/binaries/coordinator/src/lib.rs Lines 181 to 184 in 9d2ee36
If the
Good catch! So we need a way to check whether a path is an absolute Windows path on Linux systems (and the other way around). Maybe there are some crates that allow this? Alternatively, we could copy the |
This issue outlines the steps that we need to take to make dora dataflow work across multiple machines:
dora check
to skip checking paths on remote machinesdora start
The text was updated successfully, but these errors were encountered: