Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Debug concurrent streams test #624

Merged
merged 5 commits into from
Nov 9, 2023
Merged

Debug concurrent streams test #624

merged 5 commits into from
Nov 9, 2023

Conversation

emcfarlane
Copy link
Contributor

@emcfarlane emcfarlane commented Nov 3, 2023

Locally increasing the number of concurrent streams and running with -race generates:

writeRequest io.ErrClosedPipe
2023/11/03 10:19:46 timeout waiting for SETTINGS frames from pipe
writeRequest io.ErrClosedPipe
--- FAIL: TestConcurrentStreams (35.69s)
    connect_ext_test.go:490: failed to send request: unknown: write envelope: EOF
    connect_ext_test.go:490: failed to send request: unknown: write envelope: EOF

Large number of go routines can timeout past the arbitrary limit of 2 seconds in the HTTP2 libraries. See comment here.
Bind the concurrency to runtime.GOMAXPROCS to reduce resource requirements for testing on different machines.

See golang/go#60359

Occurred in https://github.com/connectrpc/connect-go/actions/runs/6725037978/job/18278596900?pr=611

@emcfarlane emcfarlane marked this pull request as ready for review November 3, 2023 14:42
connect_ext_test.go Outdated Show resolved Hide resolved
@emcfarlane emcfarlane requested review from jhump and removed request for jchadwick-buf November 6, 2023 20:02
Copy link
Member

@jhump jhump left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. But I'm worried that maybe now we're being too conservative with the number of concurrent operations. Any reason not to lift this a bit before merging?

@@ -475,7 +476,7 @@ func TestConcurrentStreams(t *testing.T) {
server := memhttptest.NewServer(t, mux)
var done, start sync.WaitGroup
start.Add(1)
for i := 0; i < 100; i++ {
for i := 0; i < runtime.GOMAXPROCS(0)*2; i++ {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In case GOMAXPROCS is very low, can we use a slightly higher multiplier? Like maybe 10? Since the test is meant to exercise concurrency, having only two concurrent streams (if GOMAXPROCS is 1) seems potentially ineffective. Also, this likely defaults to 8 or 10 on your MBP, which means we're only doing 16 or 20 requests, which seems quite hard to believe it could exceed 2 seconds. So the multiplier of 2 seems like it may be overly cautious.

Also, do we know the actual value of this in CI? It was previously resolving to 100 operations. How many does it do now, in practice? Maybe we also should t.Log the actual duration of the operations, so we can see if/when we're getting close to the 2 second limit?

Copy link
Contributor Author

@emcfarlane emcfarlane Nov 7, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What's the value in increasing the concurrency further? *2 ensures theres some level of concurrency. If GOMAXPROCS is 1, 2 will still be testing concurrent execution.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was hoping for stronger guarantee of concurrent activity -- the scheduler is non-deterministic, so it seems trivial for this to allow one goroutine to complete before the other actually starts. I suppose another way to force concurrency might be to call Send(nil) (to make sure streams are actually started) before calling start.Wait().

But I guess it also depends on what this is trying to test. The previous value of 100 is quite high, so I assumed it wanted to be a sort of stress test. And in that case, reducing all the way from 100 down to 2 streams seems rather insufficient. So, based on my interpretation of the intent, I would expect to see more than just 2 here.

Copy link
Contributor Author

@emcfarlane emcfarlane Nov 9, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The windows runner has more cores. They are relatively low and maybe would catch more with more concurrency. Each stream does do 100 back on forth on the stream per go rountine. So I thought it would be good enough to just have some parallelism for the test. Will bump to 8 and can always lower later.

Windows:

    connect_ext_test.go:514: NUMCPU 4
    connect_ext_test.go:515: GOMAXPROCS 4

Linux:

    connect_ext_test.go:514: NUMCPU 2
    connect_ext_test.go:515: GOMAXPROCS 2

@emcfarlane emcfarlane force-pushed the ed/debugConcurrentStreamsTest branch from 934a209 to e96582b Compare November 9, 2023 15:58
@jhump jhump merged commit 67dceff into main Nov 9, 2023
7 checks passed
@jhump jhump deleted the ed/debugConcurrentStreamsTest branch November 9, 2023 16:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants