test: refactor test suite to (hopefully) fix flakiness by mccutchen · Pull Request #65 · mccutchen/websocket

mccutchen · 2025-10-07T10:20:05Z

The theory here is that the new, more rigorous tests for closing handshake added as part of #63 have caused significantly more flakiness because of setupRawConnWithHandler's use of a throwaway bufio.Reader around the underlying net.Conn just to read the HTTP 101 response for the opening handshake.

This causes a race with tests where the server might write websocket data to the connection immediately after the handshake (e.g. when a test immediately closes the connection), because the bufio.Reader is likely to read (partial) websocket data while reading the HTTP response, leaving subsequent reads with only partial/incomplete data and causing them to block until the test suite times out.

I'm very curious to see if this fixes the tests. If it works, credit goes to Claude Code for helping identify the race condition. If it doesn't, well, obviously LLMs are useless.

Update: while that issue was likely a source of flakiness, fixing it did not actually make the test suite any less flaky.

Ended up doing a much larger refactor to set up complete client and server connections instead of setting up a single connection to simulate a client and server reading from/writing to the same conn.

This seems to have helped a bit, maybe, but we're still seeing intermittent test failures due to timeouts.

github-actions · 2025-10-07T10:20:13Z

🔥 Run benchmarks comparing 856dd89 against main:

gh workflow run bench.yaml -f pr_number=65

Note: this comment will update with each new commit.

mccutchen · 2025-10-07T10:30:09Z

No dice on the more minimal changes, let's see if dropping net.Pipe will help us out here

codecov · 2025-10-18T21:04:10Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 96.99%. Comparing base (ca6c1e4) to head (856dd89).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #65      +/-   ##
==========================================
+ Coverage   96.66%   96.99%   +0.32%     
==========================================
  Files           2        2              
  Lines         570      432     -138     
==========================================
- Hits          551      419     -132     
+ Misses         14        7       -7     
- Partials        5        6       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mccutchen · 2025-10-19T12:00:01Z

Ended up doing a much bigger refactor to ensure all tests were using real TCP connections between a client and a real server instead of reading from and writing to a single shared connection.

Unfortunately … we're still failing in confusing and weird ways:

--- FAIL: TestProtocolErrors (0.00s)
    --- FAIL: TestProtocolErrors/max_message_size (0.02s)
        websocket_test.go:612: expected error "message paylaod too large", got "client frame must be masked" (*websocket.Error vs *websocket.Error)
        websocket_test.go:605: incorrect close status code:
            want: 1009
             got: 1002
coverage: 87.0% of statements
panic: test timed out after 1m0s
	running tests:
		TestCloseFrameValidation (1m0s)
		TestCloseFrameValidation/invalid_close_code_(too_high)) (1m0s)

mccutchen added 6 commits October 15, 2025 16:20

test: fix flaky tests (hopefully)

e816fc6

drop net.Pipe usage

ba326cc

meh refactor tests

f8a4a3e

wip

64155e3

meh

22cbc5f

bigger refactor

fe05e87

mccutchen force-pushed the flaky-tests-2 branch from d3363ec to fe05e87 Compare October 17, 2025 13:22

mccutchen added 13 commits October 17, 2025 10:38

claude refactor

3f6e8f0

touchup claude refactoring

cf605bb

fix: close immediately on basically any proto-level error

e1dcdde

update TestProtocolErrors

ccc752d

update TestCloseFrames

901c235

claude refactor TestCloseHandhsake

284c9b8

fixup TestClosehandshake

c82b837

extend clientServerTest w/ conn wrappers

3453a33

refactor TestConnectionLimits

1961cf6

delete

6629dc9

claude refactor TestNetworkErrors

fcbc4c8

update TestServeLoop and TestClose

06cbdcd

cleanup

006a07f

mccutchen added 3 commits October 18, 2025 17:05

cleanup

d6c8aba

cleanup

7770c71

unused?

5ce3495

mccutchen added 2 commits October 19, 2025 08:01

undo test logging change

4eab61e

missed test case

7e629cf

mccutchen changed the title ~~test: fix flaky tests (hopefully)~~ test: refactor test suite to (hopefully) fix flakiness Oct 19, 2025

gaming code coverage

6ac9e78

mccutchen force-pushed the flaky-tests-2 branch from 7a5b6fa to 6ac9e78 Compare October 25, 2025 09:09

mccutchen enabled auto-merge (squash) October 25, 2025 10:31

more generous timeouts for CI flakiness???

856dd89

mccutchen merged commit 8744466 into main Oct 25, 2025
10 checks passed

mccutchen deleted the flaky-tests-2 branch October 25, 2025 10:45

mccutchen mentioned this pull request Oct 27, 2025

bug: flaky test suite under go 1.25 #72

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: refactor test suite to (hopefully) fix flakiness#65

test: refactor test suite to (hopefully) fix flakiness#65
mccutchen merged 26 commits intomainfrom
flaky-tests-2

mccutchen commented Oct 7, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 7, 2025 •

edited

Loading

Uh oh!

mccutchen commented Oct 7, 2025

Uh oh!

codecov bot commented Oct 18, 2025 •

edited

Loading

Uh oh!

mccutchen commented Oct 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mccutchen commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mccutchen commented Oct 7, 2025

Uh oh!

codecov bot commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mccutchen commented Oct 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mccutchen commented Oct 7, 2025 •

edited

Loading

github-actions bot commented Oct 7, 2025 •

edited

Loading

codecov bot commented Oct 18, 2025 •

edited

Loading