[build_manager] Skip unzipping during fuzz task target discovery by PauloVLB · Pull Request #5298 · google/clusterfuzz

PauloVLB · 2026-05-29T13:46:32Z

Context b/500991018 and b/509600495.

The Problem

During the very first target discovery run of a new engine fuzzer job (or after mappings are reset), no target is selected yet (fuzz_target=None / "unknown").

The build manager was currently attempting to download and uncompress the entire build archive (which are massive: 123 GB on Linux, and up to 397 GB on Windows) just to list fuzzer target names and exit.

This causes space allocation checks (_make_space) to fail on standard GCE bot disks (75GB - 200GB), leaving the job stuck in an infinite crash loop.

The Fix

During fuzz task target discovery runs (where fuzz_target is None for engine jobs with selective unzipping), we bypass zip file extraction entirely. We only open the archive, read fuzzer target names in memory from the catalog index (which takes milliseconds over HTTP without disk allocation), save them to Datastore, and exit early.

Once saved, subsequent runs select a target and run selective unzipping (only ~500 MB), which fits comfortably.

Impact in other workflows

To ensure this optimization has zero impact on other active workflows, the bypass is protected by five guards:

not self.fuzz_target: Restricts only to target-discovery runs (where no target is selected yet).
not self._unpack_everything: Restricts only when selective target unzipping is enabled (if a job disables selective unzipping, it requires a full unpack of all targets, so we must not bypass).
environment.is_engine_fuzzer_job(): Restricts only to engine fuzzers (blackbox fuzzer jobs don't selective-unpack and always need to fully unzip their application binaries).
environment.get_value('TASK_NAME') == 'fuzz': Restricts only to fuzzing tasks (progression/regression tasks working on crashes always need their target build unpacked to disk to run reproductions).
not self.build_prefix: Restricts only to the primary target build and not supporting extra engine binary packages (which must always be fully unpacked to disk).
environment.platform() == 'WINDOWS': Restricts this bypass strictly to Windows bots. This limits the production rollout blast radius exclusively to the Windows platform to resolve the Windows bot crash block (b/509600495) with absolute safety.

Note on Testing

Unit tests were updated and a new discovery test was added. All tests are passing. Note that this cannot be easily tested in dev because we don't have working local Windows bots running right now, but the logic is fully covered by unit tests.

letitz · 2026-06-01T08:58:05Z

Drive-by comments.

Context b/500991018 and b/509600495.

The Problem

During the very first target discovery run of a new engine fuzzer job (or after mappings are reset), no target is selected yet (fuzz_target=None / "unknown").

The build manager was currently attempting to download and uncompress the entire build archive (which are massive: 123 GB on Linux, and up to 397 GB on Windows) just to list fuzzer target names and exit.

This causes space allocation checks (_make_space) to fail on standard GCE bot disks (75GB - 200GB), leaving the job stuck in an infinite crash loop.

The Fix

During fuzz task target discovery runs (where fuzz_target is None for engine jobs with selective unzipping), we bypass zip file extraction entirely. We only open the archive, read fuzzer target names in memory from the catalog index (which takes milliseconds over HTTP without disk allocation), save them to Datastore, and exit early.

Note that checking for fuzz targets in the archive will still involve unzipping a significant portion of the contents of the archive (over HTTP, in-memory only) due to fuzzer_utils.is_fuzz_target() 1 checking the contents of the files 2 if it's unsure about the file.

@notvictorl is working on improving this in crbug.com/508214240, so for chrome archives we will soon only need to unzip a tiny json file.

Once saved, subsequent runs select a target and run selective unzipping (only ~500 MB), which fits comfortably.

Impact in other workflows

To ensure this optimization has zero impact on other active workflows, the bypass is protected by five guards:

not self.fuzz_target: Restricts only to target-discovery runs (where no target is selected yet).

not self._unpack_everything: Restricts only when selective target unzipping is enabled (if a job disables selective unzipping, it requires a full unpack of all targets, so we must not bypass).

environment.is_engine_fuzzer_job(): Restricts only to engine fuzzers (blackbox fuzzer jobs don't selective-unpack and always need to fully unzip their application binaries).

Blackbox fuzzer jobs don't have a concept of fuzz targets to discover anyway.

environment.get_value('TASK_NAME') == 'fuzz': Restricts only to fuzzing tasks (progression/regression tasks working on crashes always need their target build unpacked to disk to run reproductions).

Same, progression and regression task never need to discover fuzz targets anyway? They should always run with a specific fuzz target.

not self.build_prefix: Restricts only to the primary target build and not supporting extra engine binary packages (which must always be fully unpacked to disk).

environment.platform() == 'WINDOWS': Restricts this bypass strictly to Windows bots. This limits the production rollout blast radius exclusively to the Windows platform to resolve the Windows bot crash block (b/509600495) with absolute safety.

This bug has been affecting linux bots too.

Note on Testing

Unit tests were updated and a new discovery test was added. All tests are passing. Note that this cannot be easily tested in dev because we don't have working local Windows bots running right now, but the logic is fully covered by unit tests.

IIRC we hit this issue in dev also. @notvictorl will remember the details, if I'm not hallucinating :)

PauloVLB requested a review from a team as a code owner May 29, 2026 13:46

PauloVLB requested review from ViniciustCosta and javanlacerda May 29, 2026 13:47

PauloVLB force-pushed the fix/target-discovery-space-optimization branch from e3c6c9b to d1f75c6 Compare May 29, 2026 17:50

[build_manager] Skip unzipping during fuzz task target discovery

3be95e5

PauloVLB force-pushed the fix/target-discovery-space-optimization branch from d1f75c6 to 3be95e5 Compare May 29, 2026 18:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[build_manager] Skip unzipping during fuzz task target discovery#5298

[build_manager] Skip unzipping during fuzz task target discovery#5298
PauloVLB wants to merge 1 commit into
masterfrom
fix/target-discovery-space-optimization

PauloVLB commented May 29, 2026 •

edited

Loading

Uh oh!

letitz commented Jun 1, 2026

The Problem

The Fix

Impact in other workflows

Note on Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

PauloVLB commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The Problem

The Fix

Impact in other workflows

Note on Testing

Uh oh!

letitz commented Jun 1, 2026

The Problem

The Fix

Impact in other workflows

Note on Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PauloVLB commented May 29, 2026 •

edited

Loading