Multi-page scans could be in parallel

This is tricky, as we'd have to think carefully about stability and also about doing as few queries as possible to support usage as a generator while still offering good performance.

We could do something along the lines of...

1. A scan request creates a ResultGenerator object with the following attributes:
   a. a job queue with the complete set of continuation tokens
   b. a results queue with a maximum number of elements (say, 10,000)
   c. a kill signal
   d. some number W of threaded workers
2. Each worker makes their request and feeds into the queue. If the queue is full, they enter a holding pattern, sleeping for short intervals while waiting for room on the queue and also for their kill signal.
3. Values are taken off the queue by a generator
4. Workers pull the next continuation token and the next request when there is room on the results queue but they have no more results to report
5. When the job queue is exhausted the process is done
6. On ResultGenerator cleanup, the kill_signal is issued, so that garbage collection terminates all query workers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-page scans could be in parallel #134

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Multi-page scans could be in parallel #134

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions