LazyChunk by castelao · Pull Request #80 · NatLabRockies/reVRt

castelao · 2025-06-18T03:20:30Z

Abstracting the chunk access.

We calculate the cost in full chunks at a time, thus the cost module doesn't need to understand Zarr, but just a collection of variables as ndarrays. We need a collection because the cost definition can be any combination of variables. We don't necessarily use all the variables available in the dataset, so it would be a waste to load all of them (imagine a cost based on a single variable and a features dataset with 50 variables!). Also, one variable can be used in multiple layers, so we want to reduce the I/O and load that chunk-variable only one to be used in all layers.

Instead of passing a Zarr object, we use a LazyChunk, which behaves like a HashMap, so the cost function asks for a variable, and the LazyChunk reads it in the first time, and just access on the subsequent.

The most important change here is to reduce the context in the cost module.

ppinchuk · 2025-06-18T04:03:02Z

To fix the failing Python pixi tests, just merge (rebase) onto main

ppinchuk

LGTM!

castelao · 2025-06-18T16:11:27Z

@ppinchuk , I did some few improvements/adjustments but it is not ideal yet. This could take more documentation and tests. I have to move to something else, so I'll close this one as it is and return when I have a chance.

ppinchuk · 2025-06-18T17:08:09Z

No objections, but maybe a quick commit to fix the Rust linter?

We certainly won't need u64 for this, but let's keep everything consistent on u64 for now and reduce that when it's time to optimize.

The `calculate` is now agnostic on the domain. It just respond to the given collection subset. Whoever calls this function is in charge of defining the target domain.

Reinstating log info on the chunk indices.

castelao added this to the 0.1.0 - Minimalist demonstration milestone Jun 18, 2025

castelao requested a review from ppinchuk June 18, 2025 03:20

castelao self-assigned this Jun 18, 2025

castelao added the enhancement Update to logic or general code improvements label Jun 18, 2025

castelao force-pushed the subset branch from 8f5c386 to 01bf73d Compare June 18, 2025 04:20

ppinchuk approved these changes Jun 18, 2025

View reviewed changes

castelao added 15 commits June 18, 2025 16:41

feat: Chunk

41dc4cc

refact: Using LazyChunk instead of passing a Zarr object

d466dd9

importing LazyChunk

1c0359e

style

120229e

refact: Renaming to features

df3395f

refact: Simplify for now and use chunk indices as u32

634289c

We certainly won't need u64 for this, but let's keep everything consistent on u64 for now and reduce that when it's time to optimize.

doc, fix: Updating documentation to reflect changes

2123da7

fix: Renamed to ci/cj

86bd7df

refact: Renamed to calculate instead of calculate_chunk

1bb5b30

The `calculate` is now agnostic on the domain. It just respond to the given collection subset. Whoever calls this function is in charge of defining the target domain.

fix: i/j were renamed to ci/cj

fd3879f

feat: Providing ci/cj from LazyChunk

e29f1ca

Reinstating log info on the chunk indices.

Ideas and thoughts

7e9cc2d

Not used/verified yet

95dd2c6

clean: Unecessary type conversion

ef0c0b7

style:

f4023da

castelao force-pushed the subset branch from 2f4f2f2 to f4023da Compare June 18, 2025 23:44

castelao merged commit d87244d into main Jun 19, 2025
22 checks passed

castelao deleted the subset branch June 19, 2025 03:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LazyChunk#80

LazyChunk#80
castelao merged 15 commits intomainfrom
subset

castelao commented Jun 18, 2025

Uh oh!

ppinchuk commented Jun 18, 2025 •

edited

Loading

Uh oh!

ppinchuk left a comment

Uh oh!

castelao commented Jun 18, 2025

Uh oh!

ppinchuk commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

castelao commented Jun 18, 2025

Uh oh!

ppinchuk commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ppinchuk left a comment

Choose a reason for hiding this comment

Uh oh!

castelao commented Jun 18, 2025

Uh oh!

ppinchuk commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ppinchuk commented Jun 18, 2025 •

edited

Loading