Make focal kernel memory guard dtype-aware (#3223) by brendancol · Pull Request #3232 · xarray-contrib/xarray-spatial

brendancol · 2026-06-10T17:47:53Z

_check_kernel_vs_raster_memory budgeted a flat 4 bytes per cell ("focal internals cast to float32"). That stopped being true when Preserve input float dtype in apply() and focal_stats() (#2769) #2805 made apply() and focal_stats() preserve float64, so the guard underestimated float64 allocations by 2x and a kernel + raster combo could pass the check and then use up to ~100% of available memory, the OOM the guard exists to stop (focal apply()/focal_stats()/hotspots() accept unbounded user kernels #1284).
The guard now takes an itemsize argument. apply() and focal_stats() pass np.dtype(_promote_float(agg.dtype)).itemsize (8 for float64 input, 4 otherwise). hotspots() keeps the default 4 since it computes in float32 on every backend.
The state CSV commit is from the security sweep that found this.

Backend coverage: the guard runs at the public entry points before dispatch, so all four backends get the same check. No backend code changed.

Test plan:

test_apply_oversize_kernel_accounts_for_float64_3223: float64 combo sized to pass the old 4-byte budget now raises MemoryError; the same combo as float32 still runs
test_focal_stats_oversize_kernel_accounts_for_float64_3223
Existing focal apply()/focal_stats()/hotspots() accept unbounded user kernels #1284 guard tests unchanged and passing
Full test_focal.py suite: 235 passed

brendancol

PR Review: Make focal kernel memory guard dtype-aware (#3223)

Blockers (must fix before merge)

None.

Suggestions (should fix, not blocking)

None.

Nits (optional improvements)

A companion test asserting that hotspots() with float64 input is still budgeted at 4 bytes/cell (i.e. not over-rejected) would pin down the asymmetry the comment at focal.py:1758 describes. The existing #1284 hotspots test plus the comment already make the intent clear, so this is optional.

What looks good

The itemsize is derived from _promote_float(agg.dtype), the same function the internals use to pick their compute dtype, so the budget and the actual allocations cannot drift apart again the way the hardcoded 4 did after #2805.
hotspots() keeps the default 4 with a comment explaining why (it computes in float32 on every backend); passing the promoted itemsize there would have over-rejected float64 input.
The new tests are deterministic (patched _available_memory_bytes, fixed 1 MB budget) and test both sides: float64 rejected at exactly the sizes that used to slip through, float32 still allowed.
The guard runs at the public entry points before backend dispatch, so the 3D per-band recursion and all four backends inherit the corrected budget.
np.dtype(...) works for numpy, cupy, and dask-backed DataArrays alike since .dtype is a numpy dtype in all cases.

Checklist

Algorithm matches reference (budget formula unchanged; only bytes/cell corrected)
All implemented backends produce consistent results (guard is backend-independent)
NaN handling is correct (not touched)
Edge cases are covered by tests (pass/reject boundary on both dtypes)
Dask chunk boundaries handled correctly (not touched)
No premature materialization or unnecessary copies (dtype lookup only, no data access)
Benchmark exists or is not needed (validation-path change, no kernel work)
README feature matrix: no change needed
Docstrings present and accurate (guard docstring updated with the #2805 history)

)

brendancol

Follow-up after the nit: added test_hotspots_float64_keeps_float32_budget_3223, which confirms float64 input to hotspots() stays on the 4 bytes/cell budget and is not over-rejected. Full focal suite: 236 passed. Nothing further from me.

…ocal-2026-06-10-02

…ocal-2026-06-10-02 Conflicts: xrspatial/focal.py, xrspatial/tests/test_focal.py. Combined the dtype-aware itemsize budget (#3223) with main's chunk-aware budgeting (#3228); kept both sides' new tests and bumped the _promote_float spy count in the #3231 test by one for the guard's dtype-only call.

brendancol added 2 commits June 10, 2026 10:41

Update security sweep state for focal (issues #3222, #3223)

9dc0a04

Make focal kernel memory guard dtype-aware (#3223)

e322275

github-actions Bot added the performance PR touches performance-sensitive code label Jun 10, 2026

brendancol commented Jun 10, 2026

View reviewed changes

Address review nit: pin hotspots float32 budget for float64 input (#3223

de929ec

)

brendancol commented Jun 10, 2026

View reviewed changes

brendancol added 3 commits June 10, 2026 10:49

Merge remote-tracking branch 'origin/main' into deep-sweep-security-f…

d6cf1ce

…ocal-2026-06-10-02

Merge branch 'main' into deep-sweep-security-focal-2026-06-10-02

b78034c

brendancol merged commit 377c35e into main Jun 11, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make focal kernel memory guard dtype-aware (#3223)#3232

Make focal kernel memory guard dtype-aware (#3223)#3232
brendancol merged 6 commits into
mainfrom
deep-sweep-security-focal-2026-06-10-02

brendancol commented Jun 10, 2026

Uh oh!

brendancol left a comment

Uh oh!

brendancol left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

brendancol commented Jun 10, 2026

Uh oh!

brendancol left a comment

Choose a reason for hiding this comment

PR Review: Make focal kernel memory guard dtype-aware (#3223)

Blockers (must fix before merge)

Suggestions (should fix, not blocking)

Nits (optional improvements)

What looks good

Checklist

Uh oh!

brendancol left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant