Release v2025.7.0 · scikit-hep/coffea

Important announcement

This release changes the default behavior of coffea . We are now focusing on doing analysis with the newly developed "virtual arrays" of awkward as the main backend.
For more information on virtual arrays, see this talk at PyHEP.dev 2025.
For examples of virtual array usage, see the following example repositories:
https://github.com/ikrommyd/coffea-virtual-array-tests
https://github.com/ikrommyd/coffea-virtual-array-demo
https://github.com/iris-hep/calver-coffea-agc-demo/blob/2025_IRISHEP_Training/agc-coffea-2025-virtual-arrays-and-executors.ipynb
https://github.com/ikrommyd/virtual-array-agc

The default behavior of NanoEventsFactory.from_root() has changed. It now reads the input root file using virtual arrays by default.
The backend choice is controlled by the mode argument of the method which can be set to "eager", "virtual", or "dask".
The new default is "virtual" while the delayed argument has been removed.
The old delayed=True is now equivalent to mode="dask". The old delayed=False is now equivalent to mode="eager".

At the same time, the coffea 0.7 processors and executors have been revived and analysis can be done using coffea 0.7-like syntax

from coffea.processor import ProcessorABC, Runner, DaskExecutor

class MyProcessor(ProcessorABC):
    def process(self, events):
        ...
        
    def postprocess(self, accumulator):
        ...

run = Runner(
    DaskExecutor(client=client, compression=None),
    chunksize=250_000,
    skipbadfiles=True,
    schema=NanoAODSchema,
    savemetrics=True
)

out, report = run(fileset, processor_instance=MyProcessor())

Analyses still using coffea 0.7 can and should seamlessly transition to this new release.

If you still want to use the dask interface (create task graphs), you should specify mode="dask" to NanoEventsFactory.from_root() when working on single file.
For scaling, you can still use the dataset_tools like the following

from coffea.dataset_tools import apply_to_fileset

apply_to_fileset(MyProcessor(), fileset, uproot_options={"allow_read_errors_with_report": True})

It is recommended to convert all analyses to use the new virtual arrays feature of awkward2 and not stick with packages that are unmaintained for 3 years (coffea 0.7 which still uses awkward1 ).
Please reach out for any help and to report problems.

New features

feat: EDM4HEPSchema and Newstyle FCCSchema by @prayagyadav in #1245
feat: add virtual arrays by @pfackeldey in #1277
feat: bring back iterative, futures and dask executors by @ikrommyd in #1323
feat: 0.7 style processor/executor model using ak2 virtual arrays. by @lgray in #1309
feat: bring back parsl executor by @ikrommyd in #1325
feat: add @original_array attr to events in virtual mode by @ikrommyd in #1327
feat: make column_accumulator support awkward arrays and add accumulator tests by @ikrommyd in #1352
feat: taskvine executor for new coffea by @btovar in #1360
feat: systematics handling for dask mode by @lgray in #786
feat: make max_chunks return the first N chunks per dataset (not per file per dataset like it is now) by @ikrommyd in #1359

Bug-fixes and performance

fix: properly support older numba/numpy mixtures by @lgray in #1298
fix: do not error or return None when calling min/max over length zero chunks in weight statistics, return infinities instead by @ikrommyd in #1328
fix: skip OSErrors when skipping bad files using executors by @ikrommyd in #1333
fix: executor's preprocess requires treename as input argument when there should be a default by @ikrommyd in #1334
fix: make NanoAODSchema the default in exectors for consistency with apply_to_fileset by @ikrommyd in #1335
fix: use awkward for min and max in Weights to avoid inconsistencies between eager/virtual and dask mode by @ikrommyd in #1337
fix: make nanoevents properly copiable and do not store the @original_array attribute as that will get copied by @ikrommyd in #1346
perf: use numpy only in eager weight statistics by @ikrommyd in #1351
fix: print the original processor error with the "failed processing file" exception by @ikrommyd in #1353
fix: _lazywhere was removed from scipy, use apply_where from scipy._lib.array_api_extra by @lgray in #1356
fix: make offsets start at zero forListOffsetArray coming from uproot (fix physlite entry start problem) by @ikrommyd in #1363
fix: clarify function names in systematics by @lgray in #1366
fix: Fix bugs in CorrectedMETFactory.build in coffea 202x by @ikrommyd in #1342
fix: error when delayed kwarg is used in nanoevents by @ikrommyd in #1367

Other

ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1294
ci(also docs): enforce constraints so spark / rucio build by @lgray in #1299
docs: we do not support python 3.8 any more! by @lgray in #1300
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1301
ci: update tritonserver, use GHA arm runners by @lgray in #1302
ci: relax setuptools constraint to !=78.0.1 by @lgray in #1303
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1305
ci: Use astral-sh/setup-uv to setup Python by @matthewfeickert in #1304
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1310
ci: bump astral-sh/setup-uv from 5 to 6 by @dependabot[bot] in #1313
build(deps): skip dask 2025.4.0 by @lgray in #1315
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1312
ci: bump actions/attest-build-provenance from 2.2.3 to 2.3.0 by @dependabot[bot] in #1316
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1317
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1320
chore: remove vector deprecation warning and add a warning for the switch to the virtual mode default by @ikrommyd in #1330
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1324
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1336
ci: change pre-commit schedule to monthly by @ikrommyd in #1341
ci(pre-commit): pre-commit autoupdate by @pre-commit-ci[bot] in #1344
ci: bump actions/attest-build-provenance from 2.3.0 to 2.4.0 by @dependabot[bot] in #1348
docs: return None in the cases where getattr fails in linkcode_resolve by @ikrommyd in #1349

Full Changelog: v2025.3.0...v2025.7.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2025.7.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Important announcement

New features

Bug-fixes and performance

Other

Contributors

Uh oh!