max_chunks_per_file#

coffea.dataset_tools.max_chunks_per_file(fileset: LimitStepsProtocol | DataGroupSpec, maxchunks: int | None = None) DataGroupSpec[source]#

Modify the input fileset so that only the first “maxchunks” chunks of each file will be processed.

Parameters:
  • fileset (DataGroupSpec) – The set of datasets reduce to max-chunks row-ranges.

  • maxchunks (int or None, default None) – How many chunks to keep for each file.

Returns:

out – The reduced fileset with only the first maxchunks event ranges left in.

Return type:

DataGroupSpec