Open
Description
I have noticed performance increases in other projects when I choose default compression settings based on dtype.
Optimal compression settings depend strongly on bit patterns. Data types often strongly indicate bit pattern characteristics. For example integers often benefit more from compression than floats. Datetimes are often nearly sorted and so benefit more from shuffle.
It might improve performance to change the compression defaults in defaults.py
to come from a function that takes the dtype as an input.
Metadata
Metadata
Assignees
Labels
No labels