-
-
Notifications
You must be signed in to change notification settings - Fork 18.9k
Closed
Closed
Copy link
Labels
IO HDF5read_hdf, HDFStoreread_hdf, HDFStoreNeeds TestsUnit test(s) needed to prevent regressionsUnit test(s) needed to prevent regressionsgood first issue
Description
Code Sample, a copy-pastable example if possible
import pandas as pd
df = pd.DataFrame({
'a': [1, 2, 3, 4],
'b': ['foo', 'bar', 'baz', 'qux'],
'c': [5, 6, 7, 8]
})
print(df['a'].values.strides)
store = pd.HDFStore('example.h5')
store['df'] = df
print(store['df']['a'].values.strides)
## -- End pasted text --
(8,)
(16,)
Problem description
I ran across this when doing some benchmarking. This has some rather serious performance implications for large DataFrames. Is this the result of an underlying limitation in HDF5?
Metadata
Metadata
Assignees
Labels
IO HDF5read_hdf, HDFStoreread_hdf, HDFStoreNeeds TestsUnit test(s) needed to prevent regressionsUnit test(s) needed to prevent regressionsgood first issue