You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In [82]: pd.merge_asof(df1, df2, on='time', allow_exact_matches=False, tolerance=pd.Timedelta('10ms'))
Out[82]:
time username version
0 2016-07-15 13:30:00.030 bob NaN
However, if I change the first DataFrame to have duplicate timestamps:
In [85]: pd.merge_asof(df1, df2, on='time', allow_exact_matches=False, tolerance=pd.Timedelta('10ms'))
Out[85]:
time username version
0 2016-07-15 13:30:00.030 bob 1
1 2016-07-15 13:30:00.030 charlie 1
This is in pandas version 0.18.0+418.gc46dcfa.
The text was updated successfully, but these errors were encountered:
I just rewrote the Cython logic to compare the factorized keys directly since that was the easiest way forward. Though we don't actually have to factorize the keys at all; we could just compare the timestamps directly, which would be even faster.
This is a continuation of #13695.
Starting with the original DataFrames from that issue:
I now get the null:
However, if I change the first DataFrame to have duplicate timestamps:
then the bug reappears:
This is in pandas version 0.18.0+418.gc46dcfa.
The text was updated successfully, but these errors were encountered: