Speed bottleneck on deepcopy #152

MicaelJarniac · 2022-05-30T19:20:32Z

We're having issues with extremely slow queries while using the object mapper. The raw query itself isn't slow, but instancing the objects in Python seems to be.

While profiling, we noticed that there seems to be a bottleneck on the following line, using deepcopy:

python-driver/cassandra/cqlengine/columns.py

Line 1041 in 94b64bb

copied_value = deepcopy(value)

We're not entirely sure what that deepcopy is for, so we tried removing it locally, and it seemed to result in a huge speed boost.

I believe simply removing it might break something, so I won't suggest doing that straight away, but I believe there might be a better approach to this that could avoid deepcopy when not necessary.

The text was updated successfully, but these errors were encountered:

ultrabug · 2022-06-16T15:01:03Z

FYI @jsurloppe @dargor @wyfo since we're into Python Scylla perf analysis: maybe we're affected somehow too and investigating could help?

@MicaelJarniac please be kindly reminded that we maintain a fork of the official cassandra driver here so we don't have all the background and history on every design and legacy decision

The plan of ScyllaDB is to switch out from this fork someday and use the scylla rust driver as a sane base for other language drivers with bindings.

fruch · 2022-06-26T15:22:06Z

I think this can easily be replaced by dict comprehension

        copied_value = {name:  field.to_python(value[name]) if value is not None or isinstance(field, BaseContainerColumn) else value[name] for name, field in self.user_type._fields.items() }
        return copied_value

ultrabug · 2022-07-16T11:06:21Z

@MicaelJarniac did you try what @fruch is proposing by any chance?

wyfo · 2022-07-16T11:59:20Z

I'm just wondering why is there or instead of and in

if copied_value[name] is not None or isinstance(field, BaseContainerColumn)

I mean, to_python seems to be a method of BaseContainerColumn, it's illogical to call it if only the first part of the test is true.

I also think there is a mistake in @fruch code because if value is not None should be if value[name] is not None.

I don't know this driver, so I don't know if there is some hidden implications for deepcopy removal; but it seems to me that it would be ok.
However, performance-wise, I would not use a dict-comprehension, but just replace deepcopy by a call to dict.copy.

$ python -m timeit -s "deepcopy = __import__(\"copy\").deepcopy; N = 10; value = dict(zip(range(N), range(N)))" "copied
 = deepcopy(value)" "for i in range(N):" "    if (val := value[i]) >= 5:" "        copied[i] = val + 1"
20000 loops, best of 5: 12.8 usec per loop
$ python -m timeit -s "N = 10; value = dict(zip(range(N), range(N)))" "{i: value[i] if value[i] < 5 else value[i] + 1 for i in range(N)}"
200000 loops, best of 5: 1.8 usec per loop
$ python -m timeit -s "N = 10; value = dict(zip(range(N), range(N)))" "copied = value.copy()" "for i in range(N):" "     if value[i] >= 5:" "        copied[i] = copied[i] + 1"
200000 loops, best of 5: 1.28 usec per loop

fruch · 2022-07-17T11:58:09Z

@MicaelJarniac @wyfo, since this part of the code isn't scylla specific,
if this change is that much impact, I would suggest taking it upstream to https://github.com/datastax/python-driver (so the rest of the community would gain from it)

MicaelJarniac · 2022-07-19T20:44:34Z

@MicaelJarniac @wyfo, since this part of the code isn't scylla specific, if this change is that much impact, I would suggest taking it upstream to https://github.com/datastax/python-driver (so the rest of the community would gain from it)

I was originally going to open this issue there, but they don't use GitHub Issues; they want us to create an account on another tracker or whatever, so I didn't bother to go through their bureaucracy.

But I agree, it'd make more sense for this to go there instead.

@MicaelJarniac did you try what @fruch is proposing by any chance?

I haven't tested it yet, sadly.

MicaelJarniac · 2022-07-19T20:53:55Z

https://datastax-oss.atlassian.net/browse/PYTHON-1309

fruch · 2022-07-19T20:57:31Z

@MicaelJarniac if you'll get a cold shoulder there, we could try applying those fixes here, but I'll first would want to enable the cqlengine integration tests, we are not really running them under this fork.

also if you guys some benchmark test you are using, it would be nice if you could share them (or even contribute them as tests)

k0machi · 2022-11-03T16:20:59Z

I've also encountered this issue with a Model that contains a UDT that contains a list of nested UDTs, this deepcopy call extends the query well into dozens of seconds in such a scenario. I see that upstream didn't take a look at the issue at all so maybe we should fix this issue on our end.

k0machi · 2023-11-30T14:19:28Z

After running cqlengine tests I have not found any regressions when completely removing the deepcopy in to_python. Removing deepcopy from to_database causes several tests to fail (as the source object now mutates when it is being serialized for the database.

I believe the rationale behind those deepcopies is to protect source object from modifications during db operations, however in to_python case I do not see a reason why it needs to be copied after being deserialized, so I think we may be able to remove it with no issues. In my tests I have found an almost 12x speedup when deserializing complex tables.

fruch · 2023-11-30T14:25:41Z

After running cqlengine tests I have not found any regressions when completely removing the deepcopy in to_python. Removing deepcopy from to_database causes several tests to fail (as the source object now mutates when it is being serialized for the database.

I believe the rationale behind those deepcopies is to protect source object from modifications during db operations, however in to_python case I do not see a reason why it needs to be copied after being deserialized, so I think we may be able to remove it with no issues. In my tests I have found an almost 12x speedup when deserializing complex tables.

Do a PR with this, and we'll put it on the next release

This change makes it so newly instanced UserType during deserialization isn't immediately copied by deepcopy, which could cause huge slowdown if that UserType contains a lot of data or nested UserTypes, in which case the deepcopy calls would cascade as each to_python call would eventually clone parts of source object. As there isn't a lot of information on why this deepcopy is here in the first place this change could potentially break something. Running integration tests against this commit does not produce regressions, so this call looks safe to remove, but I'm leaving this warning here for the future reference. Fixes scylladb#152

This change makes it so newly instanced UserType during deserialization isn't immediately copied by deepcopy, which could cause huge slowdown if that UserType contains a lot of data or nested UserTypes, in which case the deepcopy calls would cascade as each to_python call would eventually clone parts of source object. As there isn't a lot of information on why this deepcopy is here in the first place this change could potentially break something. Running integration tests against this commit does not produce regressions, so this call looks safe to remove, but I'm leaving this warning here for the future reference. Issue: scylladb#152

fruch added the bug Something isn't working label Jun 26, 2022

k0machi self-assigned this Nov 30, 2023

k0machi mentioned this issue Dec 1, 2023

cqlengine: Remove deepcopy on UserType deserialization #277

Merged

k0machi mentioned this issue Dec 4, 2023

PYTHON-1309 cqlengine: Remove deepcopy on UserType deserialization datastax/python-driver#1192

Open

avelanarius closed this as completed in #277 Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed bottleneck on deepcopy #152

Speed bottleneck on deepcopy #152

MicaelJarniac commented May 30, 2022

ultrabug commented Jun 16, 2022

Uh oh!

fruch commented Jun 26, 2022

Uh oh!

ultrabug commented Jul 16, 2022

Uh oh!

wyfo commented Jul 16, 2022 •

edited

Loading

Uh oh!

fruch commented Jul 17, 2022

Uh oh!

MicaelJarniac commented Jul 19, 2022

Uh oh!

MicaelJarniac commented Jul 19, 2022

Uh oh!

fruch commented Jul 19, 2022

Uh oh!

k0machi commented Nov 3, 2022 •

edited

Loading

Uh oh!

k0machi commented Nov 30, 2023 •

edited

Loading

Uh oh!

fruch commented Nov 30, 2023

Uh oh!

Speed bottleneck on deepcopy #152

Speed bottleneck on deepcopy #152

Comments

MicaelJarniac commented May 30, 2022

ultrabug commented Jun 16, 2022

Uh oh!

fruch commented Jun 26, 2022

Uh oh!

ultrabug commented Jul 16, 2022

Uh oh!

wyfo commented Jul 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fruch commented Jul 17, 2022

Uh oh!

MicaelJarniac commented Jul 19, 2022

Uh oh!

MicaelJarniac commented Jul 19, 2022

Uh oh!

fruch commented Jul 19, 2022

Uh oh!

k0machi commented Nov 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k0machi commented Nov 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fruch commented Nov 30, 2023

Uh oh!

wyfo commented Jul 16, 2022 •

edited

Loading

k0machi commented Nov 3, 2022 •

edited

Loading

k0machi commented Nov 30, 2023 •

edited

Loading