change result and more optimizations #148

bymoye · 2025-07-01T05:01:35Z

No description provided.

bymoye · 2025-07-01T05:06:07Z

I also modified one thing in the code (I just did some simple research):

.call((raw_bytes_data.to_vec(),), None)?
to:

.call1((PyBytes::new(py, raw_bytes_data),))?
Reason:
This reduces data copying from 2 operations (Rust Vec creation + Python internal copy) to 1 operation (direct copy to Python memory).

bymoye · 2025-07-01T06:56:01Z

@chandr-andr ,In 8cc74de, I made further optimizations to improve performance. I only conducted simple tests with pytest, no benchmarking. Can you run some tests? If there's a performance regression, feel free to roll back the code.
It is best to increase the memory usage test!

chandr-andr · 2025-07-01T09:33:59Z

@bymoye Thank you very much for refactoring, it looks awesome.
I cannot run benchmarks right now cuz I have to work (unfortunately).

Will run them in the evening

chandr-andr · 2025-07-01T22:17:40Z

@bymoye I run benchmarks, I have, I found a difference, not big, but you didn't do anything worse.

chandr-andr · 2025-07-01T22:27:35Z

The rest of the changes look awesome, thank you a lot!

bymoye · 2025-07-02T00:45:33Z

ok！Thank you for testing

bymoye · 2025-07-02T01:09:42Z

@chandr-andr I checked your code, and I think the current test is not right. Because the engine benchmark should test the performance of the engine, not the performance of the database. So the database query should not use ORDER BY RANDOM, but to reduce the database pressure as much as possible (so that the database is almost no pressure) to complete the query, and then return to Python to get the same result.
The data will be converted and other actions (completed by the engine).
We can return as much data as possible to complete the performance test.

bymoye · 2025-07-02T01:13:26Z

For example, a single query uses SELECT 1;
Multiple queries use id< 101; id < 1001; id < 10001;
and returns the actual data instead of the current "ok";

bymoye · 2025-07-02T02:04:07Z

Time Complexity: The original code has a time complexity of $O(N \cdot D)$, while the optimized code reduces it to $O(N)$, significantly lowering complexity and preventing performance degradation.
Space Complexity: Both versions have a space complexity of $O(N)$.

Optimized: Memory is allocated once and used throughout, ensuring stability and efficiency.
Original: Potential memory jitter (repeated allocation and deallocation) introduces additional overhead.

Memory Copy:

Original: Memory copy operations occur $N \cdot (D - 1)$ times (handled internally by Rust).
Optimized: Reduced to $0$ copies, as all operations are performed via references.

Total Copied Elements:

Original: $N \cdot (D + 1)$ elements copied.
Optimized: Reduced to $2N$ elements.

These optimizations should bring significant performance improvements to querying data values (especially for big data).

bymoye · 2025-07-02T02:20:27Z

Take the following table as an example:

field_one	field_two	field_three
a1	b1	c1
a2	b2	c2
a3	b3	c3

Query the entire table and process the data through psqlpy.
Then the code will go through:

Unloading (9 copies) (this step is the same)
The data processing result of this step is: vec!["a1", "b1", "c1", "a2", "b2", "c2", "a3", "b3", "c3"]
Transport (internal processing)
In the original code, a new Vec is created for each row, and then each row of data is copied into it. This may be bad when there is a lot of data.
In the optimized code, no copy operation is performed, but a pointer with almost zero cost is used for planning. No string copying is caused.
Return to Python (9 copies) (this step is also the same)
The process of converting Rust's String to Python's Str.

chandr-andr · 2025-07-03T09:15:00Z

@chandr-andr I checked your code, and I think the current test is not right. Because the engine benchmark should test the performance of the engine, not the performance of the database. So the database query should not use ORDER BY RANDOM, but to reduce the database pressure as much as possible (so that the database is almost no pressure) to complete the query, and then return to Python to get the same result. The data will be converted and other actions (completed by the engine). We can return as much data as possible to complete the performance test.

Hello! psqlpy-test is for querying and database communication performance. I have some raw tests for (de)serialization locally, I just didn't have enough time to make them fully complete and push to the repo and documentation.

I totally agree with you, thanks for the optimizations.

chandr-andr · 2025-07-03T13:12:30Z

@bymoye Can you check clippy - there are some errors?

bymoye · 2025-07-03T14:31:01Z

Hi, @chandr-andr ,
I'll look into the clippy issue later.
One other thing you might want to look at is I ran a test I wrote locally and got the following results:

🎯 extreme_query Performance Comparison

Library	Status	Throughput (ops/s)	Avg Latency (ms)	Memory Delta (MB)
AsyncPG	Success	32.3	30916.50	0.3
Psycopg3	Success	32.2	31050.15	0.1
PSQLPy	Success	16.2	61665.21	2.1
Databases	Success	8.3	120346.22	0.9

🎯 bulk_insert Performance Comparison

Library	Status	Throughput (ops/s)	Avg Latency (ms)	Memory Delta (MB)
AsyncPG	Success	32.4	30893.69	0.1
PSQLPy	Success	32.2	31030.38	0.1
Psycopg3	Success	32.1	31117.88	0.1
Databases	Success	8.1	123635.79	0.1

🎯 concurrent_read Performance Comparison

Library	Status	Throughput (ops/s)	Avg Latency (ms)	Memory Delta (MB)
AsyncPG	Success	32.3	30893.80	0.1
PSQLPy	Success	32.3	30953.28	0.0
Psycopg3	Success	32.2	31029.34	0.0
Databases	Success	8.4	118577.62	0.4

🎯 complex_query Performance Comparison

Library	Status	Throughput (ops/s)	Avg Latency (ms)	Memory Delta (MB)
Psycopg3	Success	33.9	29478.60	0.0
PSQLPy	Success	32.6	30641.29	0.0
AsyncPG	Success	31.5	31770.39	0.0
Databases	Success	13.2	75851.61	0.1

✅ Cross-Library Performance Comparison Complete!

I investigated the cause a little bit (not in depth), and I found that the problem might be that the execute method will build a complete StatementBuilder regardless of whether there are any parameters passed in. So this causes this problem. Maybe there should be a fast track for simple query code?

bymoye · 2025-07-03T14:43:31Z

Another thing is.... In the current pytest, it seems that the connection is not disconnected in the end. So the resources are always occupied. (The connection is not released correctly)

change result

20247b5

bymoye mentioned this pull request Jul 1, 2025

Added new as_tuple parameter to QueryResult #147

Open

more optimizations

8cc74de

fix typing hint

e799526

bymoye changed the title ~~change result~~ change result and more optimizations Jul 2, 2025

bymoye added 3 commits July 2, 2025 11:43

more optimizations

8ed0724

refactor execute_many for better performance

ba63201

more optimizations

d48163a

This was referenced Jul 3, 2025

Performance issues #150

Open

Current pytest code has resource exhaustion issues #151

Open

fix clippy warning

5bf59e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

change result and more optimizations #148

change result and more optimizations #148

bymoye commented Jul 1, 2025

Uh oh!

bymoye commented Jul 1, 2025 •

edited

Loading

Uh oh!

bymoye commented Jul 1, 2025 •

edited

Loading

Uh oh!

chandr-andr commented Jul 1, 2025

Uh oh!

chandr-andr commented Jul 1, 2025

Uh oh!

chandr-andr commented Jul 1, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025 •

edited

Loading

Uh oh!

chandr-andr commented Jul 3, 2025

Uh oh!

chandr-andr commented Jul 3, 2025

Uh oh!

bymoye commented Jul 3, 2025 •

edited

Loading

Uh oh!

bymoye commented Jul 3, 2025

Uh oh!

Uh oh!

change result and more optimizations #148

Are you sure you want to change the base?

change result and more optimizations #148

Conversation

bymoye commented Jul 1, 2025

Uh oh!

bymoye commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bymoye commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chandr-andr commented Jul 1, 2025

Uh oh!

chandr-andr commented Jul 1, 2025

Uh oh!

chandr-andr commented Jul 1, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025

Uh oh!

bymoye commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chandr-andr commented Jul 3, 2025

Uh oh!

chandr-andr commented Jul 3, 2025

Uh oh!

bymoye commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🎯 extreme_query Performance Comparison

🎯 bulk_insert Performance Comparison

🎯 concurrent_read Performance Comparison

🎯 complex_query Performance Comparison

✅ Cross-Library Performance Comparison Complete!

Uh oh!

bymoye commented Jul 3, 2025

Uh oh!

Uh oh!

bymoye commented Jul 1, 2025 •

edited

Loading

bymoye commented Jul 1, 2025 •

edited

Loading

bymoye commented Jul 2, 2025 •

edited

Loading

bymoye commented Jul 3, 2025 •

edited

Loading