Skip to content

Any Performance Results ? #197

@luweizheng

Description

@luweizheng

I've checked the NDS-H of this repository, and it's quite similar to TPC-H. I tested spark rapids with TPC-H's SF100 on my server with 8 NVIDIA A100 NVLink GPUs and found that the speed with 8 instances is not as fast as using CPUs. I also used optimization methods, such as setting spark.sql.files.maxPartitionBytes=2gb and spark.sql.adaptive.enabled=true.

I am using both the Pandas API on Spark. And Spark SQL is faster, but some queries are still not as fast a running on the same GPU server without GPU, only CPU.

Is this result expected?

Or is it that Spark Rapids can speed up certain data and queries, such as some queries of NDS (TPC-DS)?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions