Skip to content

[FEA] Create NDS-H benchmark for performance analysis #182

@mattahrens

Description

@mattahrens

I would like to add another benchmark to the repository to support additional workloads for comparison. The TPC-H benchmark is used by different partners for comparison so we can enable the execution of a TPC-H similar workload benchmark. The requirements are similar to what we have for NDS:

Data generation

  • P0: Support generation of raw data at various scale factors
  • P0: Support conversion of raw data to Parquet
  • P1: Support conversion of raw data to ORC
  • P1: Support conversion of raw data to CSV

Query generation

  • P0: Support generation of queries at various scale factors

Power run execution

  • P0: Support execution of full query set given a specified input path
  • P1: Support execution of individual query given a specific query and input path

We can add additional requirements once the initial NH scripts are set up to more closely match how we execute NDS.

Relevant links of other repos that execute TPC-H workloads:

Disclaimers for TPC-H:

  • TPC-H is Copyright © 1993-2024 Transaction Processing Performance Council. The full TPC-H specification in PDF format can be found here
  • TPC, TPC Benchmark, and TPC-H are trademarks of the Transaction Processing Performance Council.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions