Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE]Add a TPC-H PPL query suite #3178

Open
YANG-DB opened this issue Nov 30, 2024 · 0 comments
Open

[FEATURE]Add a TPC-H PPL query suite #3178

YANG-DB opened this issue Nov 30, 2024 · 0 comments
Labels
enhancement New feature or request performance Make it fast! PPL Piped processing language SQL untriaged

Comments

@YANG-DB
Copy link
Member

YANG-DB commented Nov 30, 2024

Is your feature request related to a problem?

The TPC-H (Transaction Processing Performance Council Benchmark H) test suite is a widely recognized benchmark for evaluating the performance and efficiency of decision support systems, particularly those handling complex SQL queries on large datasets.

Incorporating the TPC-H test suite into the OpenSearch SQL repository is crucial for both PPL & SQL for ensuring the robustness, scalability, and optimization of SQL/PPL query processing in OpenSearch.
OpenSearch SQL engine can be tested against real-world analytical workloads, providing insights into query execution performance, join and aggregation handling.
This test would eventually become a benchmark for identification of bottlenecks, benchmarking against industry standards, and adding confidence among users and developers that OpenSearch SQL/PPL repository can handle sophisticated analytical use cases effectively.

What solution would you like?
Add TPC-H PPL query suite for:

  • ensure all TPC-H SQL can be rewritten by PPL
  • ensure all 22 PPL queries can be executed in Spark
  • ensure all physical plans generated matches the expected output

Do you have any additional context?

@YANG-DB YANG-DB added enhancement New feature or request untriaged SQL PPL Piped processing language performance Make it fast! labels Nov 30, 2024
@YANG-DB YANG-DB moved this to Todo in PPL Commands Nov 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request performance Make it fast! PPL Piped processing language SQL untriaged
Projects
Status: Todo
Development

No branches or pull requests

1 participant