perf : Optimize memory allocation hash join collision check#21412
perf : Optimize memory allocation hash join collision check#21412coderfender wants to merge 6 commits intoapache:mainfrom
Conversation
|
run benchmark tpch tpch10 tpcds |
|
🤖 Benchmark running (GKE) | trigger CPU Details (lscpu)Comparing optimize_mem_hash_join_arrow (9c4cea8) to 3d2e6b2 (merge-base) diff using: tpcds File an issue against this benchmark runner |
|
🤖 Benchmark running (GKE) | trigger CPU Details (lscpu)Comparing optimize_mem_hash_join_arrow (9c4cea8) to 3d2e6b2 (merge-base) diff using: tpch File an issue against this benchmark runner |
|
🤖 Benchmark running (GKE) | trigger CPU Details (lscpu)Comparing optimize_mem_hash_join_arrow (9c4cea8) to 3d2e6b2 (merge-base) diff using: tpch10 File an issue against this benchmark runner |
|
🤖 Benchmark completed (GKE) | trigger Instance: CPU Details (lscpu)Details
Resource Usagetpch10 — base (merge-base)
tpch10 — branch
File an issue against this benchmark runner |
|
🤖 Benchmark completed (GKE) | trigger Instance: CPU Details (lscpu)Details
Resource Usagetpcds — base (merge-base)
tpcds — branch
File an issue against this benchmark runner |
|
investigating why there is slowness (results on my machine showed ~ 10 % speed with Q3) |
|
Seems like SIMD comparison over continuous memory is better than custom row-by-row comparison (although the latter consumes less memory) |
Which issue does this PR close?
Rationale for this change
What changes are included in this PR?
Are these changes tested?
Are there any user-facing changes?