Skip to content

Conversation

@alamb
Copy link
Contributor

@alamb alamb commented Jun 6, 2025

Which issue does this PR close?

Rationale for this change

While reviewing the code and the concat kernel for

I realized there is a non trivial difference when there all inlined views vs some inlined views vs mostly large strings so the benchmarks should capture that

What changes are included in this PR?

  1. Add variations of benchmark with different size strings in StringViewArray

Are there any user-facing changes?

If there are user-facing changes then we may require documentation to be updated before approving the PR.

If there are any breaking changes to public APIs, please call them out.

@github-actions github-actions bot added the arrow Changes to the arrow crate label Jun 6, 2025
}
.build();

// Model mostly short strings, but some longer ones
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Previously all the benchmarks used a max size of 30. Now I have 20 (12/20 = 60% will be inlined views) and 128 where only 12/128 ~ 1% will be inlined views.

@alamb alamb changed the title Improve coalesce_kernel benchmark Improve coalesce_kernel benchmark to capture inline vs non inline views Jun 6, 2025
Copy link
Contributor

@Dandandan Dandandan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is a nice extension :)

@Dandandan Dandandan merged commit 44d7194 into apache:main Jun 6, 2025
24 checks passed
@alamb alamb deleted the alamb/improve_coalesce_benchmark branch June 6, 2025 23:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

arrow Changes to the arrow crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants