Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: type-erase offsets buffer type in contiguous_split #18232

Draft
wants to merge 3 commits into
base: branch-25.04
Choose a base branch
from

Conversation

wence-
Copy link
Contributor

@wence- wence- commented Mar 11, 2025

Description

Large strings have an int64 offset buffer, so we must handle that.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Large strings have an int64 offset buffer, so we must handle that.
@wence- wence- added bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change labels Mar 11, 2025
Copy link

copy-pr-bot bot commented Mar 11, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@@ -160,7 +181,7 @@ __device__ void copy_buffer(uint8_t* __restrict__ dst,
std::size_t element_size,
std::size_t src_element_index,
uint32_t stride,
int value_shift,
cuda::std::int64_t value_shift,
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@nvdbaranec points out this copy_buffer implementation needs to be templated on this the type of the offsets buffer (lines 201-215 in this PR) for example, so that we load and modify the values appropriately.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] cuDF-Polars string-column serialization is broken
1 participant