Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] For certain parquet list schemas, the root PageNestingInfo struct can end up uninitialized. #14634

Open
nvdbaranec opened this issue Dec 14, 2023 · 0 comments
Assignees
Labels
bug Something isn't working cuIO cuIO issue

Comments

@nvdbaranec
Copy link
Contributor

Inside of the allocate_nesting_info function, we allocate PageNestingInfo and PageNestingDecodeInfo structs and initialize them. However, the logic for traversing the schema in the file can sometimes leave the 0th element uninitialized. This is a mild bug that leads to a slightly wrong size calculating for output chunk sizes in the chunked reader.

The easiest way to repro is with the file python/cudf/cudf/tests/data/parquet/one_level_list.parquet

@nvdbaranec nvdbaranec added bug Something isn't working Needs Triage Need team to review and classify cuIO cuIO issue labels Dec 14, 2023
@nvdbaranec nvdbaranec self-assigned this Dec 14, 2023
@bdice bdice added Needs Triage Need team to review and classify and removed Needs Triage Need team to review and classify labels Mar 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working cuIO cuIO issue
Projects
None yet
Development

No branches or pull requests

3 participants