-
Notifications
You must be signed in to change notification settings - Fork 3k
Select columns of Dataset or DatasetDict #5480
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
The documentation is not available anymore as the PR was closed or merged. |
mariosasko
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! I've suggested some nits for consistency with the rest of the codebase, but looks great other than that!
Co-authored-by: Mario Šaško <[email protected]>
mariosasko
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Last nit
Co-authored-by: Mario Šaško <[email protected]>
mariosasko
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks all good now, thanks!
lhoestq
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome thank you ! I just added some small suggestions :)
Co-authored-by: Quentin Lhoest <[email protected]>
lhoestq
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks :)
Show benchmarksPyArrow==6.0.0 Show updated benchmarks!Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json
Show updated benchmarks!Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json
|
* Allow to select columns of Dataset or DatasetDict * Add column selection method to autodoc index * Allow to select columns of IterableDataset or IterableDatasetDict * Fix formatting issues * Prefer sequential containers for building feature table Co-authored-by: Mario Šaško <[email protected]> * Rework feature descriptor update in select_columns Co-authored-by: Mario Šaško <[email protected]> * Apply suggestions from code review Co-authored-by: Quentin Lhoest <[email protected]> --------- Co-authored-by: Mario Šaško <[email protected]> Co-authored-by: Quentin Lhoest <[email protected]>
* Allow to select columns of Dataset or DatasetDict * Add column selection method to autodoc index * Allow to select columns of IterableDataset or IterableDatasetDict * Fix formatting issues * Prefer sequential containers for building feature table Co-authored-by: Mario Šaško <[email protected]> * Rework feature descriptor update in select_columns Co-authored-by: Mario Šaško <[email protected]> * Apply suggestions from code review Co-authored-by: Quentin Lhoest <[email protected]> --------- Co-authored-by: Mario Šaško <[email protected]> Co-authored-by: Quentin Lhoest <[email protected]>

Close #5474 and #5468.