-
Notifications
You must be signed in to change notification settings - Fork 3k
Support complex feature types as features in packaged loaders
#4364
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
The documentation is not available anymore as the PR was closed or merged. |
features in packaged loaders
lhoestq
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks ! Added a few comments
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks ! I just added more comments about pa.Table.from_arrays
The rest looks all good to me :)
lhoestq
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks ! LGTM :)
This PR adds
table_castto the packaged loaders to fix casting to theImage/Audio,ArrayNDandClassLabeltypes. If these types are not present in thebuilder.config.featuresdictionary, the built-inpa.Table.castis used for better performance. Additionally, this PR addscast_storagetoClassLabelto support the string to int conversion intable_castand ensure that integer labels are in a valid range.Fix #4210
This PR is also a solution for these (popular) discussions: https://discuss.huggingface.co/t/converting-string-label-to-int/2816 and https://discuss.huggingface.co/t/class-labels-for-custom-datasets/15130/2
TODO: