-
Notifications
You must be signed in to change notification settings - Fork 3k
Keep original features order #2453
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Keep original features order #2453
Conversation
|
The arrow writer was supposing that the columns were always in the sorted order. I just pushed a fix to reorder the arrays accordingly to the schema. It was failing for many datasets like squad |
|
and obviously it broke everything |
|
Feel free to revert my commit. I can investigate this in the coming days |
|
@lhoestq I do not understand when you say:
All the tests were green after my last commit. |
Yes but loading the actual squad dataset was failing :/ |
lhoestq
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for figuring out why some columns were missing !
When loading a Dataset from a JSON file whose column names are not sorted alphabetically, we should get the same column name order, whether we pass features (in the same order as in the file) or not.
I found this issue while working on #2366.