Skip to content

Conversation

@ryan-mangeno
Copy link

Redundant CSV Loading: The main issue with the original approach is that the CSV is being loaded twice — once for metadata_columns and once for NECESSARY_COLS. This leads to repeated IO operations and potential inefficiency. Since the columns are being merged in the same function, you can load the CSV once and manage the metadata combination more efficiently.

Instead of I loaded all metadata columns in one with loader_metadata = CSVLoader(path, metadata_columns=NECESSARY_COLS + metadata_columns)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant