Update load_csv.py #229

ryan-mangeno · 2024-11-17T22:20:31Z

Redundant CSV Loading: The main issue with the original approach is that the CSV is being loaded twice — once for metadata_columns and once for NECESSARY_COLS. This leads to repeated IO operations and potential inefficiency. Since the columns are being merged in the same function, you can load the CSV once and manage the metadata combination more efficiently.

Instead of I loaded all metadata columns in one with loader_metadata = CSVLoader(path, metadata_columns=NECESSARY_COLS + metadata_columns)

ryan-mangeno added 2 commits November 17, 2024 17:17

Update load_csv.py

b4bd8da

Merge branch 'main' into patch-1

d60c621

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update load_csv.py #229

Update load_csv.py #229

Uh oh!

ryan-mangeno commented Nov 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Update load_csv.py #229

Are you sure you want to change the base?

Update load_csv.py #229

Uh oh!

Conversation

ryan-mangeno commented Nov 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant