NASC ingestion refactoring #349

brandynlucca · 2025-05-08T20:29:06Z

This draft PR includes refactored changes to the NASC ingestion, docstrings, and associated tests.

…hoview_nasc`

echopop/nwfsc_feat/ingest_nasc.py

leewujung · 2025-05-09T15:25:31Z

@brandynlucca : testing structure looks great! thanks!

leewujung

Hey @brandynlucca : Thanks for the PR!

The flow you have under the main functions looks good so I didn't go through all the new functions carefully. I trusted that you have verified the details.

I only have a few suggestions/questions below:

For the old tests, you can use pytestmark = pytest.mark.skip(reason="Temporarily disable this module") on top of a .py to disable all tests in lieu of commenting the code out.
Why are setting column names to lowercase and impute_bad_coordinates needed in read_nasc_file? I was thinking df_nasc_all_ages (or df_nasc_no_age1) would be a "final" product so that people could just load and use those directly. Are the formatting and data in df_nasc_all_ages kept in a specific way to accommodate something else?
Is filter_transect_intervals related to the mesh polygon creation downstream? Or is it just so that people have some control over what part of the geographical regions are included?
I see many unit tests, but couldn't find integration tests. How about adding those for the functions that are exposed in feat_hake.py?
Mirrorfeat_hake.py to have a notebook as a gateway for people to interactively try out the code?

leewujung · 2025-05-30T17:37:49Z

Oh also just noticed the merge conflict - seems just small things from my PRs added after your branched out this, so they ended up not in the commit history here.

for more information, see https://pre-commit.ci

brandynlucca · 2025-06-04T19:04:23Z

Why are setting column names to lowercase and impute_bad_coordinates needed in read_nasc_file? I was thinking df_nasc_all_ages (or df_nasc_no_age1) would be a "final" product so that people could just load and use those directly. Are the formatting and data in df_nasc_all_ages kept in a specific way to accommodate something else?

This mostly ensures backwards compatibility with previous FEAT survey years where the column name schemes are somewhat inconsistent. This is somewhat in anticipation of incorporating the validation step(s), which would incorporate any required formatting changes.

Is filter_transect_intervals related to the mesh polygon creation downstream? Or is it just so that people have some control over what part of the geographical regions are included?

This relates to removing off-effort transect intervals.

brandynlucca added 4 commits May 1, 2025 11:59

Initial refactoring commits (scratch)

3773dac

Extract fixtures

3a491a5

Test and function changes for refactoring (NASC ingestion)

e101db3

Additional uncommited changes for NASC ingestion

591fdd6

brandynlucca requested a review from leewujung May 8, 2025 20:29

brandynlucca added 2 commits May 8, 2025 15:03

Updated functions for internal merge_exports and external `merge_ec…

a257c53

…hoview_nasc`

Add transect-region-haul key file reader and tests

e2e1449

leewujung reviewed May 9, 2025

View reviewed changes

echopop/nwfsc_feat/ingest_nasc.py Outdated Show resolved Hide resolved

brandynlucca added 5 commits May 9, 2025 09:56

Add region name processing

7fd31d5

Update through consolidate_echoview_nasc

4546342

Some changes to single file loading

b6efb0c

Add consolidated NASC reader

966ed0e

Update workflow

f44a26e

brandynlucca marked this pull request as ready for review May 15, 2025 20:21

leewujung reviewed May 30, 2025

View reviewed changes

brandynlucca and others added 5 commits June 4, 2025 11:21

Minor changes to feat_hake.py plus impute arguments

f5d6c7e

Merge branch 'main' into refactor_codebase

fbe526e

[pre-commit.ci] auto fixes from pre-commit.com hooks

e985206

for more information, see https://pre-commit.ci

Some pre-commit related fixes

0f866ff

Additional pre-commit and pytest changes

ac7c487

brandynlucca merged commit 33d72c6 into OSOceanAcoustics:main Jun 4, 2025
6 checks passed

brandynlucca mentioned this pull request Jun 5, 2025

Refactor Echoview NASC export #347

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NASC ingestion refactoring #349

NASC ingestion refactoring #349

Uh oh!

brandynlucca commented May 8, 2025

Uh oh!

Uh oh!

leewujung commented May 9, 2025

Uh oh!

leewujung left a comment

Uh oh!

leewujung commented May 30, 2025

Uh oh!

brandynlucca commented Jun 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NASC ingestion refactoring #349

NASC ingestion refactoring #349

Uh oh!

Conversation

brandynlucca commented May 8, 2025

Uh oh!

Uh oh!

leewujung commented May 9, 2025

Uh oh!

leewujung left a comment

Choose a reason for hiding this comment

Uh oh!

leewujung commented May 30, 2025

Uh oh!

brandynlucca commented Jun 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants