GitHub - nasa/harmony-maskfill

Overview:

The MaskFill utility works with gridded data, applying a fill value in all pixels outside of a provided shape. This utility is now available via a Harmony service.

The utility accepts HDF-5 and NetCDF-4 files that follow CF conventions and GeoTIFFs.

Installation:

MaskFill was developed using the Anaconda distribution of Python (https://www.anaconda.com/download) and conda virtual environment. This simplifies dependency management. Run these commands to create a MaskFill conda virtual environment and install all the needed packages:

conda create --name maskfill --file conda_requirements.txt \
  python=3.12 --channel conda-forge --override-channels
conda activate maskfill
pip install -r pip_requirements.txt

Development:

General notes:

Commit messages should use the ticket number as a prefix, e.g.: DAS-123: Awesome feature description.
Commit history should be squashed locally, to avoid minor commits (e.g.: fix typo, update README). This can be done via an interactive rebase, where N is the number of commits added during the feature development:
```
git rebase -i HEAD~N
```

Versioning:

The Harmony version is a semantic version number (major.minor.patch), which should be iterated every release. It is contained in the docker/service_version.txt file. When making any update to the service code, the version number in this file should be updated before making a pull request. The general rules for iterating a semantic version number are:

Major: When API changes are made to the service that are not backwards compatible.
Minor: When functionality is added in a backwards compatible way.
Patch: Used for backwards compatible bug fixes or performance improvements.

When the Docker image is built, it will be tagged with the semantic version number as stored in docker/service_version.txt.

CI/CD:

The CICD for MaskFill is contained in GitHub workflows in the .github/workflows directory:

run_tests.yml - A reusable workflow that builds the service and test Docker images, then runs the Python unit test suite in an instance of the test Docker container.
run_tests_on_pull_requests.yml - Triggered for all PRs against the main branch. It runs the workflow in run_tests.yml to ensure all tests pass for the new code.
publish_docker_image.yml - Triggered either manually or for commits to the main branch that contain changes to the docker/service_version.txt file.

The publish_docker_image.yml workflow will:

Run the full unit test suite, to prevent publication of broken code.
Extract the semantic version number from docker/service_version.txt.
Extract the release notes for the most recent version from CHANGELOG.md
Build the service Docker image and push it to the GitHub Container Registry.
Create a GitHub release that will also tag the related git commit with the semantic version number.

Before triggering a release, ensure both the docker/service_version.txt and CHANGELOG.md files are updated. The CHANGELOG.md file requires a specific format for a new release, as it looks for the following string to define the newest release of the code (starting at the top of the file).

## vX.Y.Z

Running locally:

The best method to run Harmony locally is to have a local instance of Harmony running that is configured to use the MaskFill service. Requests can then be made as they would for any other environment (production, UAT, SIT) via:

harmony-py
cURL
A URL placed in a browser window, pointing at localhost:3000.

Testing:

Unit tests:

This project has unit tests that utilize the standard unittest Python package. These can be run from the root directory of this repository using the following commands:

export ENV=test
python -m unittest discover tests

The environment variable ENV must be set to ensure that all unit tests that invoke the MaskFillAdapter class do not try to stage their output files.

The unit tests also contain basic tests for code style, ensuring that all Python files conform to PEP8, excluding checks on line-length.

Tests within tests/test_maskfill.py are designed to test the full use of the functionality, taking an input file, creating an output file and comparing that output file to a template. Those within tests/unit are designed as more granular unit tests of the logic and behaviour of individual functions.

Test coverage report:

To see how much of the code is covered by the unit and end-to-end tests, run the following three commands.

export ENV=test
coverage run -m unittest discover tests
coverage report --omit=tests/*

A more detailed way to view the test coverage can be to run the coverage report in HTML pages. This output will be automatically generated by the bin/run-test script in the harmony-maskfill/coverage directory. Alternatively, one can create a coverage directory and run the following commands:

export ENV=test
mkdir -p coverage
coverage run -m unittest discover tests
coverage html --omit=tests/* -d coverage

Then navigate in a web browser to:

file:///full/path/to/harmony-maskfill/coverage/index.html

This should display a page with a table of coverage percentages. Clicking on each file should open a further page that renders the contents of the file, indicating exactly the lines that have coverage, and those that don't.

Unit tests in Docker:

The unit tests can also be run within a Docker container:

# Build the service image, which is a base image for the test image
./bin/build-image

# Build the test image
./bin/build-test

# Run the tests in a container instance of the test image
./bin/run-test

The terminal should display output from the test results, with the failures from unittest. Additionally, the XML test reports should be saved to the new test-reports directory. Test coverage report should also be displayed in the terminal, and will also be saved to the 'coverage' directory in HTML format. Coverage reports are being generate for each execution of the GitHub workflow, and are saved as artefacts.

Gotchas:

New collection grid mappings:

MaskFill will try to determine the projection information for a variable by using the following metadata (in the order specified):

DIMENSION_LIST attribute. If present, and with units of 'degrees', the data are assumed to be geographic.
grid_mapping attribute. If present, this will point to a grid_mapping variable in the granule. The metadata of that variable is used to define the projection of the variable being filled.
Configuration file. If neither DIMENSION_LIST nor grid_mapping are included in the metadata attributes, the configuration file is checked for default values.
If all of the above options do not return information from which a projection can be derived, MaskFill will raise an exception, and the service will fail.

When adding several SMAP collections, new entries were needed for the default grid mapping when input data to MaskFill have not been reprojected. When adding the MaskFill service to a new collection, care should be taken to ensure whether the granule format can provide the necessary grid mapping information.

pre-commit hooks:

This repository uses pre-commit to enable pre-commit checking the repository for some coding standard best practices. These include:

Removing trailing whitespaces.
Removing blank lines at the end of a file.
JSON files have valid formats. formatting checks.

To enable these checks:

# Install pre-commit Python package as part of test requirements:
pip install -r tests/pip_test_requirements.txt

# Install the git hook scripts:
pre-commit install

# (Optional) Run against all files:
pre-commit run --all-files

When you try to make a new commit locally, pre-commit will automatically run. If any of the hooks detect non-compliance (e.g., trailing whitespace), that hook will state it failed, and also try to fix the issue. You will need to review and git add the changes before you can make a commit.

It is planned to implement additional hooks, possibly including tools such as mypy.

pre-commit.ci is configured such that these same hooks will be automatically run for every pull request.

Get in touch:

You can reach out to the maintainers of this repository via email:

Name		Name	Last commit message	Last commit date
Latest commit History 215 Commits
.github		.github
bin		bin
doc		doc
docker		docker
maskfill		maskfill
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.snyk		.snyk
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
conda_requirements.txt		conda_requirements.txt
legacy-CHANGELOG.md		legacy-CHANGELOG.md
pip_requirements.txt		pip_requirements.txt
pip_requirements_skip_snyk.txt		pip_requirements_skip_snyk.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview:

Installation:

Development:

General notes:

Versioning:

CI/CD:

Running locally:

Testing:

Unit tests:

Test coverage report:

Unit tests in Docker:

Gotchas:

New collection grid mappings:

pre-commit hooks:

Get in touch:

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 8

Uh oh!

Languages

License

nasa/harmony-maskfill

Folders and files

Latest commit

History

Repository files navigation

Overview:

Installation:

Development:

General notes:

Versioning:

CI/CD:

Running locally:

Testing:

Unit tests:

Test coverage report:

Unit tests in Docker:

Gotchas:

New collection grid mappings:

pre-commit hooks:

Get in touch:

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 8

Uh oh!

Languages

Packages