Research question
To what extent do NMDC and other LBL/DOE projects actually use the OBI (Ontology for Biomedical Investigations) ontology, and how well is OBI represented in the BERDL ontology tables?
Motivation
OBI is a widely-used ontology for describing biomedical and environmental investigations, but its adoption across DOE/LBL data resources is unclear. This project does a lightweight
investigation to assess:
- Whether OBI terms are loaded into the BERDL ontology tables at all (they may not be)
- How frequently OBI terms appear in NMDC and related datasets
- Whether there are missed opportunities for richer OBI annotation
Scope and caveats
This is an exploratory, cross-cutting investigation — not a typical pangenome or fitness analysis. The BERDL lakehouse may or may not be the right place for this work; the same question
will be pursued in parallel using other systems. The goal here is to do enough poking around to determine whether BERDL has useful signal, not to build a full analysis pipeline.
Research question
To what extent do NMDC and other LBL/DOE projects actually use the OBI (Ontology for Biomedical Investigations) ontology, and how well is OBI represented in the BERDL ontology tables?
Motivation
OBI is a widely-used ontology for describing biomedical and environmental investigations, but its adoption across DOE/LBL data resources is unclear. This project does a lightweight
investigation to assess:
Scope and caveats
This is an exploratory, cross-cutting investigation — not a typical pangenome or fitness analysis. The BERDL lakehouse may or may not be the right place for this work; the same question
will be pursued in parallel using other systems. The goal here is to do enough poking around to determine whether BERDL has useful signal, not to build a full analysis pipeline.