Skip to content

aare-arnold/MuIS-object-labeling

Repository files navigation

🪐 spaCy Project: Improving the quality of museums data

Object-labeling solution to improve Estonian museum information system.

📋 project.yml

The project.yml defines the data assets required by the project, as well as the available commands and workflows. For details, see the spaCy projects documentation.

⏯ Commands

The following commands are defined by the project. They can be executed using spacy project run [name]. Commands are only re-run if their inputs have changed.

Command Description
preprocess Preproccesses traing data for evaluation
convert Convert the data to spaCy's binary format
train Train the textcat model
evaluate Evaluate the model and export metrics
package Package the trained model as a pip package
visualize-model Visualize the model's output interactively using Streamlit
test Test the trained pipeline
clean Remove intermediate files

⏭ Workflows

The following workflows are defined by the project. They can be executed using spacy project run [name] and will run the specified commands in order. Commands are only re-run if their inputs have changed.

Workflow Steps
all preprocessconverttrainevaluatepackage

🗂 Assets

The following assets are defined by the project. They can be fetched by running spacy project assets in the project directory.

File Source Description
assets/unique_types.csv Local Types list
assets/train.csv Local Training data
assets/eval.csv Local Development data

About

AI-based object-labeling solution to improve Estonian museum information system

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages