Skip to content

[API+UI] Trigger checkpointing manually #1816

@bghira

Description

@bghira

Similar vein to #1815 but we want to trigger checkpoint export in case of emergency, or for automated purposes via external tools.

The checkpoint would trigger while the training loop is going, after the next gradient sync completes.

This will allow the current gradient accumulation (if any) to complete first.

Metadata

Metadata

Assignees

No one assigned

    Labels

    apiThis issue or enhancement impacts the API.enhancementNew feature or requestwebuiThis issue or enhancement impacts the web interface.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions