Skip to content

Offline model library use #1449

@sempervictus

Description

@sempervictus

Objectives

  • Work with models in airplane mode/offline conditions
  • Execute already cached models without reaching out to HF
  • Use local models which do not have HF endpoints
  • Operate segmented runners each storing their local layers/sections in the most optimal way
    • Longer-term objective but does rely on ability to arbitrarily operate on local files unless all runners have outbound

Proposed Approach

  • Enable --from-file(s) or similar flag on all model types to bypass codepaths which would reach out to remote model sources
    • Add crate feature to build this way by default requiring --from-repo(s) or similar to invert the behavior
  • Utilize the skip method if no env:VARNAME or token file (or later analogs to other sources) is found to search the local cache instead of issuing requests which log access-denied events at HF.
  • Retain tokenizer and other relevant metadata from initial download to spool-up when there is no link

Metadata

Metadata

Assignees

No one assigned

    Labels

    new featureNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions