-
Notifications
You must be signed in to change notification settings - Fork 482
Closed
Labels
new featureNew feature or requestNew feature or request
Description
Objectives
- Work with models in airplane mode/offline conditions
- Execute already cached models without reaching out to HF
- Use local models which do not have HF endpoints
- Operate segmented runners each storing their local layers/sections in the most optimal way
- Longer-term objective but does rely on ability to arbitrarily operate on local files unless all runners have outbound
Proposed Approach
- Enable
--from-file(s)or similar flag on all model types to bypass codepaths which would reach out to remote model sources- Add crate feature to build this way by default requiring
--from-repo(s)or similar to invert the behavior
- Add crate feature to build this way by default requiring
- Utilize the skip method if no
env:VARNAMEor token file (or later analogs to other sources) is found to search the local cache instead of issuing requests which log access-denied events at HF. - Retain tokenizer and other relevant metadata from initial download to spool-up when there is no link
Metadata
Metadata
Assignees
Labels
new featureNew feature or requestNew feature or request