-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Closed
Labels
feature requestNew feature or requestNew feature or request
Description
Feature request
Enable the use of locally stored adapters as created by huggingface/peft. Ideally, this should be compatible with the most notable benefits of TGI (e.g. sharing and flash attention).
Motivation
Using models fine-tuned with PEFT is possible only by merging the adapter back in the original weights of the model. This is especially cumbersome in terms of disk space for use-cases where the user has many adapters for just one model.
Your contribution
I'm not sure how much work this may induce or if it is at all feasible (notably enabling sharding with adapters). I'll gladly read any insights on the complexity and the relevance of adding this feature.
sld, tienthanhdhcn, bilelomrani1, dongs0104, pavan07533 and 14 more
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or request