-
Notifications
You must be signed in to change notification settings - Fork 35
Closed
Labels
quant requestEvaluate impact of quantizing a modelEvaluate impact of quantizing a model
Description
llama.cpp is gaining Falcon support via GGUF: ggml-org/llama.cpp#2717
Metadata
Metadata
Assignees
Labels
quant requestEvaluate impact of quantizing a modelEvaluate impact of quantizing a model