Skip to content

feat: IQ quants support #2631

@mr-september

Description

@mr-september

Problem
GGUF models quantized with IQ quants fail to load.

Success Criteria
Load and play as usual

Additional context
IQ quants: ggml-org/llama.cpp#4773

Example model with both traditional Q and new IQ quants: https://huggingface.co/bartowski/Starling_Monarch_Westlake_Garten-7B-v0.1-GGUF

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions