Skip to content

Commit 1dfed1e

Browse files
authored
Merge branch 'mudler:master' into Nold360-fix-qwen-for-real
2 parents 99bb10f + 94b47a9 commit 1dfed1e

File tree

1 file changed

+32
-0
lines changed

1 file changed

+32
-0
lines changed

gallery/index.yaml

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,36 @@
11
---
2+
- name: "minimax-m2.1-i1"
3+
url: "github:mudler/LocalAI/gallery/virtual.yaml@master"
4+
urls:
5+
- https://huggingface.co/mradermacher/MiniMax-M2.1-i1-GGUF
6+
description: |
7+
The model **MiniMax-M2.1** (base model: *MiniMaxAI/MiniMax-M2.1*) is a large language model quantized for efficient deployment. It is optimized for speed and memory usage, with quantized versions available in various formats (e.g., GGUF) for different performance trade-offs. The quantization is done by the user, and the model is licensed under the *modified-mit* license.
8+
9+
Key features:
10+
- **Quantized versions**: Includes low-precision (IQ1, IQ2, Q2_K, etc.) and high-precision (Q4_K_M, Q6_K) options.
11+
- **Usage**: Requires GGUF files; see [TheBloke's documentation](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for details on integration.
12+
- **License**: Modified MIT (see [license link](https://github.com/MiniMax-AI/MiniMax-M2.1/blob/main/LICENSE)).
13+
14+
For gallery use, emphasize its quantized variants, performance trade-offs, and licensing.
15+
overrides:
16+
parameters:
17+
model: llama-cpp/models/MiniMax-M2.1.i1-Q4_K_M.gguf
18+
name: MiniMax-M2.1-i1-GGUF
19+
backend: llama-cpp
20+
template:
21+
use_tokenizer_template: true
22+
known_usecases:
23+
- chat
24+
function:
25+
grammar:
26+
disable: true
27+
description: Imported from https://huggingface.co/mradermacher/MiniMax-M2.1-i1-GGUF
28+
options:
29+
- use_jinja:true
30+
files:
31+
- filename: llama-cpp/models/MiniMax-M2.1.i1-Q4_K_M.gguf
32+
sha256: dba387e17ddd9b4559fb6f14459fcece7f00c66bbe4062d7ceea7fb9568e3282
33+
uri: https://huggingface.co/mradermacher/MiniMax-M2.1-i1-GGUF/resolve/main/MiniMax-M2.1.i1-Q4_K_M.gguf
234
- name: "tildeopen-30b-instruct-lv-i1"
335
url: "github:mudler/LocalAI/gallery/virtual.yaml@master"
436
urls:

0 commit comments

Comments
 (0)