Skip to content

Commit 3ce9cb5

Browse files
localai-botmudler
andauthored
chore(model gallery): 🤖 add 1 new models via gallery agent (#7040)
chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <[email protected]>
1 parent ee7638a commit 3ce9cb5

File tree

1 file changed

+25
-0
lines changed

1 file changed

+25
-0
lines changed

gallery/index.yaml

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22981,3 +22981,28 @@
2298122981
- filename: GroveMoE-Base.i1-Q4_K_M.gguf
2298222982
sha256: 9d7186ba9531bf689c91176468d7a35c0aaac0cd52bd44d4ed8f7654949ef4f4
2298322983
uri: huggingface://mradermacher/GroveMoE-Base-i1-GGUF/GroveMoE-Base.i1-Q4_K_M.gguf
22984+
- !!merge <<: *qwen3
22985+
name: "nvidia.qwen3-nemotron-32b-rlbff"
22986+
urls:
22987+
- https://huggingface.co/DevQuasar/nvidia.Qwen3-Nemotron-32B-RLBFF-GGUF
22988+
description: |
22989+
The **nvidia/Qwen3-Nemotron-32B-RLBFF** is a large language model based on the Qwen3 architecture, fine-tuned by NVIDIA using Reinforcement Learning from Human Feedback (RLHF) for improved alignment with human preferences. With 32 billion parameters, it excels in complex reasoning, instruction following, and natural language generation, making it suitable for advanced tasks such as code generation, dialogue systems, and content creation.
22990+
22991+
This model is part of NVIDIA’s Nemotron series, designed to deliver high performance and safety in real-world applications. It is optimized for efficient deployment while maintaining strong language understanding and generation capabilities.
22992+
22993+
**Key Features:**
22994+
- **Base Model**: Qwen3-32B
22995+
- **Fine-tuning**: Reinforcement Learning from Human Feedback (RLBFF)
22996+
- **Use Case**: Advanced text generation, coding, dialogue, and reasoning
22997+
- **License**: MIT (check Hugging Face for full details)
22998+
22999+
👉 [View on Hugging Face](https://huggingface.co/nvidia/Qwen3-Nemotron-32B-RLBFF)
23000+
23001+
*Note: The GGUF version hosted by DevQuasar is a quantized variant for efficient local inference. The original, unquantized model is available at the link above.*
23002+
overrides:
23003+
parameters:
23004+
model: nvidia.Qwen3-Nemotron-32B-RLBFF.Q4_K_M.gguf
23005+
files:
23006+
- filename: nvidia.Qwen3-Nemotron-32B-RLBFF.Q4_K_M.gguf
23007+
sha256: 5dfc9f1dc21885371b12a6e0857d86d6deb62b6601b4d439e4dfe01195a462f1
23008+
uri: huggingface://DevQuasar/nvidia.Qwen3-Nemotron-32B-RLBFF-GGUF/nvidia.Qwen3-Nemotron-32B-RLBFF.Q4_K_M.gguf

0 commit comments

Comments
 (0)