Add NVIDIA Integration for Local LLM Support #144

Vikranth3140 · 2025-07-20T15:27:12Z

This PR closes #143 by adding support for NVIDIA GPU acceleration in local LLM inference, allowing users with NVIDIA hardware to leverage CUDA for faster processing when using local models.

The integration enables optional local LLM usage via environment variables. It implicitly utilizes NVIDIA GPUs if the local server (e.g., Ollama, LM Studio) is configured with CUDA support. No new dependencies are added, keeping the codebase lightweight.

Please review and merge if everything looks good! Let me know if any adjustments are needed.

…DeepSeek R1 model

…odel over 8b model

Vikranth3140 added 6 commits July 20, 2025 20:35

feat: add NVIDIA model support and update model selection priority

c0d10ec

feat: add NVIDIA API key to environment example

d58a71e

docs: update README to clarify AI provider options and model priority

809ac18

fix: update model selection priority to prioritize NVIDIA models

e8836dc

fix: update NVIDIA model selection order and comment out problematic …

8ebd0ae

…DeepSeek R1 model

fix: update model selection priority to prioritize NVIDIA Llama 70b m…

7f6a0c4

…odel over 8b model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add NVIDIA Integration for Local LLM Support #144

Add NVIDIA Integration for Local LLM Support #144

Uh oh!

Vikranth3140 commented Jul 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add NVIDIA Integration for Local LLM Support #144

Are you sure you want to change the base?

Add NVIDIA Integration for Local LLM Support #144

Uh oh!

Conversation

Vikranth3140 commented Jul 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant