Fix/openrouter rate limit and dmr #196

rendyhd · 2025-11-20T14:03:16Z

A fix for:

function now includes a retry mechanism with exponential backoff for handling rate limits (429 errors) and empty responses.
included the DMR comparability

…s8mGAn5UghuFemm5ydF3 Claude/add openrouter support 01 c fs8m g an5 ughu femm5yd f3

CONFIRMED: GPU (CUDA) is currently only used for Analysis, not Clustering RESEARCH FINDINGS: - GPU acceleration can provide 10-30x speedup for clustering tasks - KMeans: 10-50x faster, DBSCAN: 5-100x faster, PCA: 10-40x faster - Example: 5000 clustering runs that take 2-4 hours on CPU can complete in 5-15 minutes on GPU IMPLEMENTATION: Implemented GPU-accelerated clustering using RAPIDS cuML as an optional feature: New Features: - GPU-accelerated KMeans, DBSCAN, and PCA using RAPIDS cuML - Automatic fallback to CPU if GPU unavailable or encounters errors - New environment variable USE_GPU_CLUSTERING (default: false) - Maintains all existing clustering options and settings - Compatible with CUDA 12.2+ and NVIDIA GPUs Files Modified: 1. config.py: - Added USE_GPU_CLUSTERING configuration flag 2. tasks/clustering_gpu.py (NEW): - Created GPU clustering module with RAPIDS cuML implementations - GPU-accelerated classes: GPUKMeans, GPUDBSCAN, GPUPCA - CPU-only wrappers: GPUGaussianMixture, GPUSpectralClustering - Factory functions for model creation with GPU/CPU selection - Automatic GPU availability detection and graceful fallback 3. tasks/clustering_helper.py: - Imported GPU clustering module with fallback handling - Updated _perform_single_clustering_iteration to use GPU PCA when enabled - Modified _apply_clustering_model to support GPU clustering - Maintains full backward compatibility with CPU-only mode 4. Dockerfile: - Added cupy-cuda12x and cuml-cu12 installation for NVIDIA builds - Only installs GPU packages when BASE_IMAGE is nvidia/cuda - CPU builds remain unchanged and lightweight 5. deployment/.env.example: - Added USE_GPU_CLUSTERING configuration with documentation - Default: false (CPU only, backward compatible) 6. README.md: - Added "GPU Acceleration for Clustering" section - Documented performance improvements and usage instructions - Listed supported algorithms and compatibility requirements - Noted that GaussianMixture and SpectralClustering use CPU (no GPU version) Notes: - GPU clustering is OPTIONAL and disabled by default - CPU clustering remains the default for backward compatibility - All existing clustering parameters and settings are preserved - GaussianMixture and SpectralClustering always use CPU (no cuML implementation) - GPU usage: Analysis (ONNX inference) + Clustering (RAPIDS cuML, optional)

…Ju9JeTX1N4popdd3Kt6FoM

The bash -lc subshell prevented access to the BASE_IMAGE ARG, causing GPU packages (cupy, cuml) to never be installed. Changed to use 'set -ux;' pattern (matching base stage) which properly accesses Docker ARG variables in the current shell. This ensures cuML and cupy are installed when building with nvidia/cuda base images, enabling GPU-accelerated clustering.

The new docker image needed packages that where quite large, resulting in a very slow docker build

Feat/gpu clustering

…R fix

rendyhd · 2025-11-20T14:05:08Z

Ow, this is on top of #195, sorry for the mess

NeptuneHub · 2025-11-20T17:48:49Z

Is this still for clustering on GPU? can you add this change in the #195 so when finished I can download all in one and do only "one round of test" ? thanks.

rendyhd · 2025-11-20T20:32:52Z

Included with #195

rendyhd and others added 9 commits November 19, 2025 11:33

Merge pull request #6 from rendyhd/claude/add-openrouter-support-01CF…

63f34e2

…s8mGAn5UghuFemm5ydF3 Claude/add openrouter support 01 c fs8m g an5 ughu femm5yd f3

Merge branch 'NeptuneHub:main' into claude/gpu-clustering-research-01…

7ba7ced

…Ju9JeTX1N4popdd3Kt6FoM

Fix clustering GPU acceleration, but slow docker build

393086c

Improve DockerFile for GPU Clustering build

08ccca0

The new docker image needed packages that where quite large, resulting in a very slow docker build

Merge branch 'NeptuneHub:main' into feat/gpu-clustering

d3a643d

Merge pull request #7 from rendyhd/feat/gpu-clustering

968ad88

Feat/gpu clustering

Fix for openrouter "max retries" and empty responses, included the DM…

849c68f

…R fix

rendyhd mentioned this pull request Nov 20, 2025

Add Docker Model Runner–Ready Compose Configuration #194

Open

Changed base image to cudnn

8739f81

rendyhd closed this Nov 20, 2025

rendyhd mentioned this pull request Nov 20, 2025

GPU Accelaration for Clustering #195

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix/openrouter rate limit and dmr #196

Fix/openrouter rate limit and dmr #196

Uh oh!

rendyhd commented Nov 20, 2025

Uh oh!

rendyhd commented Nov 20, 2025

Uh oh!

NeptuneHub commented Nov 20, 2025

Uh oh!

rendyhd commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix/openrouter rate limit and dmr #196

Fix/openrouter rate limit and dmr #196

Uh oh!

Conversation

rendyhd commented Nov 20, 2025

Uh oh!

rendyhd commented Nov 20, 2025

Uh oh!

NeptuneHub commented Nov 20, 2025

Uh oh!

rendyhd commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants