sci-ml/ollama: fix broken CUDA support via dynamic GPU detection by nabbi · Pull Request #409 · gentoo/guru

nabbi · 2025-12-29T09:34:14Z

Fix an issue where USE=cuda builds failed to provide GPU acceleration. Previous "native" build attempts were non-functional due to sandbox restrictions and incorrect architecture targeting.

Implement smart CUDAARCHA detection using __nvcc_device_query.
Add sandbox-aware hardware check to prevent 0x64 (No Device) errors.
Disable GGML_NATIVE to ensure specific GPU kernels are generated.
Default to 'all' (complete binary) if hardware is inaccessible.
Add pkg_pretend guidance for binary package (binpkg) portability.
Remove duplicate back end install CUDA tensor upload failures during model load ollama/ollama#13614

Thanks :)

nabbi · 2026-01-05T15:45:43Z

@negril FYA

nabbi · 2026-01-05T16:18:30Z

We still need to address the concern of "backends end up in /usr/bin otherwise"

negril · 2026-01-05T16:35:19Z

Building without -DGGML_BACKEND_DIR="${EPREFIX}/usr/$(get_libdir)/${PN}/backends" yields ( via app-portage/iwdevtools ):

 * CMP: =sci-ml/ollama-9999 with sci-ml/ollama-9999/image
 *  FILES:+usr/bin/libggml-cpu-haswell.so
 *  FILES:+usr/bin/libggml-cpu-sandybridge.so
 *  FILES:+usr/bin/libggml-cpu-sse42.so
 *  FILES:+usr/bin/libggml-cpu-x64.so
 *  FILES:+usr/bin/libggml-cuda.so
 *  FILES:+usr/bin/libggml-vulkan.so
 *  FILES:-usr/lib64/ollama/backends/libggml-cpu-haswell.so
 *  FILES:-usr/lib64/ollama/backends/libggml-cpu-sandybridge.so
 *  FILES:-usr/lib64/ollama/backends/libggml-cpu-sse42.so
 *  FILES:-usr/lib64/ollama/backends/libggml-cpu-x64.so
 *  FILES:-usr/lib64/ollama/backends/libggml-cuda.so
 *  FILES:-usr/lib64/ollama/backends/libggml-vulkan.so
 * ------> FILES(+6,-6)

This is an issue upstream needs to fix ( and is most likely caused by the incomplete import and abuse of ggml by ollama ).

You can try and see if setting -DGGML_BACKEND_DIR="${EPREFIX}/usr/$(get_libdir)/${PN}" fixes your issue. Otherwise we need to see how we correct or remove the dupes.

addpredict "/dev/char/" is needed once we remove the SANDBOX_PREDICT entries from nvidia-cuda-toolkit.

Passing -DGGML_NATIVE=OFF has no effect for cuda once we pass CMAKE_CUDA_ARCHITECTURES, see https://github.com/ollama/ollama/blob/v0.13.5/ml/backend/ggml/ggml/src/ggml-cuda/CMakeLists.txt#L25
But it will change behaviour for the cpu backend. So I'd rather not pass that.

The cuda changes are an amalgamation of my cuda stuff in various ebuilds. I'll see how much it differs from the wip eclass and if it makes sense to add that to ::guru for the time being.

nabbi · 2026-01-05T16:54:38Z

Okay. I'll tweak again with your feedback and test again, yeah I think that'll resolve the duplication.

Fix an issue where USE=cuda builds failed to provide GPU acceleration. Previous "native" build attempts were non-functional due to sandbox restrictions and incorrect architecture targeting. - Implement smart CUDAARCHS detection using __nvcc_device_query. - Add sandbox-aware hardware check to prevent 0x64 (No Device) errors. - Disable GGML_NATIVE to ensure specific GPU kernels are generated. - Default to 'all' (fat binary) if hardware is inaccessible. - Add pkg_pretend guidance for binary package (binpkg) portability. - Fix duplicate library install. Signed-off-by: Nic Boet <nic@boet.cc>

nabbi · 2026-01-05T17:24:01Z

Update. lmk if I missed the mark.

Yes. I found various approaches of handing CMAKE_CUDA_ARCHITECTURES in the tree. Would be awesome to have some standardization in the cuda.eclass for this and documentation of a global CUDAARCHS :)
So I tried a best guess to keep this aligned but also added a bit more logging to confirm it was no longer building as "native". Changes welcomed

nabbi · 2026-01-05T17:24:46Z

/var/tmp/portage/sci-ml/ollama-9999# tree image/
image/
├── etc
│   ├── conf.d
│   │   └── ollama
│   └── init.d
│       └── ollama
└── usr
    ├── bin
    │   └── ollama
    ├── lib
    │   └── systemd
    │       └── system
    │           └── ollama.service
    ├── lib64
    │   └── ollama
    │       ├── libggml-base.so -> libggml-base.so.0
    │       ├── libggml-base.so.0 -> libggml-base.so.0.0.0
    │       ├── libggml-base.so.0.0.0
    │       ├── libggml-cpu-x64.so
    │       └── libggml-cuda.so
    └── share
        └── doc
            └── ollama-9999
                └── README.md.bz2

14 directories, 10 files

negril · 2026-01-05T19:01:43Z

But does it work? I'll look at the cuda stuff tomorrow.

nabbi · 2026-01-05T19:08:01Z

But does it work?

Yes! It's a little quirky as it's attempting to reinstall on top of libggml-cpu-x64.so and libggml-cuda.so
The copy operation is happening twice, second time reports "Up-to-date"

-- Installing: /var/tmp/portage/sci-ml/ollama-9999/image/usr/lib64/ollama/libggml-cpu-x64.so
-- Set non-toolchain portion of runtime path of "/var/tmp/portage/sci-ml/ollama-9999/image/usr/lib64/ollama/libggml-cpu-x64.so" to ""
-- Up-to-date: /var/tmp/portage/sci-ml/ollama-9999/image/usr/lib64/ollama/libggml-cpu-x64.so

-- Installing: /var/tmp/portage/sci-ml/ollama-9999/image/usr/lib64/ollama/libggml-cuda.so
-- Set non-toolchain portion of runtime path of "/var/tmp/portage/sci-ml/ollama-9999/image/usr/lib64/ollama/libggml-cuda.so" to ""
-- Up-to-date: /var/tmp/portage/sci-ml/ollama-9999/image/usr/lib64/ollama/libggml-cuda.so

nabbi force-pushed the ollama-cuda-gpu branch 2 times, most recently from 62fdee3 to ac741e2 Compare January 5, 2026 15:39

nabbi mentioned this pull request Jan 5, 2026

CUDA tensor upload failures during model load ollama/ollama#13614

Closed

nabbi force-pushed the ollama-cuda-gpu branch from ac741e2 to 00cdc3b Compare January 5, 2026 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sci-ml/ollama: fix broken CUDA support via dynamic GPU detection#409

sci-ml/ollama: fix broken CUDA support via dynamic GPU detection#409
nabbi wants to merge 1 commit intogentoo:masterfrom
nabbi:ollama-cuda-gpu

nabbi commented Dec 29, 2025 •

edited

Loading

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

negril commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026 •

edited

Loading

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

negril commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nabbi commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

negril commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nabbi commented Jan 5, 2026

Uh oh!

negril commented Jan 5, 2026

Uh oh!

nabbi commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nabbi commented Dec 29, 2025 •

edited

Loading

nabbi commented Jan 5, 2026 •

edited

Loading

nabbi commented Jan 5, 2026 •

edited

Loading