ggml+cuda+force+cublas+no

2025-03-30 11:22:03

拼音 [ 拼音 ]

简拼 [ 简拼 ]

含义

Misc. bug: CUDA error: CUDA-capable device(s) is/are busy or...

Name and Version ./llama-cli --version [bin]$ ./llama-cli --version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 3 CUDA devices: Device 0: GRID A100D-16C, compute capability 8.0...
...with cuda12.8 and deepseekr1q6 · Issue #11965 · ggml-org...

ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 CUDA devices: Device 0: NVIDIA GeForce RTX 2080 Ti, compute capability 7.5, VMM: yes|model|size|params|backend|ngl|mmap|test|t/s||---|---:|---:|---|--:|---:|---:|---:|main: error: failed to load ...
...GGUF model when using llama-cli · Issue #11111 · ggml...

Relevant log output ./build/bin/llama-cli -m Meta-llama3-8B-fp16.gguf -p"you are an assiatant"-ngl 33 -c 8192 -cnv --no-context-shift ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 CUDA devices: Device 0: NVID...
...on Qwen 2 with HIP/ROCm · Issue #11153 · ggml-org/llama...

ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 ROCm devices: Device 0: AMD Radeon RX 7900 XTX, compute capability 11.0, VMM: no | model | size | params | backend | ngl | test | t/s | | --- | ---: | ---...
...against DeepSeek-R1 GGUF · Issue #11635 · ggml-org/llama...

Name and Version $ ./build/bin/llama-cli --version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 CUDA devices: Device 0: NVIDIA L40S, compute capability 8.9, VMM: yes version: ...
...when cache quantization specified · Issue #11200 · ggml...

Name and Version llama-cli --version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 2 CUDA devices: Device 0: NVIDIA GeForce RTX 3090 Ti, compute capability 8.6, VMM: yes Device 1...
...template will miss the <think> tag · Issue #12107 · ggml...

Name and Version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 CUDA devices: Device 0: Tesla T4, compute capability 7.5, VMM: yes version: 4790 (438a839) Operating systems Linu...
Eval bug: GGML_ASSERT(hparams.n_embd_head_k % ggml_blck_size...

Name and Version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 3 CUDA devices: Device 0: Tesla P40, compute capability 6.1, VMM: yes Device 1: Tesla P40, compute capability 6.1, ...
...57b-a14b-instruct-fp16. · Issue #9628 · ggml-org/llama.cpp

ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 CUDA devices: Device 0: NVIDIA GeForce RTX 3050 Laptop GPU, compute capability 8.6, VMM: yes register_backend: registered backend CUDA (1 devices) ...
...api first query very slow · Issue #9492 · ggml-org/llama...

ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 2 CUDA devices: Device 0: NVIDIA L20, compute capability 8.9, VMM: yes Device 1: NVIDIA L20, compute capability 8.9, VMM: yes ...

快搜汉语词典

ggml+cuda+force+cublas+no

拼音 [ 拼音 ]

简拼 [ 简拼 ]

含义

Misc. bug: CUDA error: CUDA-capable device(s) is/are busy or...

...with cuda12.8 and deepseekr1q6 · Issue #11965 · ggml-org...

...GGUF model when using llama-cli · Issue #11111 · ggml...

...on Qwen 2 with HIP/ROCm · Issue #11153 · ggml-org/llama...

...against DeepSeek-R1 GGUF · Issue #11635 · ggml-org/llama...

...when cache quantization specified · Issue #11200 · ggml...

...template will miss the <think> tag · Issue #12107 · ggml...

Eval bug: GGML_ASSERT(hparams.n_embd_head_k % ggml_blck_size...

...57b-a14b-instruct-fp16. · Issue #9628 · ggml-org/llama.cpp

...api first query very slow · Issue #9492 · ggml-org/llama...

缩写

今日热搜

上海网友集中晒蘑菇

近反义词

相关词语

相关搜索