llama-cpp-python使用方法

2025-05-25 23:23:48

拼音 [ 拼音 ]

使用llama.cpp进行GGUF量化及基于llama-cpp-python的部署方法

RUN git clone https://github.com/ggerganov/llama.cpp RUN pip install gguf -i https://pypi.tuna.tsinghua.edu.cn/simple WORKDIR /llama.cpp RUN mkdir build WORKDIR /llama.cpp/build RUN cmake .. -DLLAMA_CUDA=ON RUN cmake --build . --config Release # python build RUN CMAKE_ARGS="-...