4 changes: 4 additions & 0 deletions 4 vllm/config.py Original file line numberDiff line numberDiff line change @@ -282,6 +282,10 @@ def verify_with_parallel_config( raise ValueError( "BitAndBytes quantization with TP or PP is not supported yet.")...
Temporarily enforce eager mode for GPTQ models (vllm-project#2154) Browse files main (vllm-project/vllm#2154) WoosukKwon committed Dec 17, 2023 Verified 1 parent 26c52a5 commit 3a765bd Showing 1 changed file with 5 additions and 0 deletions. Whitespace Ignore whitespace Split Unified...
Thank you for contributing to the vLLM project. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only runfastcheckCI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by un...