Starting from 2.1.0, specific optimizations for certain LLM models are introduced in the Intel® Extension for PyTorch*. Check LLM optimizations for details. Optimized Model List MODEL FAMILYMODEL NAME (Huggingface hub)FP32BF16Static quantization INT8Weight only quantization INT8Weight only ...