usually an empty string for the top-level model and a string like ``"vision"`` or ``"language"`` for the sub-models. In general, it matches the name of the different parts of the model are quantized differently. The `prefix` is usually an empty string for the top-level model and ...