🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - Fix bug in apply_rotary_pos_emb_flashatt: in Qwen2-5-VL (#36065) · huggingface/transformers@014047e
定义apply_rotary_pos_emb函数: 该函数接受两个参数:query_layer和rotary_pos_emb。 query_layer的形状通常为[seq_len, batch_size, num_heads, head_dim]。 rotary_pos_emb的形状通常为[seq_len, num_heads, head_dim // 2, 2],其中2代表复数的实部和虚部。 在函数内部实现旋转位置嵌入的逻辑: 首先...
Assign User on Comment torch.onnx.export (dynamo=False) fails with uninformative error when exporting apply_rotary_pos_emb/repeat_interleave #147296 Sign in to view logs Summary Jobs assign Run details Usage Workflow file Triggered via issue February 11, 2025 20:13 xenova commented on #14...