node01: VM-0-14-centos:106:196 [0] transport/net_ib.cc:73 NCCL WARN NET/IB : Got async event : GID table change node01: node01: VM-0-14-centos:107:194 [0] transport/net_ib.cc:73 NCCL WARN NET/IB : Got async event : GID table change node01: node01: VM-0-14-centos:108...
IBPB: disabled, STIBP: disabled Vulnerability Srbds: Not affected Vulnerability Tsx async abort: Not affected Versions of relevant libraries: [pip3] mypy==1.9.0 [pip3] mypy-extensions==1.0.0 [pip3] numpy==1.24.4 [pip3] onnx==1.15.0rc2 [pip3] onnx-graphsurgeon==0.3.27 [pip3] onn...
ib_modify_qp_is_ok 也被更新以考虑链路层。 有些参数对于以太网链路层是必需的,而对于IB来说则无关。 修改供应商驱动程序以支持新的函数签名 rdma_lag_get_ah_roce_slave rdma_read_gid_attr_ndev_rcu rdma_get_xmit_slave_udp rdma_build_skb netdev_get_xmit_slave RDMA_LAG_FLAGS_HASH_ALL_SLAVES ...
ucx建连 staticucs_status_tuct_ud_iface_create_qp(uct_ud_iface_t*self,constuct_ud_iface_config_t*config){uct_ud_iface_ops_t*ops=ucs_derived_of(self->super.ops,uct_ud_iface_ops_t);uct_ib_qp_attr_tqp_init_attr={};structibv_qp_attrqp_attr;staticucs_status_tstatus;intret;qp_...
vllm 0.4.0.post1 docker image how ran: docker run -d \ --runtime=nvidia \ --gpus '"device=0,1"' \ --shm-size=10.24gb \ -p 5002:5002 \ -e NCCL_IGNORE_DISABLED_P2P=1 \ -v /etc/passwd:/etc/passwd:ro \ -v /etc/group:/etc/group:ro \ -u `id -u`:`id -g` \ -v...
IBPB conditional, RSB filling, PBRSB-eIBRS SW sequence Vulnerability Srbds: Not affected Vulnerability Tsx async abort: Mitigation; Clear CPU buffers; SMT Host state unknown Versions of relevant libraries: [pip3] mypy==1.9.0 [pip3] mypy-extensions==1.0.0 [pip3] numpy==1.26.4 [pip3] to...