和你一样设置的。我用样本数的倒数当做权重,但是效果比不设置权重使用交叉熵loss的模型差2% 可能我的...
weight_decay: 1 # weight decay multiplier for the filters weight_decay: 0 # weight decay multiplier for the biases convolution_param { num_output: 96 # learn 96 filters kernel_size: 11 # each filter is 11x11 stride: 4 # step 4 pixels between each filter application weight_filler { type...