How We Scaled Bert To Serve 1+ Billion Daily Requests on CPUs Roblox 2020 Permute, Quantize, and Fine-tune: Efficient Compression of Neural Networks (Paper) Uber 2021 GPU-accelerated ML Inference at Pinterest Pinterest 2022 Ethics Building Inclusive Products Through A/B Testing (Paper) LinkedIn ...