TensorFlow Serving performance optimization
Wei Wei, Developer Advocate at Google, shares general principles and best practices to improve TensorFlow Serving performance. He discusses how to improve the latency for API surfaces, batching, and more parameters that you can tune. Resources: TensorFlow Serving performance guide → https://goo.gle/3zW168E Profile Inference Requests with TensorBoard → https://goo.gle/3zWjluJ TensorFlow Serving batching configuration → https://goo.gle/3xT2SVz […]