Guide To Optimize Machine Learning System: Introduction
WIP In this article, I am going to introduce some basic concepts and methods on performance engineering. Basic Workflow Measure the performance of Program A Calculate the theoretical peek performance of Program A with current input and hardware Profile Program A Think about possible changes and how much they would theoretically improve the performance Make a change to Program A to produce a hopefully faster Program A* Measure the performance of Program A* Compare the output of A* to that of A If A* get the same output as that of A and is faster than A, set A = A* If A is still not fast enough, go to Step 3 Measure and Profiling The “performance” here specifically refers to the latency and throughput of a program. I will not cover other metrics like QPS. ...