This directory includes the code used for estimating the latencies of computing the forward pass of off-the-shelf LLMs.