#per-token latency calculation