From 3ecae420ac3569f7feee6ab2577811ea01959d66 Mon Sep 17 00:00:00 2001
From: Darinka <39233990+Darinochka@users.noreply.github.com>
Date: Tue, 7 May 2024 00:39:58 +0300
Subject: [PATCH] Update api.md (#3945)

* Update api.md

Changed the calculation of tps (token/s) in the documentation

* Update docs/api.md

---------

Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
---
 docs/api.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/api.md b/docs/api.md
index e79b6f5a..2f52c55a 100644
--- a/docs/api.md
+++ b/docs/api.md
@@ -95,7 +95,7 @@ The final response in the stream also includes additional data about the generat
 - `context`: an encoding of the conversation used in this response, this can be sent in the next request to keep a conversational memory
 - `response`: empty if the response was streamed, if not streamed, this will contain the full response
 
-To calculate how fast the response is generated in tokens per second (token/s), divide `eval_count` / `eval_duration`.
+To calculate how fast the response is generated in tokens per second (token/s), divide `eval_count` / `eval_duration` * `10^9`.
 
 ```json
 {