llama.cpp/examples/high_level_api_streaming.py

import json
import argparse

from llama_cpp import Llama

parser = argparse.ArgumentParser()
parser.add_argument("-m", "--model", type=str, default="./models/...")
args = parser.parse_args()

llm = Llama(model_path=args.model)

stream = llm(
    "Question: What are the names of the planets in the solar system? Answer: ",
    max_tokens=48,
    stop=["Q:", "\n"],
    stream=True,
)

for output in stream:
    print(json.dumps(output, indent=2))
Add support for stream parameter. Closes #1 2023-03-28 08:03:57 +00:00			`import json`
			`import argparse`

			`from llama_cpp import Llama`

			`parser = argparse.ArgumentParser()`
Small fixes for examples 2023-04-04 00:33:07 +00:00			`parser.add_argument("-m", "--model", type=str, default="./models/...")`
Add support for stream parameter. Closes #1 2023-03-28 08:03:57 +00:00			`args = parser.parse_args()`

			`llm = Llama(model_path=args.model)`

			`stream = llm(`
			`"Question: What are the names of the planets in the solar system? Answer: ",`
			`max_tokens=48,`
			`stop=["Q:", "\n"],`
			`stream=True,`
			`)`

			`for output in stream:`
			`print(json.dumps(output, indent=2))`