llama.cpp/llama_cpp/server/__main__.py

"""Example FastAPI server for llama.cpp.

To run this example:

```bash
pip install fastapi uvicorn sse-starlette
export MODEL=../models/7B/...
```

Then run:
```
uvicorn llama_cpp.server.app:app --reload
```

or

```
python3 -m llama_cpp.server
```

Then visit http://localhost:8000/docs to see the interactive API docs.

"""
import os
import argparse

import uvicorn

from llama_cpp.server.app import create_app, Settings

if __name__ == "__main__":
    parser = argparse.ArgumentParser()
    for name, field in Settings.__fields__.items():
        description = field.field_info.description
        if field.default is not None and description is not None:
            description += f" (default: {field.default})"
        parser.add_argument(
            f"--{name}",
            dest=name,
            type=field.type_,
            help=description,
        )

    args = parser.parse_args()
    settings = Settings(**{k: v for k, v in vars(args).items() if v is not None})
    app = create_app(settings=settings)

    uvicorn.run(
        app, host=os.getenv("HOST", settings.host), port=int(os.getenv("PORT", settings.port))
    )
Add server as a subpackage 2023-04-05 20:23:25 +00:00			`"""Example FastAPI server for llama.cpp.`

			`To run this example:`

			```bash
			`pip install fastapi uvicorn sse-starlette`
			`export MODEL=../models/7B/...`
			```

llama_cpp server: app is now importable, still runnable as a module 2023-04-29 05:43:37 +00:00			`Then run:`
			```
			`uvicorn llama_cpp.server.app:app --reload`
			```
Add server as a subpackage 2023-04-05 20:23:25 +00:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-29 05:43:37 +00:00			`or`
Add server as a subpackage 2023-04-05 20:23:25 +00:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-29 05:43:37 +00:00			```
			`python3 -m llama_cpp.server`
			```
Add server as a subpackage 2023-04-05 20:23:25 +00:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-29 05:43:37 +00:00			`Then visit http://localhost:8000/docs to see the interactive API docs.`
Add server as a subpackage 2023-04-05 20:23:25 +00:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-29 05:43:37 +00:00			`"""`
			`import os`
Add cli options to server. Closes #37 2023-05-05 16:08:28 +00:00			`import argparse`

llama_cpp server: app is now importable, still runnable as a module 2023-04-29 05:43:37 +00:00			`import uvicorn`
Add server as a subpackage 2023-04-05 20:23:25 +00:00
Add cli options to server. Closes #37 2023-05-05 16:08:28 +00:00			`from llama_cpp.server.app import create_app, Settings`
Add server as a subpackage 2023-04-05 20:23:25 +00:00
			`if __name__ == "__main__":`
Bugfix: not falling back to environment variables when default is value is set. 2023-05-08 18:46:25 +00:00			`parser = argparse.ArgumentParser()`
Add cli options to server. Closes #37 2023-05-05 16:08:28 +00:00			`for name, field in Settings.__fields__.items():`
Bugfix: not falling back to environment variables when default is value is set. 2023-05-08 18:46:25 +00:00			`description = field.field_info.description`
			`if field.default is not None and description is not None:`
			`description += f" (default: {field.default})"`
Add cli options to server. Closes #37 2023-05-05 16:08:28 +00:00			`parser.add_argument(`
			`f"--{name}",`
			`dest=name,`
			`type=field.type_,`
Bugfix: not falling back to environment variables when default is value is set. 2023-05-08 18:46:25 +00:00			`help=description,`
Add cli options to server. Closes #37 2023-05-05 16:08:28 +00:00			`)`

			`args = parser.parse_args()`
Use environment variable if parsed cli arg is None 2023-05-08 18:20:53 +00:00			`settings = Settings(**{k: v for k, v in vars(args).items() if v is not None})`
Add cli options to server. Closes #37 2023-05-05 16:08:28 +00:00			`app = create_app(settings=settings)`
Add server as a subpackage 2023-04-05 20:23:25 +00:00
Handle prompt list 2023-04-07 01:07:35 +00:00			`uvicorn.run(`
correction to add back environment variable support <3 docker 2023-06-11 00:11:24 +00:00			`app, host=os.getenv("HOST", settings.host), port=int(os.getenv("PORT", settings.port))`
Handle prompt list 2023-04-07 01:07:35 +00:00			`)`