llama.cpp/llama_cpp/server/__main__.py

"""Example FastAPI server for llama.cpp.

To run this example:

```bash
pip install fastapi uvicorn sse-starlette
export MODEL=../models/7B/...
```

Then run:
```
uvicorn llama_cpp.server.app:app --reload
```

or

```
python3 -m llama_cpp.server
```

Then visit http://localhost:8000/docs to see the interactive API docs.

"""
import os
import uvicorn

from llama_cpp.server.app import create_app

if __name__ == "__main__":
    app = create_app()

    uvicorn.run(
        app, host=os.getenv("HOST", "localhost"), port=int(os.getenv("PORT", 8000))
    )
Add server as a subpackage 2023-04-05 16:23:25 -04:00			`"""Example FastAPI server for llama.cpp.`

			`To run this example:`

			```bash
			`pip install fastapi uvicorn sse-starlette`
			`export MODEL=../models/7B/...`
			```

llama_cpp server: app is now importable, still runnable as a module 2023-04-28 22:43:37 -07:00			`Then run:`
			```
			`uvicorn llama_cpp.server.app:app --reload`
			```
Add server as a subpackage 2023-04-05 16:23:25 -04:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-28 22:43:37 -07:00			`or`
Add server as a subpackage 2023-04-05 16:23:25 -04:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-28 22:43:37 -07:00			```
			`python3 -m llama_cpp.server`
			```
Add server as a subpackage 2023-04-05 16:23:25 -04:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-28 22:43:37 -07:00			`Then visit http://localhost:8000/docs to see the interactive API docs.`
Add server as a subpackage 2023-04-05 16:23:25 -04:00
llama_cpp server: app is now importable, still runnable as a module 2023-04-28 22:43:37 -07:00			`"""`
			`import os`
			`import uvicorn`
Add server as a subpackage 2023-04-05 16:23:25 -04:00
Refactor server to use factory 2023-05-01 22:38:46 -04:00			`from llama_cpp.server.app import create_app`
Add server as a subpackage 2023-04-05 16:23:25 -04:00
			`if __name__ == "__main__":`
Refactor server to use factory 2023-05-01 22:38:46 -04:00			`app = create_app()`
Add server as a subpackage 2023-04-05 16:23:25 -04:00
Handle prompt list 2023-04-06 21:07:35 -04:00			`uvicorn.run(`
			`app, host=os.getenv("HOST", "localhost"), port=int(os.getenv("PORT", 8000))`
			`)`