This website requires JavaScript.
Explore
Help
Sign in
baalajimaestro
/
llama.cpp
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
48c3b77e6f
llama.cpp
/
llama_cpp
/
server
History
Andrei Betlen
48c3b77e6f
Offload KQV by default
2024-01-18 11:08:57 -05:00
..
__init__.py
llama_cpp server: app is now importable, still runnable as a module
2023-04-29 11:41:25 -07:00
__main__.py
[Feat] Multi model support (
#931
)
2023-12-22 05:51:25 -05:00
app.py
Support Accept text/event-stream in chat and completion endpoints,
resolves
#1083
(
#1088
)
2024-01-16 12:52:52 -05:00
cli.py
[Feat] Multi model support (
#931
)
2023-12-22 05:51:25 -05:00
errors.py
server: Support none defaulting to infinity for completions (
#111
)
2023-12-22 14:05:13 -05:00
model.py
Implement GGUF metadata KV overrides (
#1011
)
2024-01-15 12:29:29 -05:00
settings.py
Offload KQV by default
2024-01-18 11:08:57 -05:00
types.py
server: Support none defaulting to infinity for completions (
#111
)
2023-12-22 14:05:13 -05:00