This website requires JavaScript.
Explore
Help
Sign in
baalajimaestro
/
llama.cpp
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
c96b2daebf
llama.cpp
/
llama_cpp
/
server
History
ddh0
c96b2daebf
feat: Use all available CPUs for batch processing (
#1345
)
2024-04-17 10:05:54 -04:00
..
__init__.py
llama_cpp server: app is now importable, still runnable as a module
2023-04-29 11:41:25 -07:00
__main__.py
feat: Add support for yaml based configs
2024-04-10 02:47:01 -04:00
app.py
feat: Add support for yaml based configs
2024-04-10 02:47:01 -04:00
cli.py
Fix python3.8 support
2024-01-19 08:17:49 -05:00
errors.py
misc: Format
2024-02-28 14:27:40 -05:00
model.py
feat: add support for KV cache quantization options (
#1307
)
2024-04-01 10:19:28 -04:00
settings.py
feat: Use all available CPUs for batch processing (
#1345
)
2024-04-17 10:05:54 -04:00
types.py
feat: Add logprobs support to chat completions (
#1311
)
2024-03-31 13:30:13 -04:00