This website requires JavaScript.
Explore
Help
Sign in
baalajimaestro
/
llama.cpp
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
3ca86ab390
llama.cpp
/
llama_cpp
/
server
History
Andrei Betlen
b8fc1c7d83
feat: Add ability to load chat format from huggingface autotokenizer or tokenizer_config.json files.
2024-01-18 21:21:37 -05:00
..
__init__.py
llama_cpp server: app is now importable, still runnable as a module
2023-04-29 11:41:25 -07:00
__main__.py
[Feat] Multi model support (
#931
)
2023-12-22 05:51:25 -05:00
app.py
Support Accept text/event-stream in chat and completion endpoints,
resolves
#1083
(
#1088
)
2024-01-16 12:52:52 -05:00
cli.py
[Feat] Multi model support (
#931
)
2023-12-22 05:51:25 -05:00
errors.py
server: Support none defaulting to infinity for completions (
#111
)
2023-12-22 14:05:13 -05:00
model.py
feat: Add ability to load chat format from huggingface autotokenizer or tokenizer_config.json files.
2024-01-18 21:21:37 -05:00
settings.py
feat: Add ability to load chat format from huggingface autotokenizer or tokenizer_config.json files.
2024-01-18 21:21:37 -05:00
types.py
server: Support none defaulting to infinity for completions (
#111
)
2023-12-22 14:05:13 -05:00