This website requires JavaScript.
Explore
Help
Sign in
baalajimaestro
/
llama.cpp
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
f9b7221c8f
llama.cpp
/
llama_cpp
/
server
History
Andrei Betlen
f9b7221c8f
Merge branch 'main' of github.com:abetlen/llama_cpp_python into main
2024-05-03 19:07:54 -04:00
..
__init__.py
llama_cpp server: app is now importable, still runnable as a module
2023-04-29 11:41:25 -07:00
__main__.py
feat: Add support for yaml based configs
2024-04-10 02:47:01 -04:00
app.py
feat: add
disable_ping_events
flag (
#1257
)
2024-04-17 10:08:19 -04:00
cli.py
Fix python3.8 support
2024-01-19 08:17:49 -05:00
errors.py
misc: Format
2024-02-28 14:27:40 -05:00
model.py
fix(server): Propagate
flash_attn
to model load. (
#1424
)
2024-05-03 12:17:07 -04:00
settings.py
feat: Add option to enable
flash_attn
to Lllama params and ModelSettings
2024-04-30 09:29:16 -04:00
types.py
feat(server): Remove temperature bounds checks for server.
Closes
#1384
2024-05-03 15:23:06 -04:00