.. |
ext_server
|
IBM granite/granitemoe architecture support (#6760)
|
2024-10-17 11:59:52 -07:00 |
generate
|
llm: Remove GGML_CUDA_NO_PEER_COPY for ROCm (#7174)
|
2024-10-12 09:56:49 -07:00 |
llama.cpp@3f1ae2e32c
|
IBM granite/granitemoe architecture support (#6760)
|
2024-10-17 11:59:52 -07:00 |
patches
|
IBM granite/granitemoe architecture support (#6760)
|
2024-10-17 11:59:52 -07:00 |
filetype.go
|
Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322)
|
2024-05-23 13:21:49 -07:00 |
ggla.go
|
image processing for llama3.2 (#6963)
|
2024-10-18 16:12:35 -07:00 |
ggml.go
|
image processing for llama3.2 (#6963)
|
2024-10-18 16:12:35 -07:00 |
ggml_test.go
|
llm: speed up gguf decoding by a lot (#5246)
|
2024-06-24 21:47:52 -07:00 |
gguf.go
|
image processing for llama3.2 (#6963)
|
2024-10-18 16:12:35 -07:00 |
llm_darwin.go
|
Optimize container images for startup (#6547)
|
2024-09-12 12:10:30 -07:00 |
llm_linux.go
|
Optimize container images for startup (#6547)
|
2024-09-12 12:10:30 -07:00 |
llm_windows.go
|
runner: Set windows above normal priority (#6905)
|
2024-09-21 16:54:49 -07:00 |
memory.go
|
image processing for llama3.2 (#6963)
|
2024-10-18 16:12:35 -07:00 |
memory_test.go
|
Rename gpu package discover (#7143)
|
2024-10-16 17:45:00 -07:00 |
server.go
|
image processing for llama3.2 (#6963)
|
2024-10-18 16:12:35 -07:00 |
status.go
|
Catch one more error log
|
2024-08-05 09:28:07 -07:00 |