baalajimaestro/llama.cpp

Fork 0

Commit graph

Author	SHA1	Message	Date
MillionthOdin16	c283edd7f2	Set n_batch to default values and reduce thread count: Change batch size to the llama.cpp default of 8. I've seen issues in llama.cpp where batch size affects quality of generations. (It shouldn't) But in case that's still an issue I changed to default. Set auto-determined num of threads to 1/2 system count. ggml will sometimes lock cores at 100% while doing nothing. This is being addressed, but can cause bad experience for user if pegged at 100%	2023-04-05 18:17:29 -04:00
MillionthOdin16	76a82babef	Set n_batch to the default value of 8. I think this is leftover from when n_ctx was missing and n_batch was 2048.	2023-04-05 17:44:53 -04:00
Andrei Betlen	44448fb3a8	Add server as a subpackage	2023-04-05 16:23:25 -04:00

Author

SHA1

Message

Date

MillionthOdin16

c283edd7f2

Set n_batch to default values and reduce thread count:

Change batch size to the llama.cpp default of 8. I've seen issues in llama.cpp where batch size affects quality of generations. (It shouldn't) But in case that's still an issue I changed to default.

Set auto-determined num of threads to 1/2 system count. ggml will sometimes lock cores at 100% while doing nothing. This is being addressed, but can cause bad experience for user if pegged at 100%

2023-04-05 18:17:29 -04:00

MillionthOdin16

76a82babef

Set n_batch to the default value of 8. I think this is leftover from when n_ctx was missing and n_batch was 2048.

2023-04-05 17:44:53 -04:00

Andrei Betlen

44448fb3a8

Add server as a subpackage

2023-04-05 16:23:25 -04:00

1 2 3 4

153 commits