Update max_new_tokens slider in web gui to support at least 16k #3835

marmelade500 · 2023-09-07T18:34:26Z

Flanua · 2023-09-07T18:42:15Z

I don't think there is AI models that support that yet. You can use RoPE scaling for that if your model supports it.

P.S: More info here:
ggml-org/llama.cpp#2054

berkut1 · 2023-09-12T03:29:31Z

@Flanua I think the OP is talking about token generation (like long respond), not context.

If this is equal to llama.embedding_length, then some models support 5120 or even 16k.

marmelade500 · 2023-09-19T19:24:24Z

For anyone who wants a workaround, you can create a copy of the settings yaml, rename it to settings.yaml, and put your own custom settings in it. It will use those settings when the program is started.

github-actions · 2023-11-01T23:16:32Z

This issue has been closed due to inactivity for 6 weeks. If you believe it is still relevant, please leave a comment below. You can tag a developer in your comment.

marmelade500 added the enhancement label Sep 7, 2023

github-actions bot added the stale label Nov 1, 2023

github-actions bot closed this as completed Nov 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update max_new_tokens slider in web gui to support at least 16k #3835

Update max_new_tokens slider in web gui to support at least 16k #3835

marmelade500 commented Sep 7, 2023

Flanua commented Sep 7, 2023 •

edited

Loading

berkut1 commented Sep 12, 2023 •

edited

Loading

marmelade500 commented Sep 19, 2023 •

edited

Loading

github-actions bot commented Nov 1, 2023

Update max_new_tokens slider in web gui to support at least 16k #3835

Update max_new_tokens slider in web gui to support at least 16k #3835

Comments

marmelade500 commented Sep 7, 2023

Flanua commented Sep 7, 2023 • edited Loading

berkut1 commented Sep 12, 2023 • edited Loading

marmelade500 commented Sep 19, 2023 • edited Loading

github-actions bot commented Nov 1, 2023

Flanua commented Sep 7, 2023 •

edited

Loading

berkut1 commented Sep 12, 2023 •

edited

Loading

marmelade500 commented Sep 19, 2023 •

edited

Loading