Llama 3.1 producing invalid output from Chat #9

jmont-dev · 2024-08-10T16:56:08Z

Llama 3.1 appears to have issues with the Chat endpoint, often producing empty or incoherent responses. The model appears to work with the standard generate endpoint.

Other models such as mistral and phi do appear to work without issues. There may be options or a special request structure that has to be made for Llama 3.

The text was updated successfully, but these errors were encountered:

jmont-dev · 2024-08-10T17:56:35Z

This is resulting from providing a format value to the chat request, eg "format": "json". The API says that this should be acceptable but it appears to produce incoherent responses from some models. This tag has been commented out for now and the issue will be fed upstream to the Ollama team so they can address it or comment further.

The change removing the format field from chat requests was merged into main with #11.

JG-Adams · 2024-08-10T18:42:55Z

Removing format made both 3.1 and 3 work for me on MinGW Window. What is format supposed to do?

jmont-dev · 2024-08-13T04:30:11Z

Format represents the type of message that is returned from the Ollama server. Ollama currently only provides responses as JSON in HTTP replies to clients. The API lists this field as part of the specification and says that "json" is the only valid option. It's probably designed to future-proof the API in case they wanted to provide support for other types in the future (eg, YAML, TOML, or XML).

It definitely causes issues with some model responses, so they may be inadvertently passing the field on to the model itself which causes it to produce incoherent output. Marking this as closed for now and I'll push this up to the Ollama team to investigate in the base project.

jmont-dev added the bug Something isn't working label Aug 10, 2024

jmont-dev added this to the 1.0 Release milestone Aug 10, 2024

jmont-dev self-assigned this Aug 10, 2024

jmont-dev mentioned this issue Aug 10, 2024

No such thing as CreateFile2? A MinGW problem #5

Closed

jmont-dev closed this as completed Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 3.1 producing invalid output from Chat #9

Llama 3.1 producing invalid output from Chat #9

jmont-dev commented Aug 10, 2024

jmont-dev commented Aug 10, 2024

JG-Adams commented Aug 10, 2024

jmont-dev commented Aug 13, 2024

Llama 3.1 producing invalid output from Chat #9

Llama 3.1 producing invalid output from Chat #9

Comments

jmont-dev commented Aug 10, 2024

jmont-dev commented Aug 10, 2024

JG-Adams commented Aug 10, 2024

jmont-dev commented Aug 13, 2024