-
Notifications
You must be signed in to change notification settings - Fork 11.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to convert Mistral-7B-OpenOrca to GGUF #3583
Comments
find this in SentencePieceVocab (convert.py) and just use
mine is |
wrote a PR #3585 |
For a quick workaround, for this model specifically, in added_tokens.json delete lines with tokens 0,1,2, because those particular tokens are basically always assumed to be in the vocab already, and you are not removing them, that definition isn't needed in this specific case. |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
Hello,
I am attempting to convert the Mistral-7B-OpenOrca to GGUF using "convert.py"
I understand that TheBloke has released a GGUF version, however I am wanting to convert it myself on my local computer.
However I keep getting the error:
Exception: Expected added token IDs to be sequential and start at 6; got [0, 1, 2, 32000, 32001, 32002]
I believe this is due to the additional "added_tokens.json" file that Mistral-7B-OpenOrca has.
A similar issue was reported here
However there was no fix?
Any solutions or pointers would be greatly appreciated.
Thanks,
Nate
The text was updated successfully, but these errors were encountered: