OpenHermes 2.5 Mistral 7B
teknium/openhermes-2.5-mistral-7b
A continuation of OpenHermes 2 model, trained on additional code datasets. Potentially the most interesting finding from training on a good ratio (est. of around 7-14% of the total dataset) of code instruction was that it has boosted several non-code benchmarks, including TruthfulQA, AGIEval, and GPT4All suite. It did however reduce BigBench benchmark score, but the net gain overall is significant.
Modalities
Context
Low
4K
Released
Nov 20, 2023
Knowledge Cutoff
Sep 2023
Activity
Token volume and request traffic to this model over time.