Can't relate at all.

Sips'@slrpnk.net · 1 year ago

Can't relate at all.

brucethemoose@lemmy.world · 1 year ago

Depends which 14B. Arcee’s 14B SuperNova Medius model (which is a Qwen 2.5 with some training distilled from larger models) is really incrtedible, but old Llama 2-based 13B models are awful.

Hackworth@lemmy.world · 1 year ago

I’ll try it out! It’s been a hot minute, and it seems like there are new options all the time.

brucethemoose@lemmy.world · 1 year ago

Try a new quantization as well! Like an IQ4-M depending on the size of your GPU, or even better, an 4.5bpw exl2 if you can manage to set up TabbyAPI.