@Lantier@jlai.lu to LocalLLaMA@sh.itjust.worksEnglish • 4 months agoQwen/QwQ-32B · Hugging Facehuggingface.coexternal-linkmessage-square5arrow-up118
arrow-up118external-linkQwen/QwQ-32B · Hugging Facehuggingface.co@Lantier@jlai.lu to LocalLLaMA@sh.itjust.worksEnglish • 4 months agomessage-square5
minus-squaresuokolinkfedilinkEnglish1•4 months agoWhy insane? For quality, speed, size? I find the coder 1.5b and 3b light and good
minus-square@morrowind@lemm.eelinkfedilinkEnglish3•4 months agoIt matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32
insane, absolutely insane
Why insane? For quality, speed, size? I find the coder 1.5b and 3b light and good
It matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32