Please add IQ3_KT and IQ4_KT
π 1
1
#8 opened 5 months ago
by
KeinNiemand
CUDA error when prompt start processing
π 1
1
#7 opened 7 months ago
by
illeniumx
Custom jinja template and draft model usage
π 1
9
#6 opened 8 months ago
by
ubergarm
KL Divergence as Performance Metric
1
#5 opened 9 months ago
by
joaquinrfs
IQ2_KL Testing - Runs Great Until The Model The Model The Model (lol)
π₯ 1
8
#4 opened 9 months ago
by
phakio
Can you provide some low-precision quantization options?
βπ 3
11
#3 opened 9 months ago
by
lingyezhixing
Good job
56
#2 opened 9 months ago
by
huccjj
Works like a charm on ik_llama.cpp server with PR 668
π₯ 3
11
#1 opened 9 months ago
by
Nexesenex