Steve Li
CHNtentes
AI & ML interests
None yet
Recent Activity
new activity 2 days ago
unsloth/MiniMax-M2.7-GGUF:larger file size for same quant new activity 3 days ago
Tongyi-MAI/MAI-UI-8B:will we ever get 32b and 235b versions? new activity 4 days ago
zai-org/GLM-5.1:GLM5.1角色问题-重要Organizations
None yet
larger file size for same quant
4
#4 opened 3 days ago
by
CHNtentes
will we ever get 32b and 235b versions?
#4 opened 3 days ago
by
CHNtentes
GLM5.1角色问题-重要
9
#17 opened 5 days ago
by
liuyt6515
can we get minimax-m2.7
🤗 13
5
#49 opened 28 days ago
by
CHNtentes
35b variant?
👍 4
9
#2 opened about 1 month ago
by
dagbs
FP8 Version for running on vLLM with hardware optimizations from Ada+ generation GPUs
4
#14 opened about 1 month ago
by
AQLabs
Could someone make Qwen/Qwen3.5-0.4B?
3
#4 opened about 1 month ago
by
MihaiPopa-1
Can we get a 9B-FP8 version next
👍 14
4
#5 opened about 1 month ago
by
kq
Massive clipping damage? Why is Q8KXL have F16 tensors/layers when it's a native BF16 model?
👍 1
8
#3 opened about 1 month ago
by
RyanoSaurus-Wrex
Is `qwen3_nonthinking.jinja` available for disabling thinking?
7
#23 opened about 2 months ago
by
kraftDong
what's this?
11
#1 opened about 2 months ago
by
Simon716
Quantization AWQ INT4
1
#17 opened about 2 months ago
by
abbas381366
数字和中文之间多了空格,在某些场景下完全用不了。
14
#15 opened about 2 months ago
by
waxwax0099
Update: Should now be Fixed - Bug in UD-Q4_K_XL recipe using MXFP4 for attn tensors and experts?
👍 8
26
#5 opened about 2 months ago
by
ubergarm
Official FP8
👀👍 14
4
#4 opened about 2 months ago
by
retowyss
llama cpp Error: Unknown (built-in) filter 'items' for type String
9
#2 opened about 2 months ago
by
fullstack
Multimodality not working
3
#4 opened about 2 months ago
by
meganoob1337
Prompt cache not working correctly
👀👍 6
1
#4 opened about 2 months ago
by
guiopen
Will there be an Instruct version?
3
#6 opened about 2 months ago
by
a-r-c
Instruct mode metrics
2
#3 opened about 2 months ago
by
Baskermas