Convert this model to FP8-Dynamic with llm-compressor but failed
#3 opened 6 days ago
by
elichen-skymizer
RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cuda:5)
#2 opened 2 months ago
by
freebsdx
Please make an official GGUF
#1 opened 4 months ago
by
PlayAI