RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published 16 days ago • 7
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference Paper • 2202.10408 • Published Feb 21, 2022 • 5