-
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
Evaluating and Aligning CodeLLMs on Human Preference
Paper • 2412.05210 • Published • 50 -
Evaluating Language Models as Synthetic Data Generators
Paper • 2412.03679 • Published • 48 -
Yi-Lightning Technical Report
Paper • 2412.01253 • Published • 28
Collections
Discover the best community collections!
Collections including paper arxiv:2409.12186
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 297k • • 1.96k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 677k • • 571 -
Qwen/Qwen2.5-Coder-32B-Instruct-GGUF
Text Generation • 33B • Updated • 18.5k • 177 -
dphn/dolphin-2.9.2-qwen2-72b
Text Generation • 73B • Updated • 787 • 170
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 35 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 39 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 34
-
Qwen2.5 Coder Artifacts
🐢1.7kCreate and view code for applications using text prompts
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 297k • • 1.96k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 13k • • 136 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 152
-
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
Evaluating and Aligning CodeLLMs on Human Preference
Paper • 2412.05210 • Published • 50 -
Evaluating Language Models as Synthetic Data Generators
Paper • 2412.03679 • Published • 48 -
Yi-Lightning Technical Report
Paper • 2412.01253 • Published • 28
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 297k • • 1.96k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 677k • • 571 -
Qwen/Qwen2.5-Coder-32B-Instruct-GGUF
Text Generation • 33B • Updated • 18.5k • 177 -
dphn/dolphin-2.9.2-qwen2-72b
Text Generation • 73B • Updated • 787 • 170
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 35 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 39 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 34
-
Qwen2.5 Coder Artifacts
🐢1.7kCreate and view code for applications using text prompts
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 297k • • 1.96k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 13k • • 136 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 152