From Word to World: Can Large Language Models be Implicit Text-based World Models? Paper • 2512.18832 • Published 17 days ago • 11
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 15 days ago • 16
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation Paper • 2512.20352 • Published 15 days ago • 2
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing Paper • 2512.17909 • Published 19 days ago • 36
HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices Paper • 2512.14052 • Published 23 days ago • 40
Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views Paper • 2512.12980 • Published 24 days ago • 27
Comparative Analysis of LLM Abliteration Methods: A Cross-Architecture Evaluation Paper • 2512.13655 • Published 23 days ago • 2
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 24 days ago • 103
Fairy2i: Training Complex LLMs from Real LLMs with All Parameters in {pm 1, pm i} Paper • 2512.02901 • Published Dec 2, 2025 • 5
Causal Judge Evaluation: Calibrated Surrogate Metrics for LLM Systems Paper • 2512.11150 • Published 27 days ago • 5
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Paper • 2512.07783 • Published about 1 month ago • 36
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published about 1 month ago • 57