blizzard-neel 's Collections papers
updated
Meta-Learning a Dynamical Language Model
Paper
• 1803.10631
• Published
• 1
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance
Generation
Paper
• 2003.11963
• Published
BigScience: A Case Study in the Social Construction of a Multilingual
Large Language Model
Paper
• 2212.04960
• Published
• 1
Continuous Learning in a Hierarchical Multiscale Neural Network
Paper
• 1805.05758
• Published
• 2
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Paper
• 1910.03771
• Published
• 21
Evaluate & Evaluation on the Hub: Better Best Practices for Data and
Model Measurements
Paper
• 2210.01970
• Published
• 13
TransferTransfo: A Transfer Learning Approach for Neural Network Based
Conversational Agents
Paper
• 1901.08149
• Published
• 3
Datasets: A Community Library for Natural Language Processing
Paper
• 2109.02846
• Published
• 14
Large Language Models Can Self-Improve in Long-context Reasoning
Paper
• 2411.08147
• Published
• 65
Model soups: averaging weights of multiple fine-tuned models improves
accuracy without increasing inference time
Paper
• 2203.05482
• Published
• 7
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Paper
• 2410.19168
• Published
• 24
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding
Benchmark
Paper
• 2409.02813
• Published
• 33
JuStRank: Benchmarking LLM Judges for System Ranking
Paper
• 2412.09569
• Published
• 20
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained
Evidence within Generation
Paper
• 2412.11919
• Published
• 36
Are Your LLMs Capable of Stable Reasoning?
Paper
• 2412.13147
• Published
• 93
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse
Task Synthesis
Paper
• 2412.19723
• Published
• 87
Large Language Model-Brained GUI Agents: A Survey
Paper
• 2411.18279
• Published
• 30
Molar: Multimodal LLMs with Collaborative Filtering Alignment for
Enhanced Sequential Recommendation
Paper
• 2412.18176
• Published
• 16
Token-Budget-Aware LLM Reasoning
Paper
• 2412.18547
• Published
• 46
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via
Collective Monte Carlo Tree Search
Paper
• 2412.18319
• Published
• 39
RobustFT: Robust Supervised Fine-tuning for Large Language Models under
Noisy Response
Paper
• 2412.14922
• Published
• 88
Learning to Reason via Self-Iterative Process Feedback for Small
Language Models
Paper
• 2412.08393
• Published
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal
Perturbation and Learning Stabilization
Paper
• 2501.01245
• Published
• 5
Xmodel-2 Technical Report
Paper
• 2412.19638
• Published
• 27
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper
• 2412.21187
• Published
• 40
Executable Code Actions Elicit Better LLM Agents
Paper
• 2402.01030
• Published
• 188
Advancing LLM Reasoning Generalists with Preference Trees
Paper
• 2404.02078
• Published
• 46
Training Software Engineering Agents and Verifiers with SWE-Gym
Paper
• 2412.21139
• Published
• 25
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective
Rationales
Paper
• 2405.20974
• Published
Resolving Interference When Merging Models
Paper
• 2306.01708
• Published
• 17
Editing Models with Task Arithmetic
Paper
• 2212.04089
• Published
• 7
Language Models are Super Mario: Absorbing Abilities from Homologous
Models as a Free Lunch
Paper
• 2311.03099
• Published
• 30