MTP-LM - a tomg-group-umd Collection

tomg-group-umd 's Collections

Retrofitting Recurrence

Refusal Token Models

Efficient_inference_models

Gemstone Models

Recurrent Models

Style Descriptors

Goldfish Loss: Mitigating Memorization in LLMs

Zero-Shot Grafting

MTP-LM

updated 4 days ago

Models to accompany research paper on training multi token prediction language models using self-distillation.