PyTorch
llama

HASS Baseline Checkpoint (Used in PosS)

This repository provides a reproduced HASS model checkpoint that is used as a baseline in PosS (Position Specialist) experiments.

PosS is a speculative decoding method proposed in the paper:

PosS: Position Specialist Generates Better Draft for Speculative Decoding

In our experiments, this HASS checkpoint serves as the baseline draft model for comparison with the proposed position-specialized draft models.


πŸ”— Code

The full implementation of PosS, along with training details and evaluation scripts (including EAGLE-2 and HASS baselines), is available at:

πŸ‘‰ GitHub: https://github.com/shrango/PosS


πŸ“¦ Files

If the model is not automatically downloaded by your framework, you may manually download the following files from this repository:

  • pytorch_model.bin β€” model weights
  • config.json β€” model configuration

πŸ“– Citation

If you use this checkpoint in the context of PosS or refer to the PosS method, please cite:

@misc{huang2025posspositionspecialistgenerates,
  title        = {POSS: Position Specialist Generates Better Draft for Speculative Decoding},
  author       = {Langlin Huang and Chengsong Huang and Jixuan Leng and Di Huang and Jiaxin Huang},
  year         = {2025},
  eprint       = {2506.03566},
  archivePrefix= {arXiv},
  primaryClass = {cs.CL},
  url          = {https://arxiv.org/abs/2506.03566}
}
Downloads last month
25
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Collection including HINT-lab/HASS-Llama3-8B-Instruct-Reproduce