Chess GPT - Prof's Architecture

Params: 998,656 Vocab: 1604 (TOP_K=2000) Dataset: 1M samples x 5 epochs

Config:

  • n_embd: 128
  • n_layer: 4
  • n_head: 4
  • LR: 5e-4
  • UNK rate: 25.7%

Target: 60-70% legal rate

Downloads last month
12
Safetensors
Model size
999k params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support