boris

boris-50M-superlight-cubscout

boris-50M-superlight-cubscout (Boris) is a lightweight, ~50M parameter text generation model trained entirely on the roneneldan/TinyStories dataset.

It was developed entirely on one NVIDIA RTX 3060 in ~2 hours. Boris's primary use case is generating bad children's short stories.


Traning Details:

  • Trained on TinyStories (1000 steps)
  • Trained using one NVIDIA RTX 3060 (12GB VRAM)
  • Precision: FP16
  • Final Traning Loss: ~1.76

Advice:

  1. Set max tokens to ~50-100.
  2. This is a base model, and does not know how to stop. Add stop sequences like "the end." or ###

Evaluation Results:

Final Training Loss: ~1.76 TinyStories (Train)

Perplexity (PPL): 8.52 TinyStories (Validation)


Copyright & License:

Copyright 2026 Joseph Jones

This project and all associated files (the "Work") are licensed under the Apache License, Version 2.0 (the "License"); you may not use this project except in compliance with the License. You may obtain a copy of the License at:

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Downloads last month
73
Safetensors
Model size
58.1M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train KlondikeDev/boris-50M-superlight-cubscout

Space using KlondikeDev/boris-50M-superlight-cubscout 1