nvidia/Nemotron-Post-Training-Dataset-v1
Viewer
•
Updated
•
25.7M
•
10.2k
•
170
The SFT datasets for KORMo-10B were collected from diverse, publicly available source
Note SFT datasets Englsih - nvidia/Nemotron-Post-Training-Dataset-v1 (~2.8B) - HuggingFaceTB/smoltalk2 (~259.5M) Korean - kormo-lm/reasoning_ko_filter_0710 (3.37B) English & Korean - kormo-lm/KORMo-SFT-datasets (~175M)