Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1 Text Generation • 4B • Updated about 1 month ago • 4
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_8192_epoch_1 Text Generation • 4B • Updated about 1 month ago • 5
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_4096_epoch_1 Text Generation • 4B • Updated Nov 25 • 5
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_16384_epoch_1 Text Generation • 4B • Updated Nov 25 • 15
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_4096_epoch_1 Text Generation • 4B • Updated Nov 25 • 69
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_8192_epoch_1 Text Generation • 4B • Updated Nov 25 • 14
Ujan/lts_DeepMath-103K_samples_10000_seq_16384_Qwen3-30B-A3B-Thinking-2507_22_23_24_0.8 Viewer • Updated 25 days ago • 11k • 12
Ujan/lts_pruned_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_17_18_19_0.5 Viewer • Updated 25 days ago • 11k • 14
Ujan/lts_pruned_processed_DeepMath-103K_samples_50000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 26 days ago • 51k • 17
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.8 Viewer • Updated 27 days ago • 11k • 13
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 27 days ago • 11k • 30