MoeReward/combined_preference_dataset_qwen2.5_sft_alpaca_heavy
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_qwen2.5_sft_qa_heavy
Viewer
•
Updated
•
9.23k
MoeReward/combined_preference_dataset_qwen2.5_sft_coding_heavy
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_qwen2.5_sft_math_heavy
Viewer
•
Updated
•
10k
•
1
MoeReward/combined_preference_dataset_qwen2.5_sft_equal_dist
Viewer
•
Updated
•
10k
•
1
MoeReward/combined_preference_dataset_qwen2.5_sft
Viewer
•
Updated
•
81.3k
MoeReward/combined_preference_dataset_olmoe_sft
Viewer
•
Updated
•
61.7k
MoeReward/combined_preference_dataset_olmoe_base
Viewer
•
Updated
•
66.7k
MoeReward/combined_preference_dataset_olmoe_base_alpaca_heavy
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_olmoe_base_qa_heavy
Viewer
•
Updated
•
9.23k
MoeReward/combined_preference_dataset_olmoe_base_coding_heavy
Viewer
•
Updated
•
9.92k
MoeReward/combined_preference_dataset_olmoe_base_math_heavy
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_olmoe_base_equal_dist
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_qwen1.5_base_alpaca_heavy
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_qwen1.5_base_qa_heavy
Viewer
•
Updated
•
9.23k
MoeReward/combined_preference_dataset_qwen1.5_base_coding_heavy
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_qwen1.5_base_math_heavy
Viewer
•
Updated
•
10k
MoeReward/combined_preference_dataset_qwen1.5_base_equal_dist
Preview
•
Updated
MoeReward/combined_preference_dataset_qwen1.5_base
Viewer
•
Updated
•
61.9k
MoeReward/combined_preference_dataset_qwen
Viewer
•
Updated
•
50k
MoeReward/combined_preference_dataset_olmoe
Viewer
•
Updated
•
56.6k
MoeReward/combined_sft_dataset
Viewer
•
Updated
•
115k
MoeReward/combined_preference_dataset
Viewer
•
Updated
•
52k
MoeReward/combined_rlhf_dataset
Viewer
•
Updated
•
125k
•
1