AI & ML interests
None defined yet.
AdversarialRLHF/sffop_1706381144_410msft_relabel_pythia6.9b_logprobs_cond3emojiepropallprefix
Viewer
•
Updated
•
130k
•
1
AdversarialRLHF/sffop_1706381144_410msft_relabel_pythia6.9b_logprobs_cond3emojieallprefix
Viewer
•
Updated
•
130k
•
3
AdversarialRLHF/summarize_from_feedback_oai_preprocessing_1706381144_410msft_trained_allprefix
Viewer
•
Updated
•
300
•
3
AdversarialRLHF/summarize_from_feedback_oai_preprocessing_1706381144_410msft_trained
Viewer
•
Updated
•
300
•
4
AdversarialRLHF/summarize_from_feedback_tldr_3_filtered_oai_preprocessing_1706381144_allprefix
Viewer
•
Updated
•
130k
•
1
AdversarialRLHF/sffop_1706381144_410msft_relabel_pythia6.9b_logprobs_cond3emojieboth
Viewer
•
Updated
•
130k
•
3
AdversarialRLHF/sffop_1706381144_410msft_relabel_pythia6.9b_logprobs_cond3emojieprefix
Viewer
•
Updated
•
130k
•
1
AdversarialRLHF/sffop_1706381144_410msft_relabel_pythia6.9b_logprobs_cond3emojiesuffix
Viewer
•
Updated
•
130k
•
3
AdversarialRLHF/sffop_1706381144_410msft_relabel_pythia6.9b_3emojieprefix_randomize
Viewer
•
Updated
•
130k
•
4
AdversarialRLHF/summarize_from_feedback_oai_preprocessing_1706381144_410msft_relabel_pythia6.9b_logprobs
Viewer
•
Updated
•
130k
•
1
AdversarialRLHF/summarize_from_feedback_oai_preprocessing_1706381144_410msft_relabel_pythia6.9b
Viewer
•
Updated
•
130k
•
1
AdversarialRLHF/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia6.9b_3emojieprefix_chosen
Viewer
•
Updated
•
177k
•
1
AdversarialRLHF/summarize_from_feedback_oai_preprocessing_1706381144_410msft
Viewer
•
Updated
•
130k
•
1