Ayush Mangal

ayushtues

·

AI & ML interests

None yet

Organizations

upvoted 2 articles over 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 419

Article

The N Implementation Details of RLHF with PPO

+1

vwxyzjn, tianlinliu0121, lvwerra

•

Oct 24, 2023

• 72

upvoted a paper almost 3 years ago

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 34