Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Gen-PRM

university
https://ricardokevins.github.io/
Activity Feed

AI & ML interests

None defined yet.

Shuaijie She's profile picture

kevinpro 
authored 5 papers 4 months ago

Question Translation Training for Better Multilingual Reasoning

Paper • 2401.07817 • Published Jan 15, 2024 • 1

R-PRM: Reasoning-Driven Process Reward Modeling

Paper • 2503.21295 • Published Mar 27

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Paper • 2505.21505 • Published May 27 • 18

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters

Paper • 2507.13618 • Published Jul 18 • 16

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20 • 85
kevinpro 
authored a paper almost 2 years ago

MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization

Paper • 2401.06838 • Published Jan 12, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs