Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AceCoder

community
https://jdf-prog.github.io/
DongfuJiang
jdf-prog
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

chiruan  updated a model about 1 hour ago
CodeDPO/filtered_original_acecoderv3
chiruan  updated a model about 1 hour ago
CodeDPO/filtered_original_acecoderv3
DongfuJiang  authored a paper 3 months ago
Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning
View all activity

Dongfu Jiang's profile picture Wyett's profile picture Haozhe Wang's profile picture chiruan's profile picture

CodeDPO 's models 42

CodeDPO/qwen25-ins-7b-coderm_new_margin_scalebt-7b-reinforce-plus-episode_1

Text Generation • 8B • Updated Jan 28 • 5

CodeDPO/qwen25-coder-base-7b-testcaserm-7b-new-dataset-hard

8B • Updated Jan 27 • 6

CodeDPO/Qwen2.5-Coder-7B-binarized

7B • Updated Jan 26 • 8

CodeDPO/Qwen2.5-Coder-7B-new_with_margin_scalebt

7B • Updated Jan 26 • 3

CodeDPO/Qwen2.5-Coder-7B_with_margin_scalebt

7B • Updated Jan 26 • 6

CodeDPO/qwen25-coder-base-7b-testcaserm-7b-ppo-binary

8B • Updated Jan 26 • 7

CodeDPO/qwen25-ins-7b-testcaserm-7b-reinforce-plus_new_dataset

8B • Updated Jan 26 • 5

CodeDPO/qwen25-ins-7b-testcaserm-7b-reinforce-plus-binary

8B • Updated Jan 26 • 4

CodeDPO/qwen25-ins-7b-coderm-7b-ppo

8B • Updated Jan 26 • 4

CodeDPO/qwen25-ins-7b-testcaserm-7b-reinforce-plus

8B • Updated Jan 25 • 2

CodeDPO/qwen_coder_2.5_rm_openrlhf

7B • Updated Jan 23 • 7

CodeDPO/llama3-RL-both-E2-0117-ckpt1624

8B • Updated Jan 22 • 5
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs