AgentGym

Activity Feed

AI & ML interests

LLM Agent

Recent Activity

WooooDyy authored a paper 7 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

KYLN24 authored a paper 7 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

KYLN24 authored a paper 11 days ago

OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

View all activity

WooooDyy

authored a paper 7 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published 11 days ago • 63

KYLN24

authored a paper 7 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published 11 days ago • 63

KYLN24

authored a paper 11 days ago

OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

Paper • 2601.10343 • Published 12 days ago

KYLN24

authored a paper about 1 month ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 145

KYLN24

authored 3 papers about 2 months ago

Better Process Supervision with Bi-directional Rewarding Signals

Paper • 2503.04618 • Published Mar 6, 2025

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 57

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 84

WooooDyy

authored a paper about 2 months ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 80

KYLN24

authored a paper about 2 months ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 80

KYLN24

in AgentGym/AgentGym-RL-Data-ID 5 months ago

Upload webarena_train.json

#3 opened 5 months ago by

SixPlusSeven13

Add comprehensive dataset card for AgentGym-RL-Data-ID

#2 opened 5 months ago by

nielsr

KYLN24

published a dataset 5 months ago

AgentGym/AgentGym-RL-Data-ID

Viewer • Updated Sep 12, 2025 • 186k • 141 • 4

KYLN24

authored 2 papers 5 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4, 2025 • 24

KYLN24

in AgentGym/AgentTraj-L 5 months ago

Update sciworld_train.json

#3 opened 5 months ago by

SixPlusSeven13

KYLN24

updated a dataset 5 months ago

AgentGym/AgentGym-RL-Data-ID

Viewer • Updated Sep 12, 2025 • 186k • 141 • 4

KevinChenwx

authored 2 papers 11 months ago

The Rise and Potential of Large Language Model Based Agents: A Survey

Paper • 2309.07864 • Published Sep 14, 2023 • 8

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024

KYLN24

authored 2 papers 11 months ago

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published Mar 2, 2025 • 13

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024

AI & ML interests

Recent Activity

Team members 4

AgentGym's activity

Upload webarena_train.json

Add comprehensive dataset card for AgentGym-RL-Data-ID

Update sciworld_train.json