Fangyuan Yu's picture

Fangyuan Yu PRO

Ksgk-fy

·

fangyuan-ksgk

AI & ML interests

AGI

Recent Activity

upvoted a paper about 12 hours ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

updated a collection about 12 hours ago

Representation & Optimization

updated a model about 14 hours ago

Ksgk-fy/gpt2-xl-fineweb10B

View all activity

Organizations

upvoted a paper about 12 hours ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 3 days ago • 64

upvoted a paper 1 day ago

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23, 2025 • 30

upvoted 2 papers 7 days ago

Pretraining Frame Preservation in Autoregressive Video Memory Compression

Paper • 2512.23851 • Published 10 days ago • 22

Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published 10 days ago • 15

upvoted a paper 11 days ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 21 days ago • 83

upvoted 2 papers 12 days ago

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published 15 days ago • 66

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published 26 days ago • 42

upvoted a paper 17 days ago

Bolmo: Byteifying the Next Generation of Language Models

Paper • 2512.15586 • Published 22 days ago • 14

upvoted 2 papers 2 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 221

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 45

upvoted 8 papers 3 months ago

Memory Retrieval and Consolidation in Large Language Models through Function Tokens

Paper • 2510.08203 • Published Oct 9, 2025 • 9

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 271

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 184

The Three Regimes of Offline-to-Online Reinforcement Learning

Paper • 2510.01460 • Published Oct 1, 2025 • 1

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Paper • 2510.02263 • Published Oct 2, 2025 • 8

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 41

Generalized Parallel Scaling with Interdependent Generations

Paper • 2510.01143 • Published Oct 1, 2025 • 5

Mem-α: Learning Memory Construction via Reinforcement Learning

Paper • 2509.25911 • Published Sep 30, 2025 • 14

upvoted 2 papers 4 months ago

Scalable Reinforcement Post-Training Beyond Static Human Prompts: Evolving Alignment via Asymmetric Self-Play

Paper • 2411.00062 • Published Oct 31, 2024 • 1

Distilled Pretraining: A modern lens of Data, In-Context Learning and Test-Time Scaling

Paper • 2509.01649 • Published Sep 1, 2025 • 2