ByteDance Seed

company

https://opensource.bytedance.com

AI & ML interests

None defined yet.

Recent Activity

wondervictor authored a paper 1 day ago

Mixture-of-Depths Attention

Facico new activity 3 days ago

ByteDance-Seed/Stable-DiffCoder-8B-Instruct:fix bug for generaion(empty response and PyTorch exception)

Aboriginer new activity 5 days ago

ByteDance-Seed/ReSA:Add links to paper and project page

View all activity

Papers

Mixture-of-Depths Attention

Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

View all Papers

authored a paper 1 day ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 1 day ago • 57

in ByteDance-Seed/Stable-DiffCoder-8B-Instruct 3 days ago

fix bug for generaion(empty response and PyTorch exception)

#8 opened 3 days ago by

in ByteDance-Seed/ReSA 5 days ago

Add links to paper and project page

#2 opened 9 days ago by

submitted a paper to Daily Papers 5 days ago

Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

Paper • 2603.11103 • Published 7 days ago • 8

in ByteDance-Seed/Stable-DiffCoder-8B-Base 9 days ago

Update to adapt transformers v5.3.0

#1 opened 9 days ago by

in ByteDance-Seed/Stable-DiffCoder-8B-Instruct 9 days ago

Update to adapt transformers v5.3.0

#5 opened 9 days ago by

authored 4 papers 9 days ago

AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

Paper • 2510.06261 • Published Oct 5, 2025 • 6

Reasoned Safety Alignment: Ensuring Jailbreak Defense via Answer-Then-Check

Paper • 2509.11629 • Published Sep 15, 2025 • 1

Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection

Paper • 2406.00806 • Published Jun 2, 2024

Noisy Test-Time Adaptation in Vision-Language Models

Paper • 2502.14604 • Published Feb 20, 2025

updated a dataset 9 days ago

ByteDance-Seed/SCFbench

Updated 9 days ago • 182 • 5

in ByteDance-Seed/Stable-DiffCoder-8B-Instruct 23 days ago

Issues running your model in LM Studio

#2 opened about 1 month ago by

submitted a paper to Daily Papers about 1 month ago

VideoWorld 2: Learning Transferable Knowledge from Real-world Videos

Paper • 2602.10102 • Published Feb 10 • 14

submitted 3 papers to Daily Papers about 1 month ago

Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better

Paper • 2602.05393 • Published Feb 5 • 8

BABE: Biology Arena BEnchmark

Paper • 2602.05857 • Published Feb 5 • 10

Protein Autoregressive Modeling via Multiscale Structure Generation

Paper • 2602.04883 • Published Feb 4 • 3

updated a dataset about 1 month ago

ByteDance-Seed/ReSA

Viewer • Updated 5 days ago • 79.7k • 86 • 3

authored a paper about 1 month ago

LoopViT: Scaling Visual ARC with Looped Transformers

Paper • 2602.02156 • Published Feb 2 • 12

submitted a paper to Daily Papers about 1 month ago

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

Paper • 2602.02472 • Published Feb 2 • 46

submitted a paper to Daily Papers about 2 months ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published Jan 29 • 42