VidEoMT: Your ViT is Secretly Also a Video Segmentation Model Paper • 2602.17807 • Published Feb 19 • 7
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism Feb 12 • 20
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published Dec 18, 2025 • 89
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 68
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 310