Shreshth

shreshthsaini

https://shreshthsaini.github.io/

AI & ML interests

computer vision, video processing, ML, DL, MLOps

Recent Activity

liked a model 2 days ago

arcee-ai/Trinity-Large-Preview

liked a model 2 days ago

moonshotai/Kimi-K2.5

liked a model 7 days ago

ariG23498/moe-routing-algorithm

View all activity

Organizations

liked 2 models 2 days ago

arcee-ai/Trinity-Large-Preview

Text Generation • 399B • Updated 2 days ago • 43 • 75

moonshotai/Kimi-K2.5

Image-Text-to-Text • Updated about 17 hours ago • 21.4k • • 1.09k

liked a model 7 days ago

ariG23498/moe-routing-algorithm

Updated 8 days ago • 3

liked a model 15 days ago

kernels-community/flash-attn3

Updated 8 days ago • 166k • 33

liked a dataset 21 days ago

facebook/research-plan-gen

Viewer • Updated 27 days ago • 22.5k • 3.37k • 295

liked 3 models 22 days ago

liked a model 3 months ago

W2GenAI/LucidFlux

Image-to-Image • Updated Oct 28, 2025 • 26

liked a Space 3 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Improve model performance by transferring knowledge between different model families

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.67k

The ultimate guide to training LLM on large GPU Clusters

liked a model 4 months ago

Wan-AI/Wan2.2-Animate-14B

Video-to-Video • Updated Nov 5, 2025 • 65.9k • 1k

liked 3 models 5 months ago

Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25, 2025 • 43.8k • • 2.29k

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26, 2025 • 13k • 1.01k

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated Sep 5, 2025 • 72.2k • • 813

liked 3 models 6 months ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 3.17M • • 1.44k

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 2.88M • • 4.41k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 6.43M • • 4.27k

liked a model 7 months ago

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26, 2025 • 4.22M • • 883

liked a model 8 months ago

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated 21 days ago • 590 • 1.18k