Building on HF

4 11 75

Sumit Yadav

rockerritesh

https://sumityadav.com.np

AI & ML interests

AI(GAN) || LLM RAG

Recent Activity

updated a Space 1 day ago

rockerritesh/memory-dashboard

updated a Space 3 days ago

rockerritesh/sumit-server

published a dataset 4 days ago

Maithili-Computational-Linguistics-Lab/maithili_poem

View all activity

Organizations

upvoted 2 papers 2 months ago

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

Paper • 2602.02343 • Published Feb 2 • 13

On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

Paper • 2602.00130 • Published Jan 28 • 3

upvoted 2 papers 6 months ago

Can maiBERT Speak for Maithili?

Paper • 2509.15048 • Published Sep 18, 2025 • 1

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 34

upvoted a collection about 1 year ago

Cogito v1 Preview

Collection

5 items • Updated Apr 8, 2025 • 119

upvoted a paper about 1 year ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 207

upvoted 3 collections about 1 year ago

upvoted 2 articles about 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12, 2025

•

494