Indian AI Developers

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

nauman-tih authored a paper 28 days ago

BhashaBench V1: A Comprehensive Benchmark for the Quadrant of Indic Domains

nauman-tih authored a paper 28 days ago

AyurParam: A State-of-the-Art Bilingual Language Model for Ayurveda

kalashshah19 new activity about 1 month ago

IndianAIDevs/README:Welcomes and Greetings

View all activity

KingNish

posted an update about 9 hours ago

Post

353

Muon vs MuonClip vs Muon+Adamw

Muon has gone from an experiment to a mainstream optimizer, but does it hold up for fine‑tuning? We ran head‑to‑head tests on Qwen3‑4B (10k+ high‑quality instruction rows) to find out.

Short story: Pure Muon converged fastest at the start, but its gradient‑norm spikes made training unstable. MuonClip (Kimi K2’s clipping) stabilizes long pretraining runs, yet in our small‑scale fine‑tune it underperformed, lower token accuracy and slower convergence. The winner was the hybrid: Muon for 2D layers + AdamW for 1D layers. It delivered the best balance of stability and final performance and even beat vanilla AdamW.

Takeaway: for small-scale fine-tuning, hybrid = practical and reliable.

Next Step: scale to larger models/datasets to see if Muon’s spikes become catastrophic or if clipping wins out.

Full Blog Link: https://huggingface.co/blog/KingNish/optimizer-part1

KingNish

posted an update 2 days ago

Post

2322

I tested Muon vs MuonClip vs Muon+AdamW for fine-tuning LLMs
Just published a blog on that, Read here 👉 https://huggingface.co/blog/KingNish/optimizer-part1

1 reply

Parveshiiii

posted an update 23 days ago

Post

1609

Another banger from XenArcAI! 🔥

We’re thrilled to unveil three powerful new releases that push the boundaries of AI research and development:

🔗 XenArcAI/SparkEmbedding-300m

- A lightning-fast embedding model built for scale.
- Optimized for semantic search, clustering, and representation learning.

🔗 XenArcAI/CodeX-7M-Non-Thinking

- A massive dataset of 7 million code samples.
- Designed for training models on raw coding patterns without reasoning layers.

🔗 XenArcAI/CodeX-2M-Thinking

- A curated dataset of 2 million code samples.
- Focused on reasoning-driven coding tasks, enabling smarter AI coding assistants.

Together, these projects represent a leap forward in building smarter, faster, and more capable AI systems.

💡 Innovation meets dedication.
🌍 Knowledge meets responsibility.

nauman-tih

authored 2 papers 28 days ago

BhashaBench V1: A Comprehensive Benchmark for the Quadrant of Indic Domains

Paper • 2510.25409 • Published Oct 29 • 3

AyurParam: A State-of-the-Art Bilingual Language Model for Ayurveda

Paper • 2511.02374 • Published Nov 4 • 3

Parveshiiii

posted an update about 1 month ago

Post

3023

SparkEmbedding - SoTA cross lingual retrieval

Iam very happy to announce our latest embedding model sparkembedding-300m base on embeddinggemma-300m we fine tuned it on 1m extra examples spanning over 119 languages and result is this model achieves exceptional cross lingual retrieval

Model: XenArcAI/SparkEmbedding-300m

kalashshah19

in IndianAIDevs/README about 1 month ago

Welcomes and Greetings

#2 opened 4 months ago by

kalashshah19

Let's Talk about AI

#1 opened 4 months ago by

kalashshah19

Neural-Hacker

in IndianAIDevs/README about 2 months ago

Let's Talk about AI

#1 opened 4 months ago by

kalashshah19

in IndianAIDevs/README about 2 months ago

General

#5 opened 2 months ago by

kalashshah19

Abhaykoul

in IndianAIDevs/README about 2 months ago

General

#5 opened 2 months ago by

kalashshah19

Neural-Hacker

in IndianAIDevs/README about 2 months ago

General

#5 opened 2 months ago by

kalashshah19

Parveshiiii

posted an update about 2 months ago

Post

200

AIRealNet - SoTA - Image detection model

We’re proud to release AIRealNet — a binary image classifier built to detect whether an image is AI-generated or a real human photograph. Based on SwinV2 and fine-tuned on the AI-vs-Real dataset, this model is optimized for high-accuracy classification across diverse visual domains.

If you care about synthetic media detection or want to explore the frontier of AI vs human realism, we’d love your support. Please like the model and try it out. Every download helps us improve and expand future versions.

Model page: XenArcAI/AIRealNet

JDhruv14

in IndianAIDevs/README about 2 months ago

General

#5 opened 2 months ago by

kalashshah19

JDhruv14

in IndianAIDevs/README 2 months ago

Let's Talk about AI

#1 opened 4 months ago by

kalashshah19

Parveshiiii

posted an update 2 months ago

Post

4486

Ever wanted an open‑source deep research agent? Meet Deepresearch‑Agent 🔍🤖

1. Multi‑step reasoning: Reflects between steps, fills gaps, iterates until evidence is solid.

2. Research‑augmented: Generates queries, searches, synthesizes, and cites sources.

3. Fullstack + LLM‑friendly: React/Tailwind frontend, LangGraph/FastAPI backend; works with OpenAI/Gemini.

🔗 GitHub: https://github.com/Parveshiiii/Deepresearch-Agent

shashank23088

in IndianAIDevs/README 2 months ago

General

#5 opened 2 months ago by

kalashshah19

Parveshiiii

in IndianAIDevs/README 2 months ago

Let's Talk about AI

#1 opened 4 months ago by

kalashshah19

Welcomes and Greetings

#2 opened 4 months ago by

kalashshah19

Parveshiiii

posted an update 2 months ago

Post

3103

🚀 Big news from XenArcAI!

We’ve just released our new dataset: **Bhagwat‑Gita‑Infinity** 🌸📖

✨ What’s inside:
- Verse‑aligned Sanskrit, Hindi, and English
- Clean, structured, and ready for ML/AI projects
- Perfect for research, education, and open‑source exploration

🔗 Hugging Face: XenArcAI/Bhagwat-Gita-Infinity

Let’s bring timeless wisdom into modern AI together 🙌

AI & ML interests

Recent Activity

Team members 41

IndianAIDevs's activity

Welcomes and Greetings

Let's Talk about AI

Let's Talk about AI

General

General

General

General

Let's Talk about AI

General

Let's Talk about AI

Welcomes and Greetings