Bilingual LMs ( L1 {es fr de pl tr ar zh} + L2 en ) trained on Cultura-X for L1 and FineWebEdu (L2)
Suchir Salhan
suchirsalhan
AI & ML interests
Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.
Recent Activity
authored
a paper
3 days ago
BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop updated
a collection
4 days ago
BabyLM 2026 published
a dataset 4 days ago
BabyLM-community/BabyLM-2026-Strict