Center for Language and Speech Processing @ JHU

university

https://www.clsp.jhu.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

mmarone updated a collection 5 days ago

mmBERT: a modern multilingual encoder

TaiMingLu authored a paper 9 days ago

Strong Teacher Not Needed? On Distillation in LLM Pretraining

TaiMingLu authored a paper 9 days ago

i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models

View all activity

Papers

DAR: Deontic Reasoning with Agentic Harnesses

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

View all Papers

Collections 3

View 3 collections

spaces 1

Science Hierarchography

Explore academic paper hierarchies and details

models 53

jhu-clsp/mmBERT-small

Fill-Mask • Updated Oct 17, 2025 • 29.9k • • 76

jhu-clsp/mmBERT-base

Fill-Mask • Updated Oct 7, 2025 • 342k • • 214

jhu-clsp/mmBERT-checkpoints

Updated Sep 9, 2025 • 4

jhu-clsp/ettin-decoder-1b

Fill-Mask • Updated Jul 21, 2025 • 16 • 5

jhu-clsp/ettin-decoder-32m

Text Generation • Updated Jul 18, 2025 • 398

jhu-clsp/ettin-encoder-1b

Feature Extraction • Updated Jul 18, 2025 • 1.94k • 22

jhu-clsp/ettin-encoder-68m

Fill-Mask • Updated Jul 18, 2025 • 3.54k • • 5

jhu-clsp/ettin-dec-from-enc-32m

Text Generation • Updated Jul 18, 2025 • 4

jhu-clsp/ettin-encoder-150m

Fill-Mask • Updated Jul 18, 2025 • 6.38k • • 13

jhu-clsp/ettin-decoder-400m

Text Generation • Updated Jul 18, 2025 • 6.97k • 4

datasets 40

jhu-clsp/ManyIH-Bench

Preview • Updated Apr 13 • 49 • 3

jhu-clsp/robust04-instructions

Viewer • Updated Mar 12 • 136k • 816 • 2

jhu-clsp/core17-instructions

Viewer • Updated Mar 12 • 49.4k • 707 • 2

jhu-clsp/news21-instructions

Viewer • Updated Mar 12 • 71.5k • 652 • 1

jhu-clsp/SciTaRC

Viewer • Updated Mar 6 • 371 • 48 • 1

jhu-clsp/megawika-2

Updated Mar 3 • 51 • 4

jhu-clsp/mmBERT-decay-data

Updated Dec 11, 2025 • 33k • 6

jhu-clsp/mmBERT-midtraining-data

Updated Oct 13, 2025 • 1.95k • 1

jhu-clsp/ettin-pretraining-data

Updated Jul 18, 2025 • 58.8k • 9

jhu-clsp/ettin-decay-data

Updated Jul 18, 2025 • 1.24k • 1

View 40 datasets