113 GB

Ctrl+K

1 contributor

Fold RMS norm weight into encoder weight, changing hook point from attn/mlp in to ln1.hook_normalized/ln2.hook_normalized (aligned with TransformerLens hook points)

ebc8c39 verified about 2 months ago

32x
Fold RMS norm weight into encoder weight, changing hook point from attn/mlp in to ln1.hook_normalized/ln2.hook_normalized (aligned with TransformerLens hook points) 2 months ago
8x
Fold RMS norm weight into encoder weight, changing hook point from attn/mlp in to ln1.hook_normalized/ln2.hook_normalized (aligned with TransformerLens hook points) about 2 months ago