Peter Szemraj PRO

pszemraj

501 367 1064

https://pszemraj.carrd.co/

AI & ML interests

metallic intuition

Recent Activity

upvoted a paper about 19 hours ago

SLAI T-Rex: Full-Parameter Post-training of the DeepSeek-V4 Family on Ascend SuperPOD

published a model 2 days ago

pszemraj/Fara1.5-9B-Q6_K-GGUF

updated a model 2 days ago

pszemraj/Fara1.5-9B-Q6_K-GGUF

View all activity

Organizations

upvoted a paper about 19 hours ago

SLAI T-Rex: Full-Parameter Post-training of the DeepSeek-V4 Family on Ascend SuperPOD

Paper • 2607.20145 • Published 3 days ago • 56

upvoted a paper 2 days ago

Fara-1.5: Scalable Learning Environments for Computer Use Agents

Paper • 2606.20785 • Published Jun 18 • 3

upvoted a collection 2 days ago

Fara1.5

Collection

Collection of Fara1.5 CUA models in three sizes - 4B, 9B and 27B. • 3 items • Updated 2 days ago • 6

upvoted a paper 5 days ago

Perceiver IO: A General Architecture for Structured Inputs & Outputs

Paper • 2107.14795 • Published Jul 30, 2021 • 2

upvoted 2 articles 29 days ago

Article

Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World

daniel-treble, whojavumusic, alessia-treble, georg-goetz, bezzam

•

Jun 24

• 9

Article

Which tokens does a hybrid model predict better?

allenai

•

29 days ago

• 8

upvoted a paper about 1 month ago

Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

Paper • 2606.19334 • Published Jun 17 • 8

upvoted 2 collections about 1 month ago

SWE-FastContext

Collection

A family of code-search models powering the Explore subagent for coding agents.(It will be made public later) • 3 items • Updated 25 days ago • 18

Gemma 4 QAT Mobile

Collection

4 items • Updated 3 days ago • 46

upvoted 6 articles about 1 month ago

Article

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

Samoed

•

Jun 12

• 22

Article

Unlocking asynchronicity in continuous batching

ror, pcuenq, ariG23498

•

May 14

• 65

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 423

Article

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

ariG23498, ror, sergiopaniego, pcuenq, sayakpaul

•

Jun 11

• 56

Article

Introducing North Mini Code: Cohere’s First Model For Developers

CohereLabs

•

Jun 9

• 83

Article

olmo-eval: An evaluation workbench for the model development loop

allenai

•

Jun 12

• 17

upvoted 5 papers about 2 months ago

Audio Interaction Model

Paper • 2606.05121 • Published Jun 3 • 121

NITP: Next Implicit Token Prediction for LLM Pre-training

Paper • 2605.24956 • Published May 24 • 36

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Paper • 2605.28424 • Published May 27 • 32

LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training

Paper • 2605.29888 • Published May 28 • 34

LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis

Paper • 2605.30434 • Published May 28 • 23

Peter Szemraj PRO

AI & ML interests

Recent Activity

Organizations

pszemraj's activity

Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World

Which tokens does a hybrid model predict better?

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

Unlocking asynchronicity in continuous batching

Continuous batching from first principles

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

Introducing North Mini Code: Cohere’s First Model For Developers

olmo-eval: An evaluation workbench for the model development loop