21 61 35

Pavlo Molchanov PRO

pmolchanov

https://www.pmolchanov.com

AI & ML interests

Efficiency in Multi-Modal LLMs

Recent Activity

authored a paper 4 days ago

Small Language Models are the Future of Agentic AI

authored a paper 4 days ago

LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement

authored a paper 4 days ago

Minifinetuning: Low-Data Generation Domain Adaptation through Corrective Self-Distillation

View all activity

Organizations

authored 20 papers 4 days ago

Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2, 2025 • 23

LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement

Paper • 2504.16053 • Published Apr 22, 2025

Minifinetuning: Low-Data Generation Domain Adaptation through Corrective Self-Distillation

Paper • 2506.15702 • Published May 30, 2025

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 40

FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting

Paper • 2506.04174 • Published Jun 4, 2025

Universal Deep Research: Bring Your Own Model and Strategy

Paper • 2509.00244 • Published Aug 29, 2025 • 14

3D Aware Region Prompted Vision Language Model

Paper • 2509.13317 • Published Sep 16, 2025 • 14

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1, 2025 • 19

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30, 2025 • 56

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

Paper • 2507.14204 • Published Jul 14, 2025

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 91

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16, 2025 • 16

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 8

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published Nov 6, 2025 • 28

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 124

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published Nov 20, 2025 • 27

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 34

Pavlo Molchanov PRO

AI & ML interests

Recent Activity

Organizations

pmolchanov's activity