SNOW-Multimodal

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

shubhamagarwal92 authored a paper 14 days ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

shubhamagarwal92 authored a paper 14 days ago

LitLLM: A Toolkit for Scientific Literature Review

shubhamagarwal92 authored a paper 14 days ago

History for Visual Dialog: Do we really need it?

View all activity

shubhamagarwal92

authored 3 papers 14 days ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

LitLLM: A Toolkit for Scientific Literature Review

Paper • 2402.01788 • Published Mar 21, 2025

History for Visual Dialog: Do we really need it?

Paper • 2005.07493 • Published May 8, 2020

shubhamagarwal92

authored 8 papers 15 days ago

Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems

Paper • 2602.16430 • Published Feb 18

Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation

Paper • 2502.20420 • Published Feb 27, 2025

MUTANT: A Recipe for Multilingual Tokenizer Design

Paper • 2511.03237 • Published Mar 22

rajeswarsai

authored a paper about 1 month ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

rajeswarsai

authored 2 papers about 2 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 148

shubhamagarwal92

authored a paper 4 months ago

BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages

Paper • 2511.10338 • Published Nov 13, 2025

rajeswarsai

authored a paper 6 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 107

rajeswarsai

authored 4 papers 7 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3, 2025 • 39

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Paper • 2407.06423 • Published Jul 8, 2024

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Paper • 2503.15661 • Published Mar 19, 2025 • 3

AI & ML interests

Recent Activity

Team members 2

snow-multimodal's activity