Massachusetts Institute of Technology

university

Verified

https://www.mit.edu/

AI & ML interests

None defined yet.

Recent Activity

ahn1376 authored a paper about 1 month ago

Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization

ahn1376 authored a paper about 1 month ago

BIKED++: A Multimodal Dataset of 1.4 Million Bicycle Image and Parametric CAD Designs

ahn1376 authored a paper about 1 month ago

CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation

View all activity

Papers

What does it mean to understand language?

Back to Basics: Let Denoising Generative Models Denoise

View all Papers

frankzydou

authored a paper 3 months ago

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation

Paper • 2509.20358 • Published Sep 24 • 14

halleewong

authored a paper 3 months ago

MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance

Paper • 2412.15058 • Published Dec 19, 2024

voidism

authored a paper 4 months ago

MetaCLIP 2: A Worldwide Scaling Recipe

Paper • 2507.22062 • Published Jul 29 • 36

dczhao

authored a paper 5 months ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 122

frankzydou

authored 2 papers 5 months ago

MOSPA: Human Motion Generation Driven by Spatial Audio

Paper • 2507.11949 • Published Jul 16 • 24

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published Jul 9 • 54

voidism

authored a paper 6 months ago

Don't "Overthink" Passage Reranking: Is Reasoning Truly Necessary?

Paper • 2505.16886 • Published May 22 • 6

frankzydou

authored 6 papers 6 months ago

Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Paper • 2412.03079 • Published Dec 4, 2024 • 2

ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking

Paper • 2501.03220 • Published Jan 6 • 4

SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation

Paper • 2411.19921 • Published Nov 29, 2024

MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow

Paper • 2502.11697 • Published Feb 17

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

Paper • 2403.13307 • Published Mar 20, 2024

CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects

Paper • 2505.21437 • Published May 27 • 21

luisnaka

authored 2 papers 7 months ago

Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs

Paper • 2412.14304 • Published Dec 18, 2024 • 1

WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation

Paper • 2410.12722 • Published Oct 16, 2024 • 5

frankzydou

authored a paper 8 months ago

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

Paper • 2503.19901 • Published Mar 25 • 41

nielsr

in MIT/ast-finetuned-audioset-10-10-0.448 10 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by

nielsr

in MIT/ast-finetuned-audioset-12-12-0.447 10 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by

frankzydou

authored a paper 11 months ago

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Paper • 2501.03847 • Published Jan 7 • 23

nielsr

in MIT/ast-finetuned-audioset-10-10-0.448-v2 12 months ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by