make-a-audio

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

akhaliq submitted a paper 8 days ago

V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

akhaliq submitted a paper 11 days ago

Multimodal OCR: Parse Anything from Documents

akhaliq submitted a paper about 2 months ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

View all activity

submitted a paper to Daily Papers 8 days ago

V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Paper • 2603.16792 • Published 9 days ago • 3

submitted a paper to Daily Papers 11 days ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published 13 days ago • 35

submitted 3 papers to Daily Papers about 2 months ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

Paper • 2602.04811 • Published Feb 4 • 2

Visual Personalization Turing Test

Paper • 2601.22680 • Published Jan 30 • 2

Causal World Modeling for Robot Control

Paper • 2601.21998 • Published Jan 29 • 30

submitted 3 papers to Daily Papers 2 months ago

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published Jan 20 • 10

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Paper • 2601.09499 • Published Jan 14 • 9

UM-Text: A Unified Multimodal Model for Image Understanding

Paper • 2601.08321 • Published Jan 13 • 11

submitted 4 papers to Daily Papers 3 months ago

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Paper • 2601.03955 • Published Jan 7 • 3

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Paper • 2512.24724 • Published Dec 31, 2025 • 8

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Paper • 2512.24766 • Published Dec 31, 2025 • 9

What matters for Representation Alignment: Global Information or Spatial Structure?

Paper • 2512.10794 • Published Dec 11, 2025 • 9

submitted 2 papers to Daily Papers 4 months ago

Towards a Science of Scaling Agent Systems

Paper • 2512.08296 • Published Dec 9, 2025 • 16

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

authored a paper 5 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

posted an update over 1 year ago

Post

53479

Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat

5 replies

·

posted an update over 1 year ago

Post

52492

QwQ-32B-Preview is now available in anychat

A reasoning model that is competitive with OpenAI o1-mini and o1-preview

try it out: https://huggingface.co/spaces/akhaliq/anychat

2 replies

·

posted an update over 1 year ago

Post

5114

New model drop in anychat

allenai/Llama-3.1-Tulu-3-8B is now available

try it here: https://huggingface.co/spaces/akhaliq/anychat

posted an update over 1 year ago

Post

3856

anychat

supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app

try it out there: https://huggingface.co/spaces/akhaliq/anychat

authored a paper over 1 year ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50