Aryan L.L. Horizon's picture

Aryan L.L. Horizon

aarmn

·

http://aarmn.com

aarmn
aarmn

AI & ML interests

Integration of Graph and VectorDB with Transformers, Asymmetric Train/Inference Systems, Transfer Learning, Quantization, Quantization, Edge-user applications

Recent Activity

liked a model 26 days ago

nvidia/Nemotron-Labs-Diffusion-14B

liked a model 26 days ago

nvidia/Nemotron-Labs-Diffusion-3B

liked a model about 1 month ago

appvoid/palmer-005-nano

View all activity

Organizations

upvoted a paper 6 months ago

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30, 2025 • 72

upvoted 8 collections 6 months ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated Dec 16, 2025 • 24

🎯 Liquid Nanos

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 34 items • Updated 3 days ago • 117

R-HORIZON

The training and evaluation datasets for Paper "How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?" • 6 items • Updated Oct 22, 2025 • 10

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 170

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 149

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9, 2025 • 79

Qwen3-VL

37 items • Updated Dec 31, 2025 • 745

Qwen3

84 items • Updated Dec 31, 2025 • 1.81k

upvoted a paper about 2 years ago

SUTRA: Scalable Multilingual Language Model Architecture

Paper • 2405.06694 • Published May 7, 2024 • 38