How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="kshitijthakkar/qwen3.5-tiny-test")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)
# Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM

processor = AutoProcessor.from_pretrained("kshitijthakkar/qwen3.5-tiny-test")
model = AutoModelForMultimodalLM.from_pretrained("kshitijthakkar/qwen3.5-tiny-test")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

Qwen3.5 Tiny Test Model

A tiny Qwen3.5 hybrid MoE model for testing and validation purposes.

This model has random weights and is not trained. It exists to validate the architecture implementation and hub upload pipeline.

Architecture

  • Type: Hybrid MoE (Gated DeltaNet + Gated Attention)
  • Parameters: 138,261,536 total
  • Layers: 8 (6 DeltaNet + 2 Full Attention)
  • Experts: 8 routed (top-2) + 1 shared
  • Embedding dim: 256
  • Vocab size: 248,320
  • Context: 4096 tokens
Downloads last month
4
Safetensors
Model size
0.1B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including kshitijthakkar/qwen3.5-tiny-test