Can't run Janus with HuggingFaceEndpoint

LZY-SPCA · March 6, 2025, 5:56am

I’m trying to integrate huggingface endpoint with LangChain project using HuggingFaceEndPoint proviede by LangChain.

llm = HuggingFaceEndpoint(
    repo_id="deepseek-ai/Janus-Pro-7B",
    task="image-text-to-text",
    max_new_tokens=512,
    do_sample=False,
    repetition_penalty=1.03
)

But it returns with 500 Server Error, and unknown variant ‘any-to-any’. I’m using task like ‘image-text-to-text’ and ‘image-to-text’ and they have same return value.

Not only Janus-Pro-7B can’t run, other VLM like Qwen2.5-VL-7B-Instruct and glm-4v-9b can’t run. I suspect this HuggingFaceEndpoint could not run VLM.

John6666 · March 6, 2025, 6:00am

Oh…

This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support any-to-any models for transformers library.

LZY-SPCA · March 6, 2025, 6:16am

Thanks! What VLM can I use with HF inference API?

John6666 · March 6, 2025, 6:18am

This?

John6666 · March 6, 2025, 6:18am

This? Qwen/Qwen2-VL-7B-Instruct

LZY-SPCA · March 6, 2025, 6:23am

Sorry!This model return thishuggingface_hub.errors.HfHubHTTPError: 503 Server Error: Service Temporarily Unavailable for url: /static-proxy?url=https%3A%2F%2Frouter.huggingface.co%2Fhf-inference%2Fmodels%2FQwen%2FQwen2.5-VL-7B-Instruct%2Fv1%2Fchat%2Fcompletions%3C%2Fcode%3E%3Cbr%3E And I can exclude network reason because I can run microsoft/Phi-3-mini-4k-instruct

Topic		Replies	Views
HuggingFace Infernece Endpoint don't work Beginners	2	67	July 13, 2025
Langchain huggingface endpoints error Beginners	3	161	December 9, 2025
LangChain and Hugging Face Endpoints Beginners	1	121	September 9, 2025
I am getting this error on langchain Beginners	15	2165	May 25, 2025
Inference API works for flan-t5-xxl, but not for many other models I have tried with Jupyter/VSCode 🤗Transformers	0	401	June 15, 2023

Can't run Janus with HuggingFaceEndpoint

Related topics