Instructions to use deepseek-ai/DeepSeek-Prover-V2-671B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use deepseek-ai/DeepSeek-Prover-V2-671B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="deepseek-ai/DeepSeek-Prover-V2-671B", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-Prover-V2-671B", trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-Prover-V2-671B", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- HuggingChat
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use deepseek-ai/DeepSeek-Prover-V2-671B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "deepseek-ai/DeepSeek-Prover-V2-671B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "deepseek-ai/DeepSeek-Prover-V2-671B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/deepseek-ai/DeepSeek-Prover-V2-671B
- SGLang
How to use deepseek-ai/DeepSeek-Prover-V2-671B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "deepseek-ai/DeepSeek-Prover-V2-671B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "deepseek-ai/DeepSeek-Prover-V2-671B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "deepseek-ai/DeepSeek-Prover-V2-671B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "deepseek-ai/DeepSeek-Prover-V2-671B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use deepseek-ai/DeepSeek-Prover-V2-671B with Docker Model Runner:
docker model run hf.co/deepseek-ai/DeepSeek-Prover-V2-671B
They really want to solve math, don't they?
Self evident from the research direction of that model
who knows? all of a sudden
671B,maybe train from deepseek v3 as base model ,hope this just start
can this solve the question of why girls won't talk to me at my college??
can this solve the question of why girls won't talk to me at my college??
If you can help em solve math problems with this model
can this solve the question of why girls won't talk to me at my college??
This is too difficult for LLM😁
can this solve the question of why girls won't talk to me at my college??
This is too difficult for LLM😁
much more difficult for human ...
can this solve the question of why girls won't talk to me at my college??
Top-tier question for humanity,too difficult for LLM
"can this solve the question of why girls won't talk to me at my college??"
easy answer: you found yourself in a discussion section of math prover model 10 minutes after release 😭
can this solve the question of why girls won't talk to me at my college??
Top Ten Unsolved Mysteries of Humanity
671B,maybe train from deepseek v3 as base model ,hope this just start
config.json says so "architectures": [
"DeepseekV3ForCausalLM"
],
can this solve the question of why girls won't talk to me at my college??
Bro, please accept my condolences and stay strong.
can this solve the question of why girls won't talk to me at my college??
AGI might able to solve that
or We might need ASI for that task stay strong till then