Instructions to use YonatanDavidov/qasem-he-dictalm2-full with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use YonatanDavidov/qasem-he-dictalm2-full with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="YonatanDavidov/qasem-he-dictalm2-full")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("YonatanDavidov/qasem-he-dictalm2-full")
model = AutoModelForCausalLM.from_pretrained("YonatanDavidov/qasem-he-dictalm2-full")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use YonatanDavidov/qasem-he-dictalm2-full with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "YonatanDavidov/qasem-he-dictalm2-full"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "YonatanDavidov/qasem-he-dictalm2-full",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/YonatanDavidov/qasem-he-dictalm2-full

SGLang

How to use YonatanDavidov/qasem-he-dictalm2-full with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "YonatanDavidov/qasem-he-dictalm2-full" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "YonatanDavidov/qasem-he-dictalm2-full",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "YonatanDavidov/qasem-he-dictalm2-full" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "YonatanDavidov/qasem-he-dictalm2-full",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use YonatanDavidov/qasem-he-dictalm2-full with Docker Model Runner:
```
docker model run hf.co/YonatanDavidov/qasem-he-dictalm2-full
```

QASem Hebrew Full Model (DictaLM 2.0)

This model performs QA-based semantic parsing (QASem) in Hebrew.

Overview

This repository provides a fully fine-tuned model for performing QA-based semantic parsing (QASem) in Hebrew.

QASem represents predicate–argument structure using natural-language question–answer pairs, rather than predefined semantic role labels. This makes the representation more interpretable and flexible across languages.

The model is based on:

Base model: dicta-il/dictalm2.0-instruct

and was fully fine-tuned for QA-based semantic parsing.

✨ Why this model matters

Traditional semantic role labeling methods rely on fixed label schemas and costly expert annotation.

This model takes a different approach by:

Representing semantics using natural-language question–answer pairs
Enabling automatic dataset construction via cross-lingual projection
Supporting scalable semantic parsing across languages
Achieving strong performance with efficient fine-tuned models

This makes it possible to build semantic parsers for new languages with minimal cost.

Use Cases

This model can be used for:

Research in QA-based semantic parsing (QASem) and semantic representation learning
Extraction of predicate–argument structures from Hebrew text
Automatic dataset creation for training semantic models in new languages
Downstream NLP applications such as:
- Information extraction
- Text understanding
- Factuality and attribution evaluation

Language

Hebrew 🇮🇱

Training Data

The model was trained on the Multilingual QASem Dataset:

👉 https://huggingface.co/datasets/biu-nlp/Multilingual_QASem_Datasets

The dataset includes:

Automatically generated QASem annotations
Train / Development / Test splits
Multiple languages: French, Hebrew, Russian
Tens of thousands of QA pairs per language

The data was constructed using a cross-lingual projection approach, ensuring scalability across languages.

📄 Associated Work

This model and the underlying dataset are introduced in: Effective QA-Driven Annotation of Predicate-Argument Relations Across Languages.

The paper presents the full methodology, dataset construction process, and evaluation across multiple languages.

🚀 Quick Start (Recommended)

Using the XQASem Parser

For a simple and structured interface, you can use the XQASem parser.

Installation

pip install xqasem

Basic Example

from xqasem import XQasemParser

parser = XQasemParser.from_language("he")

sentences = [
    "המומחים הדגישו שהאלגוריתם החדש מאיץ משמעותית את עיבוד הבקשות המורכבות."
]

df = parser(sentences)

print(df)

Output Format

The model produces structured predicate–argument representations in the form of:

A predicate (verb or nominal)
A natural-language question
A corresponding answer span from the sentence

This structure can be easily converted into tabular or JSON format for downstream use.

Example Output

sentence	predicate	predicate_type	question	answer
המומחים הדגישו שהאלגוריתם החדש מאיץ משמעותית את עיבוד הבקשות המורכבות.	הדגישו	verb	מי הדגיש משהו?	המומחים
המומחים הדגישו שהאלגוריתם החדש מאיץ משמעותית את עיבוד הבקשות המורכבות.	הדגישו	verb	מה מישהו הדגיש?	שהאלגוריתם החדש מאיץ משמעותית את עיבוד הבקשות המורכבות
המומחים הדגישו שהאלגוריתם החדש מאיץ משמעותית את עיבוד הבקשות המורכבות.	מאיץ	verb	מה מאיץ משהו?	האלגוריתם החדש
המומחים הדגישו שהאלגוריתם החדש מאיץ משמעותית את עיבוד הבקשות המורכבות.	מאיץ	verb	מה משהו מאיץ?	את עיבוד הבקשות המורכבות

👉 For more details and advanced usage, see the project repository:
https://github.com/JohnnieDavidov/xqasem

Manual Model Loading (Advanced)

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "YonatanDavidov/qasem-he-dictalm2-full"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

Limitations

Performance may degrade on out-of-domain text
Complex or ambiguous predicates may lead to inconsistent outputs
The model is optimized for QASem-style generation and not for general-purpose text generation

📄 Citation

If you use this model, please cite our work:

@inproceedings{davidov-etal-2026-effective,
    title = "Effective {QA}-Driven Annotation of Predicate{--}Argument Relations Across Languages",
    author = "Davidov, Jonathan  and
      Slobodkin, Aviv  and
      Klein, Shmuel Tomi  and
      Tsarfaty, Reut  and
      Dagan, Ido  and
      Klein, Ayal",
    editor = "Demberg, Vera  and
      Inui, Kentaro  and
      Marquez, Llu{\'i}s",
    booktitle = "Proceedings of the 19th Conference of the {E}uropean Chapter of the {A}ssociation for {C}omputational {L}inguistics (Volume 1: Long Papers)",
    month = mar,
    year = "2026",
    address = "Rabat, Morocco",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2026.eacl-long.112/",
    doi = "10.18653/v1/2026.eacl-long.112",
    pages = "2484--2502",
    ISBN = "979-8-89176-380-7",
}

Downloads last month: 147

Safetensors

Model size

7B params

Tensor type

F16

Model tree for YonatanDavidov/qasem-he-dictalm2-full

Base model

dicta-il/dictalm2.0

Finetuned

dicta-il/dictalm2.0-instruct