The bilingual English/Chinese Baichuan2-7B-Chat VLM trained via LORA for https://arxiv.org/abs/2406.11665.

The Chinese half of the training data used for multimodal alignment and visual instruction tuning is sampled from here.

Safetensors

Model size

8B params

Tensor type

F32

F16

Dataset used to train amitha/mllava-baichuan2-en-zh

Paper for amitha/mllava-baichuan2-en-zh