Rziane/mbert-kreyol-RH
Continued MLM pre-training of bert-base-multilingual-cased on a custom text corpus.
Training
- base model:
bert-base-multilingual-cased - epochs: 8.0
- batch size (per device): 64
- gradient accumulation: 1
- learning rate: 5e-05
- scheduler: cosine (warmup 0.06)
- weight decay: 0.01
- max length: 128
- mlm probability: 0.15
- precision: fp16
- seed: 42
Final eval
- eval_loss: 1.5256
- eval_runtime: 406.4867
- eval_samples_per_second: 584.1000
- eval_steps_per_second: 4.5630
- epoch: 8.0000
- perplexity: 4.5978
Usage
from transformers import AutoTokenizer, AutoModelForMaskedLM
tok = AutoTokenizer.from_pretrained('Rziane/mbert-kreyol-RH')
model = AutoModelForMaskedLM.from_pretrained('Rziane/mbert-kreyol-RH')
- Downloads last month
- 37
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for Rziane/mbert-kreyol-RH
Base model
google-bert/bert-base-multilingual-cased