Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bertin-project
/
bertin-gpt-j-6B
like
18
Follow
BERTIN Project
32
Text Generation
Transformers
PyTorch
Safetensors
bertin-project/mc4-es-sampled
Spanish
gptj
causal-lm
arxiv:
2104.09864
arxiv:
2101.00027
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
bertin-gpt-j-6B
48.4 GB
2 contributors
History:
27 commits
versae
Update README.md
e47a5d9
verified
about 1 year ago
.gitattributes
1.23 kB
Adding `safetensors` variant of this model (#2)
over 1 year ago
README.md
10.6 kB
Update README.md
about 1 year ago
config.json
836 Bytes
Checkpoint 850001 float32
over 3 years ago
merges.txt
456 kB
Add tokenizer
almost 4 years ago
model.safetensors
24.2 GB
xet
Adding `safetensors` variant of this model (#2)
over 1 year ago
pytorch_model.bin
24.2 GB
xet
Checkpoint 1000000 float32
over 3 years ago
special_tokens_map.json
90 Bytes
Add tokenizer
almost 4 years ago
tokenizer.json
2.11 MB
Add tokenizer
almost 4 years ago
tokenizer_config.json
236 Bytes
Set model_max_length to 2048 as per the model
almost 3 years ago
vocab.json
798 kB
Add tokenizer
almost 4 years ago