Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Concyclics
/
PeoplesDaily-Qwen3-4B-Base
like
0
Text Generation
Safetensors
Concyclics/PeoplesDaily
Chinese
qwen3
news
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
SFT on Concyclics/PeoplesDaily:
SFT on Concyclics/PeoplesDaily:
batch_size: 96
epochs: 2
learning_rate: 1.0e-5
lr_scheduler_type: cosine
warmup_ratio: 0.1
total_flops: 483TFlops
train_loss: 1.646
Downloads last month
26
Safetensors
Model size
4B params
Tensor type
BF16
·
Chat template
Files info
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
Concyclics/PeoplesDaily-Qwen3-4B-Base
Base model
Qwen/Qwen3-4B-Base
Finetuned
(
162
)
this model
Finetunes
1 model
Quantizations
2 models
Dataset used to train
Concyclics/PeoplesDaily-Qwen3-4B-Base
Concyclics/PeoplesDaily
Viewer
•
Updated
1 day ago
•
109k
•
11