Can you share the learning settings

#23
by Forceless - opened

Thanks for this great work for @zai-org-3 !
I've been trying to fine-tune this model lately, but noticed the model seems to lack robustness during both training and inference.
I could only find the batch size and sequence length mentioned in the paper, but other important hyperparameters (e.g., learning rate) seem to be missing.

It would be very helpful for the community and me to use, thx!

Sign up or log in to comment